Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2023 Jan 10;119:105820. doi: 10.1016/j.engappai.2023.105820

Prognosticating various acute covid lung disorders from COVID-19 patient using chest CT Images

Suganya D 1,, Kalpana R 1
PMCID: PMC9829610  PMID: 36644478

Abstract

The global spread of coronavirus illness has surged dramatically, resulting in a catastrophic pandemic situation. Despite this, accurate screening remains a significant challenge due to difficulties in categorizing infection regions and the minuscule difference between typical pneumonia and COVID (Coronavirus Disease) pneumonia. Diagnosing COVID-19 using the Mask Regional-Convolutional Neural Network (Mask R-CNN) is proposed to classify the chest computerized tomographic (CT) images into COVID-positive and COVID-negative. Covid-19 has a direct effect on the lungs, causing damage to the alveoli, which leads to various lung complications. By fusing multi-class data, the severity level of the patients can be classified using the meta-learning few-shot learning technique with the residual network with 50 layers deep (ResNet-50) as the base classifier. It has been tested with the outcome of COVID positive chest CT image data. From these various classes, it is possible to predict the onset possibilities of acute COVID lung disorders such as sepsis, acute respiratory distress syndrome (ARDS), COVID pneumonia, COVID bronchitis, etc. The first method of classification is proposed to diagnose whether the patient is affected by COVID-19 or not; it achieves a mean Average Precision (mAP) of 91.52% and G-mean of 97.69% with 98.60% of classification accuracy. The second method of classification is proposed for the detection of various acute lung disorders based on severity provide better performance in all the four stages, the average accuracy is of 95.4%, the G-mean for multiclass achieves 94.02%, and the AUC is 93.27% compared with the cutting-edge techniques. It enables healthcare professionals to correctly detect severity for potential treatments.

Keywords: Mask R-CNN, Covid-19, Meta learning, Few-shot learning technique, ResNet-50, ARDS

1. Introduction

Many respiratory viral diseases have spread over the last two decades, such as Severe-Acute-Respiratory-Syndrome (SARS), Middle-East-Respiratory-Syndrome (MERS), and other respiratory diseases. In 2019, major pathogens mainly targeting the respiratory system for humans include novel coronaviruses (CoV) like SARS-CoV, SARS-CoV-2, MERS, MERS-CoV, and influenza-A viruses. Their infections can cause anything from minor respiratory infections to a serious case of pneumonia, even leading to respiratory failure. The first beta coronavirus (SARS-CoV) outbreak in November 2002 led to 774 deaths. COVID-19, a new coronavirus disease that gives rise to a new beta coronavirus (SARS-CoV-2) in 2019, rapidly spreads around the world. Almost 613 million people were affected globally, which equates to about 6.5 million deaths as of September 2022. SAR-CoV-2 is considered more transmissible and virulent than the other two beta-coronaviruses, and it has immediately turned into a pandemic that has become a global medical crisis. Corona virus can be identified using several methods such as antigen testing, antibody testing, Reverse Transcription Polymerase Chain Reaction (RT-PCR) etc. But the challenge is to find the appropriate disease and its severity, which leads to death. Imaging modalities are essential in exposing censorious findings related to the continuation of serious illness. A diverse committee of medical professionals and radiologists from various countries has issued a directive to use chest X-ray and computerized tomography on Covid patients beginning in April 2020. If a person’s breathing is getting worse or they are at risk for a serious breathing illness, they should get chest imaging to find out how bad it is Aswathy et al. (2021). Several acute respiratory syndromes can be identified and classified based on the severity of the corona virus positive patients. Fig. 1 shows the statistical report of covid-19 severity using a CT score categorized by different age groups.

Fig. 1.

Fig. 1

Age category of Covid-19 using CT Severity score.

Finding the severity of chronic obstructive pulmonary disease (COPE) at an early stage is still a big challenge. This work helps to categorize the severity of disease and save lives. Fig. 2 explains the general workflow of this research paper, where it initially does a binary classification. It tells whether the victim is affected by covid-19 or not affected by covid-19 using a pre-trained model of the Mask R-CNN algorithm. Madani et al. (2022) reviewed the importance of artificial intelligence to detect breast cancer with different imaging modalities and explained how imaging modalities are efficient in advanced diagnosis. Several AI-based networks have been used for various diagnoses; Mask R-CNN plays an efficient role in image segmentation and classification. Johnson JW proposed automatic nucleus segmentation using Mask R-CNN (Johnson, 2020). Segmentation and detection of pulmonary nodules using 3D CT scan images were discussed by Kopelowitz and Engelhard (2019) and Liu et al. (2018). Mask R-CNN also provides better results in liver segmentation from multi-model images, as proposed by Mulay et al. (2019). Dhieb et al. (2019) suggest using mask r-CNN deep learning models to create a framework for automatically classifying and counting blood cells. Anantharaman et al. (2018) talk about how an improved mask (R-CNN) can be used to find and divide up oral diseases. Hand images can be segmented at different viewpoints and tracked using the mask R-CNN combination proposed by Nguyen et al. (2018). Endoscopic images are used to diagnose gastric cancer at an early stage, as discussed by Shibata et al. (2020), and the identification of breast cancer and classifying its severity are evaluated using Mask R-CNN by Chiao et al. (2019).

Fig. 2.

Fig. 2

General workflow of Research.

Meta-based few-shot learning: Previous works used meta-learning to construct a model or optimizer that can rapidly modify and update the values for adaptation to the unseen tasks outlined by Podder et al. (2021) in order to address the data insufficiency in few-shot learning. Ye et al. (2020), for instance, talked about the Meta Network, which gathered meta-level information for quick generalization. The Long Short Term Memory (LSTM) meta-learner model trains the optimization technique, as discussed by Munkhdalai and Yu (2017). By learning only the initial learner parameters, the model-agnostic meta-learning model (Finn et al., 2017, Antoniou et al., 2018) simplified the aforementioned meta-network and achieved rapid adaptation with respect to those initial values and good generalizability to the new tasks. The similarities between two images are also learned using meta-learning methods. Matching-Net (Vinyals et al., 2016) proposed to map a limited labeled​ support set to its label and find the closest labeled example, in addition to figuring out the class of each instance in the query set. With the use of the class-wise mean and the Euclidean distance, Proto-Net was developed by Snell et al. (2017), who generalized Matching Net from one-shot learning to few-shot learning. Relation-Net was made by Sung et al. (2018) to learn useful metrics. It uses few-shot graph neural network models and CNN-based relation modules.

Non-Meta-based Few-shot learning: In addition to those non-meta methods based on meta-learning, there are other non-meta methods that use high-density characterizations of image regions to calculate the distances, as in Zhang et al., 2020a, Zhang et al., 2020b or use cosine similarity measures to predict the novel classifier with a weight generator and instantly set the weights based on the activations of the embedding layer (Qi et al., 2018). In order to attain competitive performance, Chen et al. recommended limiting intra-class variation as well as constraining similarity. Both the meta and non-meta techniques used a fixed feature extractor trained on the based classes, which is insufficient to account for the domain difference between the base and novel classes (Chen et al., 2020). Approaching the few-shot problem by finding a meta-learning scheme through adaptive search that is compatible with both meta and non-meta approaches as opposed to learning more sophisticated optimizers or classification metrics

Here the severity of covid-19 affected person can be classified using non-meta based few shot leaning with the help of cosine similarity. The proposed model not only tells the severity level of a patient but also tells the infections which cause risk. The four main infections which help to differentiate the severity of covid positive patients are discussed below.

  • Sepsis: COVID-19 has the potential to cause sepsis. When an infection causes the immune system to overreact, which can lead to tissue damage, organ failure, or even death, it is a dangerous complication that can be caused by an infection.

  • Acute Respiratory Distress Syndrome (ARDS): Bacteria can be broken down more easily because of their longer lifespan. The core purpose of the monocyte cell is to provide a piece of microbe to the T cell so that it can recognize it again in potential attacks.

  • Covid-Pneumonia: It causes immune responses such as hives and hay fever, as well as diseases of the nervous and collagenous systems, the spleen, parasitic inflammation, and other types of allergies; however, their primary prey is parasites such as tapeworms and hook worms.

  • Covid-Bronchitis: Basophils help with allergic and antigen reactions by releasing histamine, which is a sign of an allergy. This causes blood vessels to widen and helps the immune system.

Train the model by fusing multi-model data, which includes the CT images of sepsis, ARDS, Covid Pneumonia, Covid Bronchitis etc. It is possible to forecast the exactingness level of Corona virus positive individuals by combining these multi-model data sets, allowing for better treatment. The novelty of this work is detecting not only the severity of the particular patient but also identifying the disease that is causing that severity. If the patient is affected by COVID-Sepsis, then the individual is in the critical stage, if the patient is affected by ARDS, they are in the severe stage. If the appropriate treatment is started quickly, it can prevent the loss. Patients with COVID pneumonia are in a moderate stage of illness, so proper medication will help them recover quickly. Covid-Bronchitis is in the early or mild stage, many people affected during this stage of the pandemic will be cured easily.

The main contributions of the proposed work are as follows:

  • Classification-1: The mask R-CNN is used to detect COVID-19 using chest CT scan images; the CNN used here is ResNet-50, which classifies whether the patient is affected by the coronavirus or not.

  • Classification-2: Prognosticate the severity of covid-19 as four classes using the four main infections (Sepsis, ARDS, Covid-Pneumonia and COVID-Bronchitis), which help to classify the covid-positive patients’ severity as critical, severe, moderate, and mild using the few-shot learning method using the cosine similarity of base classifier ResNet-50.

  • The output of classification-1 (Covid-19 positive) is used as input in classification-2 to find its severity. It not only classifies the severity but also classify the disease (infections) which causes the severity. It empowers healthcare professionals to correctly identify the detection of severity for potential treatments.

The remainder of the paper is systemized in this way. Section 2 discuss about the recent works related to the proposed models and literature survey. Section 3 explains about the working of proposed approach. The experimentation findings, implementations and evaluation metrics were discussed in Section 4. The final section ended with conclusion and future enhancement.

2. Related work and literature survey

Many research studies have been conducted for prediction or detection of corona virus as wells as for prognosing the covid-19 patient’s severity in orders to provide appropriate treatment. RT-PCR is the recommended, fast and majorly used diagnosing method for diagnosis covid-19. Easy to use, quick and accurate solutions for covid-19 diagnosis are Chest X-ray and Chest CT-images. Rapid growth in the research field of artificial intelligence, machine learning and deep neural learning algorithms, Covid-19 diagnosis and its severity have been proposed.

2.1. Deep learning methodology for Covid-19 detection

Deep learning is an authentic diagnosis method for classifying COVID cases (Bai et al., 2020). Due to the busy schedules of medical practitioners and the involvement of high-risk patients, the DL serves as an adjunct method for detecting the coronavirus and its severity. Bai et al. explain the efficiency of computational resources in terms of time.  The importance of imaging modalities like chest computed tomography images for covid-19 detection is discussed by Tenda et al. (2020). Ozsahin et al., 2020a, Ozsahin et al., 2020b discuss the ensemble of various Deep Neural Networks models for diagnosing covid19 using CT scan images. It explains the comparison of various pre-trained models like VGG16, ImageNet, ResNet, CovidNet, InceptionNet, etc. Deep learning provides a great solution for the classification of medical images for clinical treatment. Raghu et al. (2019) explain in detail how transfer learning techniques can be used in the analysis of medical images. The dataset of ImageNet is being trained; this pre-trained technique provides better speedup and scaling. Polsinelli et al. (2020) use light CT to detect covid19 using a chest CT scan with a very short detection time. The chest CT scan yields both conventional and non-conventional results. Yadav and Jadhav (2019) discuss how CNN provides a deep solution for diagnosing various diseases by classifying medical images. It uses CXR images to classify the different types of pneumonia and comes to the conclusion that Deep CNN is better than Convolutional Neural Networks at classifying images with small datasets.

A pre-trained model excelled in the prognosis of various diseases using medical images. Sajja et al. (2019) discusses the prediction of cancer in the lungs using chest CT scan images. It provides better accuracy when compared to CNN and other machine learning models. The transfer learning model works well with the availability of a small covid-19 dataset. The availability of lung CT scan images for using a pre-trained neural network to diagnose COVID-19 patients is extremely limited. The data augmentation technique increases the images by creating high resolution synthetic images to provide better performance. Di et al. (2021) discusses the information collected from the CT images with the help of hypergraph learning. It contains radiometric and geographic information that a weighted vertex hypergraph learning technique can analyze and detect whether the new individual has suffered from covid19 or not. Suganya and Kalpana (2022) explain the automated detection of the Corona virus using deep multiple-instance learning (DMIL). Due to the limited datasets, this method uses the technique of CycleGAN to balance the unbalanced dataset by creating several high resolution synthetic images, which helps to obtain good accuracy comparatively. The extreme learning machine used for detecting covid-19 consequently produces better accuracy. Non-iterative and fast deep learning: multilayer extreme learning machines was discussed by Zhang et al. (2020b), which clearly explains the importance of predicting the error rate rather than finding the accuracy. According to Turkoglu (2021), the COVID-19 detection using Extreme Learning Machine (ELM) with multiple kernels based on deep neural networks achieves an accuracy of 98.36 and an F1 score of 98.25. Rajpal et al. (2022) proposed COV-ELM with a local interpretable model agnostic explanation (LIME) for detecting corona coronavirus, which produces a sensitivity of 94% and an F1 score of 95%. Inam et al. (2021) discussed COVID detection using real-time statistical deep extreme learning machine (RTS-DEL), which achieves an accuracy of 98.18%. Table 1 shows the performance and datasets used in various networks to detect covid-19.

Table 1.

Various methodologies for covid-19 detection.

Authors Dataset used Methodology Performance metrics
He et al. (2020) 350 COVID-19
397 WITHOUT-COVID-19
DenseNet – 201 Accuracy is 86.57%
F1 Score is 85.32%

Yang et al. (2020) 350 COVID-19
465 WITHOUT-COVID-19
Multiple tasking learning Accuracy is 89.10%
F1 Score is 90.03%

Loey et al. (2020) 742 CT IMAGES
345 COVID-19
397 NON-C0VID
VGGNet-16, VGGNet-19 AlexNet, ResNet-50, GoogleNet Accuracy is 83.74%
Sensitivity is 78.54%
Specificity is 88.92%

Liu et al. (2020) 560 COVID-19
659 WITHOUT-COVID-19
VGG-16 based DNN Accuracy is 88.61%
F1-Score is 88.38%
Sensitivity is 89.91%

Wang et al. (2020) 315 COVID-19
229 WITHOUT-COVID-19
U-Net Accuracy is 90.72%
Specificity is 91.87%
Sensitivity is 90.70%

Pathak et al. (2022) 415 COVID-19
440 WITHOUT-COVID-19
ResNet50 with 2-Dimensional CNN Accuracy is 93.79%
Specificity is 95.23%
Sensitivity is 91.81%

Han et al. (2020) 230 COVID-19
130 NORMAL
Attenuation based Deep 3D-Multiple Instance Learning Accuracy is 98.12%
Sensitivity is 97.93%
F1 Score is 97.30%

Harmon et al. (2020) 1030 COVID-19
1695 WITHOUT-COVID-19
AH-Net DenseNet-201 Accuracy is 91.41%
Specificity is 93.04%
Sensitivity is 84.67%

Jin et al. (2020) 495 COVID-19
1385 WITHOUT-COVID-19
Convolutional Neural Network Accuracy is 95.54%
Specificity is 96.97%
Sensitivity is 94.81%

Ahuja et al. (2021) 350 COVID-19
397 NON- COVID-19
ResNet18 Accuracy is 99.00%
Specificity is 98.51%
Sensitivity is 99.03%
F1 Score is 99.02%

Suganya and Kalpana (2022) 230 COVID-19
130 NON- COVID-19
DMIL with CycleGAN Accuracy is 98.60%
Specificity is 96.51%
F1 Score is 96.32%

Turkoglu (2021). 1745 COVID-19
1985 NON-COVID-19
MK-CLM-DNN Accuracy is 98.36%
Specificity is 98.44%
Sensitivity is 98.28%
F1 Score is 98.25%
AUC is 98.36%

Rajpal et al. (2022) 520 COVID-19
520 NORMAL
COV-ELM with LIME Sensitivity is 94.42%
F1 Score is 95.40%

Inam et al. (2021) 547 EMR FROM SERVICE HOSPITAL, PAKISTAN D2C-RTS-DELM Accuracy is 98.18%
Selectivity is 98.44%
Error rate is 1.12%

Various deep transfers learning technique is used for the image classification. In deep learning, several convolutions filter helps to extract the feature by itself. It automatically generates various feature maps to get solution to the problem by varying the size of the filters, activation function, convolutional layers, pooling layers etc. Several pre-trained deep learning architecture were used to predict the corona virus.

2.2. Deep learning methodology for COVID-19 severity detection

The use of multi-class classification to determine the severity of COVID-19 is discussed. A.L. Aswathy et al. discussed a technique for predicting the covid-19 patient’s severity with the help of back propagation neural networks and transfer learning techniques (Aswathy et al., 2021). Lung infections can be quantified using a VB-Net neural network, whose process of instance segmentation was discussed by Shan et al. (2021). The infected region is segmented and quantified to produce an accurate classification of severity. It uses the dice similarity coefficient as a metric to label the infections. Tang et al. (2021) explore the possibility of detecting the severity of COVID-19 using a random forest algorithm where quantitative features are extracted from chest CT scan images of the patients. It classifies and labels the output as severe or non-severe. Xiao et al. (2020) discussed the DL model of ResNet-34 that is used to classify the severity. In order to avoid over-fitting, it uses 5-fold cross-validation to train the datasets and uses the binary-classification method to label them as severe or non-severe. A bi-directionally elastic-registered algorithm is used to find the severity and progression of covid-19 as discussed by Pu et al. (2020). Initially, it segmented the boundary of the lung and vessels using U-Net architecture to identify the region affected by common and covid pneumonia and found the result by calculating the threshold value using the average lung density. For corona virus severity detection, it calculates the Pearson correlation between ground truth and prediction values. Shen et al. (2020) explored the adaptive region model to segment the lungs’ volume using thresholding, and then the lung region is cut out by segmenting pulmonary vessels. Finally, pneumonia detection occurs, and the authors evaluate the Pearson correlation for lung lesions and lung volumes using the proposed method (Ozsahin et al., 2020a).

Several existing systems compare the severity of covid-19 in Table 2. The main challenges in the existing system are to predict covid-19 with high accuracy by handling unbalanced datasets and predicting the severity accurately. Obtaining better accuracy with imbalanced classification is the most important challenge. Li et al. (2022) discussed an imbalanced matrix to handle the unbalanced class data using extreme learning machine. Multi-objective optimization-based adaptive class-specific cost-extreme learning machine for imbalanced classification use an imbalanced matrix and penalty adjustment matrix to make a cost-effective decision boundary. Previous articles have mostly classified severity into two or three classes or labels, namely severe/non-severe and severe/mild/non-severe. But the proposed approach categorizes the severity into four classes based on the positive chest CT samples. The difficulties in detecting COVID in patients and its severity are being investigated as a result of these studies. By overcoming these challenges and using the proposed methods, we can achieve better results for severity detection.

Table 2.

Various model used to predict the severity of Covid-19.

Authors Dataset used Classes Method Performance metrics
Aswathy et al. (2021) Covid-CT public dataset with 349 images Covid-19 severity (High, Moderate, Low) ResNet-50, DenseNet-201 Accuracy is 97.81% Sensitivity is 95.12% Specificity is 97.31% Precision is 94.12%

Shan et al. (2021) 550 CT volumes Infected region (segmentation and quantification) VBNet Dice similarity co-efficient value 92.41% 10.0

Tang et al. (2021) 176 patients CT images Severe
Non-Severe
Random forest Algorithm Accuracy is 87.52%

Xiao et al. (2020) 23 810 Covid-19 images Severe
Non-Severe
ResNet-34 Precision is 81.52%
AUC is 98.32%

Pu et al. (2020) 72 covid-19 images and 120 other volume images Severity of covid-19 and its progression UNet-BER Algorithm Sensitivity is 95.76% Specificity is 85.92%

Shen et al. (2020) CT images from 50 patients Severe
Non-Severe
Thresholding and AGR algorithm Pearson correlation coefficient r ranges from 0.77 to 0.84, P<0.05

3. Proposed approach

This research presents a classification of Covid-19 detection and its severity in two stages using novel methods. Initially, Covid-19 detection process occurs using a pre-trained model Mask R-CNN which classify whether the image is affected by covid-19 +ve or ve. In second stage, it detects the severity of the corona virus positive patients. It uses a few shot learning method with a base classifier of ResNet-50 to detect the severity by using softmax as an activation function to achieve better accuracy.

3.1. COVID-19 detection

In first stage, COVID-19 is detected by inputting the chest Computerized Tomographic scan images to the architecture to identify whether the patient is affected by covid-19 or not. The pre-trained network, Mask R-CNN is used to predict the covid-19 +ve and ve.

3.1.1. Mask R-CNN

Mask R-CNN generates segmentation mask with high quality for each instances from the object. Initially Chest CT images were taken as input image, the Feature-Pyramid-Network helps to extract the essential features in the chest image. In general Mask R-CNN performs pixel level segmentation, here the Chest CT image is segmented into pixel and classifies the images accurately (Podder et al., 2021). The feature maps directly send the extracted feature to ROI Align as well as to the Region Proposal Network (RPN). RPN process the featured image with the help of 3 3 Convolution layers. The CNN used here is ResNet-50 which consists of 48 Convolution (Conv) layer along with 1 Max-Pool and 1 average (avg)-pool layer. It splits into two 1 1 Conv layer, one with Boundary Box Regression and other using Softmax function. The proposal network is created from the output of Bounding-Box (bbox) Regression and Softmax activation function. The proposal output is send to ROI Align for further classification. Then the image from ROI align sent to fully connected layer, it classify the bbox regression as well as the classification which is covid or not covid from the chest image with the mask. Fig. 3 depicts the framework of Mask R-CNN for classifying the image into covid-19 and Not-covid-19.

Fig. 3.

Fig. 3

Framework for Mask R-CNN for image classification.

The Mask R-CNN not only creates bounding-boxes on the targeting images but also create masks which classify the boxes based on the pixels inside it.

This deep learning mechanism is used to building up the entire model. Bounding boxes (bbox) is created to classify the object into different classes and then create a mask to the output objects. The multiple masking loss-function in each case is given as

Lfunc=Lbbox+Lmask+Lclass (1)

Here Lf is the loss_function, Lbbox is the regression loss of bbox, Lmask is the prediction loss during masking, Lclass is the loss occurs during classification. These three components are composed in order to minimize the loss_function. The following Eq. (2) defines the classification loss. The loss during classification Lclass_loss of each anchor is the loss using log function, to find the lung area which is calculated from Eq. (3)

Lclass=1|Nclass_loss|iLclass_loss(di,di) (2)
Lclass_loss(di,di)=di,logdi(1di)log(1di) (3)

The following Eq. (4) explains about the bbox regression loss.

Lbbox=1|Nbbox|idiL1(pi,pi) (4)

Here the mathematical expressions indicates that anchor index as i, detection probability that anchor the lung is denoted as di, the truth value is denoted as di, if the anchor shows of the lung it produces the value as 1, if the value is 0, it does not produce any lung image. The parameterized co-ordinates of the bounding box is predicted and denoted as pi. The ground truth co-ordinates of the positive anchor are represented as pi. Here the mini batch size is Nclass_loss and Nbbox denotes the number of anchor location; where L1 loss is used as robust for outliers. Lmask define the cross entropy loss for binary as shown in Eq. (5) these are the normalization terms.

Lmask=1|m2|1i,jmyijlogyijk+1yijlog1yijk. (5)

Here the value of label cell is (i,j) where the size of region n n is given by yij. kth class of that particular cell is predicted using the value of yijk. During dataset training, the minimized loss function value nearly 0 is observed, which clearly indicates that this model performs well with no over-fitting problems.

3.2. Covid-19 severity detection

3.2.1. Few shot learning

A Few short learning is a simple method which classifies new data with very few training samples with supervised information. It mainly focuses on pre-training and fine turning the images. Initially few Chest CT images were taken to train the pretrained model which gain experiences from others that is why it is characterized as a Meta learning problem. The base classifier is mainly required to classify the images in the training stage. ResNet-50 used as base classifier to classify the images accurately, fully connected layer is replaced by 1 1 convolutional layers to achieve the final classification effectively as shown in Fig. 4. It uses N-number of way K- number of shot method for classification to discriminate between N-Classes with K-Samples.

Fig. 4.

Fig. 4

Workflow of Few shot learning with ResNet-50 base classifier.

The Support-Set samples consists of Nl-number of labels, where each and every label contains K-labeled images, a Query-Set contains query images names as Q. The average loss calculation for each task is calculated. During the classification task Q among N-classes are given and Nl K images in the Support-Sets. It helps to fine tune f on Support-set quickly.

The fine-tuned f in the query set is send to cosine similarity where the loss results is calculated from the classification error throughout the function and update that as ‘θ’. Finally softmax function is employed throughout the training. The cosine similarities are multiplied by learnable scalar in the training to get the prediction of probability prediction. To classify the severity of lung disorder with few shot learning task using cosine similarity of base classifier ResNet-50 with the image from query set ‘f’ achieves better accuracy.

3.2.2. Base classifier

The base classifier used in this model for predicting the severity is ResNet-50, which is a pre-trained architecture (Aswathy et al., 2021). ResNet-50 is a CNN network with 50 layers deep. In the few-shot learning technique, a cosine nearest method is used to train the classifier while taking in account for loss with respect to the base classes. An encoder “f” is acquired by removing the fully connected layer from the pre-trained model and adding 1 1 convolution layers on the top of it. Encoder “f” is in charge of mapping each input with respect to its embedding, which aids in providing training for the classifiers corresponding to all base classes in addition to conventional cross-entropy loss.

Xd=1|Pd|aPdfa (6)

Let us take a look at the supported pair for task “P”, a few-shot. The corresponding few-shot samples over the class are taken as “Pd” and are represented as “d” for each class that is present across the dataset. The above equation describes how the base classifier calculates the average embedding, or class centroid, ‘Xd’:

pB=d|a=expfa,Xddexpfa,Xd (7)

In a few-shot task, the query “a” is used to provide a sample. As shown in Eq. (7), it aids in determining the likelihood of samples belonging to the class “d” with respect to cosine similarity and centroid. The value fa,Xd in the Chest CT scan image denotes the cosine similarity of two vector values. According to Eq. (8) cosine similarity is obtained as

fa,Xd=fa.XdfaXd (8)

When determining logits, it may be helpful to normalize the value beforehand using the Softmax function throughout training because the confidence interval for cosine similarity is [1, 1]. The learnable scalar is multiplied by the cosine similarity and scalar during training to obtain the probability prediction.

pB=d|a=expθ.fa,Xdtexpθ.fa,Xd (9)

A few-shot task classifies a CT image in the query set using cosine similarity as a proximity metric after evaluating the average features for samples of each classes using the support set “p” as input (9). For classifying the severity level effectively, the convergence pre-trained base classifier is further optimized.

4. Experimentation and result discussion

4.1. Dataset

In this study two distinct datasets were analyzed. The first dataset is derived from publicly available datasets which included in this research comprised of original chest CT-scan images of 377 persons. The data-set contains 15 589 and 48 260 CTScan images which belongs to 282 non covid person and 95 covid-19 persons. The first dataset for Covid-19 detection source is available in the repository https://github.com/mr7495/COVID-CTset from Rahimzadeh et al. (2021). These images are 16 bit gray-scale DICOM format with a pixel resolution of 512 512. From this dataset, pre-classification process occurs into covid +ve and covid ve.

Then the second data-sets used in this research work which is created by fusing multiple images from different source and created a new datasets, where the total dataset consisting of 12 455 images. Totally four classes is being created in this dataset which includes sepsis image for critical stage, ARDS for severe stage, Covid pneumonia for moderate stage and common bronchitis for mild stage. Each class consists of different number of images. For instance, class covid pneumonia comprises maximum of 7685 images and class sepsis comprises of only seventy two images.

Then the datasets must have to re-classify the covid-19 images into these four classes. Initially, pre-process the images and make them to same size, then it is been fused to perform different classes classification. The images contain all the four classes which is shown below in Fig. 5. The dataset’s properties are listed in Table 3, Table 4 accordingly.

Fig. 5.

Fig. 5

Chest CT images of Sepsis, ARDS, Covid Bronchitis and Covid Pneumonia.

Table 3.

Properties of Dataset-1.

Properties of dataset Respective values
Image size 512 × 512 pixel
Nature of image Gray scale images
Entire classes 2
Covid +ve 95
Covid ve 282

Table 4.

Properties of Dataset-2.

Properties of dataset Respective values
Image size 512 × 512 pixel
Nature of image Gray scale images
Entire classes 4
Critical (Sepsis) 72
Severe (ARDS) 856
Moderate (Covid Pneumonia) 7685
Mild (Covid Bronchitis) 3839

4.2. Evaluation metrics

Classification models have a discrete output. Evaluation metrics is needed to compare the different classes (discrete) in some form. Classification metrics can evaluate the model’s performance, each metrics evaluate in different way.

Accuracy=Truepositive+TrueNegativeTruepositive+TrueNegative+FalsePositive+FalseNegative (i)

Accuracy perhaps the simplest and most important metric to implement, it is expounded as the number of true prediction divided by total number of prediction (i).

Sensitivity=TruePositiveTruePositive+FalseNegative (ii)

Sensitivity metrics defined as the total proportion of observed true positive that were predicted to be a positives (ii). It helps to make better decisions with various outcomes. In simple term, the true positive divided by true positive and false negative.

Precision=TruePositiveTruePositive+FalsePositive (iii)

Precision is the classification metric which evaluates the number of true positive predictions occurs (iii). It is expounded as the no. of true +ve divid by number of true +ve and false +ve.

Specificity=TrueNegativeTrueNegative+FalsePositive (iv)

Specificity metrics defined as the total proportion of observed true positive that were predicted to be a negatives (iv). In simple term, the true negative divided by true negative and false positive.

MatthewsCorrelationCoefficient
=TP×TN+FP×FN(TP+FP)(TP+FN)(TN+FP)(TN+FN) (v)

Matthews’s correlation coefficient (MCC) is a single value classification metric helps to summarize the confusion matrix (v). It works well in binary classification with unbalanced datasets. Here TP denotes true positive, FP denotes false positive, TN denotes true negative and FN denotes false negative.

F1Score=2×Precision×RecallPrecision+Recall (vi)

F1 score is a Harmonic mean of precision and recall. It is a more desirable performance measure for classification than accuracy (vi). When data is unbalanced, such as when the number of samples belonging to one class far outnumbers those observed in the other class.

mAP=1Ni=1NAPi (vii)

Mean Average Precision (mAP) is a standard metric by which the accuracy of object detection models can be judged. Recall values between 0 and 1 are used to determine the mean AP value (vii). The mAP takes into account both false positives (FP) and false negatives (FN), incorporating the trade-off between precision and recall.

(Gmean)binary=(specificity×recall) (viii)

The G-mean is the square root of the product of the sensitivities of the different classes. This metric’s goal is to achieve the highest possible accuracy for each class while maintaining parity among them. G-mean is calculated by taking the square root of the product of the specificity and sensitivity for a binary classification (viii).

(Gmean)=TPR×1FPR (ix)

It is the square root of the product of the sensitivities across classes for a multi-class situation, where TPR denotes true positive rate and FPR denotes false positive rate. Eq. (ix) is an un-biased metric for imbalanced datasets.

AUC=12TPTP+FNTNTN+FP (x)

The Area under Curve (AUC) is calculated with the help of confusion matrix in Eq. (x) as discussed by Idrees et al. (2017). The AUC is calculated by summing up the area under the ROC curve (a concept from integral calculus) in a two-dimensional space extending from (0, 0) to (1, 1).

4.3. Implementation and result discussion

4.3.1. Covid-19 detection

The experimentation on the deep learning models was performed on chest CT scan images to identify whether the patient is affected by covid-19 or not. Initially Covid-19 is detected using a pre-trained Mask R-CNN algorithm with a base classifier of ResNet-50, which were trained for 100 epochs. Various ResNet models were compared for detecting covid-19 as shown in Table 5, where ResNet-50 performs well when compared other models (Podder et al., 2021).

Table 5.

Performance of Covid-19 detection using various ResNet models.

Base classifiers F1 Score Specificity Accuracy
ResNet-41 0.9454 0,8903 0.9316
ResNet-50 0.9693 0.9736 0.9698
ResNet-65 0.9574 0.9179 0.9435
ResNet-101 0.9763 0.9264 0.9523

For evaluating Mask R-CNN model with ResNet-50, 5-fold cross validation is performed. In each round, we split the dataset into 5 parts, one part is for validation, and the remaining 4 parts are merged into training subsets for model evaluation. The holdout method is repeated 5 times. Performance is evaluated for each round, the average of all the metrics are computed together with their standard deviation as shown in Table 6 based on binary classification model. Binary classification (Covid-19 and not-Covid-19) task is computed and the performance of each fold is calculated. Fig. 6 shows the results of the confusion matrix for all the 5 folds.

Table 6.

Performance of Covid-19 detection using Mask R-CNN with ResNet-50.

Measures Fold-1 Fold-2 Fold-3 Fold-4 Fold-5 Average/Mean (μ) Standard deviation (σ)
Accuracy 0.9775 0.9850 0.9930 0.9885 0.9860 0.9860 0.0050
Sensitivity 0.9870 0.9780 1.0000 0.9840 0.9820 0.9862 0.0074
Precision 0.9686 0.9919 0.9862 0.9929 0.9899 0.9859 0.0089
Specificity 0.9680 0.9920 0.9861 0.9930 0.9900 0.9858
0.0092
F1 Score 0.9777 0.9849 0.9930 0.9884 0.9859 0.9859 0.0049
False positive rate 0.0320 0.0080 0.0139 0.0070 0.0100 0.0141 0.0092
False negative rate 0.0130 0.0220 0.0000 0.0160 0.0180 0.0138
0.0074
False discovery rate 0.0314 0.0081 0.0138 0.0071 0.0101 0.0141 0.0089
Matthews correlation coefficient 0.9552 0.9701 0.9861 0.9770 0.9720 0.9728 0.0100
Fig. 6.

Fig. 6

Comparison of Covid-19 and Not-Covid-19 confusion matrices (a) Confusion matrix of Fold-1 (b) Confusion matrix of Fold-2 (c) Confusion matrix of Fold-3 (d) Confusion matrix of Fold-4 (e) Confusion matrix of Fold-5.

The performance of the proposed work for detecting covid-19 or not-covid-19 is evaluated, metrics of the entire 5-fold is evaluated and their average is illustrated in Table 6. It achieves the performance of 98.60% for Accuracy, 98.58% of Specificity, 98.59% of Precision, 98.62% of Sensitivity, 98.59% of F1-Score, 0.0141 of False Positive Rate, 0.0138 of False Negative Rate, 0.0141 of False Discovery Rate. Metrics such as accuracy and F1 score derived from confusion matrix or error matrix are widely used in binary classification. On unbalanced datasets, these techniques can exhibit overoptimistic inflated results. So, we calculate Matthews Correlation Coefficient which is a single value classification metric helps to summarize the confusion matrix. It reveals when the classifier fails to correctly label negative-class samples. The Matthews Correlation Coefficient value is calculated using Eq. (vii) obtain the value as 0.9728. The geometric mean (G-Mean) is a single score that takes into account both sensitivity and specificity for binary classification as given in the Eq. (viii). The average value of sensitivity and specificity is 0.9862 and 0.9859. By taking squared root of both values, we achieve a G-Mean value as 0.9769 with an optimal threshold value of 0.0131.

The Quality of a classifier’s results can be measured using AUC Receiver Operating Characteristic (ROC) curve. AUC ROC curve is an efficient method to summarize the overall accuracy. The greater the area the better is the classifier. The Y-axis of ROC curve represents the true positive rate whereas X-axis represents the false positive rate. Because of this, the “ideal” point is located in the upper left corner of the plot, where the false positive rate is zero and the actual positive rate is one. We calculate the AUC values from the error matrix for all the folds and get an average value of 0.9721 for covid-19 detection. Fig. 7 shows the analysis of a classifier performance using ROC curve for binary classification. Here the binary classification occurs between covid-19 and not covid-19.

Fig. 7.

Fig. 7

AUC-ROC curve for Covid-19 detection.

In object detection task, Intersection over Union (IOU) is focused mainly for evaluating the degree of overlap between the predicted bounding box coordinates to the ground truth box. If the IOU value is higher, the actual ground frame βgf i.e. ground truth of the target and predicted frame βpf coordinates resembles close to each other. From these two frames, we calculate the IOU using the formula IOU=area(βgfβpf)area(βgfβpf), It helps to check whether the detection frame meets the required condition. IOU ranges between 0 and 1, it is calculated with thresholding i.e. we need a threshold (α) to decide whether the detection is correct or not. To determine the average precision (AP), we use the weighted mean of the precisions at each threshold, where the weight represents the progress in recall from the previous threshold By using the basic definition in Eq. (vii) mAP is the average of AP in each classes. Adjusting different confidence threshold, we predict different values with different accuracies. Instead of changing the threshold value, we take a global threshold mAP@0.5.

By using the threshold of (α=0.5), the Mask R-CNN model with base classifier of ResNet-50 achieves the value of 0.9152. It achieves greater accuracy when compared to other state of art methods as shown in Table 5.

The output of covid-19 detection using Mask R-CNN is shown in Fig. 8. Initially it creates a bounding box to an image, and then creates a mask image to classify whether the patient is affected by covid +ve or not covid. Fig. 9.A, Fig. 9.B shows the performance of the classification task of covid-19 detection which has an accuracy and loss value. The testing set’s loss is diminishing over time, as can be shown. Though there is a difference in training and test accuracy between epochs 20 and 40, it improves after 50.

Fig. 8.

Fig. 8

Covid-19 detection-CT image of patients and its classification using Mask R-CNN.

Fig. 9.A.

Fig. 9.A

Covid-19 detection accuracy.

Fig. 9.B.

Fig. 9.B

Covid-19 detection loss.

The mean Average Precision is the standard metric for object detection model; it is calculated and compared with various states of art object detection models shown in Table 7. The mAP is calculated with the global threshold of (α=0.5). The proposed work achieves mAP of 0.9152 which is 0.1550, 0.4479, 0.2922, and 0.0281 greater than EfficientNet (Shamila Ebenezer et al., 2022), Faster R-CNN, YOLO v5 (Qu et al., 2021) and RYOLO v4tiny (Kumar, 2022) respectively.

Table 7.

Performance of various object detection models for covid-19 detection.

Detection models mAP@0.5 F1-Score (%) Accuracy (%)
EfficientNet 0.7602 89.94 90.15
EfficientNet with CLAHE 93.02 94.56
Faster RCNN 0.4673 92.56 95.60
YOLO v5 0.6237 93.01 95.37
RYOLO v4tiny 0.8871 93.42 94.74
MaskR-CNN 0.9152 98.59 98.60

When compare to various state of arts models, the proposed Mask R-CNN with a base classifier of ResNet-50 gives better accuracy and good overall results for detecting covid-19.

4.3.2. Covid-19 severity detection

Various lung disorders may arise for the corona virus affected person. Few-short learning algorithm is implemented to find its severity with a base classifier of ResNet-50. Initially, pre-train the model and fine-tune with an evolutionary algorithm is used here to get better accuracy. For Meta-learning methods, sample episodes with 4 classes from the target dataset. For each class, k instances taken as support set sample and k-1 instances as the query set for k-shot task. In training, 12 000 images were trained for 1-shot and 8000 episodes for 5 shot task on the base datasets. In search of samples 20 episodes of images from the validation dataset. Repeated k-Fold cross validation method is performed here to minimize errors and estimate the mean model performance. Typical performance criteria in image classification investigation are employed in this study to examine the classification reliability and accuracy. It is also used to evaluate models’ performance by looking at the AUC of the receiver operation characteristic curve (AUC of ROC).

For deeper network, more number of layers required to be fine-tuned for few shot learning. Fine tuning the network with 100 iterations for support-set and evaluates the network on the query-set. In the evaluation, the fine tune layer searched configuration on the support-set and evaluate on the query-set images from novel dataset. The feature extractor is trained for 100 epochs with the batch sized of 32 on the novel dataset is maintained throughout the process. For Algorithm 1, set the maximum iteration as 40, population size as 40 and other samples like fine tuning vector random sampling, crossover images as 100.

graphic file with name fx1_lrg.jpg

ResNet-50 is used as ground classifier, which become trained on ImageNet database. For training, Adam optimizer is used to reduce the loss function with the learning rate of 1e−4, a momentum of 0.9, and decay factors of 0.1. In fine tuning, Adam optimizer with 0.001 learning rate is fixed throughout the classification. Also, every training batch includes a small quantity of few shot tasks for the purpose of calculating the average loss. The cosine scaling parameter is initially set to a value of 10. It is crucial to evaluate performance of classification in image classification interrogations in order acquire scientific evidence for the study’s findings. Otherwise, the classification study might be abandoned in the classroom, leaving it inadequate. It is typical practice in few-shot learning to employ the episode training strategy.

A five-way, 25-shot approach with 25 training images for each class is used to complete the classification-2 task. The training set is divided into four categories, from which a random sample of twenty images from each class was drawn. The standard geometric transformations like random flip, horizontal or vertical crop, rotation or translation of images occurs during data augmentation

The result obtained for severity of covid positive patients is detected as shown in Table 8 which evaluates the accuracy and F-1 score of various pre-trained models. The accuracy obtained by VGG-19 is 93.54%, AlexNet is 93.70%, VGG-16 is 94.64%, ResNet-101 is 95.35%, DenseNet-201 is 96.34%, ResNet-50 is 94.65% and Few short Learning + ResNet-50 is 95.47%. When compare to various existing techniques, the proposed a few short Learning with the base classifier of ResNet-50 gives better accuracy and goosrad overall results for detecting the severity covid-19 positive patients.

Table 8.

Various pre-trained network used for severity detection.

Pre-trained network used for severity detection Accuracy F1 Score
VGG-19 93.54% 89.23%
AlexNet 93.70% 91.45%
VGG-16 94.64% 93.94%
ResNet-101 95.35% 93.86%
DenseNet-201 96.34% 95.84%
ResNet-50 94.65% 94.01%
Few short Learning + ResNet-50 95.47% 95.65%

Various pretrained model for detection severity from the covid-19 positive patients is represented in the above chart in Fig. 10. It compares the accuracy of various pre-trained model with the proposed model which helps to analyze the results clearly

Fig. 10.

Fig. 10

Various pre-trained network used for severity detection.

From the output of covid-19 patient, it further classifies the severity using few shot learning with a base classifier of ResNet-50. Train the model by fusion multi-model datasets which includes the CT images of sepsis, ARDS, Covid Pneumonia, Covid Bronchitis as shown in Fig. 11. The proposed algorithm prognosis the severity based on four classes, Critical class leads to Sepsis, Severe class leads to ARDS, Moderate class leads to Covid pneumonia and Mild class leads to covid Bronchitis. It classifies the severity of the patient from the positive sample and the performances obtained are shown Table 9, which helps to provide better treatment.

Fig. 11.

Fig. 11

Four classes of Covid-19 severity images.

Table 9.

Performance evaluation for various severity detection classes.

Severity detection classes Accuracy (%) Specificity (%) Sensitivity (%) Precision (%) F1 Score (%) AUC
Critical (Sepsis) 98.80±0.9 99.33±0.5 97.67±1.2 97.86±1.1 97.43±1.4 0.0966±0.0472
Severe (ARDS) 92.33± 2.1 98.75±1.2 92.84±4.3 98.34±0.8 93.53±2.9 0.0914±0.0294
Moderate (Covid Pneumonia) 94.57±3.2 97.30±2.1 94.97±3.1 97.33±1.3 95.38±2.1 0.0943±0.0388
Mild (Covid Bronchitis) 96.18±2.5 98.48±1.0 97.33±1.4 96.28±2.6 96.27±2.4 0.0934±0.0175
Average value of severity detection 95.47±2.3 98.49±0.6 95.87±1.7 97.47±0.6 95.65 ± 2.2 0.0932±0.0323

The classification task of covid-19 severity detection has been performed, where the accuracy and loss value is calculated in both training and testing phase. The testing set’s loss is diminishing over time, as can be shown in Fig. 12.a, Fig. 12.b. Though there is a difference in training and test accuracy between epochs 20 and 40, it improves after 70.

Fig. 12.a.

Fig. 12.a

Covid-19 severity detection accuracy.

Fig. 12.b.

Fig. 12.b

Covid-19 severity detection loss.

With the help of fusing these multi-model datasets, it is possible to predict the severity level of the covid-19 positive patients accurately. The proposed model achieves an accuracy of 98.8% for critical (sepsis), 92.3% for severe, 94.5% for moderate and 96.1% for mild. It achieves the sensitivity of 97.6% for critical, 92.8% for severe, 94.9% for moderate (covid pneumonia) and 97.3% for mild. The precision obtained here is 97.4% for critical, 93.5% for severe, 95.3% for moderate and 96.21% for mild (covid bronchitis). The specificity of this model is 99.3% for critical, 98.7% for severe (Acute Respiratory distress syndrome), 94.9% for moderate and 97.3% for mild. From the above Table, the critical stage of covid positive patients can be able to analyze accurately.

The AUC value is calculated for each class for covid-19 severity detection. From the value of True-Positive-Rate and False-Positive-Rate, ROC curve is drawn for classes 0–3 i.e. 4 classes (Critical, severe, moderate, mild) as shown in Fig. 13. The mean (μ) value of severity classes are 95.47 whereas the standard deviation (σ) value is 2.3. This equation is used to calculate Standard Error of Mean (SEM) is 0.0243 at 95% confidence level. Based on the SEM, the margin of errors (or confidence intervals) of different confidence levels is calculated for all the metrics.

Fig. 13.

Fig. 13

AUC-ROC curve for Covid-19 severity detection.

Depending on the field of study, a confidence level of 95% (or statistical significance of 5%) is typically used for data representation. It shows that the margin of error depends on its confidence level (CL), when the CL value increases, error rate will get decrease as shown in Table 10. The unbiased metric for imbalanced data is Geometric mean. The G-Mean value for multiclass can be calculated from the value of TPR and FPR for all the subsets using the formula of TPR×1FPR. G-Mean is calculated for all the four subsets, the subset-1 (Critical) achieves a G-Mean of 0.9416 and error rate of 0.0432. The subset-2 (severe) achieves a G-Mean of 0.9352 and error rate of 0.0231. The subset-3 (moderate) and subset-4 (mild) achieves a G-Mean of 0.9367, 0.9475 and error rate of 0.0185, 0.0226 respectively. Table 11 shows the average G-Mean and Error rate value for Covid-19 severity classification model i.e. 0.9402 and0.0243 respectively.

Table 10.

Margin of errors calculation.

Accuracy
Specificity
Sensitivity
Confidence level Margin of error Confidence level Margin of Error Confidence level Margin of Error
68.3%, σx¯ 95.42±1.185 (±1.24%) 68.3%, σx¯ 98.42±0.29 (±0.30%) 68.3%, σx¯ 95.82±0.795 (±0.83%)
90%, 1.645σx¯ 95.42±1.949 (±2.04%) 90%, 1.645σx¯ 98.42±0.478 (±0.49%) 90%, 1.645σx¯ 95.82±1.307 (±1.36%)
95%, 1.960σx¯ 95.42±2.432 (±2.43%) 95%, 1.960σx¯ 98.42±0.569 (±0.58%) 95%, 1.960σx¯ 95.82±1.558 (±1.63%)
99%, 2.576σx¯ 95.25±3.053 (±3.20%) 99%, 2.576σx¯ 98.42±0.748 (±0.76%) 99%, 2.576σx¯ 95.82±2.047 (±2.14%)
99.9%, 3.291σx¯ 95.42±3.9 (±4.09%) 99.9%, 3.291σx¯ 98.42±0.956 (±0.97%) 99.9%, 3.291σx¯ 95.82±2.615 (±2.73%)
99.99%, 3.891σx¯ 95.42±4.611 (±4.83%) 99.99%, 3.891σx¯ 98.42±1.13 (±1.15%) 99.99%, 3.891σx¯ 95.82±3.092 (±3.23%)

F1-Score
Precision
AUC
Confidence level Margin of Error Confidence level Margin of Error Confidence level Margin of Error

68.3%, σx¯ 97.4±0.311 (±0.32%) 68.3%, σx¯ 95.6±0.712 (±0.74%) 68.3%, σx¯ 0.0935±0.00901 (±0.96%)
90%, 1.645σx¯ 97.4±0.512 (±0.53%) 90%, 1.645σx¯ 95.6±1.17 (±1.22%) 90%, 1.645σx¯ 0.0935±0.00148 (±1.59%)
95%, 1.960σx¯ 97.4±0.61 (±0.63%) 95%, 1.960σx¯ 95.6±1.395 (±1.46%) 95%, 1.960σx¯ 0.0935±0.00177 (±1.89%)
99%, 2.576σx¯ 97.4±0.801 (±0.82%) 99%, 2.576σx¯ 95.6±1.833 (±1.92%) 99%, 2.576σx¯ 0.0935±0.00232 (±2.48%)
99.9%, 3.291σx¯ 97.4±1.024 (±1.05%) 99.9%, 3.291σx¯ 95.6±2.342 (±2.45%) 99.9%, 3.291σx¯ 0.0935±0.00297 (±3.17%)
99.99%, 3.891σx¯ 97.4±1.211 (±1.24%) 99.99%, 3.891σx¯ 95.6±2.768 (±2.90%) 99.99%, 3.891σx¯ 0.0935±0.00351 (±3.75%)
Table 11.

G-Mean and Error rate for severity classification model.

Class subsets G-Mean value Error rate
Critical 0.9416 0.0332
Severe 0.9352 0.0231
Moderate 0.9367 0.0185
Mild 0.9475 0.0226
Average 0.9402 0.0243

The Fig. 14- shows the performance of classification model for severity detection using Few-shot learning method with a base classifier of ResNet-50. The data labels for critical and mild are mentioned in the above figure. This model can perform both detection and access the severity of the patient condition with a single architecture and achieves excellent accuracy.

Fig. 14.

Fig. 14

Performance graph for severity detection.

5. Conclusion

The world is still struggling with different mutants of Corona viruses. The most difficult challenge for doctors is still accurate diagnosis and prognosis of COPD and lung disorders caused by the Corona virus. Patients with deteriorating breathing or those considered at risk for illness progression must undergo chest imaging for diagnosis. As a solution to the problem of classifying coronavirus and its severity, Mask R-CNN is used to predict covid +ve and ve from the chest computerized tomographic image, and severity is detected with a few-shot learning approach. Using a combination of pre-trained base classifiers and meta-learning, a framework is proposed. It is far simpler and more convenient than other methods for classifying the severity of positive patients. From the experimentation outcome, the proposed model achieves the G-Mean and MCC for binary classification as 97.69% and 97.28%, respectively. For multiclass detection, it achieves a G-Mean of 94.02% with an error rate of 2.43%. When building this model, the cosine similarity distance is used to estimate how far apart the trained meta-learning model and the query images are from each other. It enables healthcare professionals to accurately detect severity in order to recommend appropriate treatments. The main challenge in this model is imbalanced classification. In the future, the severity of COVID can be detected with a greater number of infections (class labels) like pneumothorax, asthma, etc. by overcoming the problem of imbalanced classification.

CRediT authorship contribution statement

Suganya D.: Conceptualization, Methodology, Writing – original draft. Kalpana R.: Visualization, Investigation.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Data availability

Data will be made available on request.

References

  1. Ahuja S., Panigrahi B.K., Dey N., Rajinikanth V., Gandhi T.K. Deep transfer learning-based automated detection of COVID-19 from lung CT scan slices. Appl. Intell. (Dordrecht, Netherlands) 2021;51(1):571–585. doi: 10.1007/s10489-020-01826-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Anantharaman R., Velazquez M., Lee Y. Utilizing mask R-CNN for detection and segmentation of oral diseases. 2018 IEEE International Conference on Bioinformatics and Biomedicine; BIBM; IEEE; 2018. pp. 2197–2204. [Google Scholar]
  3. Antoniou A., Edwards H., Storkey A. 2018. How to train your MAML. arXiv preprint arXiv:1810.09502. [Google Scholar]
  4. Aswathy A.L., et al. COVID-19 diagnosis and severity detection from CT-images using transfer learning and back propagation neural network. J Infect Public Health. 2021 doi: 10.1016/j.jiph.2021.07.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bai H.X., et al. Artificial intelligence augmentation of radiologist performance in distinguishing COVID-19 from pneumonia of other origin at chest CT. Radiology. 2020;296(3):E156–E165. doi: 10.1148/radiol.2020201491. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Chen, Yinbo, Liu, Zhuang, Xu, Huijuan, Darrell, Trevor, Wang, Xiaolong, 2020. Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning. In: 2021 IEEE/CVF International Conference on Computer Vision. ICCV, pp. 9042–9051.
  7. Chiao J.Y., Chen K.Y., Liao K.Y.K., et al. Detection and classification the breast tumors using mask R-CNN on sonograms. Medicine (Baltimore) 2019;98 doi: 10.1097/MD.0000000000015200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Dhieb N., Ghazzai H., Besbes H., et al. An automated blood cells counting and classification framework using mask R-CNN deep learning model. 2019 31st International Conference on Microelectronics; ICM; IEEE; 2019. pp. 300–303. [Google Scholar]
  9. Di D., et al. Hypergraph learning for identification of COVID-19 with CT imaging. Med. Image Anal. 2021;68 doi: 10.1016/j.media.2020.101910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Finn C., Abbeel P., Levine S. Proceedings of the 34th International Conference on Machine Learning, Vol. 70. Proceedings of Machine Learning Research; 2017. Model-agnostic meta-learning for fast adaptation of deep networks; pp. 1126–1135. Available from https://proceedings.mlr.press/v70/finn17a.html. [Google Scholar]
  11. Han Z., Wei B., Hong Y., et al. Accurate screening of COVID19 usingattention-based deep 3D multiple instance learning. IEEE Trans. Med. Imaging. 2020;39(8):2584–2594. doi: 10.1109/TMI.2020.2996256. [DOI] [PubMed] [Google Scholar]
  12. Harmon S.A., Sanford T.H., Xu S., et al. Artificial intelligence for the detection of COVID-19 pneumonia on chest CT using multinational datasets. Nature Commun. 2020;11(1):4080. doi: 10.1038/s41467-020-17971-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. He X., Yang X., Zhang S., Zhao J., Zhang Y., Xing E.P., Xie P. 2020. Sample-efficient deep learning for COVID-19 diagnosis based on CT scans. [DOI] [Google Scholar]
  14. Idrees Fauzia, Rajarajan Muttukrishnan, Conti Mauro, Chen Thomas M., Rahulamathavan Yogachandran. Pindroid: A novel android malware detection system using ensemble learning methods. Comput. Secur. 2017;68:36–46. doi: 10.1016/j.cose.2017.03.011. https://www.sciencedirect.com/science/article/pii/S0167404817300640. [DOI] [Google Scholar]
  15. Inam Aaqib, Sarwar Ayesha, Salah-Ud-Din, Atta Ayesha, Naseer Iftikhar, Siddiqui Shahan, Khan Muhammad. Detection of COVID-19 enhanced by a deep extreme learning machine. Intell. Autom. Soft Comput. 2021;27:701–712. doi: 10.32604/iasc.2021.014235. [DOI] [Google Scholar]
  16. Jin C., Chen W., Cao Y., Xu Z., Tan Z., Zhang X., Deng L., Zheng C., Zhou J., Shi H., Feng J. Development and evaluation of an artificial intelligence system for COVID-19 diagnosis. Nature Commun. 2020;11(1):5088. doi: 10.1038/s41467-020-18685-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Johnson, J.W., 2020. Automatic Nucleus Segmentation with Mask-RCNN. In: Proceedings of the 2019 Computer Vision Conference, Vol. 2. 10.1007/978-3-030-17798-0_32. [DOI]
  18. Kopelowitz E., Engelhard G. Medical Imaging with Deep Learning. 2019. Lung nodules detection and segmentation using 3D mask-RCNN. arXiv preprint arXiv:1907.08612. [Google Scholar]
  19. Kumar Akhil. RYOLO v4-tiny: A deep learning based detector for detection of COVID and non-COVID pneumonia in CT scans and X-RAY images. Optik. 2022;268 doi: 10.1016/j.ijleo.2022.169786. https://www.sciencedirect.com/science/article/pii/S0030402622010610. [DOI] [Google Scholar]
  20. Li Yanjiao, Zhang Jie, Zhang Sen, Xiao Wendong, Zhang Zhiqiang. Multi-objective optimization-based adaptive class-specific cost extreme learning machine for imbalanced classification. Neurocomputing. 2022;496:107–120. doi: 10.1016/j.neucom.2022.05.008. https://www.sciencedirect.com/science/article/pii/S0925231222005409. [DOI] [Google Scholar]
  21. Liu M., Dong J., Dong X., et al. Segmentation of lung nodule in CT images based on mask R-CNN. 2018 9th International Conference on Awareness Science and Technology; ICAST; IEEE; 2018. pp. 1–6. [Google Scholar]
  22. Liu, B., Gao, X., He, M., Liu, L., Yin, G., 2020. A fast online COVID-19 diagnostic system withchest CT scans. In: Proceedings of KDD 2020, New York.
  23. Loey M., Manogaran G., Khalifa N.E.M. A deep transfer learning model with classical data augmentation and CGAN to detect COVID-19 from chest CT radiography digital images. Neural Comput. Appl. 2020 doi: 10.1007/s00521-020-05437-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Madani M., Behzadi M.M., Nabavi S. The role of deep learning in advancing breast cancer detection using different imaging modalities: A systematic review. Cancers. 2022;14:5334. doi: 10.3390/cancers14215334. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Mulay S., Deepika G., Jeevakala S., et al. International Workshop on Multiscale Multimodal Medical Imaging, Vol. 11977. Springer; Cham: 2019. Liver segmentation from multimodal images using HED-mask R-CNN; pp. 68–75. [Google Scholar]
  26. Munkhdalai T., Yu H. Proceedings of the 34th International Conference on Machine Learning, Vol. 70. JMLR.org; 2017. Meta networks; pp. 2554–2563. [PMC free article] [PubMed] [Google Scholar]
  27. Nguyen D.H., Le T.H., Tran T.H., et al. Hand segmentation under different viewpoints by combination of mask R-CNN with tracking. 2018 5th Asian Conference on Defense Technology; ACDT; IEEE; 2018. pp. 14–20. [Google Scholar]
  28. Ozsahin I., Sekeroglu B., Musa M.S., Mustapha M.T., Uzun Ozsahin D. Review on diagnosis of COVID-19 from chest CT images using artificial intelligence. Comput. Math. Methods Med. 2020;26 doi: 10.1155/2020/9756518(2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Ozsahin I., Sekeroglu B., Musa M.S., Mustapha M.T., Uzun Ozsahin D. Review on diagnosis of COVID-19 from chest CT images using artificial intelligence. Comput. Math. Methods Med. 2020;(2020) doi: 10.1155/2020/9756518. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Pathak Y., Shukla P.K., Tiwari A., Stalin S., Singh S., Shukla P.K. Deep transfer learning based classification model for COVID-19 disease. Ingen. Rec. Biomed.: IRBM = Biomed. Eng. Res. 2022;43(2):87–92. doi: 10.1016/j.irbm.2020.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Podder Soumyajit, Bhattacharjee Somnath, Roy Arijit. An efficient method of detection of COVID-19 using mask R-CNN on chest X-ray images. AIMS Biophys. 2021;8(3):281–290. doi: 10.3934/biophy.2021022. [DOI] [Google Scholar]
  32. Polsinelli Matteo, Cinque Luigi, Placidi Giuseppe. A light CNN for detecting COVID-19 from CT scans of the chest. Pattern Recognit. Lett. 2020;140:95–100. doi: 10.1016/j.patrec.2020.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Pu J., Leader J.K., Bandos A., et al. Automated quantification of COVID-19 severityand progression using chest CT images. Eur. Radiol. 2020 doi: 10.1007/s00330-020-07156-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Qi, H., Brown, M., Lowe, D.G., 2018. Low-shot learning with imprinted weights. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 5822–5830.
  35. Qu, R., Yang, Y., Wang, Y., 2021. COVID-19 Detection Using CT Image Based On YOLOv5 Network. In: 2021 3rd International Academic Exchange Conference on Science and Technology Innovation. IAECST, pp. 622–625. 10.1109/IAECST54258.2021.9695714. [DOI]
  36. Raghu M., Zhang C., Kleinberg J., Bengio S. 2019. Transfusion: understanding transfer learning for medical imaging. arXiv:1902.07208. [Google Scholar]
  37. Rahimzadeh Mohammad, Attar Abolfazl, Sakhaei Seyed Mohammad. A fully automated deep learning-based network for detecting COVID-19 from a new and large lung CT scan dataset. Biomed. Signal Process. Control. 2021;68 doi: 10.1016/j.bspc.2021.102588. https://www.sciencedirect.com/science/article/pii/S1746809421001853. Datasets : https://github.com/mr7495/COVID-CTset. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Rajpal Sheetal, et al. 2022. COV-ELM classifier: An extreme learning machine based identification of COVID-19 using chest X-ray images; pp. 193–203. [Google Scholar]
  39. Sajja T.K., Devarapalli R., Kalluri H.K. Lung cancer detection based on CT scans images by using deep transfer learning. Trait. Signal. 2019;36:339–344. [Google Scholar]
  40. Shamila Ebenezer A., Deepa Kanmani S., Sivakumar Mahima, Jeba Priya S. Effect of image transformation on EfficientNet model for COVID-19 CT image classification. Mater. Today: Proc. 2022;51(Part 8):2512–2519. doi: 10.1016/j.matpr.2021.12.121. https://www.sciencedirect.com/science/article/pii/S2214785321078469. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Shan F., Gao Y., Wang J., Shi W., Shi N., Han M., Xue Z., Shen D., Shi Y. Abnormal lung quantification in chest CT images of COVID-19 patients with deep learning and its application to severity prediction. Med. Phys. 2021;48(4):1633–1645. doi: 10.1002/mp.14609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Shen C., et al. Quantitative computed tomography analysis for stratifyingthe severity of coronavirus disease 2019. J. Pharm. Anal. 2020;10(2):123–129. doi: 10.1016/j.jpha.2020.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Shibata T., Teramoto A., Yamada H., et al. Automated detection and segmentation of early gastric cancer from endoscopic images using mask R-CNN. Appl. Sci. 2020;10:3842. [Google Scholar]
  44. Snell J., Swersky K., Zemel R. Advances in Neural Information Processing Systems. 2017. Prototypical networks for few-shot learning; pp. 4077–4087. [Google Scholar]
  45. Suganya, D., Kalpana, R., 2022. Automated COVID-19 diagnosis using Deep Multiple Instance Learning with CycleGAN. In: 2022 International Conference on Smart Technologies and Systems for Next Generation Computing. ICSTSN, pp. 1–6. 10.1109/ICSTSN53084.2022.9761334. [DOI]
  46. Sung, F., Yang, Y., Zhang, L., Xiang, T., Torr, P.H., Hospedales, T.M., 2018. Learning to compare: Relation network for few-shot learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 1199–1208.
  47. Tang Z., Zhao W., Xie X., Zhong Z., Shi F., Ma T., Liu J., Shen D. Severity assessment of COVID-19 using CT image features and laboratory indices. Phys. Med. Biol. 2021;66(3) doi: 10.1088/1361-6560/abbf9e. [DOI] [PubMed] [Google Scholar]
  48. Tenda E.D., et al. The importance of chest CT scan in COVID-19. Acta Med. Indones. 2020;52(1):68–73. PMID: 32291374. [PubMed] [Google Scholar]
  49. Turkoglu M. COVID-19 detection system using chest CT images and multiple kernels-extreme learning machine based on deep neural network. IRBM. 2021;42(4):207–214. doi: 10.1016/j.irbm.2021.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Vinyals O., Blundell C., Lillicrap T., Wierstra D., et al. Advances in Neural Information Processing Systems. 2016. Matching networks for one shot learning; pp. 3630–3638. [Google Scholar]
  51. Wang X., Deng X., Fu Q., et al. A weakly-supervised framework for COVID-19classification and lesion localization from chest CT. IEEE Trans. Med. Imaging. 2020;39(8):2615–2625. doi: 10.1109/TMI.2020.2995965. [DOI] [PubMed] [Google Scholar]
  52. Xiao L.S., Li P., Sun F., Zhang Y., Xu C., Zhu H., Cai F.Q., He Y.L., Zhang W.F., Ma S.C., Hu C., Gong M., Liu L., Shi W., Zhu H. Development and validation of a deep learning-based model using computed tomography imaging for predicting disease severity of coronavirus disease 2019. Front. Bioeng. Biotechnol. 2020;8(898) doi: 10.3389/fbioe.2020.00898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Yadav S.S., Jadhav S.M. Deep convolutional neural network based medical image classification for disease diagnosis. J. Big Data. 2019;6:113. [Google Scholar]
  54. Yang Xingyi, Zhao Jinyu, Zhang Yichen, He Xuehai, Xie Pengtao. 2020. COVID-CT-dataset: A CT scan dataset about COVID-19. arXiv abs/2003.13865. n. pag. [Google Scholar]
  55. Ye, H.-J., Hu, H., Zhan, D.-C., Sha, F., 2020. Few-shot learning via embedding adaptation with set-to-set functions. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 8808–8817.
  56. Zhang, Chi, Cai, Yujun, Lin, Guosheng, Shen, Chunhua, 2020a. DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover’s Distance and Structured Classifiers. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. CVPR, pp. 12200–12210.
  57. Zhang Jie, Li Yanjiao, Xiao Wendong, Zhang Zhiqiang. Non-iterative and fast deep learning: Multilayer extreme learning machines. J. Franklin Inst. B. 2020;357(13):8925–8955. doi: 10.1016/j.jfranklin.2020.04.033. https://www.sciencedirect.com/science/article/pii/S0016003220302751. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data will be made available on request.


Articles from Engineering Applications of Artificial Intelligence are provided here courtesy of Elsevier

RESOURCES