Deep-Precognitive Diagnosis: Preventing Future Pandemics by Novel Disease Detection With Biologically-Inspired Conv-Fuzzy Network

Aviral Chharia; Rahul Upadhyay; Vinay Kumar; Chao Cheng; Jing Zhang; Tianyang Wang; Min Xu

doi:10.1109/ACCESS.2022.3153059

. 2022 Feb 21;10:23167–23185. doi: 10.1109/ACCESS.2022.3153059

Deep-Precognitive Diagnosis: Preventing Future Pandemics by Novel Disease Detection With Biologically-Inspired Conv-Fuzzy Network

Aviral Chharia ¹, Rahul Upadhyay ², Vinay Kumar ², Chao Cheng ³, Jing Zhang ⁴, Tianyang Wang ⁵, Min Xu ^6,^7,^✉

PMCID: PMC8967064 NIHMSID: NIHMS1786284 PMID: 35360503

Abstract

Deep learning-based Computer-Aided Diagnosis has gained immense attention in recent years due to its capability to enhance diagnostic performance and elucidate complex clinical tasks. However, conventional supervised deep learning models are incapable of recognizing novel diseases that do not exist in the training dataset. Automated early-stage detection of novel infectious diseases can be vital in controlling their rapid spread. Moreover, the development of a conventional CAD model is only possible after disease outbreaks and datasets become available for training (viz. COVID-19 outbreak). Since novel diseases are unknown and cannot be included in training data, it is challenging to recognize them through existing supervised deep learning models. Even after data becomes available, recognizing new classes with conventional models requires a complete extensive re-training. The present study is the first to report this problem and propose a novel solution to it. In this study, we propose a new class of CAD models, i.e., Deep-Precognitive Diagnosis, wherein artificial agents are enabled to identify unknown diseases that have the potential to cause a pandemic in the future. A de novo biologically-inspired Conv-Fuzzy network is developed. Experimental results show that the model trained to classify Chest X-Ray (CXR) scans into normal and bacterial pneumonia detected a novel disease during testing, unseen by it in the training sample and confirmed to be COVID-19 later. The model is also tested on SARS-CoV-1 and MERS-CoV samples as unseen diseases and achieved state-of-the-art accuracy. The proposed model eliminates the need for model re-training by creating a new class in real-time for the detected novel disease, thus classifying it on all subsequent occurrences. Second, the model addresses the challenge of limited labeled data availability, which renders most supervised learning techniques ineffective and establishes that modified fuzzy classifiers can achieve high accuracy on image classification tasks.

Keywords: Deep learning, COVID-19, medical imaging, computer-aided diagnosis, pandemics

I. Introduction

Throughout human history, there have been several pandemics, the most recent and ongoing being the SARS-CoV-2. The first case of the disease was reported in late December 2019 in Wuhan, China [1], [2].

Ever since the first case of COVID-19 was reported, the world has seen over 219 million cases and 4.547 million deaths due to the virus [3]. Despite tremendous improvements in our understanding of microbial hazards over the last two decades, humans remain as vulnerable as ever to unexpected attacks by unknown viruses. The World Health Organization (WHO) has adopted ‘Disease X’ as a placeholder name for hypothetical, unknown diseases that might trigger a future epidemic. This list is periodically updated and includes diseases such as Middle East-respiratory syndrome coronavirus (MERS-CoV) and Severe Acute Respiratory Syndrome (SARS), among others. Past studies [4] suggest that the increasing scale of anthropogenic destruction of biodiversity is paving the way to a ‘pandemic era’. In such a situation, it is projected that roughly 850,000 distinct viruses might spread from animals to people, with SARS-CoV-2 being just one of them [4]. The study has been strengthened by the increased frequency with which novel disease outbreaks have occurred in the last two decades. This has prompted an urgent need for reconsideration, as well as a need for substantial actions to be implemented to address a health catastrophe of such magnitude. Table 1 enlists various recent novel zoonotic virus outbreaks [5]–[7].

TABLE 1. Increased Rate of Spread of Numerous Novel Zoonotic Viruses in Recent Disease Outbreaks, Suggestive of the Approaching . Data Taken From [5]–[7].

Disease	First Reported	Rate of Spread	Time period	Total Infected	Total Deaths	Fatality (%)	Countries Affected
SARS-CoV-1	Feb 2003, Foshan, China	High	Major outbreak lasted eight months; declared contained in July 2003; but cases reported till date	8096	774	9.6%	29 Countries
MERS-CoV	June 2012, Jeddah, Saudi Arabia	Low	June 2012 - till date	2574	885	34.4%	27 Countries
SARS-CoV-2	Dec 2019, Wuhan, China	Very High	Dec, 2019 - till date	219 Million	4.547 Million	2–3%	Worldwide (Pandemic)

Open in a new tab

A. Background

Presently, numerous researchers are involved in developing new methods for rapid disease detection to increase the rate of daily tests. Real-time Reverse-transcription polymerase chain reaction (rRT-PCR) [8] is the most widely used test for diagnosing COVID-19. However, many times, the test fails to detect the disease in case of a newly evolved coronavirus strain before extracting the new virus’s DNA sequence, potentially delaying testing [9]. Today, deep learning (DL) has been used in a wide range of areas which includes developing solutions for aiding the visually impaired [10], solving a 50-year-old grand protein folding challenge [11], analyzing macromolecules from cellular electron Cryo-tomography [12], [13], developing complex intrusion-based detection systems [14]–[18], enabling IoT-based systems [19], [20], etc. DL-based computer-aided diagnosis (CAD) has drawn immense attention in recent years for its capability to enhance diagnosis performance and elucidate complex clinical tasks.

During the pandemic, researchers have focused on developing various computational models for the rapid detection of SARS-CoV-2 on CXR scans with high accuracy and sensitivity [21]–[26]. Initial analysis of patients with severe symptoms showed signs of pneumonia [27]. Others have attempted to predict disease severity from X-rays. Figure 1 shows CXR scans of patients infected with various respiratory diseases- bacterial pneumonia, SARS-CoV-1, MERS-CoV, and SARS-CoV-2 (or COVID-19). The use of computed tomography (CT) is another way for detecting COVID-19 infection. However, the high radiation doses (also comparatively higher costs) makes it unsuitable for usage, particularly for pregnant women and children [28].

FIGURE 1. — Representative images from the dataset [48], [49] showing Anteroposterior CXR scans of patients diagnosed with (a) Bacterial Pneumonia (b) SARS-CoV-1 (c) MERS-CoV (d) SARS-CoV-2 caused by the novel coronavirus (n-CoV).

B. Motivation

Although these methods help and provide the radiologist with assistance, they act only after a pandemic outbreak has occurred. Presently, little/ no study exists on developing a model that can detect novel diseases that have the potential to cause a future pandemic at their preliminary stage of spread itself. Such a model acts before a pandemic outbreak. This task, nevertheless, remains challenging.

Other questions that remain unaddressed and hinders the development of next-generation CAD models [29] include, first, all supervised learning models rely heavily on the availability of labeled medical datasets. Since highly skilled experts perform their collection, it requires considerable time and resources. Moreover, it is difficult to gather a large dataset of positive disease samples in an early stage of dissemination [29]. Therefore, an accurately labeled dataset can be available only after the disease outbreak has taken place, its multiple incidences have been reported, researched upon, and its pathological tests are available. A future disease outbreak may be a highly infectious one (viz. SARS-CoV-2) and spread at an even faster rate. Such an outbreak may not provide time to re-develop and deploy CAD models. Second, the task of detecting novel diseases is hard to visualize as a supervised learning problem in the absence of labeled training cases. Third, each time a novel disease outbreak ensues, new models need to be developed and updated throughout the medical system, following a similar pipeline. Fourth, many DL X-ray/ CT-scan models have low accuracy over cross-validation datasets [30], possibly since these were trained on a single dataset. Fifth, the emergence of newly mutated viruses results in a major change in the dataset, severely restricting the performance of conventional classification-based approaches. These complexities pose significant challenges in the development of a next-generation CAD model (one which aims to detect a novel disease before its outbreak) and motivates the research community towards this problem.

C. Contributions

In the present study, we highlight the utility of the proposed model with the hypothetical outbreak of three pathogens (SARS-CoV-1, MERS-CoV, and COVID-19) that are unseen by the model, i.e., ‘Disease X’ as suggested by the WHO to be a potential cause of a future major epidemic. Along with COVID-19 as the unseen disease, the efficacy of the proposed model is demonstrated experimentally on SARS-CoV-1 and MERS-CoV, present in WHO’s list. Here it has been assumed that a future pandemic might be caused by a novel coronavirus on the similar lines of previous disease outbreaks, impacting lungs, and could be captured through CXR scans. The significant contributions of this paper include-

•
The concept of ‘Deep-Precognitive diagnosis’ is introduced as a new class of CAD having broad applicability in developing future diagnosis models capable of detecting novel diseases at preliminary stage of spread.
•
A novel solution is proposed for the introduced problem. The task is formulated as a class membership lookup problem using a biologically-inspired Conv-fuzzy network. The model’s ability to detect occurrences of novel diseases on CXR scans with state-of-the-art (SOTA) accuracy, is shown on three coronaviruses, as unseen diseases.
•
The proposed algorithm eliminates the need for re-training for each new disease outbreak. Real-time architecture update enables new class creation for the detected novel disease, classifying it on all further occurrences.
•
The challenge of limited labeled data availability is addressed. The proposed model has the ability to learn complex feature space on small datasets, unlike DL models. Further, as the architecture is semi-supervised, a labeled dataset is not needed for every sample, reducing heavy dependence on labeled datasets. Unlike conventional models, the proposed model requires single-pass training. Thus, there is a massive reduction in training time is observed.
•
According to our knowledge, the present work is the first to adapt Fuzzy reflex classifiers for image classification tasks and quantitatively establish that they achieve comparable performance to SOTA ML classifiers.

The remaining paper is organized as follows. Section II introduces the relevant related works. Section III describes the proposed model, its architecture design, and algorithm. The experimental settings, experiments performed, and the results are elaborated in Section IV. This is followed by the discussion in Section V. Section VI discusses the limitations and the future work of the study. Finally, the conclusion is presented in Section VII.

II. Related Work

A. Manual Detection of Novel Diseases

Presently, computed tomography (CT) scans offering 3D anatomy are examined by expert radiologists to find abnormal features in the thoracic region suggestive of novel diseases, based on the type, and extent of lesions [31]. These are further sent for clinical tests to confirm new disease presence. Even for detection of COVID-19, firstly most common manifestation and patterns of lung abnormality were used for manual classification [32]. The initial screening is mainly dependent on expert radiologists that may not be present in all diagnostic centers. Presently, as per our knowledge, there exists no end-to-end model that has been proposed to address the problem. If compared with CT-scans, CXR scans do not provide three-dimensional anatomy, but pneumonia and most other diseases can be differentiated, though many Radiologists consider CXR scans as “the most challenging plain film to interpret correctly” [33]. However, due to its fast imaging speed, low radiation, and low cost, X-rays are the most widely used CAD imaging modality. Thus, X-rays have been used in this study to demonstrate the effectiveness of the proposed model.

B. Deep Learning Breakthrough in CAD

Today, DL-based CAD has been extensively used to improve the accuracy of prediction for screening both infectious and non-infectious diseases [34]. The development of the first CAD model traces back to 1976 when the first CAD model called MYCIN was developed. It used 450 rules designed for bacterial infection and suggested antibiotics to patients [35]. The recent development of DL has triggered a new era in CAD, including breakthroughs in detection, delineation of pathological abnormalities, disease progression monitoring, and therapy response. Many such models have been developed by researchers for classification of various communicable diseases including SARS, EBOLA [36], HIV [37], [38], COVID-19 [23], [39] among others.

C. Challenges in Next-Generation Diagnosis

Nearly all CAD models involve three sub-steps: data feature extraction (in medical image analysis), their classification, and the diagnosis. However, presently, many challenges are faced in CAD development that inhibits the progress in this field of research [29]. The first major challenge is the availability of adequately labeled medical datasets for novel disease outbreaks. While the quantity of Electronic health records (EHRs) has increased by multiple folds due to proper data collection; data records are frequently missing, recorded incorrectly, or improperly disposed of [40]. Second, it is difficult to get positive samples in an early stage of novel disease outbreak. Annotating such samples further requires great expertise. Studies by [41], [42] shows that general intuition of better classifier performance for large training dataset is misleading and models can attain good accuracy on limited datasets. But the development of CAD models that can work on limited dataset still remains a challenging task. Third, the emergence of newly mutated viruses results in a significant change in the dataset, severely restricting the performance of classification-based approaches.

D. Fuzzy Logic in Computer-Aided Diagnosis

Even though the study and application of fuzzy logic has received immense attention in recent years [43]–[45], the use of fuzzy logic for classification in CAD models is an open research area [29]. Fuzzy min-max neural networks, first introduced by [46], have been used in many applications. Various improvements have been proposed [47] in the original network to improve the model’s overall accuracy. One of the areas of past work includes fuzzy reflex classifiers [48], where a self-supervised network learns on data and classifies it. However, this network has not been investigated for tasks involving high-dimensional data like image classification, including medical image classification in CAD models. We demonstrate how the recent rise of DL-based supervised feature extraction bridges the gap between fuzzy reflex classifiers and their machine learning (ML) counterparts.

E. Deep Learning for Novel Disease Detection

Today, very few works focus on developing CAD models capable of detecting unseen novel diseases. This problem has the potential for future pandemic prevention and control. Recent attempts have been made by [49] on how to learn classifiers to predict or choose to defer the classification decision to a downstream expert. Work by [50] has investigated unseen disease detection using DL on CXR scans, but the study is limited to an internal validation set without an external test set including different unseen diseases. Moreover, the proposed model architecture cannot be updated in real-time and has to be re-developed for adding new classes each time a novel disease is detected.

III. Proposed Methodology

A. Problem Formulation

In traditional models, an input test sample Inline graphic , is classified amongst the previously learnt classes . These classification models are based on a ‘closed world’ assumption [50], where no new classes are present in the test set, which hardly is the case for real-world medical applications. Therefore, such models suffer from a major disadvantage, i.e., samples belonging to a new class (i.e., high probability of a novel disease), which the model has not seen or not been trained on previously, get wrongly classified in one of the existing classes, leading to diagnostic failure. Moreover, for every introduction of a new disease, these models need to be re-trained and developed.

Unlike conventional models, Deep-Precognitive diagnosis (DPD) refers to developing intelligent CAD models that can detect the occurrences of new diseases at their preliminary stage of spread. The proposed model (refer Figure 2(b)) has the ability to detect new disease classes and create a new class Inline graphic for them in real-time. Such a model will have the capability to control the spread of novel diseases with an early warning of a future pandemic. Figure 2 shows the conventional CAD models in contrast with the proposed Deep-Precognitive diagnosis model.

FIGURE 2. — (a) Traditional CAD models are based on a “closed world” assumption, where no new classes are present in the test set. This is hardly the case for real-world medical applications. These models are limited to classify input test sample which may belong to a new class of disease as one amongst the previously learnt classes having the highest probability. This leads to wrong diagnosis which may cause disease spread/ patient’s death. Even after clinical tests establish existence of a new disease, to classify it, complete model retraining is required (b) Deep-Precognitive diagnosis detects this novel disease, creates new class in real time and classifies the novel disease to this class. The model’s accuracy tend to increase with time over this new class as it sees its more samples. Therefore, the need for model re-training is eliminated.

Inline graphic — (a) Traditional CAD models are based on a “closed world” assumption, where no new classes are present in the test set. This is hardly the case for real-world medical applications. These models are limited to classify input test sample which may belong to a new class of disease as one amongst the previously learnt classes having the highest probability. This leads to wrong diagnosis which may cause disease spread/ patient’s death. Even after clinical tests establish existence of a new disease, to classify it, complete model retraining is required (b) Deep-Precognitive diagnosis detects this novel disease, creates new class in real time and classifies the novel disease to this class. The model’s accuracy tend to increase with time over this new class as it sees its more samples. Therefore, the need for model re-training is eliminated.

DPD is difficult to be thought of as a supervised learning problem. Since the dataset of the novel disease, which the model is trying to detect, cannot be available previously. Instead of a supervised learning problem, we frame the task as a class membership lookup problem to solve it. For each input, the model learns both contrasting and similar visual features between different classes Inline graphic . The input feature vector is mapped in a high dimensional feature space (here, dimensional) where its classification becomes feasible. This is particularly different from Convolutional Neural Network (CNN) [51] based classification, which focuses mainly on learning those image features which can potentially be useful in distinguishing between two classes.

B. Disease Feature Extraction and Mapping

The proposed model architecture (refer Figure 3) combines CNN-based CXR feature extraction with a fuzzy classification network for real-time image classification. From each sample, the image feature vector Inline graphic containing prominent disease characteristics like ground-glass opacities, crazy paving pattern, etc., are extracted.

Deep transfer learning is used on modified VGG-16 [52] CNN architecture. A Global Max Pooling (GMP) layer is introduced after the fifth ‘MaxPooling2D’ layer of the original network, as shown in Table 2. Subsequent layers, i.e., the flatten, fc1, fc2, predictions, and dense_1 layers, are removed from the original network, which were primarily trained for classifying ImageNet [53] into 1000 different classes. Recent studies [54] have confirmed the effectiveness of transfer learning, which surprisingly offers higher accuracy in medical imaging tasks. Unlike standard dense (layer) in conventional CNNs, using fuzzy classification enables online learning and allows single-pass training compared to CNNs, which require multiple epochs to obtain high accuracy. Figure 5 shows the layer-specific feature representation maps generated by the feature extraction network.

TABLE 2. Architecture of the Modified VGG-16 CNN for Extracting 512-Dimensional CXR Feature Vector. Here, Represents Stage, is the Number of Stacked Layers in the Operator, Represents the Weight/ Height/ Channel and GMP is the Global Max Pooling Layer.

S	Operator	L
1	Conv3-64	2
2	MaxPooling2D	1
3	Conv3-128	2
4	MaxPooling2D	1
5	Conv3-256	3
6	MaxPooling2D	1
7	Conv3-512	3
8	MaxPooling2D	1
9	Conv3-512	3
10	MaxPooling2D	1
	GMP	1

Open in a new tab

FIGURE 5. — Layer-specific feature representation maps generated by the modified VGG-16 CXR feature extraction network.

C. Fuzzy Classification

The fuzzy classification network [46] learns by forming hyperbox fuzzy sets. A hyperbox Inline graphic [55] is a simple geometrical-shape defined in the -dimensional feature space. The size of is controlled by the hyperbox expansion coefficient ( ), which lies between 0 and 1. Figure 6 shows a hyperbox for .

FIGURE 6. — A Hyperbox in 3-dimensional feature space i.e., . Here, ‘ ’ represents min coordinate and ‘ ’ the max coordinate of .

1). Point Hyperbox ( ) Creation

During training, the extracted feature vector Inline graphic for each training sample is passed to the input nodes of the fuzzy classifier after normalization. The classifier creates hyperboxes with co-ordinate and co-ordinate in the 512-dimensional feature space. Assuming is the training sample, is a hyperbox for class ; we initialize Inline graphic having and class label , i.e., during training, for the first sample, a point hyperbox is created.

2). Class Membership Computation

For each sample, the extracted feature vector is passed to the Classifying Neurons ( Inline graphic ), for the classification of the learned data using - hyperboxes [46]. A membership function [46] is defined with respect to the and co-ordinates of . In , neuron represents hyperbox fuzzy set .

In classifying section nodes, to compute the class memberships, the activation function proposed by [56] is used to assign membership value equal to 1 when the test sample falls within Inline graphic . In other cases, when the test sample lies outside , the model calculates membership value based on its distance from extreme coordinates of . Figure 4(a) enlists the activation functions of the classifying neuron. Here, represents a 2-parameter ramp threshold function and ‘ ’ represents the fuzziness control parameter. It is to be noted that as the limit of the maximum allowed size of hyperbox ( Inline graphic ) increases, number of hyperboxes created during training reduces and vice-versa. This is confirmed experimentally in Section VI.

FIGURE 4. — The three categories of Neuron (a) Classifying Neuron ( ) with its activation function (b) node and its activation function (c) node and its activation function , used in the proposed model architecture; and represents the threshold functions.

3). Significance of Inter-Node Connections

In the middle layer of the classifier, the input nodes and the hyperbox nodes are connected together. These connections represents the Inline graphic - coordinates and of the 512-dimensional hyperbox fuzzy set [48]. During training, the middle layer neurons are created dynamically. Connection between the hyperbox node to a class node , is represented by matrix , where, if , else .

4). Training Learning Classifier

Whenever a training sample is encountered by the model that does not belong to the classes it has learned so far, a hyperbox node is created in the Inline graphic section. During training, the model tries to accommodate subsequent samples in the previous hyperboxes belonging to the same class using the conditions discussed below, provided the hyperbox size does not exceeds a specified maximum limit (given by expansion coefficient ) [46]. If the expansion of any of the existing hyperboxes ( Inline graphic ) which belongs to that class is not feasible, a new hyperbox is added to the model; i.e., for a new training sample , a hyperbox is found such that or which has the highest membership value and satisfies following conditions-

Adjust min-max coordinates of Inline graphic , as, , , where and if and then .

If no suitable Inline graphic is present then a novel hyperbox for class is created with ; i.e., a point hyperbox is created. Since the high dimensional feature space contains all the learned visual features of the image, a possible case of hyperbox overlap can occur. This can be explained as a common visual feature between two different diseases. Table 3 enlists both the common and distinguishing visual features on CXR scans for various diseases.

TABLE 3. Comparative Analysis of Visually Similar and Distinguishing Features of COVID-19, SARS-CoV-1 and MERS Respiratory Diseases on CXR Scans. Data Taken From [57]–[62].

Disease	Image Performance	Normal Radiography	Crazy Paving Pattern	Ground Glass Opacity	Consolidation	Septal Thickening	Air Bronchogram	Pleural Effusion	Pneumo-thorax
SARS-CoV	Unilateral, focal; unilateral, multifocal; bilateral; peripheral distribution	18.40%	46.27%	68.48%	65.65%	55.22%	37.04%	17.31%	9.62%
MERS-CoV	Bilateral, multifocal; isolated unilateral; peripheral distribution	20.00%	26.67%	86.36%	50.00%	40.91%	-	54.55%	Rare
COVID-19	Bilateral, multifocal peripheral distribution	19.90%	8.56%	68.92%	26.64%	34.54%	34.54%	3.57%	Rare

Open in a new tab

D. Biologically-Inspired Reflex Section

The Reflex section contains the Overlap Compensation Neurons ( Inline graphic ) and Containment Compensation Neurons ( ) [48]. These neurons become active only when a case of hyperbox overlap and containment is encountered, respectively. The reflex mechanism is biologically inspired from that of the human brain, which unconsciously gains control of the human body in hazardous conditions.

1). Reflex Section Architecture

Inline graphic represents of size equal to the overlapped space between two of different classes. The section is active only if the test data lies within the overlap space. It generates two compensation outputs, one each for the two overlapping classes. The section, which overcomes the hyperbox containment case, represents Inline graphic of size equal to the overlapping space between the two classes. activates when a test sample falls inside the overlapped space. Figure 4 represents these nodes with their respective activation functions used in the model architecture.

2). Inter-Node Connections and Training

The connection between the hyperbox nodes and class nodes in the reflex section is represented by matrix Inline graphic and , respectively (refer Figure 3). Whenever a situation of overlap/ partial or full containment of is encountered, the hyperbox node is created dynamically in the reflex section’s middle layer. Overlap or containment between a labeled hyperbox and unlabeled hyperbox is allowed and does not create any Inline graphic or nodes. This is used to label the unlabeled hyperboxes. The number of output layer nodes, present in the section, represents the total number of classes learned by the model.

3). Final Membership Computation

The final membership value [48] for the Inline graphic class node is computated as , where is the membership of the class in classifying layer ( ) section; i.e., and is the compensation for the class.

4). Disease Visual Feature Overlap and Containment

The introduction of this biologically-inspired section helps in obtaining more explainable class memberships. This is discussed in detail in the Ablation Study in Section IV. In the case of two diseases having similar visual features, a condition of hyperbox overlap may occur. Suppose a hyperbox Inline graphic , which is expanded in any previous step, is compared with all other hyperboxes . If and , the overlap and contraction test are performed as explained in Test 2 [48]. They follow the principle of minimum disturbance by computing the dimension with minimum overlap ‘ ’ and contracting it. Otherwise, Test 1 is performed [48]. Figure 7 illustrates the algorithm as a flowchart.

FIGURE 7. — Algorithm Flowchart, I. Training Algorithm, II. Online learning for dynamically created class, III. Novel disease detection.

5). Hyperbox Isolation Condition

If Inline graphic or holds for any , then, are isolated. If the condition does not hold, containment test is performed.

6). Hyperbox Containment Condition

If Inline graphic or holds for any , then Hyperboxes are contained and a node is formed dynamically. If hyperboxes are not contained, an node is created.

7). Hyperbox Overlap Test

Initial value of Inline graphic is set as 1.

Case 1:

=

Case 2:

=

Case 3:

=

Case 4:

=

If overlaps exist and Inline graphic , then, else .

8). Hyperbox Contraction Test

If overlap exists and is minimum along Inline graphic dimension, the hyperboxes are contracted using the following given conditions:

Case 1:

=

Case 2:

=

Case 3:

and then else

Case 4:

and then else

E. Novel Disease Detection System

This section describes how the model detects and classifies novel diseases, without explicit training.

1). De Novo Disease Detection

For a particular disease, hyperboxes occupy a large extent of space in an Inline graphic -dimensional feature space. For many diseases, since the type and extent of lesions tend to be either slightly or considerably similar, a portion of visual features is mapped to an -dimensional feature space, which is common for more than one disease category. Most techniques tend to distinguish different disease classes on the basis of their differentiating visual features [51]. In the case of fuzzy classifiers, both common and differentiating features of the disease are mapped in the Inline graphic -dimensional feature space using hyperboxes. Even though transfer learning is used to obtain feature vectors, since these are of high dimension, they encode a large amount of visual information of the disease [54]. Exploiting this property, we classify those diseases as novel which tend to occupy an overlapping space less than a set threshold Inline graphic , where , in this -dimensional feature space.

2). Negation Operation on Intuitionistic Membership Grades

The regular membership function Inline graphic denotes the value of an input sample belonging to a particular class of disease that CAD model has previously learned. The fundamental property, and , where denotes the value by which the entity does not belong to class . Using the negation operation, for intuitionistic membership grades, we tend to compute the input samples where Inline graphic is above-set threshold , i.e, samples for which , where . Widely used Pythagorean membership grade [63] defined by is not employed since it allows for representation on a larger body of non-standard membership grades.

3). Online Architecture Update for Class

Upon identifying a novel disease, the model adds a new class Inline graphic to the classification network, which is similar to class addition during model training. Further, when samples of this novel disease are detected in the future, the model constructs hyperboxes using the same principle discussed. This enables online architecture update and increases the model’s ability to classify the samples of the novel disease.

4). Midway Labeled Data Introduction

Furthermore, since the architecture is semi-supervised, labeled training samples of the novel disease can also be introduced at any point to improve the accuracy for this class without the need for complete re-training. Algorithm 1 is used for the detection of novel disease samples, online learning, and midway introduction of a labeled dataset. In detecting novel diseases, one of the challenges faced is determining accurate model parameters best suited for classification. Increasing the fuzziness control parameter ( Inline graphic ) leads to more fuzzy classification while decreasing it leads to a crisp classification. Algorithm 1 discusses the implementation of the methodology in detail.

Algorithm 1 Detecting and Classifying Novel Disease Occurrences

Input and initialization:
Model Pre-trained on labeled Train Set
Input test sample
, , Model Parameters
Current classes learnt
empty lists for feature vector, class index
User-input value for Online Learning
Computing maximum class membership ( ):
,
Novel disease detection:
if ( Threshold ) then
, i.e., Novel Disease Detected
.append( ), .append( )
else
, for which is
Online Learning update for dynamically created class:
if (elements in equals ) then
for ( to len( )) do
if ( [ ] classes learned) then
Add Hyperbox
Classes learnt , i.e.,
else
Calculate expanded index
Follow standard algorithm
, = [], []
Go to next test sample.
Midway introduction of labeled dataset:
Train ( ),

IV. Experiments and Results

A set of experiments are performed to demonstrate the effectiveness of the proposed approach. In this section, firstly, we discuss the experiments and the results of the model’s ability to detect novel diseases on three unseen novel disease datasets: SARS-CoV-1, MERS-CoV, and COVID-19 to demonstrate the strong generalizability of the approach. In additional experiments, the model’s classification ability is evaluated on two tasks: binary classification and multi-class classification of CXR diseases. We also discuss the hyperparameters chosen during these experiments. Lastly, we provide an ablation study to evaluate the contribution of key components of the proposed model.

A. Experimental Settings

This section describes in detail the experimental setting.

1). Dataset

To assess the proposed method, dataset from two popular open-source repositories- COVID-Chestxray set [64], and kaggle-chest-xray-dataset [65], extensively used in the research literature for training and testing of CAD models for COVID-19 CXR classification were employed. Table 4 contains the statistical details of posterior-anterior (PA) CXR scans used in the experiments. Initially, all inputs are pre-processed, which includes resizing Inline graphic and format conversion. Pre-processed high-quality images are selected and divided into two subsets: the training set (80%) and the test set (20%). The images are evenly distributed in different classes for classification experiments.

TABLE 4. Details of the CXR Radiograph Dataset Used to Assess the Proposed Method. The Proposed Model is Trained on Limited Dataset.

	Disease Category	Ref	Total CXR Scans	Training Set (80%)	Test Set (20%)
1	COVID-19	[64]	196	157	39
2	Bacterial Pneumonia	[65]	196	157	39
3	SARS-CoV-1	[64]	08	-	08
4	MERS-CoV	[64]	08	-	08
5	Normal	[65]	196	157	39

Open in a new tab

2). Implementation

The work is implemented using Keras [66] with Tensorflow [67] as backend. Nvidia K80 GPU with 12GB RAM workbench was used for conducting the experiments. ‘zscore’ was used as the normalization method which is calculated as Inline graphic . ‘yeo-johnson’ transformation was applied while training the ML classifiers for comparison.

3). Compared Methods

Various SOTA ML classifiers are implemented on the same dataset to compare the classification results. 15-fold cross-validation was used for implementing the classifiers to distinguish between COVID-19 and non-COVID-19 CXR scans compared to the proposed model on the same dataset. Similarly, the models were implemented for performance comparison on the multi-class classification task. ‘Accuracy’ was used as the metric for optimizing the hyperparameters used for training.

4). Evaluation Metrics

Confusion matrix-based metrics is used assess the classification performance of the proposed model. This includes accuracy, precision, recall, and F1-score as described below.

1)
Accuracy: It estimates the ratio of correctly classified diseases to the entire test dataset. If accuracy is higher, a model has better performance. It lies between [0, 1] and is generally reported as percentage (%).
2)
Precision: It estimates the ratio of a particular disease that has been correctly classified over CXR scan to the total number of that particular disease identified by the model. Like accuracy, precision also lies between [0, 1] and is generally reported as a percentage (%).
3)
Recall: The recall is the measure of a model correctly identifying true positives. Thus, for all the patients who actually have a particular disease over CXR scan, recall tells how many were correctly identified having that particular disease.
4)
F1-Score: It is defined as the harmonic mean of Precision and Recall. If the F1-Score is higher, a model is better. F1-Score and is generally reported as percentage (%).

where, Inline graphic , , , are the number of true positives, true negatives, false positives and false negatives respectively. For multi-class classification task, the discussed metrics is used as class-wise and macro-average.

B. Exp 1: Evaluation of Novel Disease Detection Ability

In this experiment, the model is trained to classify normal X-ray scans from bacterial pneumonia X-rays. To evaluate the ability of the model to detect novel diseases, CXR samples of diseases, like SARS-CoV-1, MERS-CoV, and SARS-CoV-2 (COVID-19), not seen by the model before, are inputted along with the original test set images.

Severe acute respiratory syndrome (SARS-CoV-1) is a viral respiratory disease reported around the end of February 2003. SARS-CoV-1 samples are used as novel disease sample inputs to the proposed model along with the test set. Figure 8(a) shows the graph obtained (at Inline graphic and ) between the max-memberships value to the pre-defined class; i.e., for different input test samples . It can be inferred from the graph that samples belonging to the classes that the model was pre-trained for, i.e., normal and bacterial pneumonia, have a higher range of membership values over predefined classes. However, the input test samples of novel disease are found to have max class memberships below a certain set threshold. This is due to different visual features than those of the previously learned diseases. CXR scans of SARS-CoV-1 patients show bilateral airspace consolidation (observed in 66.7%–70.6% patients), demonstrated by multi-focal opacity [68]. Focal opacity is also detected predominantly in the middle, lower and peripheral zones of the lungs. This is significantly different from COVID-19 and MERS-CoV, where consolidation is 26.64% and 50% respectively, as compared to 65.65% in SARS-CoV-1 (refer Table 3). The SARS-CoV-1 input samples are classified into a new dynamically created class during testing with Inline graphic , and threshold . Moreover, each time this new disease is further encountered in the future, the model classifies it to the newly created class.

FIGURE 8. — Graph between max membership to predefined class and number of input test samples for (a) SARS-CoV-1, (b) MERS-CoV and (c) SARS-CoV-2 datasets. Unseen novel disease samples detected by the proposed model and classified separately in a newly created class. Here, max memberships to predefined class, i.e., for the novel disease samples are in red and to classes learned previously are in blue.

Similarly, to demonstrate the model’s generalization ability, MERS-CoV and COVID-19 CXR scans are used with the proposed model as novel disease samples. Note that the model has not seen these images before and has never been trained on them. The model detects them as novel diseases, classifying them to a newly created class. Figures 8(b) and 8(c) show the graph obtained between max-memberships value Inline graphic to pre-defined class for MERS-CoV and COVID-19. Further, Figure 9 shows the results obtained from t-Distributed Stochastic Neighbor Embedding (t-SNE) feature visualization for the Normal vs Bacterial Pneumonia classification and the de novo disease detection experiments with CXR of COVID-19, MERS-CoV and SARS-CoV-1 input as novel disease. The performance of the model, along with the hyperparameters used, are discussed in Table 5.

FIGURE 9. — Visual illustration of feature visualization. (a) Normal vs Bacterial Pneumonia classification. Here yellow represents normal samples whereas Violet shows the Bacterial Pneumonia samples. (b) COVID-19 (in green) is input as novel disease, previously unseen by the model. (c) MERS (in yellow) and (d) SARS (in yellow) is input as novel disease, previously unseen by the model. In both (c) and (d) green represents the normal CXR scans whereas Violet shows the Bacterial Pneumonia samples.

TABLE 5. Performance of the Proposed Model on COVID-19, SARS-CoV-1 and MERS-CoV Datasets for Novel Disease Detection Experiment Along With Hyperparameters Used. The Model was Pre-Trained to Classify Normal and Bacterial Pneumonia Classes.

Novel disease (unseen by the model)				Novel Disease Detection Acc. ( )	Bacterial Pneu. vs Normal CXR classification Acc. ( )
COVID-19	0.75	1	0.70	82.50%	81.01%
SARS-CoV-1	0.75	1	0.50	100.00%	96.20%
MERS-CoV	0.75	1	0.45	100.00%	98.73%

Open in a new tab

C. Exp 2: Evaluation of Classification Ability

In this experiment, the proposed model is tested to assess its performance on binary and multi-class classification tasks to demonstrate that the model is not only capable of detecting de novo diseases but also classifying test samples belonging to other pre-trained classes Inline graphic . Binary classification is performed on non-COVID-19 and COVID-19 CXR scans, whereas multi-class classification is performed between normal, bacterial pneumonia and COVID-19 CXR samples.

The obtained results are compared with the various ML classifiers implemented on the same dataset. Table 6 shows the 15-fold cross-validation performance of SOTA ML classifiers to distinguish between COVID-19 and non-COVID-19 CXR scans compared to the proposed model. Table 7 compares the performance of the proposed model with various ML classifiers on the multi-class classification task. The results demonstrate that the performance of the proposed model is at par with other SOTA models in both classification tasks. Figure 11 illustrates the obtained confusion matrix for both the classification tasks. Further, Figure 10 shows the results obtained from the t-SNE feature visualization for both the classification tasks.

TABLE 6. Comparison of Binary Classification Results on CXR Scans With Various ML Classifiers. Here, , for Proposed Method. Feature Vectors are Obtained From CXR Feature Extraction Network.

Method	Accuracy ( )	Recall ( )	Precision ( )	F1 Score ( )	TT (sec) ( )
Quadratic Discriminant Analysis (QDA)	65.38	59.26	67.15	62.10	0.0507
Linear Discriminant Analysis (LDA)	78.85	80.74	79.26	77.82	0.0747
Gradient Boosting Classifier (GBC)	93.41	93.33	93.76	93.31	1.7200
Decision Tree (DT)	95.98	97.04	95.31	96.01	0.0467
Random Forest (RF)	96.32	97.04	95.78	96.33	0.5780
Extreme Gradient Boosting (XG-Boost)	96.35	96.30	96.52	96.24	1.7660
Support Vector Machine (SVM)	97.06	97.78	96.71	97.11	0.0400
Naive Bayes (NB)	97.06	100.0	94.91	97.26	0.0280
Proposed Method	97.47	97.46	97.46	97.46	0.7878

Open in a new tab

TABLE 7. Comparison of Multi-Class Classification Results on CXR Scans With Various ML Classifiers. Here, , for Proposed Method. Feature Vectors are Obtained From CXR Feature Extraction Network.

Method	Accuracy ( )	Recall ( )	Precision ( )	F1 Score ( )	TT (sec) ( )
Quadratic Discriminant Analysis (QDA)	42.78	57.14	43.94	41.96	0.0480
Linear Discriminant Analysis (LDA)	63.75	64.07	65.11	63.07	0.0907
Decision Tree (DT)	76.13	76.10	77.61	76.00	0.0793
Naive Bayes (NB)	89.48	89.59	90.06	89.48	0.0253
Gradient Boosting Classifier (GBC)	90.51	90.64	91.21	90.47	8.4060
Proposed Method	90.68	90.70	90.67	90.63	5.7184
Extreme Gradient Boosting (XG-Boost)	91.72	91.76	92.30	91.70	10.817
Random Forest (RF)	91.95	91.93	92.55	91.95	0.6253
Support Vector Machine (SVM)	93.20	93.26	93.87	93.16	0.0700

Open in a new tab

FIGURE 11. — The obtained Confusion matrix for the (a) Binary classification and (b) multi-class classification task.

FIGURE 10. — Visual illustration of feature visualization for (a) Normal vs COVID-19 binary classification task. Here yellow represents COVID-19 samples, whereas normal samples are shown in Violet (b) Normal vs COVID-19 vs Bacterial Pneumonia multi-class classification task. Here green shows the Bacterial Pneumonia class.

D. Ablation Studies

The proposed model contains four key components: The modified VGG-16 based CXR feature extractor, fuzzy classifier, biologically-inspired reflex section for class membership generation, and the novel disease detection framework. Here, we provide an ablation study to explore the contribution of the key components of the proposed model.

1). Global Max-Pooling vs. Global Average Pooling

Although pneumonia and most other diseases can be differentiated, X-rays are still considered as the “most challenging plain film to interpret correctly” [33]. Therefore, extracting discriminating features on CXR scans is a challenging task. Instead of using the standard global average pooling (GAP) layer in the feature extraction network, the global max-pooling layer (GMP) is preferred. During the ablation study, the GMP layer was replaced with the GAP layer to study its contribution. Unlike conventional models, in which GAP outperforms GMP, in the present model, GMP is found to generate more representable features from the input image and shows significant performance over GAP. This mainly occurs when visual features of two image classes are very near to each other with very minute differences. Therefore, in such cases averaging the features, i.e., using GAP fails.

2). Varying CNN Models and Curse of Dimensionality

The model is found to perform optimally when CXR feature vector dimensionality Inline graphic . As the dimensionality is increased above 512, the model accuracy decreases, and the sample testing time is also found to increase significantly. This is due to multi-fold increase in the volume of the high dimensional space, such that the available data becomes sparse. Further, modified VGG-16 [52] CNN pre-trained on ImageNet [53] performs optimally over other feature extraction networks like ResNet [69], MobileNet [70], etc.

3). Explainability of Class Memberships

Introduction of biologically-inspired section helps in obtaining more explainable class memberships. Unlike fuzzy min-max neural networks [46], which contracts an hyperbox in case of overlaps, the reflex mechanism produces compensation outputs using Inline graphic and neurons. This brings the model near reality since most diseases have some visually similar features on CXR scans lost in FMNN due to hyperbox contraction.

V. Discussion

A. Parametric Study & Hyperparameter Tuning

An in-depth parametric study was performed to evaluate the effects of various model parameters and propose a strategy for hyperparameter tuning. Figure 14 shows plots obtained from the parametric study; i.e., the effect of various parameters including the hyperbox expansion coefficient ( Inline graphic ) and fuzziness control parameter ( ) on the model accuracy, number of hyperboxes ( ) created during model training, total model training time ( ) and the sample testing time ( ) for both the classification tasks performed to evaluate the model’s classification ability. The obtained results are:

1)
At higher values of hyperbox expansion coefficient ( ), the model shows better performance on image classification tasks (refer Figure 14(a-b)). In classification tasks where the feature vectors are mapped to low-dimensional space, i.e., , low values of expansion coefficient ( ) produces optimal results.
2)
Further, it can be inferred from Figure 14(a-b) that decreasing the fuzziness control parameter ( ) generally shows better model performance.
3)
As hyperbox expansion coefficient ( ) increases, the number of hyperboxes created during training shows an ‘exponential’ increase rather than ‘linear’ (refer Figure 14(g)). On the other hand, the model training time first shows a sharp rise until , after which its value decreases ‘exponentially’ in both the binary and multi-class classification tasks (refer Figure 14(h)).

FIGURE 14. — Plots obtained from parametric study. Graph between (a) Accuracy (%) and expansion coefficient ( ) for for binary classification (b) Accuracy (%) and expansion coefficient ( ) for for multi-class classification (c) Sample testing time (sec) and expansion coefficient ( ) for binary classification (d) Sample testing time (sec) and expansion coefficient ( ) for multi-class classification (e) Total training time (sec) and expansion coefficient ( ) for binary classification (f) Total training time (sec) and expansion coefficient ( ) for multi-class classification (g) Number of hyperboxes formed and expansion coefficient ( ) (h) Training time sec) and expansion coefficient ( ).

The results obtained from the study quantifies the model hyperparameters for image classification tasks and are helpful in hyperparameter tuning. Figure 12 shows a three-dimensional plot between the obtained classification accuracy, hyperbox expansion coefficient ( Inline graphic ), and the fuzziness control parameter ( ) for both the classification tasks, carried out to find the best fit model.

FIGURE 12. — Graph between the obtained accuracy, fuzziness control parameter ( ) and expansion coefficient ( ) to find the best fit model for (a) Binary Classification (b) Multi-class Classification tasks.

The set threshold ( Inline graphic ) plays a significant role in the detection of de novo diseases. Figure 13 compares the accuracy vs. set threshold in this regard. For class memberships, the threshold ( ) is determined experimentally. For SARS-CoV-1, a threshold ( ) is set at 0.5, MERS-CoV at 0.45, and 0.70 for COVID-19. The best fit values of the thresholds, obtained from the graph shown in Figure 13, are the point of intersection between the classification accuracy and class memberships for the novel disease. It can be inferred from the plot that when the threshold ( Inline graphic ) for the novel disease is kept very low, novel disease samples are classified as one of the previously trained classes. However, when the threshold is kept very high, input samples belonging to the pre-trained classes are incorrectly classified as novel disease samples.

B. Time Complexity Analysis

Along with the parametric study, a time complexity analysis is performed. In this study, the sample testing time (sec) is calculated by varying the hyperparameters Inline graphic and for both the binary and multi-class classification tasks. The experiment is repeated to analyze the total model training time (refer Figure 14(c-f)). The obtained results quantified that though the total training time of the model is extremely less (i.e., to 20 sec), the sample test time is relatively high and varies from Inline graphic to 300 sec. The same is observed for both the binary and multi-class image classification tasks. In the case of low-dimensional data classification tasks, such a large difference is not observed.

C. Robustness to Adversarial Attacks

Adversarial attacks [84] involves generating modified image by making subtle imperceptible changes in the original image. To boost robustness, existing defensive measures include: leveraging network distillation to extract information from the trained feature extractor [85], and using innovative training methods (such as IMA) that can expand the margins of training samples in the input space [86]. Since the feature extraction network was pre-trained on ImageNet [53], adversarial noise may not significantly affect the performance of the network. The model performance establishes strong generalizability of the approach through tests for a set of three novel diseases (taken from two different open-source datasets [64], [65]): COVID-19, SARS-CoV-1 and MERS-CoV. The main adversarial attack algorithms identified to target the proposed model include the L-BFGS algorithm and Fast gradient sign method (FGSM) [82]. Moreover, quantitatively analysing the vulnerability of the model to possible adversarial attacks w.r.t. existing defense method suitability, along with demonstrating robustness on various adversarial attacks can be an interesting extension for this research.

D. State-of-the-Art Performance Comparison

To compare the performance of the proposed model with unsupervised clustering approaches, K-means clustering was performed on the dataset with normal, bacterial pneumonia, and COVID-19 X-ray feature vectors. The feature vectors are extracted using the same feature extraction network as used in the proposed model. K-means clustering was found to incorrectly detect 4 clusters as shown in t-SNE plot [83] in Figure 15. It illustrates the failure of conventional clustering approaches and how semi-supervised fuzzy classifiers are effective. For hyperbox expansion coefficient Inline graphic , the fuzzy classifier is the k-nearest neighbor classifier.

FIGURE 15. — Results of the k-means clustering showing 04 identified clusters, depicting the failure of conventional clustering approaches.

A comparative analysis of the proposed model was performed with existing SOTA techniques developed employing CXR images. Table 8 summarizes the study’s findings. It shows both quantitatively and qualitatively the out-performance of the proposed model over other models in present literature. Moreover, the present work is the first to identify the challenging task of deep-precognitive diagnosis and propose a novel solution to it.

TABLE 8. Performance Comparison With SOTA Techniques in Literature Developed Using CXR Images. The Present Work is First to Identify the Challenging Task of Novel Disease Detection and Propose a Novel Solution to it.

Study	Number of class/ Samples	Method	Accuracy (%)	Novel Disease Detection
Sahinbas & Catak [71]	2-class/ 140 (70 each of COVID-19 & Normal)	VGG16, VGG19, ResNet, DenseNet, InceptionV3	80.00
Panwar et al. [72]	2-class/ 570 (206 COVID-19 & 364 Normal)	CNN Transfer Learning	89.47
Hemdan et al. [73]	2-class/ 50 (25 each of COVID-19 & Normal)	COVIDX-Net	90.00
Medhi et al. [74]	2-class/ –	Deep CNN	93.00
Waheed et al. [75]	2-class/ 1124 (403 COVID-19, 721 Normal)	CovidGAN	95.00
Ahishali et al. [76]	2-class/ 13609 (1065 COVID-19, 1254 Normal)	CSEN	95.13
Vaid et al. [77]	2-class/ 545 (181 COVID-19, 364 Normal)	VGG16 Transfer Learning	96.30
Lv et al. [78]	2-class/ 270 (105 COVID-19 & 165 Normal)	Cascade Network	97.14
Proposed Study	2-class/392 (196 each of Normal & COVID-19)	Proposed Model	97.47
Civit-Masot et al. [79]	3-class/ 396 (132 each COVID-19, pneumonia, healthy)	VGG16 based DL model	86.00
Ozturk et al. [80]	3-class/ 1125 (125 COVID-19, 500 Normal, 500 Pneumonia)	DarkCovidNet	87.02
M. Qjidaa et al. [81]	3-class/ 300 (100 COVID-19, 100 Normal, 100 Pneumonia)	VGG-16 based model	87.50
Proposed Study	3-class/588 (196 each of Normal, Pneumonia & COVID-19)	Bio-Inspired Conv-Fuzzy Net	90.68
Proposed Study	392 (196 Normal & 196 Bacterial Pneumonia)	Proposed Model	–	COVID-19 82.5% SARS-CoV-1 100% MERS-CoV 100%

Open in a new tab

VI. Limitations and Future Work

One of the limitations of the proposed model is that its sample testing time (i.e., Inline graphic to 300 sec) is comparatively higher than the total model training time (i.e., to 20 sec). Future work may look into ways to reduce the high sample testing time of the model. Second, an algorithm can be developed to avoid manual interventions for updating the hyperbox expansion coefficient ( Inline graphic ) and fuzziness coefficient ( ). Moreover, we aim to increase the model sensitivity to novel diseases and improve classification accuracy by modifying the model architecture in future work. Future research directions also include expanding the proposed model over other diseases that are detectable through CXR features in high-dimensional vector spaces. Although in the future, there may be disease outbreaks that might impact some other human organ that can only be caught by scanning a particular organ or through blood chemistry, physiological analysis, CT-Scans, MRI, etc. Nevertheless, the proposed model can be further extended by changing or augmenting this kind of data. Future research in such areas will slowly help us move towards a universal DPD model that can take different kinds of data and predict new disease existence.

VII. Conclusion

Currently, most works on DL-based CAD models are limited to increasing classification accuracy and sensitivity. In this paper, the concept of Deep-Precognitive diagnosis is proposed, which has immense potential for future research and can be helpful in the development of next-generation CAD models. The challenges posed by the formulated Deep-Precognitive diagnosis task are difficult to be addressed using supervised learning models, as they require labeled data for learning to classify new data into one of the trained classes only. Since it is not possible to get the novel disease dataset prior to its outbreak, most supervised learning models would fail to detect the new disease class.

The present work is the first to address these challenges by proposing a biologically-inspired convolutional fuzzy classification model, wherein we visualize the proposed task as a class membership lookup problem. Unlike conventional models, the proposed DPD model can detect occurrences of new diseases at their preliminary stage of spread. Such a model has the capability to control the spread of novel diseases with an early warning of a future pandemic. The proposed model creates a new class for them in real-time. Experimental results on three CXR disease data- SARS-CoV-1, MERS-CoV, and COVID-19 demonstrate the feasibility and remarkable performance in identifying a new disease class. Further, the model’s classification ability is demonstrated in the binary and multi-class classification tasks. An ablation study is also performed to quantify the contributions of critical components of the model. Thus, the proposed model can be used as a baseline for future works. Besides, two other vital issues that obstruct the development of future diagnosis models are also addressed in this work: firstly, the requirement of a large labeled training medical dataset and, secondly, the need for model re-training when novel disease needs to be added to the CAD model for classification. The model learns on limited datasets and reduces heavy dependence on labeled dataset availability. The results also establish that modified fuzzy classifiers achieve accuracy comparable with SOTA models.

Deep-Precognitive diagnosis has immense potential applications in developing future-CAD models that will be powerful enough to detect new disease occurrences and learn and improve their novel disease classification ability to expand on several such new diseases in real-time.

Conflicts of Interest

The authors hereby declare that there is no conflict of interest.

Biographies

graphic file with name chhar-3153059.gif

Aviral Chharia is currently pursuing the bachelor’s degree with the Mechanical Engineering Department and the Computer Science and Engineering Department, Thapar Institute of Engineering and Technology, India. He was a recipient of Students Undergraduate Research Graduate Excellence (SURGE) Fellowship with the Indian Institute of Technology, Kanpur. He is also a MITACS Globalink Research Scholar with The University of British Columbia, Canada. He is also a Research Intern with the Computational Biology Department, School of Computer Science, Carnegie Mellon University, USA. His research interests include deep learning, computer vision, computational biology, and biomedical informatics.

graphic file with name upadh-3153059.gif

Rahul Upadhyay received the Ph.D. degree in biomedical signal processing and machine learning for brain computer interface design from the Indian Institute of Information Technology, Jabalpur. He is currently an Assistant Professor with the Electronics and Communication Engineering Department, Thapar Institute of Engineering and Technology, India. He completed his Postdoctoral Research with the Reilly Lab, Trinity College Institute of Neuroscience, Trinity College Dublin, Ireland. He has authored more than 25 research articles in peer-reviewed journals and reputed conference proceedings.

graphic file with name kumar-3153059.gif

Vinay Kumar received the Ph.D. degree from the Jaypee University of Information Technology, India. He is currently an Associate Professor with the Electronics and Communication Engineering Department, Thapar Institute of Engineering and Technology, India. He did his Postdoctoral Research with the Universidad Carlos III de Madrid, Spain, under the supervision of Prof. D. Fernando Díaz de María. He has published many articles in journals of international repute, and patents. His research interests include deep learning, image and video processing, and speech processing.

graphic file with name cheng-3153059.gif

Chao Cheng received the Ph.D. degree in computational biology and bioinformatics from the Computational and Molecular Biology Program, University of Southern California. He is currently an Associate Professor with the Department of Medicine, Baylor College of Medicine. He is also a member of the Dan L Duncan Comprehensive Cancer Center and the Institute (DLDCCC) for Clinical and Translation Research (ICTR), BCM. His research interests include cancer genomic, cancer systems biology, and cancer immunology with emphasis on translation studies.

graphic file with name zhang-3153059.gif

Jing Zhang received the Ph.D. degree from the University of Southern California, in 2012, under the supervision of Dr. Liang Chen. She completed her Postdoctoral Training with the Dr. Mark Gerstein’s Laboratory, Yale University, in 2020. During her Postdoctoral Research, she has developed several computational methods to integrate novel high-throughput sequencing assays to decipher the gene regulation “Grammar.” She is currently an Assistant Professor with the University of California at Irvine, Irvine. Her research interests include the areas of bioinformatics and computational biology, developing computational methods to predict the impact of genomic variations on genome function, and phenotype at a single-cell resolution.

graphic file with name wang-3153059.gif

Tianyang Wang received the Ph.D. degree in computer science from Southern Illinois University, in 2015. He is currently working with Austin Peay State University as an Assistant Professor. His research interests include machine learning, computer vision, and pattern recognition.

graphic file with name xu-3153059.gif

Min Xu (Member, IEEE) received the B.E. degree in computer science from Beihang University, the M.Sc. degree from the School of Computing, National University of Singapore, and the M.A. degree in applied mathematics and the Ph.D. degree in computational biology and bioinformatics from the University of Southern California (USC). He is currently an Assistant Professor with the Computational Biology Department, School of Computer Science, Carnegie Mellon University. He is also an affiliated Assistant Professor with the Computer Vision Department, MBZUAI. His career has centered on developing computational methods for the study of biological systems using imaging and omics data. He has published over 70 research papers in prestigious peer-reviewed conferences and journals, such as CVPR, ICCV, ISMB, MICCAI, PNAS, Bioinformatics, PLOS Computational Biology, Structure, and JSB.

Funding Statement

This work was supported in part by the U.S. National Institutes of Health (NIH) under Grant R01GM134020 and Grant P41GM103712, in part by the U.S. National Science Foundation (NSF) under Grant DBI-1949629 and Grant IIS-2007595, in part by the Mark Foundation for Cancer Research under Grant 19-044-ASP, and in part by AMD COVID-19 High-Performance Computing (HPC) Fund. The work of Jing Zhang was supported in part by U.S. NIH under Grant K01MH123896.

References

[1].Zhu N., Zhang D., Wang W., Li X., and Yang B., “A novel coronavirus from patients with pneumonia in China, 2019,” New England J. Med., vol. 382, pp. 727–733, Jan. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
[2].Huang C., Wang Y., Li X., Ren L., Zhao J., Hu Y., and Zhang L., “Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China,” Lancet, vol. 395, no. 10223, pp. 497–506, May 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
[3].Dong E., Du H., and Gardner L., “An interactive web-based dashboard to track COVID-19 in real time,” Lancet Infectious Diseases, vol. 20, no. 5, pp. 533–534, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
[4].Fisher J. L. and Woolaston K., “UN report says up to 850,000 animal viruses could be caught by humans, unless we protect nature,” Conversation, Oct. 2020. Accessed: May 28, 2021. [Online]. Available: https://theconversation.com/un-report-says-up-to-850-000-animal-viruses-could-be-caught-by-humans-unless-we-protect-nature-148911
[5].Health Topics. Accessed: May 28, 2021. [Online]. Available: https://www.who.int/health-topics
[6].CSR. World Health Organization–Eastern Mediterranean Region. Accessed: May 28, 2021. [Online]. Available: https://www.emro.who.int/health-topics/mers-cov/mers-outbreaks.html
[7].WHO | Summary of Probable SARS Cases With Onset of Illness From 1 November 2002 to 31 July 2003, 2015. [Google Scholar]
[8].Corman V. M., Landt O., Kaiser M., and Molenkamp R., “Detection of 2019 novel coronavirus (2019-nCoV) by real-time RT-PCR,” Eurosurveillance, vol. 25, no. 3, Jan. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
[9].Watson J., Whiting P. F., and Brush J. E., “Interpreting a covid-19 test result,” BMJ, vol. 369, p. m1808, May 2020. [DOI] [PubMed] [Google Scholar]
[10].Chharia A. and Upadhyay R., “Deep recurrent architecture based scene description generator for visually impaired,” in Proc. 12th Int. Congr. Ultra Modern Telecommun. Control Syst. Workshops (ICUMT), Oct. 2020, pp. 136–141. [Google Scholar]
[11].Senior A. W., Evans R., Jumper J., Kirkpatrick J., and Sifre L., “Improved protein structure prediction using potentials from deep learning,” Nature, vol. 577, no. 7792, pp. 706–710, 2020. [DOI] [PubMed] [Google Scholar]
[12].Gao S., Han R., Zeng X., Liu Z., Xu M., and Zhang F., “Macromolecules structural classification with a 3D dilated dense network in cryo-electron tomography,” IEEE/ACM Trans. Comput. Biol. Bioinf., vol. 19, no. 1, pp. 209–219, Jan. 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
[13].Du X., Wang H., Zhu Z., Zeng X., Chang Y.-W., Zhang J., and Xu M., “Active learning to classify macromolecular structures in situ for less supervision in cryo-electron tomography,” 2021, arXiv:2102.12040. [DOI] [PMC free article] [PubMed] [Google Scholar]
[14].Vinayakumar R., Alazab M., Soman K., Poornachandran P., Al-Nemrat A., and Venkatraman S., “Deep learning approach for intelligent intrusion detection system,” IEEE Access, vol. 7, pp. 41525–41550, 2019. [Google Scholar]
[15].Vinayakumar R., Soman K. P., and Poornachandran P., “Applying convolutional neural network for network intrusion detection,” in Proc. Int. Conf. Adv. Comput., Commun. Informat. (ICACCI), Sep. 2017, pp. 1222–1228. [Google Scholar]
[16].Vinayakumar R., Soman K. P., and Poornachandran P., “Evaluating effectiveness of shallow and deep networks to intrusion detection system,” in Proc. Int. Conf. Adv. Comput., Commun. Informat. (ICACCI), Sep. 2017, pp. 1282–1289. [Google Scholar]
[17].Vinayakumar R., Soman K. P., and Poornachandran P., “Evaluation of recurrent neural network and its variants for intrusion detection system (IDS),” Int. J. Inf. Syst. Model. Des., vol. 8, no. 3, pp. 43–63, 2017. [Google Scholar]
[18].Vinayakumar R., Soman K. P., and Poornachandran P., “A comparative analysis of deep learning approaches for network intrusion detection systems (N-IDSs): Deep learning for N-IDSs,” Int. J. Digit. Crime Forensics, vol. 11, no. 3, pp. 65–89, 2019. [Google Scholar]
[19].Sriram S., Vinayakumar R., Alazab M., and Kp S., “Network flow based IoT botnet attack detection using deep learning,” in Proc. IEEE Conf. Comput. Commun. Workshops (INFOCOM WKSHPS), Jul. 2020, pp. 189–194. [Google Scholar]
[20].Vinayakumar R., Alazab M., Srinivasan S., Pham Q.-V., Padannayil S. K., and Simran K., “A visualized botnet detection system based deep learning for the Internet of Things networks of smart cities,” IEEE Trans. Ind. Appl., vol. 56, no. 4, pp. 4436–4456, Jul. 2020. [Google Scholar]
[21].Le D.-N., Parvathy V. S., Gupta D., Khanna A., Rodrigues J. J. P. C., and Shankar K., “IoT enabled depthwise separable convolution neural network with deep support vector machine for COVID-19 diagnosis and classification,” Int. J. Mach. Learn. Cybern., vol. 12, pp. 3235–3248, Jan. 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
[22].Dansana D., Kumar R., Bhattacharjee A., Hemanth D. J., Gupta D., Khanna A., and Castillo O., “Early diagnosis of COVID-19-affected patients based on X-ray and computed tomography images using deep learning algorithm,” Soft Comput., pp. 1–9, 2020, doi: 10.1007/s00500-020-05275-y. [DOI] [PMC free article] [PubMed] [Retracted]
[23].Arias-Londoño J. D., Gémez-García J. A., Moro-Velázquez L., and Godino-Llorente J. I., “Artificial intelligence applied to chest X-ray images for the automatic detection of COVID-19. A thoughtful evaluation approach,” IEEE Access, vol. 8, pp. 226811–226827, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
[24].El-kenawy E.-S. M., Mirjalili S., Ibrahim A., Alrahmawy M., El-Said M., Zaki R. M., and Eid M. M., “Advanced meta-heuristics, convolutional neural networks, and feature selectors for efficient COVID-19 X-ray chest image classification,” IEEE Access, vol. 9, pp. 36019–36037, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
[25].Sakib S., Tazrin T., Fouda M. M., Fadlullah Z. M., and Guizani M., “DL-CRC: Deep learning-based chest radiograph classification for COVID-19 detection: A novel approach,” IEEE Access, vol. 8, pp. 171575–171589, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
[26].Ahsan M. M., Ahad M. T., Soma F. A., Paul S., Chowdhury A., Luna S. A., Yazdan M. M. S., Rahman A., Siddique Z., and Huebner P., “Detecting SARS-CoV-2 from chest X-ray using artificial intelligence,” IEEE Access, vol. 9, pp. 35501–35513, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
[27].Chowdhury M. E., Rahman T., and Khandakar A., “Can AI help in screening viral and COVID-19 pneumonia?” IEEE Access, vol. 8, pp. 132665–132676, 2020. [Google Scholar]
[28].Kim Y. Y., Shin H. J., Kim M. J., and Lee M. J., “Comparison of effective radiation doses from X-ray, CT, and PET/CT in pediatric patients with neuroblastoma using a dose monitoring program,” Diagnostic Interventional Radiol., vol. 22, no. 4, pp. 390–394, Jul. 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
[29].Yanase J. and Triantaphyllou E., “The seven key challenges for the future of computer-aided diagnosis in medicine,” Int. J. Med. Informat., vol. 129, pp. 413–422, Sep. 2019. [DOI] [PubMed] [Google Scholar]
[30].Silva P., Luz E., Silva G., Moreira G., Silva R., Lucio D., and Menotti D., “COVID-19 detection in CT images with deep learning: A voting-based scheme and cross-datasets analysis,” Informat. Med. Unlocked, vol. 20, Mar. 2020, Art. no. 100427. [DOI] [PMC free article] [PubMed] [Google Scholar]
[31].Self W. H., Courtney D. M., McNaughton C. D., Wunderink R. G., and Kline J. A., “High discordance of chest X-ray and computed tomography for detection of pulmonary opacities in ED patients: Implications for diagnosing pneumonia,” Amer. J. Emergency Med., vol. 31, no. 2, pp. 401–405, Feb. 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
[32].Jacobi A., Chung M., Bernheim A., and Eber C., “Portable chest X-ray in coronavirus disease-19 (COVID-19): A pictorial review,” Clin. Imag., vol. 64, pp. 35–42, Aug. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
[33].Joarder R. and Crundwell N., Chest X-Ray in Clinical Practice. London, U.K.: Springer, 2009. [Google Scholar]
[34].Lalmuanawma S., Hussain J., and Chhakchhuak L., “Applications of machine learning and artificial intelligence for covid-19 (SARS-CoV-2) pandemic: A review,” Chaos, Solitons Fractals, vol. 139, Oct. 2020, Art. no. 110059. [DOI] [PMC free article] [PubMed] [Google Scholar]
[35].Shortliffe E., Computer-Based Medical Consultations: MYCIN, 1st ed. Amsterdam, The Netherlands: Elsevier, 2012. [Google Scholar]
[36].Colubri A., Hartley M. A., Siakor M., Wolfman V., and Felix A., “Machine-learning prognostic models from the 2014–16 Ebola outbreak: Data-harmonization challenges, validation strategies, and mHealth applications,” EClinicalMedicine, vol. 11, pp. 54–64, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
[37].Chockanathan U., Souza A. M. D., Abidin A. Z., Schifitto G., and Wismüller A., “Automated diagnosis of HIV-associated neurocognitive disorders using large-scale Granger causality analysis of resting-state functional MRI,” Comput. Biol. Med., vol. 106, pp. 24–30, Mar. 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
[38].Nápoles G., Grau I., Bello R., and Grau R., “Two-steps learning of fuzzy cognitive maps for prediction and knowledge discovery on the HIV-1 drug resistance,” Expert Syst. Appl., vol. 41, no. 3, pp. 821–830, Feb. 2014. [Google Scholar]
[39].Toğaçar M., Ergen B., and Cömert Z., “COVID-19 detection using deep learning models to exploit social mimic optimization and structured chest X-ray images using fuzzy color and stacking approaches,” Comput. Biol. Med., vol. 121, Jun. 2020, Art. no. 103805. [DOI] [PMC free article] [PubMed] [Google Scholar]
[40].Hripcsak G. and Albers D. J., “Next-generation phenotyping of electronic health records,” J. Amer. Med. Informat. Assoc. Jamia, vol. 20, no. 1, pp. 117–121, Sep. 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
[41].Hardon A., Hodgkin C., and Fresle D., How to Investigate the Use of Medicines by Consumers. Amsterdam, The Netherlands: Univ. of Amsterdam, 2004, p. 89. [Google Scholar]
[42].Oh Y., Park S., and Ye J. C., “Deep learning COVID-19 features on CXR using limited training data sets,” IEEE Trans. Med. Imag., vol. 39, no. 8, pp. 2688–2700, Aug. 2020. [DOI] [PubMed] [Google Scholar]
[43].Sankhwar S., Gupta D., Ramya K. C., Sheeba Rani S., Shankar K., and Lakshmanaprabu S. K., “Improved grey wolf optimization-based feature subset selection with fuzzy neural classifier for financial crisis prediction,” Soft Comput., vol. 24, no. 1, pp. 101–110, Jan. 2020. [Google Scholar]
[44].Gupta D. and Ahlawat A. K., “Usability determination using multistage fuzzy system,” Proc. Comput. Sci., vol. 78, pp. 263–270, Jan. 2016. [Google Scholar]
[45].Nayyar A., Garg S., Gupta D., and Khanna A., “Evolutionary computation,” in Advances in Swarm Intelligence for Optimizing Problems in Computer Science, 1st ed. Boca Raton, FL, USA: CRC Press, 2018, pp. 1–26. [Google Scholar]
[46].Simpson P. K., “Fuzzy min-max neural networks. I. Classification,” IEEE Trans. Neural Netw., vol. 3, no. 5, pp. 776–786, Sep. 1992. [DOI] [PubMed] [Google Scholar]
[47].Alhroob E., Mohammed M. F., Lim C. P., and Tao H., “A critical review on selected fuzzy min-max neural networks and their significance and challenges in pattern classification,” IEEE Access, vol. 7, pp. 56129–56146, 2019. [Google Scholar]
[48].Nandedkar A. V. and Biswas P. K., “A general reflex fuzzy min-max neural network,” Eng. Lett., vol. 14, no. 1, pp. 195–205, 2007. [Google Scholar]
[49].Mozannar H. and Sontag D., “Consistent estimators for learning to defer to an expert,” 2020, arXiv:2006.01862. [Google Scholar]
[50].Shi S., Malhi I., Tran K., Ng A. Y., and Rajpurkar P., “CheXseen: Unseen disease detection for deep learning interpretation of chest X-rays,” 2021, arXiv:2103.04590. [Google Scholar]
[51].Krizhevsky A., Sutskever I., and Hinton G. E., “ImageNet classification with deep convolutional neural networks,” Commun. ACM, vol. 60, no. 2, pp. 84–90, Jun. 2012. [Google Scholar]
[52].Simonyan K. and Zisserman A., “Very deep convolutional networks for large-scale image recognition,” 2014, arXiv:1409.1556. [Google Scholar]
[53].Russakovsky O., Deng J., Su H., Krause J., Satheesh S., Ma S., Huang Z., Karpathy A., Khosla A., Bernstein M., Berg A. C., and Fei-Fei L., “ImageNet large scale visual recognition challenge,” Int. J. Comput. Vis., vol. 115, no. 3, pp. 211–252, Dec. 2015. [Google Scholar]
[54].Raghu M., Zhang C., Kleinberg J., and Bengio S., “Transfusion: Understanding transfer learning for medical imaging,” 2019, arXiv:1902.07208. [Google Scholar]
[55].Alpern B. and Carter L., “The hyperbox,” in Proc. Vis., 2002, pp. 133–139. [Google Scholar]
[56].Gabrys B. and Bargiela A., “General fuzzy min-max neural network for clustering and classification,” IEEE Trans. Neural Netw., vol. 11, no. 3, pp. 769–783, May 2000. [DOI] [PubMed] [Google Scholar]
[57].Li K., Fang Y., Li W., Pan C., Qin P., Zhong Y., Liu X., Huang M., Liao Y., and Li S., “CT image visual quantitative evaluation and clinical classification of coronavirus disease (COVID-19),” Eur. Radiol., vol. 30, no. 8, pp. 4407–4416, Aug. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
[58].Shi H., Han X., Jiang N., Cao Y., Alwalid O., Gu J., Fan Y., and Zheng C., “Radiological findings from 81 patients with COVID-19 pneumonia in Wuhan, China: A descriptive study,” Lancet Infectious Diseases, vol. 20, no. 4, pp. 425–434, Apr. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
[59].Chan M. S. M., Chan I. Y. F., Fung K. H., Poon E., Yam L. Y. C., and Lau K. Y., “High-resolution CT findings in patients with severe acute respiratory syndrome: A pattern-based approach,” Amer. J. Roentgenol., vol. 182, no. 1, pp. 49–56, Jan. 2004. [DOI] [PubMed] [Google Scholar]
[60].Wong K. T., Antonio G. E., Hui D. S. C., Lee N., Yuen E. H. Y., Wu A., Leung C. B., Rainer T. H., Cameron P., Chung S. S. C., Sung J. J. Y., and Ahuja A. T., “Thin-section CT of severe acute respiratory syndrome: Evaluation of 73 patients exposed to or with the disease,” Radiology, vol. 228, no. 2, pp. 395–400, Aug. 2003. [DOI] [PubMed] [Google Scholar]
[61].Ajlan A. M., Ahyad R. A., Jamjoom L. G., Alharthy A., and Madani T. A., “Middle east respiratory syndrome coronavirus (MERS-CoV) infection: Chest CT findings,” Amer. J. Roentgenol., vol. 203, no. 4, pp. 782–787, Oct. 2014. [DOI] [PubMed] [Google Scholar]
[62].Das K. M., Lee E. Y., Enani M. A., AlJawder S. E., Singh R., Bashir S., Al-Nakshbandi N., AlDossari K., and Larsson S. G., “CT correlation with outcomes in 15 patients with acute middle east respiratory syndrome coronavirus,” Amer. J. Roentgenol., vol. 204, no. 4, pp. 736–742, Apr. 2015. [DOI] [PubMed] [Google Scholar]
[63].Yager R. R., “Pythagorean fuzzy subsets,” in Proc. Joint IFSA World Congr. NAFIPS Annu. Meeting (IFSA/NAFIPS), Jun. 2013, pp. 57–61. [Google Scholar]
[64].Paul Cohen J., Morrison P., Dao L., Roth K., Duong T. Q., and Ghassemi M., “COVID-19 image data collection: Prospective predictions are the future,” 2020, arXiv:2006.11988. [Google Scholar]
[65].Kermany D., “Labeled optical coherence tomography (OCT) and Chest X-ray images for classification,” Mendeley, Univ. California San Diego, La Jolla, CA, USA, Tech. Rep., 2018, doi: 10.17632/rscbjbr9sj.2. [DOI] [Google Scholar]
[66].Chollet F.. (2015). Keras. [Online]. Available: https://github.com/fchollet/keras [Google Scholar]
[67].Abadi M.et al. , “TensorFlow: Large-scale machine learning on heterogeneous distributed systems,” 2016, arXiv:1603.04467. [Google Scholar]
[68].Paul N. S., Roberts H., Butany J., Chung T., Gold W., Mehta S., Konen E., Rao A., Provost Y., Hong H. H., Zelovitsky L., and Weisbrod G. L., “Radiologic pattern of disease in patients with severe acute respiratory syndrome: The Toronto experience,” RadioGraphics, vol. 24, no. 2, pp. 553–563, Mar. 2004. [DOI] [PubMed] [Google Scholar]
[69].He K., Zhang X., Ren S., and Sun J., “Deep residual learning for image recognition,” 2015, arXiv:1512.03385. [Google Scholar]
[70].Howard A. G., Zhu M., Chen B., Kalenichenko D., Wang W., Weyand T., Andreetto M., and Adam H., “MobileNets: Efficient convolutional neural networks for mobile vision applications,” 2017, arXiv:1704.04861. [Google Scholar]
[71].Sahinbas K.. Transfer Learning Based Convolutional Neural Network for COVID-19 Detection With X-Ray Images. Accessed: May 30, 2021. [Online]. Available: https://www.ozgurcatak.org/files/papers/covid19-deep-learning.pdf [Google Scholar]
[72].Panwar H., Gupta P. K., Siddiqui M. K., Morales-Menendez R., Bhardwaj P., and Singh V., “A deep learning and grad-CAM based color visualization approach for fast detection of COVID-19 cases using chest X-ray and CT-scan images,” Chaos, Solitons Fractals, vol. 140, Nov. 2020, Art. no. 110190. [DOI] [PMC free article] [PubMed] [Google Scholar]
[73].El-Din Hemdan E., Shouman M. A., and Esmail Karar M., “COVIDX-net: A framework of deep learning classifiers to diagnose COVID-19 in X-ray images,” 2020, arXiv:2003.11055. [Google Scholar]
[74].Medhi K., Jamil M., and Hussain I., “Automatic detection of COVID-19 infection from chest X-ray using deep learning,” bioRxiv, May 2020, doi: 10.1101/2020.05.10.20097063. [DOI]
[75].Waheed A., Goyal M., Gupta D., Khanna A., Al-Turjman F., and Pinheiro P. R., “CovidGAN: Data augmentation using auxiliary classifier GAN for improved covid-19 detection,” IEEE Access, vol. 8, pp. 91916–91923, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
[76].Ahishali M., Degerli A., Yamac M., Kiranyaz S., Chowdhury M. E. H., Hameed K., Hamid T., Mazhar R., and Gabbouj M., “Advance warning methodologies for COVID-19 using chest X-ray images,” IEEE Access, vol. 9, pp. 41052–41065, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
[77].Vaid S., Kalantar R., and Bhandari M., “Deep learning COVID-19 detection bias: Accuracy through artificial intelligence,” Int. Orthopaedics, vol. 44, no. 8, pp. 1539–1542, Aug. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
[78].Lv D., Qi W., Li Y., Sun L., and Wang Y., “A cascade network for detecting COVID-19 using chest x-rays,” 2020, arXiv:2005.01468. Accessed: Aug. 27, 2021. [Online]. Available:. [Google Scholar]
[79].Civit-Masot J., Luna-Perejón F., Domínguez Morales M., and Civit A., “Deep learning system for COVID-19 diagnosis aid using X-ray pulmonary images,” Appl. Sci., vol. 10, no. 13, p. 4640, Jul. 2020. [Google Scholar]
[80].Ozturk T., Talo M., Yildirim E. A., Baloglu U. B., Yildirim O., and Rajendra Acharya U., “Automated detection of COVID-19 cases using deep neural networks with X-ray images,” Comput. Biol. Med., vol. 121, Jun. 2020, Art. no. 103792. [DOI] [PMC free article] [PubMed] [Google Scholar]
[81].Qjidaa M., Ben-fares A., Mechbal Y., Amakdouf H., Maaroufi M., Alami B., and Qjidaa H., “Development of a clinical decision support system for the early detection of COVID-19 using deep learning based on chest radiographic images,” in Proc. Int. Conf. Intell. Syst. Comput. Vis. (ISCV), Jun. 2020, pp. 1–6. [Google Scholar]
[82].Ravi V., Alazab M., Srinivasan S., Arunachalam A., and Soman K. P., “Adversarial defense: DGA-based botnets and DNS homographs detection through integrated deep learning,” IEEE Trans. Eng. Manag., early access, Mar. 12, 2021, doi: 10.1109/TEM.2021.3059664. [DOI]
[83].Papernot N., McDaniel P., Wu X., Jha S., and Swami A., “Distillation as a defense to adversarial perturbations against deep neural networks,” 2015, arXiv:1511.04508. [Google Scholar]
[84].Ma L. and Liang L., “Increasing-margin adversarial (IMA) training to improve adversarial robustness of neural networks,” 2020, arXiv:2005.09147. Accessed: May 30, 2021. [Online]. Available:. [DOI] [PMC free article] [PubMed] [Google Scholar]
[85].Goodfellow I. J., Shlens J., and Szegedy C., “Explaining and harnessing adversarial examples,” 2014, arXiv:1412.6572. [Google Scholar]
[86].van der Maaten L. and Hinton G., “Visualizing data using t-SNE,” J. Mach. Learn. Res., vol. 9, pp. 2579–2605, Nov. 2008. [Google Scholar]

[ref1] [1].Zhu N., Zhang D., Wang W., Li X., and Yang B., “A novel coronavirus from patients with pneumonia in China, 2019,” New England J. Med., vol. 382, pp. 727–733, Jan. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref2] [2].Huang C., Wang Y., Li X., Ren L., Zhao J., Hu Y., and Zhang L., “Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China,” Lancet, vol. 395, no. 10223, pp. 497–506, May 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref3] [3].Dong E., Du H., and Gardner L., “An interactive web-based dashboard to track COVID-19 in real time,” Lancet Infectious Diseases, vol. 20, no. 5, pp. 533–534, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref4] [4].Fisher J. L. and Woolaston K., “UN report says up to 850,000 animal viruses could be caught by humans, unless we protect nature,” Conversation, Oct. 2020. Accessed: May 28, 2021. [Online]. Available: https://theconversation.com/un-report-says-up-to-850-000-animal-viruses-could-be-caught-by-humans-unless-we-protect-nature-148911

[ref5] [5].Health Topics. Accessed: May 28, 2021. [Online]. Available: https://www.who.int/health-topics

[ref6] [6].CSR. World Health Organization–Eastern Mediterranean Region. Accessed: May 28, 2021. [Online]. Available: https://www.emro.who.int/health-topics/mers-cov/mers-outbreaks.html

[ref7] [7].WHO | Summary of Probable SARS Cases With Onset of Illness From 1 November 2002 to 31 July 2003, 2015. [Google Scholar]

[ref8] [8].Corman V. M., Landt O., Kaiser M., and Molenkamp R., “Detection of 2019 novel coronavirus (2019-nCoV) by real-time RT-PCR,” Eurosurveillance, vol. 25, no. 3, Jan. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref9] [9].Watson J., Whiting P. F., and Brush J. E., “Interpreting a covid-19 test result,” BMJ, vol. 369, p. m1808, May 2020. [DOI] [PubMed] [Google Scholar]

[ref10] [10].Chharia A. and Upadhyay R., “Deep recurrent architecture based scene description generator for visually impaired,” in Proc. 12th Int. Congr. Ultra Modern Telecommun. Control Syst. Workshops (ICUMT), Oct. 2020, pp. 136–141. [Google Scholar]

[ref11] [11].Senior A. W., Evans R., Jumper J., Kirkpatrick J., and Sifre L., “Improved protein structure prediction using potentials from deep learning,” Nature, vol. 577, no. 7792, pp. 706–710, 2020. [DOI] [PubMed] [Google Scholar]

[ref12] [12].Gao S., Han R., Zeng X., Liu Z., Xu M., and Zhang F., “Macromolecules structural classification with a 3D dilated dense network in cryo-electron tomography,” IEEE/ACM Trans. Comput. Biol. Bioinf., vol. 19, no. 1, pp. 209–219, Jan. 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref13] [13].Du X., Wang H., Zhu Z., Zeng X., Chang Y.-W., Zhang J., and Xu M., “Active learning to classify macromolecular structures in situ for less supervision in cryo-electron tomography,” 2021, arXiv:2102.12040. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref14] [14].Vinayakumar R., Alazab M., Soman K., Poornachandran P., Al-Nemrat A., and Venkatraman S., “Deep learning approach for intelligent intrusion detection system,” IEEE Access, vol. 7, pp. 41525–41550, 2019. [Google Scholar]

[ref15] [15].Vinayakumar R., Soman K. P., and Poornachandran P., “Applying convolutional neural network for network intrusion detection,” in Proc. Int. Conf. Adv. Comput., Commun. Informat. (ICACCI), Sep. 2017, pp. 1222–1228. [Google Scholar]

[ref16] [16].Vinayakumar R., Soman K. P., and Poornachandran P., “Evaluating effectiveness of shallow and deep networks to intrusion detection system,” in Proc. Int. Conf. Adv. Comput., Commun. Informat. (ICACCI), Sep. 2017, pp. 1282–1289. [Google Scholar]

[ref17] [17].Vinayakumar R., Soman K. P., and Poornachandran P., “Evaluation of recurrent neural network and its variants for intrusion detection system (IDS),” Int. J. Inf. Syst. Model. Des., vol. 8, no. 3, pp. 43–63, 2017. [Google Scholar]

[ref18] [18].Vinayakumar R., Soman K. P., and Poornachandran P., “A comparative analysis of deep learning approaches for network intrusion detection systems (N-IDSs): Deep learning for N-IDSs,” Int. J. Digit. Crime Forensics, vol. 11, no. 3, pp. 65–89, 2019. [Google Scholar]

[ref19] [19].Sriram S., Vinayakumar R., Alazab M., and Kp S., “Network flow based IoT botnet attack detection using deep learning,” in Proc. IEEE Conf. Comput. Commun. Workshops (INFOCOM WKSHPS), Jul. 2020, pp. 189–194. [Google Scholar]

[ref20] [20].Vinayakumar R., Alazab M., Srinivasan S., Pham Q.-V., Padannayil S. K., and Simran K., “A visualized botnet detection system based deep learning for the Internet of Things networks of smart cities,” IEEE Trans. Ind. Appl., vol. 56, no. 4, pp. 4436–4456, Jul. 2020. [Google Scholar]

[ref21] [21].Le D.-N., Parvathy V. S., Gupta D., Khanna A., Rodrigues J. J. P. C., and Shankar K., “IoT enabled depthwise separable convolution neural network with deep support vector machine for COVID-19 diagnosis and classification,” Int. J. Mach. Learn. Cybern., vol. 12, pp. 3235–3248, Jan. 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref22] [22].Dansana D., Kumar R., Bhattacharjee A., Hemanth D. J., Gupta D., Khanna A., and Castillo O., “Early diagnosis of COVID-19-affected patients based on X-ray and computed tomography images using deep learning algorithm,” Soft Comput., pp. 1–9, 2020, doi: 10.1007/s00500-020-05275-y. [DOI] [PMC free article] [PubMed] [Retracted]

[ref23] [23].Arias-Londoño J. D., Gémez-García J. A., Moro-Velázquez L., and Godino-Llorente J. I., “Artificial intelligence applied to chest X-ray images for the automatic detection of COVID-19. A thoughtful evaluation approach,” IEEE Access, vol. 8, pp. 226811–226827, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref24] [24].El-kenawy E.-S. M., Mirjalili S., Ibrahim A., Alrahmawy M., El-Said M., Zaki R. M., and Eid M. M., “Advanced meta-heuristics, convolutional neural networks, and feature selectors for efficient COVID-19 X-ray chest image classification,” IEEE Access, vol. 9, pp. 36019–36037, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref25] [25].Sakib S., Tazrin T., Fouda M. M., Fadlullah Z. M., and Guizani M., “DL-CRC: Deep learning-based chest radiograph classification for COVID-19 detection: A novel approach,” IEEE Access, vol. 8, pp. 171575–171589, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref26] [26].Ahsan M. M., Ahad M. T., Soma F. A., Paul S., Chowdhury A., Luna S. A., Yazdan M. M. S., Rahman A., Siddique Z., and Huebner P., “Detecting SARS-CoV-2 from chest X-ray using artificial intelligence,” IEEE Access, vol. 9, pp. 35501–35513, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref27] [27].Chowdhury M. E., Rahman T., and Khandakar A., “Can AI help in screening viral and COVID-19 pneumonia?” IEEE Access, vol. 8, pp. 132665–132676, 2020. [Google Scholar]

[ref28] [28].Kim Y. Y., Shin H. J., Kim M. J., and Lee M. J., “Comparison of effective radiation doses from X-ray, CT, and PET/CT in pediatric patients with neuroblastoma using a dose monitoring program,” Diagnostic Interventional Radiol., vol. 22, no. 4, pp. 390–394, Jul. 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref29] [29].Yanase J. and Triantaphyllou E., “The seven key challenges for the future of computer-aided diagnosis in medicine,” Int. J. Med. Informat., vol. 129, pp. 413–422, Sep. 2019. [DOI] [PubMed] [Google Scholar]

[ref30] [30].Silva P., Luz E., Silva G., Moreira G., Silva R., Lucio D., and Menotti D., “COVID-19 detection in CT images with deep learning: A voting-based scheme and cross-datasets analysis,” Informat. Med. Unlocked, vol. 20, Mar. 2020, Art. no. 100427. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref31] [31].Self W. H., Courtney D. M., McNaughton C. D., Wunderink R. G., and Kline J. A., “High discordance of chest X-ray and computed tomography for detection of pulmonary opacities in ED patients: Implications for diagnosing pneumonia,” Amer. J. Emergency Med., vol. 31, no. 2, pp. 401–405, Feb. 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref32] [32].Jacobi A., Chung M., Bernheim A., and Eber C., “Portable chest X-ray in coronavirus disease-19 (COVID-19): A pictorial review,” Clin. Imag., vol. 64, pp. 35–42, Aug. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref33] [33].Joarder R. and Crundwell N., Chest X-Ray in Clinical Practice. London, U.K.: Springer, 2009. [Google Scholar]

[ref34] [34].Lalmuanawma S., Hussain J., and Chhakchhuak L., “Applications of machine learning and artificial intelligence for covid-19 (SARS-CoV-2) pandemic: A review,” Chaos, Solitons Fractals, vol. 139, Oct. 2020, Art. no. 110059. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref35] [35].Shortliffe E., Computer-Based Medical Consultations: MYCIN, 1st ed. Amsterdam, The Netherlands: Elsevier, 2012. [Google Scholar]

[ref36] [36].Colubri A., Hartley M. A., Siakor M., Wolfman V., and Felix A., “Machine-learning prognostic models from the 2014–16 Ebola outbreak: Data-harmonization challenges, validation strategies, and mHealth applications,” EClinicalMedicine, vol. 11, pp. 54–64, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref37] [37].Chockanathan U., Souza A. M. D., Abidin A. Z., Schifitto G., and Wismüller A., “Automated diagnosis of HIV-associated neurocognitive disorders using large-scale Granger causality analysis of resting-state functional MRI,” Comput. Biol. Med., vol. 106, pp. 24–30, Mar. 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref38] [38].Nápoles G., Grau I., Bello R., and Grau R., “Two-steps learning of fuzzy cognitive maps for prediction and knowledge discovery on the HIV-1 drug resistance,” Expert Syst. Appl., vol. 41, no. 3, pp. 821–830, Feb. 2014. [Google Scholar]

[ref39] [39].Toğaçar M., Ergen B., and Cömert Z., “COVID-19 detection using deep learning models to exploit social mimic optimization and structured chest X-ray images using fuzzy color and stacking approaches,” Comput. Biol. Med., vol. 121, Jun. 2020, Art. no. 103805. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref40] [40].Hripcsak G. and Albers D. J., “Next-generation phenotyping of electronic health records,” J. Amer. Med. Informat. Assoc. Jamia, vol. 20, no. 1, pp. 117–121, Sep. 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref41] [41].Hardon A., Hodgkin C., and Fresle D., How to Investigate the Use of Medicines by Consumers. Amsterdam, The Netherlands: Univ. of Amsterdam, 2004, p. 89. [Google Scholar]

[ref42] [42].Oh Y., Park S., and Ye J. C., “Deep learning COVID-19 features on CXR using limited training data sets,” IEEE Trans. Med. Imag., vol. 39, no. 8, pp. 2688–2700, Aug. 2020. [DOI] [PubMed] [Google Scholar]

[ref43] [43].Sankhwar S., Gupta D., Ramya K. C., Sheeba Rani S., Shankar K., and Lakshmanaprabu S. K., “Improved grey wolf optimization-based feature subset selection with fuzzy neural classifier for financial crisis prediction,” Soft Comput., vol. 24, no. 1, pp. 101–110, Jan. 2020. [Google Scholar]

[ref44] [44].Gupta D. and Ahlawat A. K., “Usability determination using multistage fuzzy system,” Proc. Comput. Sci., vol. 78, pp. 263–270, Jan. 2016. [Google Scholar]

[ref45] [45].Nayyar A., Garg S., Gupta D., and Khanna A., “Evolutionary computation,” in Advances in Swarm Intelligence for Optimizing Problems in Computer Science, 1st ed. Boca Raton, FL, USA: CRC Press, 2018, pp. 1–26. [Google Scholar]

[ref46] [46].Simpson P. K., “Fuzzy min-max neural networks. I. Classification,” IEEE Trans. Neural Netw., vol. 3, no. 5, pp. 776–786, Sep. 1992. [DOI] [PubMed] [Google Scholar]

[ref47] [47].Alhroob E., Mohammed M. F., Lim C. P., and Tao H., “A critical review on selected fuzzy min-max neural networks and their significance and challenges in pattern classification,” IEEE Access, vol. 7, pp. 56129–56146, 2019. [Google Scholar]

[ref48] [48].Nandedkar A. V. and Biswas P. K., “A general reflex fuzzy min-max neural network,” Eng. Lett., vol. 14, no. 1, pp. 195–205, 2007. [Google Scholar]

[ref49] [49].Mozannar H. and Sontag D., “Consistent estimators for learning to defer to an expert,” 2020, arXiv:2006.01862. [Google Scholar]

[ref50] [50].Shi S., Malhi I., Tran K., Ng A. Y., and Rajpurkar P., “CheXseen: Unseen disease detection for deep learning interpretation of chest X-rays,” 2021, arXiv:2103.04590. [Google Scholar]

[ref51] [51].Krizhevsky A., Sutskever I., and Hinton G. E., “ImageNet classification with deep convolutional neural networks,” Commun. ACM, vol. 60, no. 2, pp. 84–90, Jun. 2012. [Google Scholar]

[ref52] [52].Simonyan K. and Zisserman A., “Very deep convolutional networks for large-scale image recognition,” 2014, arXiv:1409.1556. [Google Scholar]

[ref53] [53].Russakovsky O., Deng J., Su H., Krause J., Satheesh S., Ma S., Huang Z., Karpathy A., Khosla A., Bernstein M., Berg A. C., and Fei-Fei L., “ImageNet large scale visual recognition challenge,” Int. J. Comput. Vis., vol. 115, no. 3, pp. 211–252, Dec. 2015. [Google Scholar]

[ref54] [54].Raghu M., Zhang C., Kleinberg J., and Bengio S., “Transfusion: Understanding transfer learning for medical imaging,” 2019, arXiv:1902.07208. [Google Scholar]

[ref55] [55].Alpern B. and Carter L., “The hyperbox,” in Proc. Vis., 2002, pp. 133–139. [Google Scholar]

[ref56] [56].Gabrys B. and Bargiela A., “General fuzzy min-max neural network for clustering and classification,” IEEE Trans. Neural Netw., vol. 11, no. 3, pp. 769–783, May 2000. [DOI] [PubMed] [Google Scholar]

[ref57] [57].Li K., Fang Y., Li W., Pan C., Qin P., Zhong Y., Liu X., Huang M., Liao Y., and Li S., “CT image visual quantitative evaluation and clinical classification of coronavirus disease (COVID-19),” Eur. Radiol., vol. 30, no. 8, pp. 4407–4416, Aug. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref58] [58].Shi H., Han X., Jiang N., Cao Y., Alwalid O., Gu J., Fan Y., and Zheng C., “Radiological findings from 81 patients with COVID-19 pneumonia in Wuhan, China: A descriptive study,” Lancet Infectious Diseases, vol. 20, no. 4, pp. 425–434, Apr. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref59] [59].Chan M. S. M., Chan I. Y. F., Fung K. H., Poon E., Yam L. Y. C., and Lau K. Y., “High-resolution CT findings in patients with severe acute respiratory syndrome: A pattern-based approach,” Amer. J. Roentgenol., vol. 182, no. 1, pp. 49–56, Jan. 2004. [DOI] [PubMed] [Google Scholar]

[ref60] [60].Wong K. T., Antonio G. E., Hui D. S. C., Lee N., Yuen E. H. Y., Wu A., Leung C. B., Rainer T. H., Cameron P., Chung S. S. C., Sung J. J. Y., and Ahuja A. T., “Thin-section CT of severe acute respiratory syndrome: Evaluation of 73 patients exposed to or with the disease,” Radiology, vol. 228, no. 2, pp. 395–400, Aug. 2003. [DOI] [PubMed] [Google Scholar]

[ref61] [61].Ajlan A. M., Ahyad R. A., Jamjoom L. G., Alharthy A., and Madani T. A., “Middle east respiratory syndrome coronavirus (MERS-CoV) infection: Chest CT findings,” Amer. J. Roentgenol., vol. 203, no. 4, pp. 782–787, Oct. 2014. [DOI] [PubMed] [Google Scholar]

[ref62] [62].Das K. M., Lee E. Y., Enani M. A., AlJawder S. E., Singh R., Bashir S., Al-Nakshbandi N., AlDossari K., and Larsson S. G., “CT correlation with outcomes in 15 patients with acute middle east respiratory syndrome coronavirus,” Amer. J. Roentgenol., vol. 204, no. 4, pp. 736–742, Apr. 2015. [DOI] [PubMed] [Google Scholar]

[ref63] [63].Yager R. R., “Pythagorean fuzzy subsets,” in Proc. Joint IFSA World Congr. NAFIPS Annu. Meeting (IFSA/NAFIPS), Jun. 2013, pp. 57–61. [Google Scholar]

[ref64] [64].Paul Cohen J., Morrison P., Dao L., Roth K., Duong T. Q., and Ghassemi M., “COVID-19 image data collection: Prospective predictions are the future,” 2020, arXiv:2006.11988. [Google Scholar]

[ref65] [65].Kermany D., “Labeled optical coherence tomography (OCT) and Chest X-ray images for classification,” Mendeley, Univ. California San Diego, La Jolla, CA, USA, Tech. Rep., 2018, doi: 10.17632/rscbjbr9sj.2. [DOI] [Google Scholar]

[ref66] [66].Chollet F.. (2015). Keras. [Online]. Available: https://github.com/fchollet/keras [Google Scholar]

[ref67] [67].Abadi M.et al. , “TensorFlow: Large-scale machine learning on heterogeneous distributed systems,” 2016, arXiv:1603.04467. [Google Scholar]

[ref68] [68].Paul N. S., Roberts H., Butany J., Chung T., Gold W., Mehta S., Konen E., Rao A., Provost Y., Hong H. H., Zelovitsky L., and Weisbrod G. L., “Radiologic pattern of disease in patients with severe acute respiratory syndrome: The Toronto experience,” RadioGraphics, vol. 24, no. 2, pp. 553–563, Mar. 2004. [DOI] [PubMed] [Google Scholar]

[ref69] [69].He K., Zhang X., Ren S., and Sun J., “Deep residual learning for image recognition,” 2015, arXiv:1512.03385. [Google Scholar]

[ref70] [70].Howard A. G., Zhu M., Chen B., Kalenichenko D., Wang W., Weyand T., Andreetto M., and Adam H., “MobileNets: Efficient convolutional neural networks for mobile vision applications,” 2017, arXiv:1704.04861. [Google Scholar]

[ref71] [71].Sahinbas K.. Transfer Learning Based Convolutional Neural Network for COVID-19 Detection With X-Ray Images. Accessed: May 30, 2021. [Online]. Available: https://www.ozgurcatak.org/files/papers/covid19-deep-learning.pdf [Google Scholar]

[ref72] [72].Panwar H., Gupta P. K., Siddiqui M. K., Morales-Menendez R., Bhardwaj P., and Singh V., “A deep learning and grad-CAM based color visualization approach for fast detection of COVID-19 cases using chest X-ray and CT-scan images,” Chaos, Solitons Fractals, vol. 140, Nov. 2020, Art. no. 110190. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref73] [73].El-Din Hemdan E., Shouman M. A., and Esmail Karar M., “COVIDX-net: A framework of deep learning classifiers to diagnose COVID-19 in X-ray images,” 2020, arXiv:2003.11055. [Google Scholar]

[ref74] [74].Medhi K., Jamil M., and Hussain I., “Automatic detection of COVID-19 infection from chest X-ray using deep learning,” bioRxiv, May 2020, doi: 10.1101/2020.05.10.20097063. [DOI]

[ref75] [75].Waheed A., Goyal M., Gupta D., Khanna A., Al-Turjman F., and Pinheiro P. R., “CovidGAN: Data augmentation using auxiliary classifier GAN for improved covid-19 detection,” IEEE Access, vol. 8, pp. 91916–91923, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref76] [76].Ahishali M., Degerli A., Yamac M., Kiranyaz S., Chowdhury M. E. H., Hameed K., Hamid T., Mazhar R., and Gabbouj M., “Advance warning methodologies for COVID-19 using chest X-ray images,” IEEE Access, vol. 9, pp. 41052–41065, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref77] [77].Vaid S., Kalantar R., and Bhandari M., “Deep learning COVID-19 detection bias: Accuracy through artificial intelligence,” Int. Orthopaedics, vol. 44, no. 8, pp. 1539–1542, Aug. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref78] [78].Lv D., Qi W., Li Y., Sun L., and Wang Y., “A cascade network for detecting COVID-19 using chest x-rays,” 2020, arXiv:2005.01468. Accessed: Aug. 27, 2021. [Online]. Available:. [Google Scholar]

[ref79] [79].Civit-Masot J., Luna-Perejón F., Domínguez Morales M., and Civit A., “Deep learning system for COVID-19 diagnosis aid using X-ray pulmonary images,” Appl. Sci., vol. 10, no. 13, p. 4640, Jul. 2020. [Google Scholar]

[ref80] [80].Ozturk T., Talo M., Yildirim E. A., Baloglu U. B., Yildirim O., and Rajendra Acharya U., “Automated detection of COVID-19 cases using deep neural networks with X-ray images,” Comput. Biol. Med., vol. 121, Jun. 2020, Art. no. 103792. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref81] [81].Qjidaa M., Ben-fares A., Mechbal Y., Amakdouf H., Maaroufi M., Alami B., and Qjidaa H., “Development of a clinical decision support system for the early detection of COVID-19 using deep learning based on chest radiographic images,” in Proc. Int. Conf. Intell. Syst. Comput. Vis. (ISCV), Jun. 2020, pp. 1–6. [Google Scholar]

[ref82] [82].Ravi V., Alazab M., Srinivasan S., Arunachalam A., and Soman K. P., “Adversarial defense: DGA-based botnets and DNS homographs detection through integrated deep learning,” IEEE Trans. Eng. Manag., early access, Mar. 12, 2021, doi: 10.1109/TEM.2021.3059664. [DOI]

[ref83] [83].Papernot N., McDaniel P., Wu X., Jha S., and Swami A., “Distillation as a defense to adversarial perturbations against deep neural networks,” 2015, arXiv:1511.04508. [Google Scholar]

[ref84] [84].Ma L. and Liang L., “Increasing-margin adversarial (IMA) training to improve adversarial robustness of neural networks,” 2020, arXiv:2005.09147. Accessed: May 30, 2021. [Online]. Available:. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref85] [85].Goodfellow I. J., Shlens J., and Szegedy C., “Explaining and harnessing adversarial examples,” 2014, arXiv:1412.6572. [Google Scholar]

[ref86] [86].van der Maaten L. and Hinton G., “Visualizing data using t-SNE,” J. Mach. Learn. Res., vol. 9, pp. 2579–2605, Nov. 2008. [Google Scholar]

PERMALINK

Deep-Precognitive Diagnosis: Preventing Future Pandemics by Novel Disease Detection With Biologically-Inspired Conv-Fuzzy Network

Aviral Chharia

Rahul Upadhyay

Vinay Kumar

Chao Cheng

Jing Zhang

Tianyang Wang

Min Xu

Abstract

I. Introduction

TABLE 1. Increased Rate of Spread of Numerous Novel Zoonotic Viruses in Recent Disease Outbreaks, Suggestive of the Approaching . Data Taken From [5]–[7].

A. Background

FIGURE 1.

B. Motivation

C. Contributions

II. Related Work

A. Manual Detection of Novel Diseases

B. Deep Learning Breakthrough in CAD

C. Challenges in Next-Generation Diagnosis

D. Fuzzy Logic in Computer-Aided Diagnosis

E. Deep Learning for Novel Disease Detection

III. Proposed Methodology

A. Problem Formulation

FIGURE 2.

B. Disease Feature Extraction and Mapping

FIGURE 3.

TABLE 2. Architecture of the Modified VGG-16 CNN for Extracting 512-Dimensional CXR Feature Vector. Here, Represents Stage, is the Number of Stacked Layers in the Operator, Represents the Weight/ Height/ Channel and GMP is the Global Max Pooling Layer.

FIGURE 5.

C. Fuzzy Classification

FIGURE 6.

1). Point Hyperbox ( ) Creation

2). Class Membership Computation

FIGURE 4.

3). Significance of Inter-Node Connections

4). Training Learning Classifier

TABLE 3. Comparative Analysis of Visually Similar and Distinguishing Features of COVID-19, SARS-CoV-1 and MERS Respiratory Diseases on CXR Scans. Data Taken From [57]–[62].

D. Biologically-Inspired Reflex Section

1). Reflex Section Architecture

2). Inter-Node Connections and Training

3). Final Membership Computation

4). Disease Visual Feature Overlap and Containment

FIGURE 7.

5). Hyperbox Isolation Condition

6). Hyperbox Containment Condition

7). Hyperbox Overlap Test

Case 1:

Case 2:

Case 3:

Case 4:

8). Hyperbox Contraction Test

Case 1:

Case 2:

Case 3:

Case 4:

E. Novel Disease Detection System

1). De Novo Disease Detection

2). Negation Operation on Intuitionistic Membership Grades

3). Online Architecture Update for Class

4). Midway Labeled Data Introduction

Algorithm 1 Detecting and Classifying Novel Disease Occurrences

IV. Experiments and Results

A. Experimental Settings

1). Dataset

TABLE 4. Details of the CXR Radiograph Dataset Used to Assess the Proposed Method. The Proposed Model is Trained on Limited Dataset.

2). Implementation

3). Compared Methods

4). Evaluation Metrics

B. Exp 1: Evaluation of Novel Disease Detection Ability

FIGURE 8.

FIGURE 9.

TABLE 5. Performance of the Proposed Model on COVID-19, SARS-CoV-1 and MERS-CoV Datasets for Novel Disease Detection Experiment Along With Hyperparameters Used. The Model was Pre-Trained to Classify Normal and Bacterial Pneumonia Classes.

C. Exp 2: Evaluation of Classification Ability

TABLE 6. Comparison of Binary Classification Results on CXR Scans With Various ML Classifiers. Here, , for Proposed Method. Feature Vectors are Obtained From CXR Feature Extraction Network.

TABLE 7. Comparison of Multi-Class Classification Results on CXR Scans With Various ML Classifiers. Here, , for Proposed Method. Feature Vectors are Obtained From CXR Feature Extraction Network.

FIGURE 11.

FIGURE 10.

D. Ablation Studies

1). Global Max-Pooling vs. Global Average Pooling

2). Varying CNN Models and Curse of Dimensionality