CGNet: A graph-knowledge embedded convolutional neural network for detection of pneumonia

Xiang Yu; Shui-Hua Wang; Yu-Dong Zhang

doi:10.1016/j.ipm.2020.102411

. 2020 Oct 19;58(1):102411. doi: 10.1016/j.ipm.2020.102411

CGNet: A graph-knowledge embedded convolutional neural network for detection of pneumonia

Xiang Yu ^a, Shui-Hua Wang ^b,^⁎, Yu-Dong Zhang ^a,^c,^⁎

PMCID: PMC7569413 PMID: 33100482

Highlights

•
A high-performance pneumonia detection system was proposed in this paper.
•
Transfer learning is used to obtain feature extractor.
•
A novel graph-based feature reconstruction method was proposed.
•
The proposed feature reconstruction is efficient yet transplantable to other scenarios.

Keywords: COVID-19, Chest X-ray images, Transfer learning, Graph, Feature reconstruction

Abstract

Pneumonia is a global disease that causes high children mortality. The situation has even been worsening by the outbreak of the new coronavirus named COVID-19, which has killed more than 983,907 so far. People infected by the virus would show symptoms like fever and coughing as well as pneumonia as the infection progresses. Timely detection is a public consensus achieved that would benefit possible treatments and therefore contain the spread of COVID-19. X-ray, an expedient imaging technique, has been widely used for the detection of pneumonia caused by COVID-19 and some other virus. To facilitate the process of diagnosis of pneumonia, we developed a deep learning framework for a binary classification task that classifies chest X-ray images into normal and pneumonia based on our proposed CGNet. In our CGNet, there are three components including feature extraction, graph-based feature reconstruction and classification. We first use the transfer learning technique to train the state-of-the-art convolutional neural networks (CNNs) for binary classification while the trained CNNs are used to produce features for the following two components. Then, by deploying graph-based feature reconstruction, we, therefore, combine features through the graph to reconstruct features. Finally, a shallow neural network named GNet, a one layer graph neural network, which takes the combined features as the input, classifies chest X-ray images into normal and pneumonia. Our model achieved the best accuracy at 0.9872, sensitivity at 1 and specificity at 0.9795 on a public pneumonia dataset that includes 5,856 chest X-ray images. To evaluate the performance of our proposed method on detection of pneumonia caused by COVID-19, we also tested the proposed method on a public COVID-19 CT dataset, where we achieved the highest performance at the accuracy of 0.99, specificity at 1 and sensitivity at 0.98, respectively.

1. Introduction

Pneumonia is a common yet serious lung disease caused by viruses and bacteria. Though it is treatable under most situations, timely detection still plays a key role in making diagnosis and treatment. Since the outbreak of COVID-19, Viral nucleic acid techniques(VNATs) and imaging techniques have been widely used to quickly diagnose pneumonia caused by COVID-19. It was reported that the sensitivity of VNATs was high (Behzadi, Ranjbar & Alavian, 2014), the false positive rate, however, is relatively high when compared to imaging-based detection methods (Fang et al. 2020). As pointed out by authors in Fang et al. (2020), a VNAT using real-time polymerase chain reaction (RT-PCR) achieved a sensitivity at 71% on a 51 cases dataset, however, Chest computed tomography (CT) showed sensitivity at 98%. Chest X-ray imaging has also been extensively deployed for the detection of pneumonia caused by COVID-19 (Adhikari, 2020; Islam, Wijewickrema, Collins & O'Leary, 2020; Rajpurkar et al., 2017). Compared to VNATs, Chest X-ray imaging is more accurate and convenient, which makes Chest X-ray to be the most popular technique for diagnosis of pneumonia (Chen et al., 2019). Also, compared to CT, X-ray is more popular because of the low cost and convenience of imaging. However, manual interpretation of the digitalized images, a time-consuming task, remains to be a challenging job for the radiologists due to the complexity and artificial factors. Therefore, automatic systems that can help radiologists with the understanding of images is demanding and would be greatly helpful to slow the spreading of COVID-19.

Deep learning (DL), a fast-developing branch of Machine Learning (ML), has shown great power over image classification and detection. Compared to classical ML methods, which heavily rely on manually crafted features, Deep Convolutional Neural Networks (CNNs), the most typical models of DL, have achieved eye-attracting success on considerable image-based tasks such as detection, classification and segmentation (He, Zhang, Ren & Sun, 2016; Ronneberger, Fischer & Brox, 2015; Zhao, Zheng, Xu & Wu, 2019). DL models, which are usually trained on millions of images, learn more abstract features than traditional ML models that focus on learning more straightforward but less representative features. Therefore, DL models have been showing huge advantages over traditional ML models on accuracy, flexibility and robustness. In some areas, DL models even outperformed the experts in the area by a large margin He Kaiming, Shaoqing and Jian (2016); He, Zhang, Ren and Sun (2015). Given the stated superiority, DL techniques have been embedded in many Computer-aided (CAD) systems. For DL systems, how to effectively embed useful information into the systems determines the final performance of the systems. There are considerable research works on information embedding (Li, Zhang, Qin, Zhang & Shao, 2019; Shen et al., 2021; Wen, Zhang, Zhang, Fei & Wang, 2020; Zhang et al., 2019, 2020; Zhang, Liu, Shen, Shen & Shao, 2018).

In this research, we proposed a new DL model named CGNet framework, which was featured by the proposed feature reconstruction method. Based on the proposed framework, a new high-performance pneumonia detection system was developed. In our proposed CGNet framework, there are three modules including feature extraction and graph-based feature reconstruction and classification. For the feature extraction, we transferred the state-of-the-art networks and trained with pneumonia datasets. High-level features, which are coarse features for further classification, can then be acquired through the trained networks. Based on the extracted features, we propose to integrate graph representation between individual images to improve the accuracy of the following classifier GNet, which has the same architecture as the artificial neural network(ANN) but outperforms ANN by a large margin. Contrary to traditional ANN that analyses images individually, our proposed GNet can utilize the underlying relationship between images to contribute to better classification performance and therefore analyse multiple images simultaneously. The architecture of GNet is designed to be shallow to avoid the overfitting problem and unnecessary over-complexity of the whole system. Features of each image are taken as a node in graphs while edges between nodes are assigned to top k neighbours that have the shortest distance to the node. To validate our models designed according to the newly proposed framework, we evaluated the performance of models on a public X-ray pneumonia dataset as well as a public CT-image COVID-19 dataset. As being shown in the experiment, our developed system showed promising results on a public pneumonia dataset with more than 5000 images while the developed system reached an accuracy at 0.99 on the public COVID-19 dataset. Like pneumonia caused by other bacteria and viruses will cause inflammation and the air sacs, or alveoli in the lungs, COVID-19 caused pneumonia shows similar symptoms (Koo et al., 2018; Li et al., 2020). Therefore, we believe that the developed system could be helpful for the diagnosis of pneumonia caused by COVID-19 and other causes in future.

The paper is arranged as follows: In Section 2, we will briefly review the related work concerning CAD systems for the detection of pneumonia. Our proposed framework will be presented in detail in Section 3, followed by experiments in Section 4. Discussion is given in Section 5 as we conclude this paper in Section 6.

2. Related work

Feature reconstruction has been proved to be effective in improving the performance of feed-forward deep neural network (Chung, Park & Jung, 2019). When considering a feature reconstruction problem, the target feature matrix $T_{t} \in R^{n \times m}$ can be denoted by informative components $I_{t} \in R^{n \times m}$ and trivial components $T r_{t} \in R^{n \times m}$ .

T_{t} = I_{t} + T r_{t}

(1)

Principle Component Analysis (PCA) is a widely deployed technique for feature dimension reduction and reconstruction (Malagón-Borja & Fuentes, 2009). Given a group of features F_i ( $i \in R^{N}$ , Nis the number of features in the group), the mean feature ${\bar{F}}_{i}$ can then be expressed as:

{\bar{F}}_{i} = \sum_{j = 1}^{N} F_{i}

(2)

The covariance matrix C is then given by

C = \sum_{j = 1}^{N} (F_{i} - {\bar{F}}_{i i}) {(F_{i} - {\bar{F}}_{i})}^{T}

(3)

After calculating the eigenvalues and eigenvectors, the features F_i can be reconstructed by selecting the first k eigenvectors which correspond to the first biggest k eigenvalues. DL has been experiencing fast development and wide utilization in different areas, especially in the past few years. Also, many CAD systems for image analysis have used DL technique due to the excellent performance of DL. Therefore, it is heuristic to combine feature reconstruction with DL technique for the exploration of the better-performed image analysis system. Since the outbreak of COVID-19, experts in computer science community have been working hard to develop CAD systems for the detection of pneumonia caused by COVID-19, which helps to detect COVID-19.

In many of those works, transfer learning is prevalently used due to its advantages. Usually, training a deep CNN from scratch takes a long time while the performance of the trained CNN could be far from satisfaction. Therefore, transfer learning, a technique that reuses existing resolutions for new tasks, can then be used to build CNN models more effectively. In transfer learning, base networks are networks trained on different datasets, usually on ImageNet. The architectures of base networks are adjusted correspondingly to meet the specific image analysis tasks. The adjusted networks can then be trained by the interested datasets. Also, the base networks can be used as feature extractors that provide extracted features for classifiers. Given the stated advantages of transfer learning, transfer learning has been used in the detection of pneumonia. For example, a deep transfer learning model named ChestNet was proposed for the detection of multiple thorax diseases including pneumonia (Wang & Xia, 2018). The authors used ResNet architecture as the backbone, which achieved an average area under the curve (AUC) at 0.7810 per-class on X-ray images. The model was implemented under CAFFE framework (Jia et al., 2014) while it was trained on a high-performance configuration. Another similar transfer-learning-based work can be found at (Rajpurkar et al., 2017). Based on DenseNet, a model named CheXNet was proposed for detection of pneumonia based on X-ray images. The performance of the proposed model even surpassed the average performance of radiologists. Since the outbreak of COVID-19, there are also considerable works on the detection of pneumonia caused by it. To validate the performance of AI on the detection of COVID-19, authors in Chowdhury et al. (2020) explored the performance of the state-of-the-art CNNs on the detection task. In their work, they transferred AlexNet, SqueezeNet, ResNet-18 and DenseNet201 (He et al., 2016; Huang, Liu, Van Der Maaten & Weinberger, 2017; Iandola et al., 2016; Krizhevsky Alex, 2012), amongst which SqueezeNet showed the best performance on an augmented dataset. Before augmentation, there are 190 COVID-19, 1345 viral pneumonia, and 1341 normal X-ray images in the original dataset. 130 X-ray images from 190 COVID-19 were randomly selected into the training set. To balance the samples in the dataset, each category of images was then augmented to around 2600. Rotation, scaling and translation were three augmentation methods applied. The reported highest accuracy, sensitivity and specificity in the work were 98.3%, 96.7%, and 100%. However, only 60 COVID-19 positive X-ray images were analysed in the test set. In another work (Joaquin, 2020), a so-called Inception CNN was transferred on a dataset that comprises of 1119 Computerized Tomography (CT) scans (Szegedy Christian et al., 2015). The proposed model showed promising results on the internal and external dataset with the accuracy of 89.5% and 79.3% respectively, which remains to be improved to meet practical requirements. Inception has also been introduced in the work (Wang et al., 2020). 453 confirmed COVID-19 images were analysed while 217 images were partitioned into the training set. The author claimed that deep learning is of great significance on extracting graphical features for diagnosis of COVID-19 as the proposed model showed an overall accuracy at 73.1% on the external dataset. Some other valuable works in the area can be found at (Abbas, Abdelsamea & Gaber, 2020; Apostolopoulos & Mpesiana, 2020; Bukhari, Bukhari, Syed & SHAH, 2020).

One common problem in the detection of pneumonia by CAD systems is the lack of large-scale public datasets. While many works reported high performance, the numbers of images in the testing datasets are too few to form convincing results. Also, many of the works simply transferred the state-of-the-art CNNs to the classification task without further exploration of structural optimization while the reported performance heavily relies on the pre-determined parameters and hardware. To solve the stated issues, we proposed our CGNet framework to detect pneumonia based on large scale public X-ray datasets. Evaluation of a public COVID-19-caused pneumonia CT dataset also supported the proposed method. In the proposed framework, we first transferred the state-of-the-art CNNs for feature extraction instead of direct classification. Graph representations of extracted features are constructed based on the similarities of the features, which was measured by Euclidean distance between features. Each feature is then taken as a node in a graph while edges between nodes are connected when nodes were found to be neighbours according to Euclidean distance. The combined features are used for classification by our GNet, a simple graph neural network. Experiments on a public chest X-ray image dataset showed that our system implemented under CGNet framework surpassed the majority of the state-of-the-art methods while the sensitivity was extremely high.

3. Methodology

In general classification tasks, features of data are first extracted and then be classified by classifiers in traditional ML methods. In our proposed CGNet framework, there are mainly three individual components that correspond to feature extraction, feature reconstruction and final classification respectively. However, unlike traditional ML methods, which heavily rely on manually-crafted features, our framework extracts high-level abstract features by transferring the state-of-the-art CNNs in the first step. Graph representations are then delved into depth according to features‘ Euclidean distance. Finally, the classifiers are trained with features embedded with graph representation. The data flow of the proposed model under CGnet framework can be seen in Fig. 1 .

In Fig. 1, the procedures on obtaining trained CNNs is shown in left. The dark greened layers shown in transferred CNNs and trained CNNs are new top layers that replace original top layers in CNNs. The arrows in orange and blue show the flows of the training set and the test set. When implementing our models for pneumonia detection under CGNet framework, we choose CNNs that performed best on the test set as feature extractors. GNet, which takes features combined with graph representations as input, finally classifies images into normal and pneumonia. We will have a detailed illustration of each component as follows.

3.1. Feature extraction by transferred CNNs

Unlike traditional ML methods that utilize manually-designed algorithms for ad hoc tasks, we use deep- learning-based algorithms that have a higher superiority on generalization. Feature extraction plays a key role in the classification task, which directly determines the overall performance of the following classifiers. Primarily, we implement feature extraction by deploying transfer learning technique. In our feature extraction stage, the state-of-the-art networks are first transferred to the binary classification task by replacing top layers with new layers. After training with the training set, CNNs that can provide preliminary results as well as features. Generally, CNNs are trained on ImageNet (Deng et al., 2009) that give 1000-categories classification results. The general architecture of CNNs is given in Fig. 2 . After transferred the state-of-the-art networks, we chose the CNN that gives the best result on the test set as a feature extractor for following GNet.

To transfer CNNs pre-trained on ImageNet, we first removed the top layers of CNNs and then added new layers including one dropout layer, one transitional fully-connected layer with 256 channels and the final 2 channel fully-connected layer for classification. Dropout layer is added to prevent overfitting problem during the training session. Information of features would suffer from significant loss if the dimensions of feature shrink rapidly. Therefore, a transitional fully-connected layer is placed on the top of the dropout layer to prohibit heavy information loss. The detailed architecture of transferred CNNs can be seen in Fig. 3 , where FC256 and FC2 stand for two fully-connected layers with 256 and 2 channels respectively. The connection between the final pooling layer and Softmax layer is replaced by the connection between the final pooling layer and newly added dropout layer. After training the transferred networks with pneumonia dataset with a limited number of epochs, the parameters within CNNs are fine-tuned towards the desired parameters that give better representations of the dataset. The details about the acquisition of features under our framework can be found in Algorithm 1 , which includes two states that are network transferring and feature extraction, where the extracted features are analysed for underlying graph representation.

Fig 3 — Architecture of transferred CNNs. FC256 and FC2 stand for fully connected layers with 256 and 2 channels respectively.

Algorithm 1.

Feature acquisition.

Stage 1: Network transferring

Step 1.1: Load pre-trained state-of-the-art networks; (trained on ImageNet;)

Step 1.2: Remove the Softmax layer and classification layer;

Step 1.3: Add new layers including dropout, FC256, FC2, new Softmax layers, and new classification layers;

Step 1.4: Train new networks on the training set of pneumonia dataset with predefined parameters;

Step 1.5: Save trained networks and parameters;

Stage 2: Feature extraction

Step 2.1: Load trained networks;

Step 2.2: Input the dataset to trained networks for feature extraction;

Step 2.3: Extract features produced by fully-connected layer FC256.

Set	Normal	Pneumonia	Total Number
Training	1341	3875	5216
Validation	8	8	16
Testing	234	390	624
Overall	1583	4273	5856

Parameters	Value
Maximum training epoch	20
Initial learning rate	10⁻⁴
Batch size	8
Learning rate drop period	5
Learning rate drop rate	0.5
Optimization method	SGDM
Shuffle of the train set	Each epoch

Model name	Number of parameters(Millions)	Number of Layers
AlexNet	6.10	25
GoogLeNet	0.70	144
SqueezeNet	0.12	68
VGG16	13.84	41
XceptionNet	2.29	170
ResNet18	1.17	71
ResNet101	4.45	347
DenseNet201	1.98	708

Model name	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
AlexNet	0.9974	0.5000	0.6648	0.9915	0.9747	0.8109
GoogLeNet	0.9949	0.4744	0.6398	0.9823	0.9680	0..7997
SqueezeNet	0.9974	0.4402	0.6095	0.9904	0.9730	0.7885
VGG16	1.0000	0.3462	0.5143	1.0000	0.9627	0.7548
XceptionNet	0.9872	0.6453	0.7744	0.9679	0.9539	0.8590
ResNet18	0.9974	0.5598	0.7158	0.9924	0.9788	0.8333
ResNet101	1.0000	0.5385	0.7000	1.0000	0.9674	0.8269
DenseNet201	1.0000	0.4487	0.6195	1.0000	0.9705	0.7933

Model name	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
AlexNet	0.7586	0.7333	0.7458	0.7586	0.7957	0.7458
GoogLeNet	0.6552	0.4667	0.5185	0.5833	0.6293	0.5593
SqueezeNet	0.5517	0.7167	0.6667	0.6232	0.6293	0.6356
VGG16	0.6724	0.7500	0.7258	0.7031	0.7888	0.7119
XceptionNet	0.6897	0.6333	0.6552	0.6786	0.6718	0.6610
ResNet18	0.7069	0.7333	0.7273	0.7213	0.8491	0.7203
ResNet101	0.7759	0.6167	0.6727	0.7400	0.7787	0.6949
DenseNet201	0.8276	0.7000	0.7500	0.8077	0.8678	0.7627

Model name	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
AlexNet	0.7429	0.6735	0.6911	0.7097	0.7920	0.7094
GoogLeNet	0.7619	0.6837	0.7053	0.7283	0.7913	0.7241
SqueezeNet	0.7143	0.6531	0.6667	0.6809	0.6909	0.6847
VGG16	0.8095	0.6939	0.7312	0.7727	0.8331	0.7537
XceptionNet	0.6381	0.7041	0.6732	0.6449	0.7030	0.6700
ResNet18	0.7619	0.6837	0.7053	0.7283	0.7883	0.7241
ResNet101	0.7714	0.6429	0.6811	0.7241	0.7749	0.7094
DenseNet201	0.8571	0.6735	0.7374	0.8148	0.8597	0.7685

k	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
4	1.0000	0.4701	0.6395	1.0000	0.9602	0.8013
8	1.0000	0.7650	0.8668	1.0000	0.9892	0.9119
12	1.0000	0.5812	0.7351	1.0000	0.9925	0.8429
16	0.9949	0.9231	0.9558	0.9908	0.9977	0.9679
24	1.0000	0.9103	0.9530	1.0000	0.9999	0.9663
28	1.0000	0.9103	0.9530	1.0000	0.9999	0.9663

k	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
8	1.0000	0.6325	0.7749	1.0000	0.9837	0.8622
16	1.0000	0.4487	0.6195	1.0000	0.9872	0.7933
24	0.9923	0.8205	0.8951	0.9846	0.9978	0.9279
28	0.9974	0.7735	0.8702	0.9945	0.9979	0.9135
32	0.9974	0.7735	0.8702	0.9945	0.9979	0.9135
48	0.9872	0.9744	0.9764	0.9785	0.9985	0.9824
56	0.9436	1.0000	0.9551	0.9141	0.9998	0.9647

k	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
12	0.9974	0.6795	0.8071	0.9938	0.9842	0.8782
24	1.0000	0.4615	0.6316	1.0000	0.9793	0.7981
36	0.9974	0.7094	0.8279	0.9940	0.9932	0.8894
48	0.9846	0.8462	0.9041	0.9706	0.9936	0.9327
60	1.0000	0.6923	0.8182	1.0000	0.9964	0.8846
72	1.0000	0.7179	0.8358	1.0000	0.9974	0.8942
84	0.9487	1.0000	0.9590	0.9213	0.9985	0.9679

k	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
16	0.9974	0.6325	0.7728	0.9933	0.9795	0.8606
32	0.9949	0.7179	0.8317	0.9882	0.9892	0.8910
48	0.9974	0.5256	0.6872	0.9919	0.9910	0.8205
64	0.9923	0.8034	0.8847	0.9843	0.9915	0.9215
80	0.9692	0.9402	0.9442	0.9483	0.9924	0.9583
96	0.9462	0.9915	0.9631	0.9170	0.9939	0.9631
112	0.9282	1.0000	0.9435	0.8931	0.9969	0.9551
128	0.8974	1.0000	0.9213	0.8540	0.9971	0.9359
144	0.9256	1.0000	0.9416	0.8897	0.9969	0.9535

k	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
4	0.6724	0.8833	0.8030	0.7361	0.8744	0.7797
8	0.8966	0.7833	0.8319	0.8868	0.9080	0.8390
16	0.8793	1.0000	0.9449	0.8955	0.9980	0.9407
20	1.0000	0.75000	0.8571	1.0000	0.9862	0.8729
24	0.6724	1.0000	0.8633	0.7595	0.9971	0.8390
28	0.5862	1.0000	0.8333	0.7143	0.9980	0.7966

k	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
4	0.8190	0.8980	0.8585	0.8224	0.9115	0.8571
8	0.9810	0.8367	0.9011	0.9762	0.9709	0.9113
16	0.9810	0.9286	0.9529	0.9785	0.9805	0.9557
20	1.0000	0.8776	0.9348	1.0000	0.9862	0.9409
24	0.7905	0.9796	0.8889	0.8136	0.9853	0.8818
28	0.9810	0.9796	0.9796	0.9796	0.9850	0.9803

k	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
8	0.9138	0.5833	0.7000	0.8750	0.8805	0.7458
16	0.7759	0.8833	0.8413	0.8030	0.8914	0.8305
24	0.9655	0.5667	0.7083	0.9444	0.8874	0.7627
28	0.7759	0.8333	0.8130	0.7937	0.9190	0.8051
32	0.6897	0.9667	0.8529	0.7632	0.9287	0.8305
48	0.9310	1.0000	0.9677	0.9375	0.9994	0.9661
56	0.9828	0.9000	0.9391	0.9818	0.9966	0.9407

k	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
8	0.9905	0.6122	0.7547	0.9836	0.9560	0.8079
16	0.9619	0.8571	0.9032	0.9545	0.9365	0.9113
24	0.9905	0.6633	0.7927	0.9848	0.9218	0.8325
28	0.9905	0.8776	0.9297	0.9885	0.9325	0.9360
32	0.9810	0.8878	0.9305	0.9775	0.9457	0.9360
48	0.9905	0.8367	0.9061	0.9880	0.9565	0.9163
56	1.0000	0.6531	0.7901	1.0000	0.9565	0.8325

k	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
12	0.8276	0.7000	0.7500	0.8077	0.8236	0.7627
16	0.7414	0.6667	0.6957	0.7273	0.8336	0.7934
24	0.7586	0.5500	0.6168	0.7021	0.7667	0.6525
32	0.7586	0.6833	0.7130	0.7455	0.8034	0.7203
48	0.7241	0.6822	0.7009	0.7193	0.8167	0.7034
64	0.4138	0.9667	0.7632	0.6304	0.8037	0.6949
80	0.8621	0.6167	0.7048	0.8222	0.8609	0.7373

k	Specificity	Sensitivity	F1	Precision	AUC	Accuracy
12	0.9810	0.8469	0.9071	0.9765	0.9680	0.9163
16	0.9524	0.7347	0.8229	0.9351	0.9352	0.8473
24	0.9810	0.7653	0.8571	0.9740	0.9736	0.8768
32	0.9810	0.9184	0.9474	0.9783	0.9857	0.9507
48	1.0000	0.8878	0.9405	1.0000	0.9979	0.9458
64	0.9333	0.9898	0.9604	0.9327	0.9935	0.9606
80	1.0000	0.9796	0.9897	1.0000	0.9948	0.9901

Model name	Specificity	Sensitivity	F1	Precision	Accuracy
ANN_raw	0.9385	0.7949	0.8378	0.8857	0.8846
SVM	0.9821	0.5897	0.7282	0.9517	0.8349
Decision Tree	0.9769	0.5470	0.6900	0.9343	0.8157
GNet (N = 32, k = 16)	0.9949	0.9231	0.9558	0.9908	0.9679
GNet (N = 64, k = 48)	0.9872	0.9744	0.9764	0.9785	0.9824
GNet (N = 96, k = 84)	0.9487	1.0000	0.9590	0.9213	0.9679
GNet (N = 128, k = 96)	0.9846	0.9915	0.9831	0.9748	0.9872
GNet (N = 156, k = 96)	0.9462	0.9915	0.9631	0.9170	0.9631

Model name	Specificity	Sensitivity	F1	Precision	Accuracy
ANN_raw	0.7810	0.6633	0.6989	0.7386	0.7241
SVM	0.8857	0.7347	0.7912	0.8571	0.8128
Decision Tree	0.8286	0.6224	0.6893	0.7722	0.7291
GNet (N = 32, k = 28)	0.9810	0.9796	0.9796	0.9796	0.9803
GNet (N = 64, k = 28)	0.9905	0.8776	0.9297	0.9885	0.9360
GNet (N = 96, k = 80)	1.0000	0.9796	0.9897	1.0000	0.9901

Model name	Specificity	Sensitivity	F1	Precision	Accuracy
ANN	0.9385	0.7778	0.8273	0.8835	0.8782
SVM	0.9821	0.5940	0.7316	0.9521	0.8365
Decision Tree	0.9846	0.6026	0.7402	0.9592	0.8413

Model name	Specificity	Sensitivity	F1	Precision	Accuracy
ANN	0.8095	0.5510	0.6279	0.7297	0.6847
SVM	0.8857	0.7347	0.7912	0.8571	0.8128
Decision Tree	0.9333	0.5816	0.7037	0.8906	0.7635

PERMALINK

CGNet: A graph-knowledge embedded convolutional neural network for detection of pneumonia

Xiang Yu

Shui-Hua Wang

Yu-Dong Zhang

Highlights

Abstract

1. Introduction

2. Related work

3. Methodology

Fig. 1.

3.1. Feature extraction by transferred CNNs

Fig. 2.

Fig. 3.

Algorithm 1.

3.2. Graph embedded feature reconstruction

Algorithm 2.

Algorithm 3.

3.3. GNet for classification

Fig. 4.

Algorithm 4.

3.4. Evaluation metrics

4. Experiment design

4.1. Datasets

Fig. 5.

Table 1.

Fig. 6.

Table 2.

4.2. Experiment settings

Table 3.

4.3. Performance of the transferred networks

Table 4.

Table 5.

Fig. 7.

Table 6.

Table 7.

Fig. 8.

4.4. The influence of batch size N and number of neighbours k

Table 8.

Fig. 9.

Table 9.

Table 10.

Table 11.

Table 12.

Fig. 10.

Fig. 11.

Fig. 12.

Fig. 13.

Fig. 14.

Fig. 16.

Table 13.

Table 14.

Fig. 15.

Table 15.

Table 16.

Table 17.

Table 18.

Fig. 17.

Fig. 18.

4.5. Effectiveness of feature reconstruction

Table 19.

Table 20.

4.6. Comparison of feature reconstruction methods

Table 21.

Table 22.

Table 23.

Table 24.

4.7. Comparison of state-of-the-art methods

Table 25.

Table 26.

5. Discussion

6. Conclusion

CRediT authorship contribution statement

Acknowledgement

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Model name	Specificity	Sensitivity	F1	Precision	Accuracy
ANN	0.9810	0.9286	0.9529	0.9785	0.9557
SVM	1.0000	0.2347	0.3802	1.0000	0.6305
Decision Tree	0.9429	0.6122	0.7317	0.9091	0.7833

Model name	Specificity	Sensitivity	Precision	Accuracy
Liang (Liang & Zheng, 2019)	0.9549	0.9670	0.8910	0.9050
Wang[22](Wang and Xia 2018)	0.9949	0.9017	0.9906	0.9599
Rajourkar[3](Rajpurkar, Irvin et al. 2017)	0.9795	0.9359	0.9648	0.9631
Kermany[42] Kermany et al., 2018	0.9872	0.9615	0.9783	0.9776
Islam[4](Islam, Wijewickrema et al.)	0.9918	0.9880	0.9918	0.9899
Ours method	0.9846	0.9915	0.9748	0.9872

Model name	Specificity	Sensitivity	Precision	Accuracy
Zhao (Zhao et al., 2020)	–	–	–	0.89
Wu (Wu et al., 2020)	0.93	0.95	–	0.93
Jaiswal (Jaiswal, Gianchandani, Singh, Kumar & Kaur, 2020)	0.96	–	0.96	0.96
Loey [45] Loey, Manogaran & Khalifa, 2020.	0.88	0.78	0.85	0.83
He [46] (He, Yang et al. 2020)	–	–	–	0.86
Our method	1.0000	0.98	1.0000	0.99