The importance of standardisation – COVID-19 CT & Radiograph Image Data Stock for deep learning purpose

Krzysztof Misztal; Agnieszka Pocha; Martyna Durak-Kozica; Michał Wątor; Aleksandra Kubica-Misztal; Marcin Hartel

doi:10.1016/j.compbiomed.2020.104092

. 2020 Oct 28;127:104092. doi: 10.1016/j.compbiomed.2020.104092

The importance of standardisation – COVID-19 CT & Radiograph Image Data Stock for deep learning purpose

Krzysztof Misztal ^a,^b,^∗, Agnieszka Pocha ^b, Martyna Durak-Kozica ^b, Michał Wątor ^b, Aleksandra Kubica-Misztal ^b, Marcin Hartel ^c

PMCID: PMC7591316 PMID: 33161334

Abstract

With the number of affected individuals still growing world-wide, the research on COVID-19 is continuously expanding. The deep learning community concentrates their efforts on exploring if neural networks can potentially support the diagnosis using CT and radiograph images of patients’ lungs.

The two most popular publicly available datasets for COVID-19 classification are COVID-CT and COVID-19 Image Data Collection. In this work, we propose a new dataset which we call COVID-19 CT & Radiograph Image Data Stock. It contains both CT and radiograph samples of COVID-19 lung findings and combines them with additional data to ensure a sufficient number of diverse COVID-19-negative samples. Moreover, it is supplemented with a carefully defined split.

The aim of COVID-19 CT & Radiograph Image Data Stock is to create a public pool of CT and radiograph images of lungs to increase the efficiency of distinguishing COVID-19 disease from other types of pneumonia and from healthy chest. We hope that the creation of this dataset would allow standardisation of the approach taken for training deep neural networks for COVID-19 classification and eventually for building more reliable models.

Keywords: COVID-19 classification dataset, CT, Radiograph

Highlights

•
The article aims to give a public pool of CT & X-ray lungs images to increase the efficiency of detect COVID-19 disease.
•
Binary and multiclass classifiers were trained revealing that precise labels can improve the system performance.
•
Models trained on COVID-19 CT & X-ray Image Data Stock are more robust than models trained on other investigated databases.

1. Introduction

At the end of 2019, a new coronavirus SARS-CoV-2 (Severe Acute Respiratory Syndrome Coronavirus 2) appeared in Wuhan, which then triggered a global pandemic. SARS-CoV-2-induced pneumonia has been termed COVID-19 (Coronavirus Disease 2019). The main symptoms of COVID-19 are high fever, dry cough, shortness of breath, muscle pain, diarrhea, myalgia, nasal obstruction and runny nose [1]. As of July 15, 2020, a total of 13,690,108 confirmed cases with COVID-19 pneumonia have been reported globally, including 586,265 deaths (4.28%).

The current diagnostic method for COVID-19 is real time reverse transcription – polymerase chain reaction (RT-PCR) [2]. The main limitation of this method is the insufficient amount and quality of the clinical material from which the nucleic acids are isolated [3]. This can result in false negative Results.

Lung CT and radiograph scans are gradually recognised as an alternative for COVID-19 diagnosis. The lungs of people infected with COVID-19 are characterised by consolidation, ground-glass opacification, bilateral involvement, peripheral and diffuse distribution. Lung CT scans can be used to diagnose COVID-19 in patients in acute and convalescent periods of disease [1]. Only patients with severe or permanent lung damage will show changes in CT after recovery, which makes it impossible to determine the percentage of the population that has undergone the disease based on lung scans [4]. The favorable aspects of CT scanners are their availability in many hospitals and the short amount of time required to obtain the Results estimated to be around 15 min. The use of CT for initial diagnostics might significantly increase testing capabilities. On the other hand, the imaging costs are relatively high, which may limit the use of CT for COVID-19 diagnostics. Moreover, the use of CT for COVID-19 diagnostics requires thorough cleaning of the equipment between examinations and a large surface of contact increases the risk of infection, compared to the RT-PCR method performed in sterile conditions [5]. Despite the large number of publications indicating high sensitivity and specificity of CT, the radiologists’ position from American College of Radiology (ACR) advises against putting lung CT on the first line of COVID-19 diagnostics [6].

The advantages of radiograph scans for COVID-19 diagnostics include greater availability of radiographs, lower radiation doses to which the patient is subjected and a short scanning time.

Recently, both CT and radiograph scans have been shown to enable training models which achieve promising Results in the COVID-19 classification task [7,8].

Considering the advantages and disadvantages of both methods we decided to create a database containing both CT and radiograph images.

Currently, the most popular datasets for COVID-19 classification are COVID-CT [9] and COVID-19 Image Data Collection [10]. These datasets contain images of CT and radiograph chest scans of individuals affected with COVID-19 as well as of patients not affected with COVID-19.

COVID-19 Image Data Collection contains images of both CT and radiograph scans. The number of CT scans is insufficient for training deep models. The number of radiograph images is higher but there is not enough negative samples. Moreover, this dataset does not define a data split.

COVID-CT concentrates on CT scans and defines a data split. However, it provides only a rough categorisation of samples into COVID-19-positive and negative cases, where negative cases can be images of healthy individuals or patients with a different disease.

Training neural networks on these datasets requires including samples from additional data sources such as common bacterial pneumonia [11] or lung nodule analysis [12,13].

Apostolopoulos and Mpesiana [14] used a MobileNet v2 [15] pre-trained on ImageNet [16] for fine-tuning on two datasets which were created using samples from COVID-19 Image Data Collection [10], COVID-19 X-ray collection available on kaggle [17], and a dataset containing radiograph scans of common bacterial pneumonia [11]. They achieved sensitivity of 98% and specificity of 96% on the dataset which included both common bacterial pneumonia and viral pneumonia cases as distractors for the COVID-19 class, and sensitivity of 99% and specificity of 97% on the dataset which included only common bacterial pneumonia cases.

Zhao et al. [9] pre-trained a DenseNet [18] on ChestX-ray14 [19] and fine-tuned it on COVID-CT. They achieve AUC of 0.82.

He et al. [7] used models pre-trained on ImageNet, which were further pre-trained using contrastive self-learning [20] first on LUNA dataset [12,13] and then on COVID-CT, followed by fine-tuning on COVID-CT. This methodology allowed them to achieve AUC of 0.94 with DenseNet-169.

The huge variety of scenarios in which the models are evaluated prevents any comparison between them. As a result, it is difficult to tell which design choices contribute to improved performance of some models and to use this knowledge to build incrementally more reliable solutions.

In this work, we propose COVID-19 CT & Radiograph Image Data Stock, which combines data from multiple sources into a single dataset. The advantages of COVID-19 CT & Radiograph Image Data Stock include:

•
a large number of both CT and radiograph scans of COVID-19 class
•
a large number of negative samples in both modi
•
the exact class of the negative samples is known
•
the source of each sample is known
•
a data split is defined.

Using COVID-19 CT & Radiograph Image Data Stock does not require employing any additional data sources. We hope that this dataset will allow for better understanding of the influence of individual choices on the final performance of COVID-19 classification models.

To give a better insight into benefits of using COVID-19 CT & Radiograph Image Data Stock for training neural networks, we compare the performance of several popular architectures pre-trained on ImageNet [16] when trained on COVID-CT, COVID-19 Image Data Collection and COVID-19 CT & Radiograph Image Data Stock in multiple scenarios and show that models trained on COVID-19 CT & Radiograph Image Data Stock achieve better Results both in case of CT and radiograph data.

Our main contributions are as follows

1.
We build a rich and self-contained database for COVID-19 classification,
2.
We train several neural networks using COVID-19 CT & Radiograph Image Data Stock to provide baseline benchmarks,
3.
We compare the models trained on COVID-19 CT & Radiograph Image Data Stock with the models trained on COVID-19 Image Data Collection and COVID-CT and show that models trained on COVID-19 CT & Radiograph Image Data Stock are more robust,
4.
We show that using a precise class information helps to improve the model's ability to distinguish between COVID-19-positive and negative samples.

The rest of this work is organised as follows: in section 2 we shortly characterize COVID-CT and COVID-19 Image Data Collection, and describe in detail how was COVID-19 CT & Radiograph Image Data Stock created. In section 3, we describe the evaluation of models trained on each of the datasets and in section 4, we present the Results. In section 5, we conclude the paper.

2. COVID-19 CT & Radiograph Image Data Stock

In this section, we briefly characterize COVID-CT [9] and COVID-19 Image Data Collection [10] shortly discussing their strong and weak points and describe in detail how the proposed COVID-19 CT & Radiograph Image Data Stock was created.

COVID-CT. COVID-CT [9] is a dataset containing images derived from over 750 preprints on COVID-19. The images present chest CT scans in axial plane and are in png format. The task is to classify images as belonging to COVID-19-positive or negative class. This dataset has a defined split which allows for comparison between the models and reproducibility of the Results. However, it provides only a rough categorisation of samples into COVID-19-positive and negative cases, where negative cases can be images of healthy individuals or patients with a different disease.

COVID-19 Image Data Collection. COVID-19 Image Data Collection [10] is a dataset containing images of patients with COVID-19, patients with COVID-19 and acute respiratory distress syndrome (ARDS), and images of patients without COVID-19 but with other diseases. The images present CT and radiograph scans of lungs and are in jpg or png format. Each image is accompanied with additional data which describes image characteristic (such as view or modality) and patient characteristic (such as age or survival), however, most of these features are present only for some of the samples. The possible tasks include binary classification of COVID-19-positive and negative patients, and multi-class classification of an exact disease. This dataset does not provide a data split.

COVID-19 CT & Radiograph Image Data Stock. The aim of COVID-19 CT & Radiograph Image Data Stock is to create a public pool of CT and radiograph images of lungs to increase the efficiency of distinguishing COVID-19 from other types of pneumonia and from healthy lungs. We hope this can help to prepare a “ground” for distinguishing between newly discovered and already known viruses and bacteria strains causing pneumonia in order to improve diagnostics in the event of subsequent pandemics. For this reason we included COVID-19-negative samples of several classes which include healthy chest (negative control) and various types of pneumonia (bacterial, fungal, viral).

The images which constitute COVID-19 CT & Radiograph Image Data Stock were compiled from public sources. Most of the images come from websites with image collections and about 150 images were collected from online publications. The list of all sources is presented in Table 1 .

Table 1.

Sources of images in COVID-19 CT & Radiograph Image Data Stock.

diagnosis	CT	radiograph
healthy	Radiopaedia	[11]
COVID-19	Radiopaedia	SIRM, COVID-19 Resourse site for Imaging and Radiology, EURORAD, Radiopaedia, Radiology Assistant, Cases RSNA, APP Fig. 1, RAD2share, Yxppt, Fig. 1 COVID-19 Chest X-ray Dataset Initiative^a [[21], [22], [23], [24], [25], [26], [27], [28], [29], [30], [31], [32], [33], [34], [35], [36], [37], [38], [39], [40], [41], [42], [43], [44], [45], [46]],
bacterial pneumonia	Radiopaedia [[47], [48], [49], [50]],	Radiopaedia [11],
viral pneumonia	Radiopaedia [51,52],	[11,53,54]
fungal pneumonia	Radiopaedia [52],	wikipedia, Radiopaedia
ARDS		wikipedia, Radiopaedia

	CT			radiograph
	coronal	axial	total	sagittal	coronal	total
healthy chest	417	1270	1687	0	1583	1583
COVID-19	2399	5671	8070	10	323	333
bacterial pneumonia	58	249	307	5	2801	2806
viral pneumonia	5	39	44	0	1509	1509
fungal pneumonia	215	1068	1283	3	12	15
ARDS	0	0	0	0	4	4

Attribute	Description
patient ID	internal identifier
file name	name of the file including extension
type of image	radiograph or CT
section of image	sagittal, axial or coronal
diagnosis	healthy chest, COVID-19, bacterial pneumonia, viral pneumonia, fungal pneumonia, or ARDS
presence of marks	presence of marks marked by radiologist
group train/valid/test	belonging to the train, valid or test group
lung presence	the entire surface of the lungs are visible, invisible lungs or only part of the lungs are visible
origin	URL of the paper or website where the image came from
suitable/not suitable for diagnosis	information if image is suitable or not for diagnosis

	CT		radiograph
	minimum	maximum	minimum	maximum
width/height	107/85	2354/2313	156/157	4300/4298

	COVID-19-positive	COVID-19-negative	total
full dataset CT	43	1	44
full dataset radiograph	253	63	316
after cleaning radiograph	233	56	289

	COVID-19-positive	COVID-19-negative	total
dataset radiograph	333	5917	6250
dataset CT full	8051	3228	11,279
CT after cleaning	3980	2016	5996

binary	COVID-19	COVID-19	total
binary	positive	negative	total
train	231	4143	4374
validation	53	887	940
test	49	887	936

multiclass	COVID-19	bacterial	healthy	viral	total
multiclass	positive	pneumonia	chest	pneumonia	total
train	231	1966	1108	1059	4364
validation	53	420	238	225	932
test	49	420	237	225	935

	Precision	Recall	F1 score	Accuracy	AUC
COVID-CT
ResNet-18	0.57	0.77	0.65	0.61	0.61
ResNet-50	0.87	0.76	0.81	0.83	0.83
DenseNet-169	0.8	0.67	0.73	0.76	0.76
WideResNet-50	0.85	0.80	0.82	0.83	0.83
DenseNet-121⁺	0.67	0.04	0.08	0.53	0.51
COVID-19 CT & Radiograph Image Data Stock
ResNet-18	0.86	0.71	0.78	0.74	0.75
ResNet-50	0.73	0.59	0.65	0.59	0.59
DenseNet-169	0.78	0.72	0.75	0.69	0.67
WideResNet-50	0.77	0.99	0.87	0.81	0.72
DenseNet-121⁺	0.69	0.67	0.68	0.59	0.55

	Precision	Recall	F1 score	Accuracy	AUC
COVID-19 Image Data Collection
ResNet-18	0.6	0.75	0.67	0.62	0.63
ResNet-50	0.60	0.75	0.67	0.62	0.63
DenseNet-169	0.64	1	0.78	0.72	0.72
WideResNet-50	0.62	0.94	0.75	0.69	0.69
DenseNet-121⁺	0.5	0.88	0.64	0.5	0.5
COVID-19 CT & Radiograph Image Data Stock
ResNet-18	0.64	1.00	0.78	0.97	0.98
ResNet-50	0.55	1.00	0.71	0.96	0.98
DenseNet-169	0.71	0.96	0.82	0.98	0.97
WideResNet-50	0.73	0.96	0.83	0.98	0.97
DenseNet-121⁺	0.36	0.94	0.52	0.91	0.92

	Precision	Recall	F1 score	Accuracy	Binary	Multiclass
	Precision	Recall	F1 score	Accuracy	AUC	AUC
CT
ResNet-18	0.82	0.83	0.82	0.73	0.72	0.85
ResNet-50	0.80	0.63	0.70	0.59	0.65	0.74
DenseNet-169	0.78	0.77	0.77	0.69	0.65	0.82
WideResNet-50	0.74	1.00	0.85	0.64	0.99	0.87
DenseNet-121⁺	0.76	0.82	0.79	0.70	0.64	0.83
Radiograph
ResNet-18	0.70	1.00	0.82	0.65	0.99	0.90
ResNet-50	0.67	1.00	0.80	0.62	0.99	0.89
DenseNet-169	0.74	1.00	0.85	0.58	0.99	0.86
WideResNet-50	0.74	1.00	0.85	0.64	0.99	0.87
DenseNet-121⁺	0.47	0.94	0.63	0.62	0.94	0.85

	COVID-19-positive	COVID-19-negative	total
train	191	234	425
validation	60	58	118
test	98	105	203

binary	COVID-19-positive	COVID-19-negative	total
train	2749	1522	4271
validation	626	270	896
test	605	320	925

multiclass	COVID-19-positive	fungal pneumonia	healthy chest	total
train	2749	653	773	4175
validation	626	127	130	883
test	605	139	151	895

	lr	random rotation	brightness	horizontal flip	center crop	contrast
COVID-19 CT & Radiograph Image Data Stock multiclass Radiograph
ResNet-18	0.001		+	+
ResNet-50	0.001
DenseNet-169	0.001		+	+
WideResNet-50	0.00001	+		+		+
DenseNet-121⁺	0.001	+
COVID-CT
ResNet-18	0.001		+	+	+	+
ResNet-50	0.0001		+			+
DenseNet-169	0.0001	+
WideResNet-50	0.0001		+	+		+
DenseNet-121⁺	0.00001	+

PERMALINK

The importance of standardisation – COVID-19 CT & Radiograph Image Data Stock for deep learning purpose

Krzysztof Misztal

Agnieszka Pocha

Martyna Durak-Kozica

Michał Wątor

Aleksandra Kubica-Misztal

Marcin Hartel

Abstract

Highlights

1. Introduction

2. COVID-19 CT & Radiograph Image Data Stock

Table 1.

Table 2.

Fig. 1.

Fig. 2.

Table 3.

Table 4.

Fig. 3.

Fig. 4.

Table 5.

3. Evaluation methodology

3.1. Data preparation

Table 6.

Table 7.

Table 8.

Table 9.

Table 10.

Table 11.

3.2. Models

3.3. Metrics

4. Results

Table 12.

Table 13.

Table 14.

Fig. 5.

Table 15.

Table 16.

Table 17.

5. Conclusion

Declaration of competing interest

Acknowledgement

Footnotes

Appendix A.

Table 18.

Table 19.

Table 20.

Appendix B.

Fig. 6.

Fig. 7.

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases