Deep learning based binary classification of diabetic retinopathy images using transfer learning approach

Dimple Saproo; Aparna N Mahajan; Seema Narwal

doi:10.1007/s40200-024-01497-1

. 2024 Sep 20;23(2):2289–2314. doi: 10.1007/s40200-024-01497-1

Deep learning based binary classification of diabetic retinopathy images using transfer learning approach

Dimple Saproo ^1,^✉, Aparna N Mahajan ², Seema Narwal ³

PMCID: PMC11599653 PMID: 39610484

Abstract

Objective

Diabetic retinopathy (DR) is a common problem of diabetes, and it is the cause of blindness worldwide. Detection of diabetic radiology disease in the early detection stage is crucial for preventing vision loss. In this work, a deep learning-based binary classification of DR images has been proposed to classify DR images into healthy and unhealthy. Transfer learning-based 20 pre-trained networks have been fine-tuned using a robust dataset of diabetic radiology images. The combined dataset has been collected from three robust databases of diabetic patients annotated by experienced ophthalmologists indicating healthy or non-healthy diabetic retina images.

Method

This work has improved robust models by pre-processing the DR images by applying a denoising algorithm, normalization, and data augmentation. In this work, three rubout datasets of diabetic retinopathy images have been selected, named DRD- EyePACS, IDRiD, and APTOS-2019, for the extensive experiments, and a combined diabetic retinopathy image dataset has been generated for the exhaustive experiments. The datasets have been divided into training, testing, and validation sets, and the models use classification accuracy, sensitivity, specificity, precision, F1-score, and ROC-AUC to assess the model's efficiency for evaluating network performance. The present work has selected 20 different pre-trained networks based on three categories: Series, DAG, and lightweight.

Results

This study uses pre-processed data augmentation and normalization of data to solve overfitting problems. From the exhaustive experiments, the three best pre-trained have been selected based on the best classification accuracy from each category. It is concluded that the trained model ResNet101 based on the DAG category effectively identifies diabetic retinopathy disease accurately from radiological images from all cases. It is noted that 97.33% accuracy has been achieved using ResNet101 in the category of DAG network.

Conclusion

Based on the experiment results, the proposed model ResNet101 helps healthcare professionals detect retina diseases early and provides practical solutions to diabetes patients. It also gives patients and experts a second opinion for early detection of diabetic retinopathy.

Keywords: Series, DAG, Lightweight, Pre-trained networks, Classification accuracy

Introduction

The most common eye conditions that cause blindness or reduced vision are diabetic retinopathy, cataracts, glaucoma, and age-related macular degeneration (AMD) [1]. Diabetes is a condition that is common all over the world. Diabetes is ranked seventh among deadly diseases according to a World Health Organization (WHO) report [2–5]. It is noted that the number of cases and incidence of diabetes have been increasing over the past few decades, with an estimated 422 million people with diabetes disease [3–6]. It is noted that approximately 62 million Indians suffer from diabetic retinopathy with age 25–75, and it is expected that 102 million rise by 2030[1–5]. It is familiar to people with diabetes, and the retina is damaged due to high blood sugar [6]. The blood vessels leak, swell, and do not pass blood in the retina, causing abnormalities. Diabetic retinopathy damages the retina due to the complexities of diabetes mellitus and is the leading cause of blindness [7–10]. Diabetic Retinopathy has been classified into two stages named as (a) Proliferative Diabetic Retinopathy (PDR) and (b) Non-proliferative Diabetic Retinopathy (NPDR). Further, NPDR is divided into five classes: (a) Mild, (b) Moderate, and (c) severe [11–16]. The advanced form of diabetic retinopathy is called PDR, caused by the development of aberrant blood vessels or swelling in the retina [14]. The first stage of DR is known as mild NPR, during which the patient does not notice any changes in the vision or eye condition [15]. The blood vessel walls of the retina dilute and cause leakage, leading to microaneurysms, i.e., small lumps coming out from vessel walls [16–19]. Generally, a minute red spot or yellow circle that appears in the eye is called mild, moderate, or severe NDPR. Moderate NPDR is distinguished from mild NPDR by more significant damage to the retina's blood vessels. In moderate NPDR, the blood arteries develop microscopic balloons or swellings with microaneurysms, and fluid leaks from the retina or bleeds [20–24]. Severe NDPR is a serious problem characterized by a more damaged retina in blood vessels than mild and moderate NPDR [24–26].

The exhaustive literature survey shows that deep learning-based pre-trained networks with a transfer learning approach have been widely used for classifying diabetic retinopathy images [1–42]. Fundus retina images are widely used to detect abnormalities of diabetic retinopathy. In this work, a robust dataset of DR has been prepared to classify images into binary classes [27]. PDP and NPDP are considered unhealthy eye conditions, and non-diabetic Retinopathy is considered healthy eye fundus images of the eye [28]. In unhealthy eye images, retinal vessels become irregular in shape, size, and diameter, whereas healthy eye images have regular shape, size, and diameter [29–33]. Three benchmarked datasets of diabetic retinopathy images have been considered in this work for the experiment. A brief description of healthy eye and diabetic or unhealthy eye images of retinopathy is shown in Fig. 1.

Fig. 1 — A brief description of healthy eye and diabetic or unhealthy eye images of retinopathy

In the present work, pre-processing of DR images has been used to pre-process the raw DR images named as (a) resizing of DR images, (b) removing of Gaussian and salt and paper noise with a suitable filtering method [30–33]. The quality of DR images has been enhanced by selecting an optimal filtering algorithm in this work [33–37]. The optimal filtering algorithm has been chosen by pulling different filtering categories in the context of smoothening the homogeneous region and preserving the edges of the DR images [38]. This work uses structure and edge preservation index to evaluate the performance of the filtering algorithm using DR images [37–40]. Accordingly, in the present work, the pre-trained network has been divided into three categories named (a) series, (b) DAG (Residual and inception modules), and lightweight networks [35–42]. In this work, 20 pre-trained have been selected in the robust pull of three categories, i.e., Series, DAG & Lightweight, and fined-tuned the pre-trained networks using the transfer learning approach [40–42]. Accordingly, classification accuracy, sensitivity, specificity, precision, F1-score, and ROC-AUC are used to evaluate the network performance [37–42].

Table 1 shows a brief exhaustive literature review on the classification of diabetic retinopathy imaging using a deep learning-based pre-trained network and transfer learning approach.

Table 1.

A brief exhaustive literature review on the classification of diabetic retinopathy imaging using a deep learning-based pre-trained network and transfer learning approach

Investigator(s)	Year	Pre-processed Method	Dataset Name	No. of Images	DL Based Pre-Trained Model	Classifier
Gulshan et al. [1]	2016	Resizing and normalization	EyePACS & MESSIDOR-2	-	DCNN	Sen.-97.5%
Chandrakumar et al. [2]	2016	Resizing and Contrast Enhancement	EyePACS, Drive	-	DCNN	Acc.-94%
Zhou, L. et al. [3]	2017	Contrast Enhancement	Three Dataset	-	Self-Design Architecture	AUC-0.928
Dutta, S et al. [4]	2018	Contrast Enhancement	EyePACS (5 Class)	50000	VGG16	Acc.- 78.3%
Junjun, P. et al. [5]	2018	Contrast Enhancement	EyePACS (5 Class)	35, 126	ResNet18	Acc.- 78.4%
Kassani, S. H. et. al [6]	2019	Resizing and normalization	APTOS 2019 (5 Class)	3662	Xception	Acc.- 83.09%
Challa, U.K et al. [7]	2019	Gaussian filters	Kaggle dataset (Five Class)	33,000	Pre-Trained Networks	Acc. -86.64%
Qummar, S et al. [8]	2019	Scaling and Resizing	Kaggle dataset	5608	5 Pre-trained Network	Sp and F1 Score
Bhardwaj, C. et al. [9]	2020	Contrast Enhancement	MESSIDOR (4 Class)	1200	QIV Model (Inception-V3)	Acc.- 93.33%
Saxena, G. et al. [10]	2020	Resizing and Augmentation	EyePACS	88,702	InceptionResNet, ResNet and Inception	AUC-0.927
Yusaku Katada et al. [11]	2020	DA	EyePACS	35,126 (3508 Selected)	Inception v3	Sensitivity of 81.5% and 90.8%
Ali Usman et al. [12]	2020	Resizing, Data Augmentation	7 Online Dataset	2680	Inception v3, ResNet50 and Alex Net	Acc.- 85.2%
Wejdan L. Alyoubi et al. [13]	2021	CLAHE, Cropping, and DA	DDR and APTOS-2019	47,870	EfficientNetB0	Acc.-89%
Bhardwaj, C. et al. [14]	2021	Contrast Enhancement	MESSIDOR (4 Class)	1200	QEIRV‑2 Model (Inception-V3)	Acc.- 93.3%
Chen, P. N. et al. [15]	2021	Resizing and Grayscale	EyePACS	(88,702),	NASNet-Large	Acc.- 81.60% & 92.5%
San-Li Yi et al. [16]	2021	Resizing and Augmentation	APTOS 2019 (5 Class)	3662	RA-EfficientNet	Acc.- 93.55%
Z. Khan et al. [17]	2021	Scaling and Resizing	EyePACS	88,702	VGG16 and VGG-NiN	Sp.-91%
Sraddha Das et al. [18]	2021	Adaptive histogram equalization	DIARETDB1	-	CNN	Acc.- 98.7%
AbdelMaksoud et al. [19]	2022	Resizing and Augmentation	4 Dataset	39,301	E-DenseNet	Acc.- 91.3%
Kobat, S. G., et al. [20]	2022	Resizing	NDRD APTOS 2019	2355 and 3662	DenseNet201	Acc. -87.43% & 84.90%
Mungloo-Dilmohamud, Z et al. [21]	2022	Rescaling and Augmentation	APTOS 2019 (5 Class)	3662	VGG16, ResNet50 DenseNet169	Acc. -82%
Al-Omaisi Asia et al. [22]	2022	Cropping, Resizing, and Augmentation	XHO Dataset (5 Class)	1607	ResNet 50, 101 and VGG16	Acc.-80.88%
Sambit S. Mondal et al. [23]	2022	CLAHE and DA	APTOS 2019 (5 Class)	3662	ResNext and DenseNet	Acc.-86.08
Yasashvini, R. et al. [24]	2022	Weiner Filter	APTOS 2019	3662	ResNet & DenseNet	Acc.- 96.22%
Dayana, A. M. et al. [25]	2022	ADF	Local	-	AFU-NET	-
Oulhadj, M. et al. [26]	2022	Scaling and Resizing	APTOS 2019	3662	Densenet-121, Xception, Inception-v3 & Resnet-50	Acc. -85.28%
Jabbar M. K. et al. [27]	2022	CLAHE, DA & Resizing	EyePACS	35,126	VggNet	Acc.-96.6%
Menaouer, B. et al. [28]	2022	Scaling and Resizing	APTOS‑2019, Messidor-2 & Local public DR	5584	VGG 16 and 19	Acc.-90.6% and
Fayyaz, A. M. et al. [29]	2023	Resizing and Augmentation	ODIR (4 Class)	-	AlexNet and ResNet-101	SVM Acc.-93%
Dolly Das et al. [30]	2023	Resizing and Augmentation	EyePACS (5 Class)	35, 126	19 Pre-Trained	Acc.-79.11%
C. Mohanty et al. [31]	2023	Cropped and Resized	APTOS 2019 (5 Class)	3662	DenseNet 121	Acc.-97.30%
Pradeep Kumar Jena et al. [32]	2023	CLAHE	APTOS and MESSIDOR	3662 & 1200	Self-Design Architecture	Acc.—SVM (98.6% and 91.9%)
Bhimavarapu, U. et al. [33]	2023	CLAHE and Histogram Equalization	APTOS and Kaggle	3662 & 35,126	5 Pre-trained Network	Acc. – 98.32% & 98.71%
Islam, N. et al. [34]	2023	Gaussian Filter	APTOS, IDRiD	3662 & 516	Xception	APTOS—99.04% IDRiD -94.17%
Sajid, M. et al. [35]	2023	DA and Image Enhancement	Public Dataset	32,800	DR-NASNet	Acc.-96%
Alwakid, G. et al. [36]	2023	CLAHE & Data Augmentation	APTOS 2019	3662	DenseNet-121	Acc.-98.36%
Vijayan, M. et al. [37]	2023	Scaling	DDR, IDRiD, and APTOS	13,673, 516 & 3662	6 Pre-trained Network	Acc.-82.5%
Alwakid, G. et al. [38]	2023	CLAHE & DA	APTOS 2019	3662	DenseNet-121	-
Guefrachi, S et al. [39]	2024	Resizing and Augmentation	APTOS 2019	3662	Resnet152-V2	Acc.- 100%
Sunkari, S. et al. [40]	2024	Contrast and Brightness	APTOS and Kaggle	3662 & 35,126	3 Pre-trained Network	Acc.—93.51%
Macsik, P. et al. [41]	2024	CLAHE & DA	DDR and APTOS 2019	3662	Xception & EfficientNetB4	Acc
Shakibania Bu-Ali et al. [42]	2024	CLAHE & DA	APTOS 2019	3662	4 Pre-trained Network	Acc.-96.44

Class	DRD-EyePACS Dataset [51]	IDRiD Dataset [52]	ATOS -2019 Dataset [53]	Combined Dataset
Healthy (Not Diabetic Retinopathy)	1000	168	1805	2973
Unhealthy (Diabetic Retinopathy)	1750	348	1857	3955
Total	2750	516	3662	6928

Dataset	DRD-EyePACS Dataset		IDRiD Dataset		ATOS -2019 Dataset		Combined Dataset
Class	Training	Testing	Training	Testing	Training	Testing	Training	Testing
Healthy (Not Diabetic Retinopathy)	800	200	134	50	1605	200	2505	450
Unhealthy (Diabetic Retinopathy)	1550	200	298	50	1657	200	3487	450
Total DR Image	2350	400	416	100	3262	400	5992	900
Total DR Image	2750		516		3662		6928

Datasets	Diabetic Retinopathy disease type	Rotation		Flipping		Flipping with Rotation				Translation	Multiplier	No. of Images	Total Images
Datasets	Diabetic Retinopathy disease type	90^o	180^o	Horizontal	Vertical	90° H	180° H	90^oV	180^oV	Translation	Multiplier	No. of Images	Total Images
DRD-EyePACS Dataset -1	Healthy (Not Diabetic Retinopathy)	✓	✓	✓	✓	✓	×	×	×	×	5	800	4000
IDRiD Dataset -2		✓	✓	✓	✓	✓	✓	✓	✓	-13 to 13	34	118	4012
ATOS -2019 Dataset -3		✓	✓	×	×	×	×	×	×	0.5 × (-1)	2.5	1605	4012
DRD-EyePACS Dataset -1	Unhealthy (Diabetic Retinopathy)	✓	✓	×	×	×	×	×	×	0.6 × (-1)	2.6	1550	4030
IDRiD Dataset -2		✓	✓	✓	✓	✓	✓	✓	✓	-3 to 3	13.5	298	4023
ATOS -2019 Dataset—3		✓	✓	×	×	×	×	×	×	0.4 × (-1)	2.4	1657	3976

S.No	Name of the Pre-trained Networks	Type of Categories	Size of Images	Number of Parameters(Million)	Depth of the Network
1	AlexNet	Series	227 × 227 × 3	61.0	8
2	vgg16		224 × 224 × 3	138	16
3	vgg19		224 × 224 × 3	144	19
4	darknet19		256 × 256 × 3	20.8	19
5	darknet53		256 × 256 × 3	41.6	53
6	inceptionv3	DAG	299 × 299 × 3	23.9	48
7	densenet201		224 × 224 × 3	20.0	201
8	Resnet50		224 × 224 × 3	25.6	50
9	Resnet101		224 × 224 × 3	44.6	101
10	xception		299 × 299 × 3	22.9	71
11	inceptionresnetv2		299 × 299 × 3	55.9	164
12	nasnetlarge		331 × 331 × 3	88.9	-
13	SqueezeNet	Lightweight Networks	227 × 227 × 3	1.24	18
14	mobilenetv2		224 × 224 × 3	3.5	53
15	shufflenet		224 × 224 × 3	1.4	50
16	nasnetmobile		224 × 224 × 3	5.3	-
17	efficientnetb0		224 × 224 × 3	5.3	82
18	GoogleNet		224 × 224 × 3	7.0	22
19	googlenet-places365		224 × 224 × 3	7.0	22
20	resnet18		224 × 224 × 3	11.7	18

Hyperparameters	Details
Learning Rate	10^–4
Mini-batch Size	32
Maximum Epochs	30
Optimizer	Adam
The selected machine for implementation of experiments
Used Machine	Details
GPU	NVidia GEFORCE RTX 4060, 8 GB, 3072 CUDA CORE
Processor	12th Gen Intel Core i7 Processor
Operating System	Windows 11 Home

Experiment(s)	Details	Dataset
Experiment—1	Using Original DR images without augmentation	DRD-EyePACS, IDRiD, ATOS-2019, and Combined dataset
Experiment—2	Using Original DR images with augmentation	DRD-EyePACS, IDRiD, ATOS-2019, and Combined dataset
Experiment—3	Using pre-processed dataset images without augmentation	DRD-EyePACS, IDRiD, ATOS-2019, and Combined dataset
Experiment—4	Using pre-processed dataset images with augmentation	DRD-EyePACS, IDRiD, ATOS-2019, and Combined dataset

(a) Using Original DRD-EyePACS Dataset images
Network Name	Confusion Matrix		Without Augmentation					Confusion Matrix		After Augmentation
			ACC %	Sen	Sp	Pr	F1			ACC %	Sen	Sp	Pr	F1
AlexNet	136	64	65	0.62	0.68	0.66	0.64	176	24	85	0.82	0.88	0.87	0.85
	76	124						36	164
vgg16	140	60	67.5	0.65	0.70	0.68	0.67	180	20	86	0.82	0.90	0.89	0.85
	70	130						36	164
vgg19	152	48	73	0.70	0.76	0.74	0.72	186	14	89.5	0.86	0.93	0.92	0.89
	60	140						28	172
darknet19	130	70	66	0.67	0.65	0.66	0.66	182	18	86.5	0.82	0.91	0.90	0.86
	66	134						36	164
darknet53	144	56	71	0.70	0.72	0.71	0.71	176	24	87	0.86	0.88	0.88	0.87
	60	140						28	172
(b) Using Pre-processed DRD-EyePACS Dataset Images
Network Name	Confusion Matrix		ACC	Sen	Sp	Pr	F1	Confusion Matrix		ACC	Sen	Sp	Pr	F1
AlexNet	140	60	69	0.68	0.70	0.69	0.69	178	22	87	0.85	0.89	0.89	0.87
	64	136						30	170
vgg16	150	50	71	0.67	0.75	0.73	0.70	182	18	87.5	0.84	0.91	0.90	0.87
	66	134						32	168
vgg19	152	48	75	0.74	0.76	0.76	0.75	192	8	92.5	0.89	0.96	0.96	0.92
	52	148						22	178
darknet19	144	56	69	0.66	0.72	0.70	0.68	176	24	85	0.82	0.88	0.87	0.85
	68	132						36	164
darknet53	156	44	73	0.68	0.78	0.76	0.72	186	14	89.5	0.86	0.93	0.92	0.89
	64	136						28	172

(a) Using Original IDRiD Dataset images
Network Name	Confusion Matrix		Without Augmentation					Confusion Matrix		After Augmentation
			ACC %	Sen	Sp	Pr	F1			ACC %	Sen	Sp	Pr	F1
AlexNet	42	8	64	0.44	0.84	0.73	0.55	46	4	85	0.78	0.92	0.91	0.84
	28	22						11	39
vgg16	43	7	67	0.48	0.86	0.77	0.59	46	4	86	0.80	0.92	0.91	0.85
	26	24						10	40
vgg19	42	8	72	0.60	0.84	0.79	0.68	47	3	90	0.86	0.94	0.93	0.90
	20	30						7	43
darknet19	39	11	62	0.46	0.78	0.68	0.55	44	6	84	0.80	0.88	0.87	0.83
	27	23						10	40
darknet53	41	9	71	0.60	0.82	0.77	0.67	46	4	87	0.82	0.92	0.91	0.86
	20	30						9	41
(b) Using Pre-processed IDRiD Dataset Images
Network Name	Confusion Matrix		ACC %	Sen	Sp	Pr	F1	Confusion Matrix		ACC %	Sen	Sp	Pr	F1
AlexNet	45	5	68%	0.56	0.90	0.85	0.67	47	3	87	0.80	0.94	0.93	0.86
	22	28						10	40
vgg16	46	4	71%	0.50	0.92	0.86	0.63	47	3	88	0.82	0.94	0.93	0.87
	25	25						9	41
Vgg19	45	05	73%	0.56	0.90	0.85	0.67	48	2	92	0.88	0.96	0.96	0.92
	22	28						6	44
darknet19	38	12	66%	0.56	0.76	0.70	0.62	46	4	88	0.84	0.92	0.91	0.88
	22	28						8	42
darknet53	46	4	72%	0.52	0.92	0.87	0.65	45	5	89	0.88	0.90	0.90	0.89
	24	26						6	44

(a) Using Original IDRiD Dataset images
Network Name	Confusion Matrix 100		Without Augmentation					Confusion Matrix		After Augmentation
			ACC	Sen	Sp	Pr	F1			ACC	Sen	Sp	Pr	F1
inceptionv3	40	10	67	0.54	0.80	0.73	0.62	46	4	88	0.84	0.92	0.91	0.88
	23	27						8	42
densenet201	42	8	69	0.50	0.84	0.76	0.60	45	5	87	0.84	0.90	0.89	0.87
	25	25						8	42
Resnet50	41	9	70	0.58	0.82	0.76	0.66	47	3	88	0.82	0.94	0.93	0.87
	21	29						9	41
Resnet101	44	6	72	0.56	0.88	0.82	0.67	47	3	90	0.86	0.94	0.93	0.90
	22	28						7	43
xception	43	7	69	0.52	0.86	0.79	0.63	42	8	85	0.86	0.84	0.84	0.85
	24	26						7	43
inceptionresnetv2	38	12	67	0.58	0.76	0.71	0.64	45	5	84	0.78	0.90	0.89	0.83
	21	29						11	39
nasnetlarge	37	13	68	0.62	0.74	0.70	0.66	46	4	86	0.80	0.92	0.91	0.85
	19	31						10	40
(b) Using Pre-processed IDRiD Dataset Images
Network Name	Confusion Matrix		ACC	Sen	Sp	Pr	F1	Confusion Matrix		ACC	Sen	Sp	Pr	F1
inceptionv3	41	9	68	0.54	0.82	0.75	0.63	46	4	89	0.86	0.92	0.91	0.89
	23	27						7	43
densenet201	43	7	71	0.56	0.86	0.80	0.66	47	3	90	0.86	0.94	0.93	0.90
	22	28						7	43
Resnet50	43	7	72	0.58	0.86	0.81	0.67	45	5	90	0.90	0.90	0.90	0.90
	21	29						5	45
Resnet101	44	6	73	0.58	0.88	0.83	0.68	47	3	92	0.90	0.94	0.94	0.92
	21	29						5	45
xception	44	6	70	0.52	0.88	0.81	0.63	47	3	88	0.82	0.94	0.93	0.87
	24	26						9	41
inceptionresnetv2	40	10	69	0.58	0.80	0.74	0.65	43	7	86	0.86	0.86	0.86	0.86
	21	29						7	43
nasnetlarge	39	11	70	0.62	0.78	0.74	0.67	47	3	88	0.82	0.94	0.93	0.87
	19	31						9	41

(a) Using Original IDRiD Dataset images
Network Name	Confusion Matrix100		Without Augmentation					Confusion Matrix		After Augmentation
			ACC %	Sen	Sp	Pr	F1			ACC %	Sen	Sp	Pr	F1
SqueezeNet	41	9	68	0.54	0.82	0.75	0.63	44	6	83	0.78	0.88	0.87	0.82
	23	27						11	39
mobilenetv2	44	6	73	0.58	0.88	0.83	0.68	45	5	89	0.88	0.90	0.90	0.89
	21	29						6	44
shufflenet	44	6	74	0.60	0.88	0.83	0.70	46	4	90	0.88	0.92	0.92	0.90
	20	30						6	44
nasnetmobile	40	10	69	0.58	0.80	0.74	0.65	45	5	88	0.86	0.90	0.90	0.88
	21	29						7	43
efficientnetb0	38	12	65	0.54	0.76	0.69	0.61	41	9	82	0.82	0.82	0.82	0.82
	23	27						9	41
GoogleNet	39	11	65	0.52	0.78	0.70	0.60	43	7	83	0.80	0.86	0.85	0.82
	24	26						10	40
googlenet-places365	39	11	67	0.56	0.78	0.72	0.63	45	5	86	0.82	0.90	0.89	0.85
	22	28						9	41
resnet18	38	12	66	0.56	0.76	0.70	0.62	44	6	84	0.80	0.88	0.87	0.83
	22	28						10	40
(b) Using Pre-processed IDRiD Dataset Images
Network Name	Confusion Matrix		ACC %	Sen	Sp	Pr	F1	Confusion Matrix		ACC %	Sen	Sp	Pr	F1
SqueezeNet	40	10	70	0.60	0.80	0.75	0.67	45	5	86	0.82	0.90	0.89	0.85
	20	30						9	41
mobilenetv2	44	6	74	0.60	0.88	0.83	0.70	45	5	90	0.90	0.90	0.90	0.90
	20	30						5	45
shufflenet	45	5	75	0.60	0.90	0.86	0.71	46	4	91	0.90	0.92	0.92	0.91
	20	30						5	45
nasnetmobile	40	10	71	0.62	0.80	0.76	0.68	46	4	89	0.86	0.92	0.91	0.89
	19	31						7	43
efficientnetb0	39	11	66	0.54	0.78	0.71	0.61	44	6	84	0.80	0.88	0.87	0.83
	23	27						10	40
GoogleNet	40	10	68	0.56	0.80	0.74	0.64	43	7	85	0.84	0.86	0.86	0.85
	22	28						8	42
googlenet-places365	40	10	70	0.60	0.80	0.75	0.67	45	5	88	0.86	0.90	0.90	0.88
	20	30						7	43
resnet18	40	10	68	0.56	0.80	0.74	0.64	44	6	85	0.82	0.88	0.87	0.85
	22	28						9	41

(a) Using Original ATOS-2019 Dataset images
Network Name	Confusion Matrix		Without Augmentation					Confusion Matrix		After Augmentation
			ACC %	Sen	Sp	Pr	F1			ACC %	Sen	Sp	Pr	F1
AlexNet	140	60	66	0.62	0.70	0.67	0.65	172	28	85	0.84	0.86	0.86	0.85
	76	124						32	168
vgg16	141	59	68.5	0.67	0.71	0.69	0.68	180	20	87.5	0.85	0.90	0.89	0.87
	67	133						30	170
vgg19	152	48	72.5	0.69	0.76	0.74	0.72	188	12	91.5	0.89	0.94	0.94	0.91
	62	138						22	178
darknet19	142	58	68	0.65	0.71	0.69	0.67	180	20	87	0.84	0.90	0.89	0.87
	70	130						32	168
darknet53	141	59	67	0.64	0.71	0.68	0.66	177	23	86.5	0.85	0.89	0.88	0.86
	73	127						31	169
(b) Using Pre-processed ATOS-2019 Dataset Images
Network Name	Confusion Matrix		ACC %	Sen	Sp	Pr	F1	Confusion Matrix		ACC %	Sen	Sp	Pr	F1
AlexNet	145	55	67.5	0.63	0.73	0.69	0.66	179	21	86.5	0.84	0.90	0.89	0.86
	75	125						33	167
vgg16	145	55	70	0.68	0.73	0.71	0.69	184	16	88.5	0.85	0.92	0.91	0.88
	65	135						30	170
vgg19	153	47	73	0.70	0.77	0.75	0.72	191	9	92.5	0.90	0.96	0.95	0.92
	61	139						21	179
darknet19	145	55	69.5	0.67	0.73	0.71	0.69	182	18	88	0.85	0.91	0.90	0.88
	67	133						30	170
darknet53	143	57	68	0.65	0.72	0.69	0.67	180	20	87.5	0.85	0.90	0.89	0.87
	71	129						30	170

(a) Using Original ATOS-2019 Dataset images
Network Name	Confusion Matrix400		Without Augmentation					Confusion Matrix		After Augmentation
			ACC %	Sen	Sp	Pr	F1			ACC %	Sen	Sp	Pr	F1
SqueezeNet	143	57	70.5	0.70	0.72	0.71	0.70	180	20	86.5	0.83	0.90	0.89	0.86
	61	139						34	166
mobilenetv2	145	55	71	0.70	0.73	0.72	0.71	176	24	87	0.86	0.88	0.88	0.87
	61	139						28	172
shufflenet	149	51	72.5	0.71	0.75	0.73	0.72	187	13	91.5	0.90	0.94	0.93	0.91
	59	141						21	179
nasnetmobile	143	57	70	0.69	0.72	0.71	0.70	182	18	86.5	0.82	0.91	0.90	0.86
	63	137						36	164
efficientnetb0	135	65	66	0.65	0.68	0.66	0.65	170	30	83.5	0.82	0.85	0.85	0.83
	71	129						36	164
GoogleNet	130	70	64.5	0.64	0.65	0.65	0.64	165	35	81.75	0.81	0.83	0.82	0.82
	72	128						38	162
googlenet-places365	141	59	69	0.68	0.71	0.70	0.69	175	25	85.5	0.84	0.88	0.87	0.85
	65	135						33	167
resnet18	136	64	66	0.64	0.68	0.67	0.65	174	26	84	0.81	0.87	0.86	0.84
	72	128						38	162
(b) Using Pre-processed ATOS-2019 Dataset Images
Network Name	Confusion Matrix		ACC %	Sen	Sp	Pr	F1	Confusion Matrix		ACC %	Sen	Sp	Pr	F1
SqueezeNet	148	52	72	0.70	0.74	0.73	0.71	183	17	87.5	0.84	0.92	0.91	0.87
	60	140						33	167
mobilenetv2	149	51	72.5	0.71	0.75	0.73	0.72	180	20	89.5	0.89	0.90	0.90	0.89
	59	141						22	178
shufflenet	153	47	73.5	0.71	0.77	0.75	0.73	191	9	92.5	0.90	0.96	0.95	0.92
	59	141						21	179
nasnetmobile	145	55	71	0.70	0.73	0.72	0.71	185	15	89	0.86	0.93	0.92	0.89
	61	139						29	171
efficientnetb0	137	63	67	0.66	0.69	0.68	0.66	175	25	85.5	0.84	0.88	0.87	0.85
	69	131						33	167
GoogleNet	135	65	66	0.65	0.68	0.66	0.65	172	28	85	0.84	0.86	0.86	0.85
	71	129						32	168
googlenet-places365	143	57	70	0.69	0.72	0.71	0.70	180	20	87.5	0.85	0.90	0.89	0.87
	63	137						30	170
resnet18	139	61	67	0.65	0.70	0.68	0.66	175	25	85.5	0.84	0.88	0.87	0.85
	71	129						33	167

(a) Using Original Combined Dataset images
Network Name	Confusion Matrix		Without Augmentation					Confusion Matrix		After Augmentation
			ACC %	Sen	Sp	Pr	F1			ACC %	Sen	Sp	Pr	F1
AlexNet	318	132	65.33	0.60	0.71	0.67	0.63	415	35	91	0.90	0.92	0.92	0.91
	180	270						46	404
vgg16	324	126	67.88	0.64	0.72	0.69	0.67	409	41	90	0.89	0.91	0.91	0.90
	163	287						49	401
vgg19	346	104	72.66	0.68	0.77	0.75	0.71	428	22	94.66	0.94	0.95	0.95	0.95
	142	308						26	424
darknet19	311	139	66.44	0.64	0.69	0.67	0.66	421	29	92	0.90	0.94	0.93	0.92
	163	287						43	407
darknet53	326	124	69.22	0.66	0.72	0.71	0.68	410	40	90.1111	0.89	0.91	0.91	0.90
	153	297						49	401
(b) Using Pre-processed Combined Dataset Images
Network Name	Confusion Matrix		ACC %	Sen	Sp	Pr	F1	Confusion Matrix		ACC %	Sen	Sp	Pr	F1
AlexNet	330	120	68.77	0.64	0.73	0.71	0.67	419	31	92	0.91	0.93	0.93	0.92
	161	289						41	409
vgg16	341	109	70.55	0.65	0.76	0.73	0.69	413	37	91.11	0.90	0.92	0.92	0.91
	156	294						43	407
vgg19	350	100	73.88	0.70	0.78	0.76	0.73	435	15	96.22	0.96	0.97	0.97	0.96
	135	315						19	431
darknet19	327	123	68.88	0.65	0.73	0.70	0.68	425	25	93	0.92	0.94	0.94	0.93
	157	293						38	412
darknet53	345	105	70.66	0.65	0.77	0.73	0.69	421	29	91	0.88	0.94	0.93	0.91
	159	291						52	398

Category	Dataset	Network Name	Confusion Matrix		ACC %	Sen	Sp	Pr	F1
Series	Vgg19	EyePACS Dataset	192	8	92.5	0.89	0.96	0.96	0.92
Series	Vgg19	EyePACS Dataset	22	178	92.5	0.89	0.96	0.96	0.92
		IDRiD Dataset	48	2	92	0.88	0.96	0.96	0.92
		IDRiD Dataset	6	44	92	0.88	0.96	0.96	0.92
		ATOS-2019	191	9	92.5	0.90	0.96	0.95	0.92
		ATOS-2019	21	179	92.5	0.90	0.96	0.95	0.92
		Combined Dataset	435	15	96.22	0.96	0.97	0.97	0.96
		Combined Dataset	19	431	96.22	0.96	0.97	0.97	0.96
DAG	Resnet101	EyePACS Dataset	194	6	93.5	0.90	0.97	0.97	0.93
DAG	Resnet101	EyePACS Dataset	20	180	93.5	0.90	0.97	0.97	0.93
		IDRiD Dataset	46	4	91	0.90	0.92	0.92	0.91
		IDRiD Dataset	5	45	91	0.90	0.92	0.92	0.91
		ATOS-2019	193	7	94	0.92	0.97	0.96	0.94
		ATOS-2019	17	183	94	0.92	0.97	0.96	0.94
		Combined Dataset	440	10	97.33	0.97	0.98	0.98	0.97
		Combined Dataset	14	436	97.33	0.97	0.98	0.98	0.97
Lightweight	shufflenet	EyePACS Dataset	192	8	94.5	0.93	0.96	0.96	0.94
Lightweight	shufflenet	EyePACS Dataset	14	186	94.5	0.93	0.96	0.96	0.94
		IDRiD Dataset	46	4	91	0.90	0.92	0.92	0.91
		IDRiD Dataset	5	45	91	0.90	0.92	0.92	0.91
		ATOS-2019	191	9	92.5	0.90	0.96	0.95	0.92
		ATOS-2019	21	179	92.5	0.90	0.96	0.95	0.92
		Combined Dataset	439	11	96.66	0.96	0.98	0.98	0.97
		Combined Dataset	19	431	96.66	0.96	0.98	0.98	0.97

PERMALINK

Deep learning based binary classification of diabetic retinopathy images using transfer learning approach

Dimple Saproo

Aparna N Mahajan

Seema Narwal

Abstract

Objective

Method

Results

Conclusion

Introduction

Fig. 1.

Table 1.

Workflow adopted for classification of diabetic retinopathy images

Fig. 2.

Original benchmark diabetic retinopathy dataset collection

Fig. 3.

Table 2.

Data pre-processing and preparation module

Resizing of DR images

Image enhancement using suitable denoising algorithm

Dataset splitting for training, validation and testing set

Table 3.

Data augmentation

Table 4.

Table 5.

Splitting the diabetic retinopathy dataset into training, testing, and validation sets

Table 6.

Selection of pre-trained network and classification module

Table 7.

Assessment parameters used in classification of DR images

Table 8.

Implementation details and number of experiments

Table 9.

Table 10.

Performance evaluation of experiments

Performance evaluation metrics using DRD-EyePACS dataset

Performance of series-based network architectures using DRD-EyePACS dataset

Table 11.

Performance of DAG-based network architectures using DRD-EyePACS dataset

Table 12.

Performance of lightweight based network architectures using DRD-EyePACS dataset

Table 13.

Performance evaluation metrics using IDRiD Dataset

Performance of series-based network architectures using IDRiD dataset

Table 14.

Performance of DAG-based network architectures using IDRiD dataset

Table 15.

Performance of Lightweight based network architectures using IDRiD dataset

Table 16.

Performance evaluation metrics using the ATOS-2019 Dataset

Performance of series-based network architectures using ATOS-2019 Dataset

Table 17.

Performance of DAG-based network architectures using ATOS-2019 dataset

Table 18.

Performance of Lightweight based network architectures using ATOS-2019 dataset

Table 19.

Performance evaluation metrics using the combined dataset

Performance of series-based network architectures using combined dataset

Table 20.

Performance of DAG-based network architectures using combined dataset

Table 21.

Performance of lightweight-based network architecture using combined dataset

Table 22.

Result and discussion

Table 23.

Fig. 4.

Table 24.

Conclusion

Funding

Data availability

Declarations

Ethics approval and consent to participate

Consent for publication

Conflicts of interest

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS