Table 1.
Summary table for the different DL systems in the detection of referable diabetic retinopathy, glaucoma suspect, age-related macular degeneration and retinopathy of prematurity using fundus photographs
| DL systems | Year | Test data sets | Test images (n) | CNN | AUC | Sensitivity (%) | Specificity (%) |
| Referable diabetic retinopathy | |||||||
| Abràmoff et al 14 | 2016 | Messidor-2 | 1748 | AlexNet/VGG | 0.98 | 96.80 | 87.00 |
| Gulshan et al 12 | 2016 | Messidor-2 | 1748 | Inception-V3 | 0.99 | 87 | 98.50 |
| 96.10 | 93.90 | ||||||
| EyePACS-1 | 9963 | 0.991 | 90.30 | 98.10 | |||
| 97.50 | 93.40 | ||||||
| Gargeya and Leng15 | 2017 | Kaggle images | 75 137 | Customised CNN | 0.97 | NA | NA |
| E-Ophtha | 463 | 0.96 | NA | NA | |||
| Messidor-2 | 1748 | 0.94 | NA | NA | |||
| Ting et al 11 | 2017 | SiDRP 14–15 | 71 896 | VGG-19 | 0.936 | 90.50 | 91.60 |
| Guangdong | 15 798 | 0.949 | 98.70 | 81.60 | |||
| SIMES | 3052 | 0.889 | 97.10 | 82.00 | |||
| SINDI | 4512 | 0.917 | 99.3 | 73.3 | |||
| SCES | 1936 | 0.919 | 100 | 76.30 | |||
| BES | 1052 | 0.929 | 94.40 | 88.50 | |||
| AFEDS | 1968 | 0.98 | 98.80 | 86.50 | |||
| RVEEH | 2302 | 0.983 | 98.90 | 92.20 | |||
| Mexican | 1172 | 0.95 | 91.80 | 84.80 | |||
| CUHK | 1254 | 0.948 | 99.3 | 83.10 | |||
| HKU | 7706 | 0.964 | 100 | 81.30 | |||
| Abràmoff et al 28 | 2018 | 10 primary care practice sites from the USA | 892 patients | Alex/VGG | NA | 87.2 | 90.7 |
| Glaucoma suspect* | |||||||
| Ting et al 11 | 2017 | SiDRP 14–15 | 71 896 | VGG-19 | 0.942 | 96.40 | 93.20 |
| Li et al 16 | 2018 | Guangdong | 48 116 | 0.986 | 95.60 | 92.00 | |
| Age-related macular degeneration | |||||||
| Ting et al 11 | 2017 | SiDRP 14–15 | 35 948 | VGG-19 | 0.932 | 93.20 | 88.70 |
| Burlina et al 17 | 2017 | AREDS | 120 656 | AlexNet, OverFeat | 0.940–0.96 | NA | NA |
| Grassmann et al 18 | 2018 | AREDS | 120 656 | AlexNet, GoogleNet, VGG, Inception-V3, ResNet, Inception-ResNet-V2 | NA | 84.20 | 94.30 |
| Retinopathy of prematurity | |||||||
| Brown et al 19 | 2018 | i-ROP | 100 | Inception-V1 and U-Net | NA | 100 | 94 |
The diagnostic performance is not comparable between the different DL systems given the different data sets used in the individual study.
*Definition of glaucoma suspect: (1) Ting et al 11—vertical cup to disc ratio of 0.8 or greater, and any glaucomatous disc changes; (2) Li et al 16—vertical cup to disc ratio of 0.7 or greater, and any glaucomatous disc changes.
AFEDS, African American Eye Disease Study; AREDS, Age-Related Eye Disease Study; AUC, area under the receiver operating characteristic curve; BES, Beijing Eye Study; CNN, convolutional neural network; CUHK, Chinese University Hong Kong; DL, deep learning; SiDRP 14–15, Singapore Integrated Diabetic Retinopathy Screening Programme; HKU, Hong Kong University; NA, not available; RVEEH, Royal Victorian Eye and Ear Hospital; SCES, Singapore Chinese Eye Study; SIMES, Singapore Malay Eye Study; SINDI, Singapore Indian Eye Study.