Skip to main content
. Author manuscript; available in PMC: 2022 Oct 1.
Published in final edited form as: Ophthalmol Retina. 2021 Feb 6;5(10):1027–1035. doi: 10.1016/j.oret.2020.12.013

Table 2.

Area Under the Receiver Operating Characteristics Curve (AUROC), Area Under the Precision-Recall Curve (AUPRC), Sensitivity, and Specificity of CNNs on test sets from North American and Nepal. The best performing models trained on images from North American, Nepal, and combined datasets were evaluated on two independent test sets from North America and Nepal.

Model Training Set Test set AUROC AUPRC Sensitivity Specificity

North America North America 0.99 0.98 94% 96%
North America Nepal 0.96 0.88 52% 99%
Nepal North America 0.62 0.36 44% 69%
Nepal Nepal 0.97 0.91 73% 99%
North America + Nepal North America 0.99 0.98 98% 96%
North America + Nepal Nepal 0.98 0.92 82% 99%

AUROC, AUPRC, sensitivity, and specificity of all 3 models evaluated on the North American and Nepali test sets are reported. The model trained on both datasets resulted in slightly increased performance compared to models trained on individual datasets, most notably seen in sensitivity. For example, sensitivity increased from 94% when trained on North American data alone to 98% and from 73% when trained on Nepali data alone to 82%.