. Author manuscript; available in PMC: 2022 Oct 1.

Published in final edited form as: Ophthalmol Retina. 2021 Feb 6;5(10):1027–1035. doi: 10.1016/j.oret.2020.12.013

Table 2.

Area Under the Receiver Operating Characteristics Curve (AUROC), Area Under the Precision-Recall Curve (AUPRC), Sensitivity, and Specificity of CNNs on test sets from North American and Nepal. The best performing models trained on images from North American, Nepal, and combined datasets were evaluated on two independent test sets from North America and Nepal.

Model Training Set	Test set	AUROC	AUPRC	Sensitivity	Specificity

North America	North America	0.99	0.98	94%	96%
North America	Nepal	0.96	0.88	52%	99%
Nepal	North America	0.62	0.36	44%	69%
Nepal	Nepal	0.97	0.91	73%	99%
North America + Nepal	North America	0.99	0.98	98%	96%
North America + Nepal	Nepal	0.98	0.92	82%	99%

AUROC, AUPRC, sensitivity, and specificity of all 3 models evaluated on the North American and Nepali test sets are reported. The model trained on both datasets resulted in slightly increased performance compared to models trained on individual datasets, most notably seen in sensitivity. For example, sensitivity increased from 94% when trained on North American data alone to 98% and from 73% when trained on Nepali data alone to 82%.