. 2023 Jun 13;6:112. doi: 10.1038/s41746-023-00857-0

Table 3.

Comparison of G-RISK performance versus image-measured VCDR as glaucoma detection proxy.

Data	#images	AUC G-RISK [95% CI]	AUC VCDR 1 [95% CI]	AUC VCDR 2 [95% CI]
BMES	6927	0.967 [0.956–0.979]	0.958 [0.940–0.976]	NA
PAPILA (suspect referable)	488	0.769 [0.722–0.815]	0.748 [0.699–0.798]	0.743 [0.691–0.795]
PAPILA (suspect non-referable)	488	0.882 [0.840–0.923]	0.789 [0.728–0.851]	0.782 [0.716–0.847]
REFUGE1 test	400	0.986 [0.974–0.999]	0.946 [0.907–0.984]	NA
REFUGE1 all	1200	0.952 [0.925–0.979]	0.929 [0.902–0.956]	NA
REFUGE2 test	400	0.867 [NA]	0.757 [0.693–0.815]	NA
RIM-ONE r3	159	0.934 [0.889–0.978]	0.810 [0.723–0.897]	NA

VCDR was either retrieved from the cup and disc segmentation ground truth available (PAPILA, REFUGE data, RIM-ONE r3), or directly provided by the data set owners (BMES). For PAPILA, G-RISK results are compared against two independent human experts who segmented disc and cup. Best performance (AUC) per row is highlighted in bold text.