. 2024 Nov 5;14:26802. doi: 10.1038/s41598-024-78515-y

Table 1.

Results of the retrieval task. Given a fundus image, the correct OCT volume must be selected from a set of candidates. Top-1, top-5, and top-10 accuracy are shown for the hold-out test set, along with the upper and lower limits for a 95% confidence interval (CI). Here, Baseline refers to the encoders initialized with ImageNet/Kinetics-pre-trained weights, CLIP and CLOOB to the encoders obtained using the CLIP- and CLOOB-based pre-training, respectively. The best performance for each setup is indicated in bold.

Retrieval task	Test set	Model	Accuracy score
Retrieval task	Test set	Model	Top-1	Top-5	Top-10
Fundus to OCT	One sample/patient	Baseline	0.005 (0.000, 0.040)	0.022 (0.005, 0.062)	0.041 (0.016, 0.092)
		CLIP	0.781 (0.704, 0.848)	0.959 (0.908, 0.984)	0.980 (0.938, 0.995)
		CLOOB	0.800 (0.720, 0.861)	0.947 (0.898, 0.979)	0.974 (0.927, 0.992)
	All samples/patient	Baseline	0.000 (0.000, 0.001)	0.001 (0.000, 0.002)	0.002 (0.001, 0.003)
		CLIP	0.097 (0.090, 0.104)	0.310 (0.299, 0.321)	0.457 (0.445, 0.468)
		CLOOB	0.109 (0.102, 0.117)	0.333 (0.322, 0.344)	0.482 (0.470, 0.494)
OCT to fundus	One sample/patient	Baseline	0.005 (0.000, 0.040)	0.025 (0.005, 0.062)	0.050 (0.021, 0.102)
		CLIP	0.768 (0.689, 0.836)	0.949 (0.898, 0.979)	0.978 (0.938, 0.995)
		CLOOB	0.799 (0.720, 0.861)	0.945 (0.889, 0.975)	0.973 (0.927, 0.992)
	All samples/patient	Baseline	0.000 (0.000, 0.001)	0.001 (0.000, 0.001)	0.001 (0.001, 0.003)
		CLIP	0.093 (0.086, 0.100)	0.293 (0.283, 0.304)	0.437 (0.425, 0.448)
		CLOOB	0.103 (0.096, 0.110)	0.334 (0.323, 0.345)	0.484 (0.473, 0.496)