Skip to main content
. 2024 Nov 5;14:26802. doi: 10.1038/s41598-024-78515-y

Table 1.

Results of the retrieval task. Given a fundus image, the correct OCT volume must be selected from a set of candidates. Top-1, top-5, and top-10 accuracy are shown for the hold-out test set, along with the upper and lower limits for a 95% confidence interval (CI). Here, Baseline refers to the encoders initialized with ImageNet/Kinetics-pre-trained weights, CLIP and CLOOB to the encoders obtained using the CLIP- and CLOOB-based pre-training, respectively. The best performance for each setup is indicated in bold.

Retrieval task Test set Model Accuracy score
Top-1 Top-5 Top-10
Fundus to OCT One sample/patient Baseline 0.005 (0.000, 0.040) 0.022 (0.005, 0.062) 0.041 (0.016, 0.092)
CLIP 0.781 (0.704, 0.848) 0.959 (0.908, 0.984) 0.980 (0.938, 0.995)
CLOOB 0.800 (0.720, 0.861) 0.947 (0.898, 0.979) 0.974 (0.927, 0.992)
All samples/patient Baseline 0.000 (0.000, 0.001) 0.001 (0.000, 0.002) 0.002 (0.001, 0.003)
CLIP 0.097 (0.090, 0.104) 0.310 (0.299, 0.321) 0.457 (0.445, 0.468)
CLOOB 0.109 (0.102, 0.117) 0.333 (0.322, 0.344) 0.482 (0.470, 0.494)
OCT to fundus One sample/patient Baseline 0.005 (0.000, 0.040) 0.025 (0.005, 0.062) 0.050 (0.021, 0.102)
CLIP 0.768 (0.689, 0.836) 0.949 (0.898, 0.979) 0.978 (0.938, 0.995)
CLOOB 0.799 (0.720, 0.861) 0.945 (0.889, 0.975) 0.973 (0.927, 0.992)
All samples/patient Baseline 0.000 (0.000, 0.001) 0.001 (0.000, 0.001) 0.001 (0.001, 0.003)
CLIP 0.093 (0.086, 0.100) 0.293 (0.283, 0.304) 0.437 (0.425, 0.448)
CLOOB 0.103 (0.096, 0.110) 0.334 (0.323, 0.345) 0.484 (0.473, 0.496)