Skip to main content
. 2024 Nov 5;14:26802. doi: 10.1038/s41598-024-78515-y

Figure 1.

Figure 1

The proposed CLIP/CLOOB framework: contrastive pre-training of the encoders (hx,hy) of the two retinal imaging modalities (fundus images—x and OCT volumes—y), followed by using the pre-trained encoders for downstream predictive tasks.