Skip to main content
. Author manuscript; available in PMC: 2024 Aug 14.
Published in final edited form as: Proc Conf Empir Methods Nat Lang Process. 2022 Dec;2022:3876–3887. doi: 10.18653/v1/2022.emnlp-main.256

Figure 3:

Figure 3:

The workflow of MedCLIP. The knowledge extraction module extracts medical entities from raw medical reports. Then, a semantic similarity matrix is built by comparing medical entities (from text) and raw labels (from images), which enables pairing arbitrary two separately sampled images and texts. The extracted image and text embeddings are paired to match the semantic similarity matrix.