Skip to main content
. Author manuscript; available in PMC: 2024 Aug 14.
Published in final edited form as: Proc Conf Empir Methods Nat Lang Process. 2022 Dec;2022:3876–3887. doi: 10.18653/v1/2022.emnlp-main.256

Table 3:

The statistics of used datasets. Pos.%: positive sample ratio.

Pretrain # Images # Reports # Classes
MIMIC-CXR 377,111 201,063
CheXpert 223,415 14
Evaluation # Train (Pos.%) # Test (Pos.%) # Classes
CheXpert-5x200 1,000 (−) 1,000 (−) 5
MIMIC-5x200 1,000 (−) 1,000 (−) 5
COVID 2,162(19%) 3,000 (49%) 2
RSNA 8,486 (50%) 3,538 (50%) 2