Skip to main content
. 2022 Jun 18;9:350. doi: 10.1038/s41597-022-01441-z

Table 2.

Statistics of each phase of data collection and the subset of the MIMIC-CXR dataset from which images were sampled.

Dataset Phase 1(P1) Phase 2(P2) Phase 3(P3) MIMIC-CXR filtered (M)
# cases 295 250 2,507 194,495
# cases studies with eye-tracking data 285 240 2,507 0
# MIMIC-CXR images 59 50 2,507
# subjects 58 50 2,110 60,018
% female 63.8 54.0 50.7 53.9
% male 36.2 46.0 49.1 45.7
% test set 15.3 14.0 20.2 1.4
% Normal Radiograph (P1, P2, P3) & No Finding (M) 18.0 24.4 22.8 32.9
% Abnormal mediastinal contour (P2,P3) & Wide mediastinum (P1) 2.7 5.6 2.7
% Acute fracture (P2,P3) & Fracture (P1, M) 5.1 2.8 1.0 1.9
% Airway wall thickening (P1) 7.1
% Atelectasis (P1,P2,P3,M) 41.4 27.6 25.8 20.5
% Cardiomegaly (M) 19.8
% Consolidation (P1,P2,P3,M) 28.5 28.8 25.9 4.7
% Enlarged cardiac silhouette (P1,P2,P3) 28.1 28.4 21.8
% Enlarged Cardiomediastinum (M) 3.2
% Enlarged hilum (P2,P3) 2.8 1.9
% Groundglass opacity (P1,P2,P3) 9.2 18.8 12.6
% Hiatal hernia (P2,P3) 0.0 0.9
% High lung volume/emphysema (P2,P3) & Emphysema (P1) 3.1 3.2 2.9
% Interstitial lung disease (P2,P3) & Fibrosis (P1) 1.7 1.2 1.0
% Lung nodule or mass (P2,P3) & Lung Lesion (M) 1.6 5.1 2.7
% Lung Opacity (M) 22.8
% Mass (P1) 0.7
% Nodule (P1) 4.7
% Other (P1,P2,P3) 13.9 8.8 6.0
% Pleural abnormality (P2,P3) 30.0 29.5
% Pleural Effusion (P1,M) 31.2 24.2
% Pleural thickening (P1) 2.0
% Pleural Other (M) 0.9
% Pneumonia (M) 7.2
% Pneumothorax (P1,P2,P3,M) 4.7 4.4 2.9 4.6
% Pulmonary edema (P1,P2,P3) & Edema (M) 13.9 13.6 13.7 12.1
% Quality issue (P1) 3.4
% Support devices (P1,P2,P3,M) 36.9 34.8 44.8 29.3

The dataset where each label was present is shown inside parentheses. “Normal radiograph” represents CXRs for which no other label was selected. Table cells are left blank for labels that were not present in that dataset. For how the labels of the different datasets are related, check Fig. 5.