Skip to main content
. 2022 Sep 1;12:14851. doi: 10.1038/s41598-022-19045-3

Table 2.

Comparison of the verification performance on two different subsets of the ChestX-ray14 dataset that either contain foreign material or not (first two rows). Furthermore, we show the verification results for the CheXpert dataset and the COVID-19 Image Data Collection (last two rows). We present the AUC (together with the lower and upper bounds of the 95% confidence intervals from 10,000 bootstrap runs), the accuracy, the specificity, the recall, the precision, and the F1-score.

Dataset Subset AUC + 95 % CI Accuracy (TP+TNP+N) Specificity (TNN) Recall (TPP) Precision (TPTP+FP) F1-score
ChestX-ray14 w/ foreign material 0.99700.99380.9993 0.9796(672686) 0.9854(338343) 0.9738(334343) 0.9853(334339) 0.9795
w/o foreign material 0.99720.99090.9999 0.9862(430436) 0.9908(216218) 0.9817(214218) 0.9907(214216) 0.9862
CheXpert 0.98700.98550.9884 0.9440(15,56216,486) 0.9629(7,9378,243) 0.9250(7,6258,243) 0.9614(7,6257,931) 0.9429
COVID-19 0.97630.96960.9825 0.9180(1,4211,548) 0.9780(757774) 0.8579(664774) 0.9750(664681) 0.9127