Skip to main content
. Author manuscript; available in PMC: 2025 Oct 15.
Published in final edited form as: IEEE Access. 2025 Jul 28;13:133351–133369. doi: 10.1109/access.2025.3593420

TABLE 5.

Tables with the ood detection results using multiple ood scores. mls stands for Maximum Logit score. In the scores defined for vaeabmil and daeabmil, max and mean indicate the Maximum aggregator and the Mean aggregator, respectively.

(a) ood detection results for the different ood scores in the (cam16, panda) experiment.
Model OoD/Entropy/auc OoD/MLS/auc OoD/LOGPXMAX/auc OoD/LOGPXMEAN/auc OoD/RECERRMAX/auc OoD/RECERRMEAN/auc
abmil 0.954 ± 0.013 0.935 ± 0.031 - . - -
clam 0.830 ±0.199 0.826 ± 0.203 - - - -
daeabmil 0.963 ±0.011 0.968 ± 0.007 - - 0.457 ± 0.031 0.992 ± 0.000
dftdmil 0.950 ± 0.035 0.961 ±0.015 - - - -
dsmil 0.944 ± 0.022 0.923 ± 0.019 - - - -
transmil 0.969 ± 0.006 0.960 ± 0.013 - - - -
vaeabmil 0.979 ± 0.007 0.970 ±0.011 0.680 ± 0.109 1.000 ± 0.000 - -
(b) ood detection results for the different ood scores in the (cam16, bcell) experiment.
Model OoD/Entropy/auc OoD/MLS/auc OoD/LOGPXMAX/auc OoD/LOGPXMEAN/auc OoD/RECERRMAX/auc OoD/RECERRMEAN/auc

abmil 0.899 ± 0.028 0.867 ± 0.046 - - - -
clam 0.726 ± 0.071 0.722 ± 0.070 - - - -
daeabmil 0.891 ± 0.027 0.916 ± 0.027 - - 0.848 ± 0.006 0.970 ± 0.008
dftdmil 0.803 ± 0.049 0.790 ± 0.053 - - - -
dsmil 0.878 ± 0.010 0.864 ± 0.003 - - - -
transmil 0.877 ± 0.040 0.836 ± 0.072 - - - -
vaeabmil 0.882 ± 0.035 0.877 ± 0.022 0.828 ± 0.027 0.959 ± 0.003 - -
(c) ood detection results for the different ood scores in the (panda, cam16) experiment.
Model OoD/Entropy/auc OoD/MLS/auc OoD/LOGPXMAX/auc OoD/LOGPXMEAN/auc OoD/RECERRMAX/auc OoD/RECERRMEAN/auc

abmil 0.959 ± 0.007 0.956 ± 0.005 - . - -
clam 0.697 ± 0.052 0.654 ± 0.045 - - - -
daeabmil 0.333 ±0.156 0.334 ±0.156 - - 0.999 ± 0.000 1.000 ±0.000
dftdmil 0.944 ± 0.030 0.949 ± 0.025 - - - -
dsmil 0.793 ± 0.031 0.833 ± 0.032 - - - -
transmil 0.911 ±0.022 0.888 ± 0.033 - - - -
vaeabmil 0.965 ± 0.016 0.982 ± 0.013 1.000 ±0.000 1.000 ± 0.000 - -
(d) ood detection results for the different ood scores in the (panda, artif) experiment.
Model OoD/Entropy/auc OoD/MLS/auc OoD/LOGPXMAX/auc OoD/LOGPXMEAN/auc OoD/RECERRMAX/auc OoD/RECERRMEAN/auc

abmil 0.755 ± 0.025 0.771 ± 0.027 - - - -
clam 0.679 ± 0.028 0.646 ± 0.029 - - - -
daeabmil 0.327 ± 0.076 0.344 ± 0.088 - - 0.998 ± 0.001 0.999 ± 0.000
dftdmil 0.689 ± 0.029 0.704 ± 0.036 - - - -
dsmil 0.541 ± 0.039 0.566 ± 0.037 - - - -
transmil 0.698 ± 0.041 0.688 ± 0.049 - - - -
vaeabmil 0.738 ± 0.026 0.771 ± 0.036 1.000 ±0.000 0.999 ± 0.000 - -