TABLE 5.
Tables with the ood detection results using multiple ood scores. mls stands for Maximum Logit score. In the scores defined for vaeabmil and daeabmil, max and mean indicate the Maximum aggregator and the Mean aggregator, respectively.
| (a) ood detection results for the different ood scores in the (cam16, panda) experiment. | ||||||
|---|---|---|---|---|---|---|
| Model | OoD/Entropy/auc | OoD/MLS/auc | OoD/LOGPXMAX/auc | OoD/LOGPXMEAN/auc | OoD/RECERRMAX/auc | OoD/RECERRMEAN/auc |
| abmil | 0.954 ± 0.013 | 0.935 ± 0.031 | - | . | - | - |
| clam | 0.830 ±0.199 | 0.826 ± 0.203 | - | - | - | - |
| daeabmil | 0.963 ±0.011 | 0.968 ± 0.007 | - | - | 0.457 ± 0.031 | 0.992 ± 0.000 |
| dftdmil | 0.950 ± 0.035 | 0.961 ±0.015 | - | - | - | - |
| dsmil | 0.944 ± 0.022 | 0.923 ± 0.019 | - | - | - | - |
| transmil | 0.969 ± 0.006 | 0.960 ± 0.013 | - | - | - | - |
| vaeabmil | 0.979 ± 0.007 | 0.970 ±0.011 | 0.680 ± 0.109 | 1.000 ± 0.000 | - | - |
| (b) ood detection results for the different ood scores in the (cam16, bcell) experiment. | ||||||
| Model | OoD/Entropy/auc | OoD/MLS/auc | OoD/LOGPXMAX/auc | OoD/LOGPXMEAN/auc | OoD/RECERRMAX/auc | OoD/RECERRMEAN/auc |
|
| ||||||
| abmil | 0.899 ± 0.028 | 0.867 ± 0.046 | - | - | - | - |
| clam | 0.726 ± 0.071 | 0.722 ± 0.070 | - | - | - | - |
| daeabmil | 0.891 ± 0.027 | 0.916 ± 0.027 | - | - | 0.848 ± 0.006 | 0.970 ± 0.008 |
| dftdmil | 0.803 ± 0.049 | 0.790 ± 0.053 | - | - | - | - |
| dsmil | 0.878 ± 0.010 | 0.864 ± 0.003 | - | - | - | - |
| transmil | 0.877 ± 0.040 | 0.836 ± 0.072 | - | - | - | - |
| vaeabmil | 0.882 ± 0.035 | 0.877 ± 0.022 | 0.828 ± 0.027 | 0.959 ± 0.003 | - | - |
| (c) ood detection results for the different ood scores in the (panda, cam16) experiment. | ||||||
| Model | OoD/Entropy/auc | OoD/MLS/auc | OoD/LOGPXMAX/auc | OoD/LOGPXMEAN/auc | OoD/RECERRMAX/auc | OoD/RECERRMEAN/auc |
|
| ||||||
| abmil | 0.959 ± 0.007 | 0.956 ± 0.005 | - | . | - | - |
| clam | 0.697 ± 0.052 | 0.654 ± 0.045 | - | - | - | - |
| daeabmil | 0.333 ±0.156 | 0.334 ±0.156 | - | - | 0.999 ± 0.000 | 1.000 ±0.000 |
| dftdmil | 0.944 ± 0.030 | 0.949 ± 0.025 | - | - | - | - |
| dsmil | 0.793 ± 0.031 | 0.833 ± 0.032 | - | - | - | - |
| transmil | 0.911 ±0.022 | 0.888 ± 0.033 | - | - | - | - |
| vaeabmil | 0.965 ± 0.016 | 0.982 ± 0.013 | 1.000 ±0.000 | 1.000 ± 0.000 | - | - |
| (d) ood detection results for the different ood scores in the (panda, artif) experiment. | ||||||
| Model | OoD/Entropy/auc | OoD/MLS/auc | OoD/LOGPXMAX/auc | OoD/LOGPXMEAN/auc | OoD/RECERRMAX/auc | OoD/RECERRMEAN/auc |
|
| ||||||
| abmil | 0.755 ± 0.025 | 0.771 ± 0.027 | - | - | - | - |
| clam | 0.679 ± 0.028 | 0.646 ± 0.029 | - | - | - | - |
| daeabmil | 0.327 ± 0.076 | 0.344 ± 0.088 | - | - | 0.998 ± 0.001 | 0.999 ± 0.000 |
| dftdmil | 0.689 ± 0.029 | 0.704 ± 0.036 | - | - | - | - |
| dsmil | 0.541 ± 0.039 | 0.566 ± 0.037 | - | - | - | - |
| transmil | 0.698 ± 0.041 | 0.688 ± 0.049 | - | - | - | - |
| vaeabmil | 0.738 ± 0.026 | 0.771 ± 0.036 | 1.000 ±0.000 | 0.999 ± 0.000 | - | - |