Table 3.
MNIST-based dataset with a different number of training bags
| Number of Training bags | 50 | 100 | 150 | 200 | 250 | 300 |
|---|---|---|---|---|---|---|
| Max-pooling | 0.531 ± 0.063 | 0.701 ± 0.092 | 0.940 ± 0.003 | 0.957 ± 0.001 | 0.970 ± 0.001 | 0.972 ± 0.001 |
| Mean-pooling | 0.611 ± 0.053 | 0.627 ± 0.083 | 0.925 ± 0.007 | 0.964 ± 0.004 | 0.969 ± 0.001 | 0.970 ± 0.001 |
| Attention [8] | 0.727 ± 0.043 | 0.901 ± 0.005 | 0.955 ± 0.006 | 0.970 ± 0.002 | 0.969 ± 0.001 | 0.976 ± 0.001 |
| Gated Attention [8] | 0.733 ± 0.041 | 0.906 ± 0.008 | 0.945 ± 0.001 | 0.974 ± 0.002 | 0.977 ± 0.001 | 0.975 ± 0.002 |
| TGA-MIL (ours) | 0.753 ± 0.034 | 0.900 ± 0.020 | 0.950 ± 0.001 | 0.975 ± 0.001 | 0.980 ± 0.002 | 0.983 ± 0.002 |
Experiments were repeated five times, with the average AUC (±standard error) provided. The best results for different numbers of training bags are highlighted in bold