. 2023 Apr 4:1–16. Online ahead of print. doi: 10.1007/s10489-023-04458-y

Table 3.

MNIST-based dataset with a different number of training bags

Number of Training bags	50	100	150	200	250	300
Max-pooling	0.531 ± 0.063	0.701 ± 0.092	0.940 ± 0.003	0.957 ± 0.001	0.970 ± 0.001	0.972 ± 0.001
Mean-pooling	0.611 ± 0.053	0.627 ± 0.083	0.925 ± 0.007	0.964 ± 0.004	0.969 ± 0.001	0.970 ± 0.001
Attention [8]	0.727 ± 0.043	0.901 ± 0.005	0.955 ± 0.006	0.970 ± 0.002	0.969 ± 0.001	0.976 ± 0.001
Gated Attention [8]	0.733 ± 0.041	0.906 ± 0.008	0.945 ± 0.001	0.974 ± 0.002	0.977 ± 0.001	0.975 ± 0.002
TGA-MIL (ours)	0.753 ± 0.034	0.900 ± 0.020	0.950 ± 0.001	0.975 ± 0.001	0.980 ± 0.002	0.983 ± 0.002

Experiments were repeated five times, with the average AUC (±standard error) provided. The best results for different numbers of training bags are highlighted in bold