Table 2:
Dice scores for each training method and site for test sets at each site. Statistical significance with p < 0.005 is determined by the paired t-test and is indicated by an asterisk, and best results are indicated by bold text.
| A | 0.67 ± 0.06 | 0.69 ± 0.08 | 0.69 ± 0.07 | 0.63 ± 0.05 | 0.69 ± 0.08 | |
| B | 0.71 ± 0.06 | 0.69 ± 0.08 | 0.75 ± 0.07 | 0.64 ± 0.06 | 0.73 ± 0.07 | |
| A | 0.72 ± 0.07 | 0.73 ± 0.07 | 0.74 ± 0.08 | 0.63 ± 0.06 | 0.73 ± 0.07 | |
| B | 0.73 ± 0.05 | 0.72 ± 0.07 | 0.73 ± 0.08 | 0.66 ± 0.06 | 0.72 ± 0.07 | |
| FWA | 0.71 ± 0.06 | 0.72 ± 0.06 | 0.71 ± 0.08 | 0.63 ± 0.06 | 0.72 ± 0.06 | |
| FGA | 0.77 ± 0.07* | 0.76 ± 0.08* | 0.78 ± 0.07 | 0.65 ± 0.07 | 0.76 ± 0.08* |