Table 2.
Summary of quantitative evaluation of liver tumor detection. Mean and standard deviation of recall, precision and F1-Score, and median FPC are given for algorithms and evaluated against each rater, as well as of the mean per test case (). The last row shows the mean pair-wise comparison of raters .
Recall | Precision | F1-Score | FPC | |||||
---|---|---|---|---|---|---|---|---|
0.699 ± 0.305 | 0.726 ± 0.294 | 0.533 ± 0.256 | 0.387 ± 0.185 | 0.549 ± 0.210 | 0.463 ± 0.181 | 3 | 6 | |
0.680 ± 0.300 | 0.736 ± 0.308 | 0.590 ± 0.270 | 0.436 ± 0.248 | 0.595 ± 0.251 | 0.497 ± 0.233 | 3 | 6 | |
0.751 ± 0.277 | 0.781 ± 0.288 | 0.574 ± 0.249 | 0.430 ± 0.237 | 0.623 ± 0.228 | 0.510 ± 0.216 | 2 | 6 | |
0.710 ± 0.268 | 0.748 ± 0.269 | 0.566 ± 0.225 | 0.418 ± 0.205 | 0.589 ± 0.201 | 0.490 ± 0.192 | 2.3 | 6 | |
0.804 ± 0.175 | 0.787 ± 0.1805 | 0.761 ± 0.165 | 1 |