Skip to main content
. 2022 Jul 18;12:12262. doi: 10.1038/s41598-022-16388-9

Table 2.

Summary of quantitative evaluation of liver tumor detection. Mean and standard deviation of recall, precision and F1-Score, and median FPC are given for algorithms A1 and A2 evaluated against each rater, as well as of the mean per test case (Ri¯). The last row shows the mean pair-wise comparison of raters (Ri,Rj)¯.

Recall Precision F1-Score FPC
A1 A2 A1 A2 A1 A2 A1 A2
R1 0.699 ± 0.305 0.726 ± 0.294 0.533 ± 0.256 0.387 ± 0.185 0.549 ± 0.210 0.463 ± 0.181 3 6
R2 0.680 ± 0.300 0.736 ± 0.308 0.590 ± 0.270 0.436 ± 0.248 0.595 ± 0.251 0.497 ± 0.233 3 6
R3 0.751 ± 0.277 0.781 ± 0.288 0.574 ± 0.249 0.430 ± 0.237 0.623 ± 0.228 0.510 ± 0.216 2 6
Ri¯ 0.710 ± 0.268 0.748 ± 0.269 0.566 ± 0.225 0.418 ± 0.205 0.589 ± 0.201 0.490 ± 0.192 2.3 6
(Ri,Rj)¯ 0.804 ± 0.175 0.787 ± 0.1805 0.761 ± 0.165 1