Table 3.
Performance of 18 single feature-based correspondence metrics obtained from radiologist-outlined (AUCR) and computer-segmented (AUCC) lesions, respectively. The value after “±” is the standard error (se) associated with each AUC. The two-tailed p-value and 95% C.I. of ΔAUC were calculated by ROCKIT. The “Sig. level” column represents the significance level of individual tests adjusted with Holm t test (overall significant level αT=0.05) and the tests with asterisks ( *) indicate significant difference using the adjusted significance level. The features have the same convention as Table 2.
Feature | AUCR±se | AUCC±se | p value | Sig. level | 95% C.I. of ΔAUC |
---|---|---|---|---|---|
FI,1 | 0.65±0.03 | 0.56±0.03 | 0.04 | 0.0045 | [0.004, 0.20] |
FI,2 | 0.53±0.03 | 0.54±0.03 | 0.76 | — | [−0.07,0.05] |
0.78±0.03 | 0.66±0.03 | 0.001 | 0.0031 | [0.05, 0.19] | |
FII,1 | 0.71±0.03 | 0.65±0.03 | 0.06 | — | [−0.01,0.13] |
FII,2 | 0.68±0.03 | 0.67±0.03 | 0.66 | — | [−0.05,0.09] |
FII,3 | 0.69±0.03 | 0.67±0.03 | 0.48 | — | [−0.04,0.09] |
FII,4 | 0.70±0.03 | 0.66±0.03 | 0.20 | — | [−0.02,0.11] |
FII,5 | 0.57±0.03 | 0.54±0.03 | 0.30 | — | [−0.03,0.10] |
FII,6 | 0.61±0.03 | 0.56±0.03 | 0.01 | 0.0042 | [0.01, 0.10] |
FII,7 | 0.61±0.03 | 0.53±0.03 | 0.009 | 0.0038 | [0.02, 0.15] |
FII,8 | 0.58±0.03 | 0.56±0.03 | 0.44 | — | [−0.03,0.07] |
0.69±0.03 | 0.61±0.03 | 0.002 | 0.0036 | [0.03, 0.14] | |
0.65±0.03 | 0.57±0.03 | 4×10−4 | 0.0029 | [0.04, 0.13] | |
0.66±0.03 | 0.55±0.03 | <10−5 | 0.0028 | [0.06, 0.15] | |
FII,12 | 0.62±0.03 | 0.59±0.03 | 0.34 | — | [−0.03,0.09] |
FII,13 | 0.58±0.03 | 0.57±0.03 | 0.90 | — | [−0.05,0.05] |
0.59±0.03 | 0.50±0.03 | 0.001 | 0.0031 | [0.05, 0.18] | |
FIII,1 | 0.81±0.02 | 0.81±0.02 | 0.73 | — | [−0.01,0.01] |