Table 2. Performance characteristics in the evaluation of the TRUE ULCERATIVE image set.
Randomized VC setting | Evaluator | Accuracy for the first reading | Accuracy for the second reading | Difference in accuracy (%) [95 % CI] |
WLI | 1 | 56.4 [50.2, 62.4] | 44.0 [38.0, 50.2] | – 12.4 [– 18.4, – 6.2]1 |
2 | 34.8 [29.2, 40.9] | 29.2 [23.9, 35.1] | – 5.6 [– 11.0, – 0.2]1 | |
3 | 38.8 [33.0, 45.0] | 38.0 [32.2, 44.2] | – 0.8 [– 4.4, 2.8] | |
Globally2 | 43.3 | 37.1 | – 6.3 [– 12.9, 0.3] | |
FICE 1 | 4 | 49.6 [43.5, 55.8] | 33.6 [28.0, 39.7] | – 16.0 [– 22.8, – 9.0]1 |
5 | 52.8 [46.6, 58.9] | 59.2 [53.0, 65.1] | 6.4 [0.0, 12.7]1 | |
6 | 32.1 [26.6, 38.2] | 80.7 [75.4, 85.1] | 48.6 [41.4, 54.9]1 | |
Globally2 | 44.9 | 57.8 | 12.9 [– 24.1, 50.0] | |
FICE 2 | 7 | 54.4 [48.2, 60.5] | 69.2 [63.2, 74.6] | 14.8 [8.7, 20.7]1 |
8 | 54.8 [48.6, 60.9] | 44.4 [38.4, 50.6] | – 10.4 [– 17.5, – 3.1]1 | |
9 | 90.0 [85.7, 93.1] | 91.6 [87.5, 94.4] | 1.6 [– 2.6, 5.9] | |
Globally2 | 66.4 | 68.4 | 2.0 [– 12.3, 16.3] | |
FICE 3 | 10 | 53.2 [47.0, 59.3] | 79.2 [73.7, 83.8] | 26.0 [19.4, 32.2]1 |
11 | 51.2 [45.0, 57.3] | 41.6 [35.7, 47.8] | – 9.6 [– 16.7, – 2.4]1 | |
12 | 44.8 [38.8, 51.0] | 46.4 [40.3, 52.6] | 1.6 [– 5.5, 8.7] | |
Globally2 | 49.7 | 55.7 | 6.0 [– 14.6, 26.6] | |
Blue Mode | 13 | 12.8 [9.2, 17.5] | 58.4 [52.2, 64.3] | 45.6 [39.0, 51.6]1 |
14 | 62.4 [56.3, 68.2] | 77.2 [71.6, 82.0] | 14.8 [9.9, 19.7]1 | |
15 | 68.0 [62.0, 73.5] | 75.2 [69.5, 80.1] | 7.2 [1.5, 12.8]1 | |
Globally2 | 47.7 | 70.3 | 22.5 [– 0.5, 45.5]3 |
VC, virtual chromoendoscopy; WLI, white light image; CI, confidence interval
statistically significant
stratified McNemar test
p = 0.07