Table 4. Visual accuracy scores in the ILD CT segmentation.
| Original CT (automated) | Converted CT (automated) | Reference standard (original, manual) | P | Post-hoc test | |
|---|---|---|---|---|---|
| Overall (Group 2 to 7) | 6.35 ± 2.49 | 7.64 ± 1.94 | 8.54 ± 1.60 | < 0.001* | RS > Conv > ORIG† |
| Group 2 | 6.04 ± 2.57 | 7.52 ± 2.13 | 8.79 ± 1.42 | < 0.001* | RS > Conv > ORIG† |
| Group 3 | 6.81 ± 2.32 | 7.54 ± 2.16 | 8.59 ± 1.34 | < 0.001* | RS > Conv > ORIG† |
| Group 4 | 6.78 ± 2.07 | 7.56 ± 1.84 | 8.39 ± 1.53 | < 0.001* | RS > Conv > ORIG† |
| Group 5 | 7.11 ± 1.96 | 7.72 ± 1.77 | 8.25 ± 1.79 | < 0.001* | RS > Conv > ORIG† |
| Group 6 | 6.79 ± 2.31 | 7.93 ± 1.58 | 8.39 ± 2.00 | < 0.001* | RS > Conv > ORIG† |
| Group 7 | 4.74 ± 2.80 | 7.56 ± 2.09 | 8.77 ± 1.39 | < 0.001* | RS > Conv > ORIG† |
Data are presented as mean ± standard deviation of the average of the visual scores of five readers of each CT slice. Visual accuracy scores are defined as the degree of agreement with the readers’ subjective segmentation of ILD CT patterns: 1 = agreement from 0 to 9%, 2 = 10% to 19%, 3 = 20% to 29%, 4 = 30% to 39%, 5 = 40% to 49%, 6 = 50% to 59%, 7 = 60% to 69%, 8 = 70% to 79%, 9 = 80% to 89%, and 10 = 90% to 100%.
*P-value was calculated using a repeated-measures analysis, †Indicates that the post-hoc tests among RS, Conv, and ORIG are statistically significant.
ILD = interstitial lung disease, CT = computed tomography, RS = reference standard, Conv = Converted, ORIG = original