Table 4.
Comparison of expert reader vs AI reliability as assessed with the intraclass correlation coefficient (ICC) for repeated measures performance using the single measures rating, absolute agreement and two-way mixed effects model.
| Anatomical location |
Expert reader | AI analysis | ||
|---|---|---|---|---|
| ICC (95% CI, p) | Rating | ICC (95% CI, p) | Rating | |
| SoV | 0.89 (0.71–0.96, p<0.001) | Good | 0.78 (0.53–0.91, p<0.001) | Good |
| STJ | 0.78 (0.57–0.90, p<0.001) | Good | 0.69 (0.40–0.85, p<0.001) | Moderate |
| Mid Asc | 0.96 (0.92–0.98, p < 0.001) | Excellent | 0.88 (0.74–0.95, p<0.001) | Good |
| Mid Arch | 0.79 (0.57–0.91, p < 0.001) | Good | 0.74 (0.49–0.88, p<0.001) | Moderate |
| Isthmus | 0.87 (0.69–0.94, p < 0.001) | Good | 0.79 (0.58–0.91, p<0.001) | Good |
| Mid Desc | 0.84 (0.68–0.93, p < 0.001) | Good | 0.66 (0.35–0.84, p<0.001) | Moderate |
| Hiatus | 0.84 (0.66–0.93, p<0.001) | Good | 0.57 (0.22–0.79, T=0.002) | Moderate |
CI, confidence interval; ICC, intraclass correlation coefficient; STJ, sinotubular junction; SoV, sinus of Valsalva.