. 2022 Dec 7;22:1285. doi: 10.1186/s12885-022-10366-0

Table 5.

The agreement of treatment response assessment

	Testing dataset	Validation cohort
R1 vs. reference standard	0.48 (0.21–0.74)	0.63 (0.43–0.84)
R2 vs. reference standard	0.30 (0.11–0.40)	0.45 (0.20–0.69)
Automated segmentation vs. reference standard	0.51 (0.23–0.79)	0.60 (0.34–0.84)
R1 vs. R2	0.58 (0.33–0.84)	0.55 (0.32–0.78)
R1 vs. Automated segmentation	0.85 (0.70–1.00)	0.74 (0.53–0.96)
R2 vs. Automated segmentation	0.46 (0.20–0.72)	0.50 (0.24–0.75)

R1: an attending radiologist with 8 year’s reading experience; R2: a fellow radiologist with 4 year’s reading experience