Table 2.
Performance of the combined ACR workflow on the MR-SIM and MRL testing sets based on CQC1 (the final results are in bold).
| CQC1 | Major-error group | Minor-error group | |||
|---|---|---|---|---|---|
| initial DLAS contours | after ACM-ACR | after DL-ACR | initial DLAS contours | after DL-ACR | |
| MR-SIM | |||||
| numbers of contours improved to minor-error group | N/A | 44% (177/401)* | 36% (145/401) | N/A | N/A |
| numbers of contours improved to acceptable group | N/A | 5% (22/401) | 30% (119/401) | N/A | 43% (320/750) |
| DSC | 0.33 (±0.15) | 0.53 (±0.17) | 0.63 (±0.23) | 0.71 (±0.10) | 0.79 (±0.12) |
| MDA (mm) | 7.78 (±3.02) | 6.15 (±3.62) | 5.33 (±4.27) | 3.65 (±1.55) | 3.22 (±2.24) |
| HD 95% (mm) | 20.27 (±8.25) | 17.36 (±10.24) | 15.59 (±12.19) | 13.66 (±6.58) | 12.29 (±8.85) |
| sDSC | 0.28 (±0.15) | 0.41 (±0.19) | 0.51 (±0.25) | 0.55 (±0.15) | 0.65 (±0.20) |
| APL (mm) | 83.26 (±38.42) | 68.70 (±42.88) | 57.44 (±45.20) | 83.44 (±60.80) | 71.79 (±69.53) |
| MRL | |||||
| numbers of contours improved to minor-error group numbers of contours | N/A | 42% (202/485)† | 35% (172/485) | N/A | N/A |
| numbers of contours improved to acceptable group | N/A | 8% (38/485) | 28% (136/485) | N/A | 36% (267/732) |
| DSC | 0.34 (±0.16) | 0.54 (±0.19) | 0.62 (±0.25) | 0.70 (±0.10) | 0.77 (±0.14) |
| MDA (mm) | 9.64 (±5.36) | 8.11 (±6.91) | 7.37 (±8.23) | 4.39 (±1.69) |
4.13 (±2.95)
p = 0.009 |
| HD 95% (mm) | 26.18 (±14.59) | 24.05 (±18.49) | 22.18 (±21.68) | 17.34 (±7.21) |
16.43 (±11.56)
p = 0.017 |
| sDSC | 0.29 (±0.14) | 0.43 (±0.19) | 0.51 (±0.25) | 0.57 (±0.14) | 0.65 (±0.20) |
| APL (mm) | 101.59 (±47.66) | 81.61 (±46.35) | 69.30 (±51.01) | 83.43 (±67.05) | 69.75 (±72.28) |
Note: paired t test p values <<0.001 after each ACR step if not listed in the table (see Supplementary materials for the exact p values).
among these 177 contours that were improved to minor-error group by ACM-ACR, 87 (49%) were further improved to acceptable contours by DL-ACR.
among these 202 contours that were improved to minor-error group by ACM-ACR, 88 (44%) were further improved to acceptable contours by DL-ACR.