Table 6. Pairwise comparison results for each perceptual evaluation question.
Each cell reports the Bonferroni-corrected -value and the corresponding effect size , with magnitude interpreted as small, medium, or large.
Image consistency | Inappr. Reduction | Text alignment | Overall quality | |||||
---|---|---|---|---|---|---|---|---|
-value | Effect size | -value | Effect size | -value | Effect size | -value | Effect size | |
SLD-Weak vs SLD-Max | <0.001 | 0.513 (large) | 0.003 | 0.209 (small) | 1.000 | 0.022 (small) | <0.001 | 0.419 (medium) |
SLD-Weak vs. ESD | 0.002 | 0.264 (small) | 0.074 | 0.137 (small) | 1.000 | 0.010 (small) | <0.001 | 0.297 (small) |
SLD-Weak vs. Ours | <0.001 | 0.990 (large) | <0.001 | 0.633 (large) | <0.001 | 0.719 (large) | <0.001 | 0.643 (large) |
SLD-Max vs. ESD | <0.001 | 0.602 (large) | 1.000 | 0.109 (small) | 1.000 | 0.048 (small) | <0.001 | 0.531 (large) |
SLD-Max vs. Ours | <0.001 | 0.990 (large) | <0.001 | 0.472 (medium) | <0.001 | 0.661 (large) | <0.001 | 0.419 (medium) |
ESD vs. Ours | <0.001 | 0.990 (large) | <0.001 | 0.615 (large) | <0.001 | 0.723 (large) | <0.001 | 0.798 (large) |