Table 3.
Inter-rater reliability among expert raters across evaluation components.
| Component | ICC (2,1) | 95% CI lower | 95% CI upper | Interpretation |
|---|---|---|---|---|
| Accuracy | 0.737 | 0.653 | 0.809 | Good |
| Clarity | 0.689 | 0.593 | 0.771 | Moderate to good |
| Completeness | 0.857 | 0.804 | 0.898 | Excellent |
| No misleading info | 0.909 | 0.874 | 0.936 | Excellent |
| Relevance | 0.697 | 0.604 | 0.777 | Moderate to good |