Table 2. Inter-rater reliability for each criterion upon final review of the 1900 test Items.
| Criterion | Inter-rater reliability (K) at final review |
| Simplicity | 0.96 |
| Objectivity | 0.99 |
| Positivity | 0.94 |
| Clarity | 0.99 |
| Parallelism | 0.98 |
| Brevity | 1.00 |
| Relevance | 0.99 |
| Order | 1.00 |
| Independence | 1.00 |
| Appropriateness | 0.95 |