Table 2.
Trial | Percentage rate mean (range) | Q score | Prior group discussion and practice | Outcome of post hoc analysis | Dx prompts on rating form | Rating categories included in analysis | Raters rating each subject | Subjects included in analysis |
---|---|---|---|---|---|---|---|---|
Mist 2011 | 96.40% | 8.5 | * | 6 | 2 | 56 | ||
Sung Phase III | [80%] (77.8–84.4) | 6.7 | * | * | 4 | 4 | 65 | |
Zhang 2008a | [73%] (64.3–85.7) | 8.7 | * | * | * | 9 | 3 | 42 |
Grant Primary | 70% (25–100) | 8.5 | * | 3 | 2 | 27 | ||
MacPherson Secondary | [68.0%] (56–80) | 8 | * | * | 3 | 2 | 48 | |
MacPherson Primary | [65.4%] (47–75) | 8 | * | * | 3 | 2 | 87 | |
Zhang 2004 less stringenta | 64.8% (46.7–84.7) | 9.2 | * | * | * | [10] | 3 | 39 |
Tang | 59% (0–50) | 5 | * | * | * | 5 | 2 | 22 |
Sung Phase I | [57.4%] (46.2–70.6) | 6.7 | * | 4 | 4 | 39 | ||
Grant combined primary and secondary | 41% (range n/r) | 8.5 | * | n/r | 2 | 27 | ||
Zhang 2005 | 31.7% (27.5–35) | 7.3 | * | * | [7] | 3 | 40 | |
Zhang 2004 more stringenta | 28.2% (26.5–33.3) | 9.2 | * | * | * | [10] | 3 | 39 |
Popplewell pattern agreement | 23% (range n/r) | 5 | * | 18 | 2 for 16 | 35 | ||
3 for 19 | ||||||||
Popplewell weighted agreementb | 19% (range n/r) | 5 | * | 18 | 2 for 16 | 35 | ||
3 for 19 | ||||||||
Mean: | 56.7% | |||||||
Median: | 64.8% |
[] Values calculated from tabular data in publication (E.J.); median between ranks occurs between Zhang and Tang; Dx, diagnosis; * indicates that the trial had the feature named at the head of the column.
Mean rates reported in O'Brien 2009a28 for agreement by “at least two” raters may not be comparable to pairwise rates, and are omitted.
Outcomes weighted by number of pairs agreeing per dx category.
Outcome weighted by similarity of severity scores.