Table 3.
Triage | Level 1 | Level 2 | Level 3 | Level 4 | Level 5 |
---|---|---|---|---|---|
Gold standard (rater 1) | 10 (5.0) | 69 (34.2) | 83 (41.1) | 38 (18.8) | 2 (1.0) |
Rater 2 | 15 (7.4) | 68 (33.74) | 74 (36.6) | 40 (19.8) | 5 (2.5) |
Rater 3 | 15 (7.4) | 69 (34.2) | 76 (37.6) | 37 (18.3) | 5 (2.5) |
Rater 4 | 4 (2.0) | 67 (33.2) | 84 (41.6) | 44 (21.8) | 3 (1.5) |
GPT3.5 | 100 (49.5) | 97 (48.0) | 5 (2.5) | 0 (0.0) | 0 (0.0) |
GPT4 | 12 (5.9) | 55 (27.2) | 71 (35.1) | 59 (29.2) | 5 (2.5) |