Skip to main content
. 2020 Nov 30;3:543405. doi: 10.3389/frai.2020.543405

TABLE 3.

Safety and appropriateness of triage recommendations for doctors and the Babylon Triage and Diagnostic System (Babylon AI) against a range of acceptable recommendations provided by GPs.

GP-1 GP-2 GP-3
Safety (%) Appr. (%) Safety (%) Appr. (%) Safety (%) Appr. (%)
(95% CI) (95% CI) (95% CI)
Doctor A 97.9 89.4 91.5 83.0 95.7 89.4
Doctor B 79.5 75.6 60.3 59.0 75.6 74.4
Doctor C 97.9 89.6 93.8 89.6 95.8 93.8
Doctor D 80.4 76.5 64.7 62.8 86.3 84.3
Doctor E 84.3 78.6 70.0 67.1 80.0 78.6
Doctor F 92.2 86.3 74.5 68.6 92.2 84.3
Doctor G 92.2 88.2 72.6 70.6 84.3 80.4
Doctor average 89.2 83.5 75.3 71.5 87.1 83.6
(82.5–95.9) (78.1–88.8) (64.4–86.3) (62.1–80.9) (80.4–93.8) (78.0–89.2)
Babylon AI 90.0 74.0 81.0 75.0 90.0 81.0

The AI powered System gives safer triage recommendations than the doctors on average, at the expense of a slightly lower appropriateness.