Skip to main content
. 2024 Aug 30;70(9):e20240523. doi: 10.1590/1806-9282.20240523

Table 2. Emergency doctor and YOLOv8 model performance comparison based on the test datasets.

Emergency doctor
Without artificial intelligence
Total sets True positive True negative False positive False negative Sensitivity (95%CI) Specificity (95%CI) Accuracy (95%CI)
All 1,000 655 230 71 44 93.7% 75.2% 88.0%
Fracture 400 225 118 38 19 92.2% 75.6% 85.8%
Not fracture 600 430 112 33 25 94.5% 77.2% 90.3%
With artificial intelligence
Total sets True positive True negative False positive False negative Sensitivity (95%CI) Specificity (95%CI) Accuracy (95%CI)
All 1,000 688 261 30 21 97.0% 89.7% 94.9%
Fracture 400 239 132 21 8 96.8% 86.3% 92.8%
Not fracture 600 449 129 9 13 97.2% 93.5% 96.3%