Skip to main content
. 2023 Mar 22;14:1025749. doi: 10.3389/fendo.2023.1025749

Table 3.

Performance of the deep learning versus radiologists in classifying compression fractures in the human and deep learning collaboration dataset.

Dataset AUC Accuracy (%) Sensitivity (%) Specificity (%) Precision (%) F1 Score (%) PPV (%) NPV (%)
Deep learning model 76.4 64.7 82.3 65.0 64.8 64.7 82.3
Acute 0.780 69.3 67.5 71.4 73.0 70.1 73.0 65.8
Chronic 0.809 71.3 68.4 73.1 60.9 64.5 60.9 79.1
Pathologic 0.734 88.7 30.8 94.2 33.3 32.0 33.3 93.5
Trainee radiologist 70.7 56.0 78.0 55.7 55.8 56.0 78.0
Acute 0.573 62.0 65.0 58.6 64.2 64.6 64.2 59.4
Chronic 0.618 64.0 52.6 71.0 52.6 52.6 52.6 71.0
Pathologic 0.541 86.0 15.4 92.7 16.7 16.0 16.7 92.0
Competent radiologist 76.9 65.3 82.7 69.9 67.5 65.3 82.7
Acute 0.701 69.3 58.8 81.4 78.3 67.1 78.3 63.3
Chronic 0.782 78.0 78.9 77.4 68.2 73.2 68.2 85.7
Pathologic 0.665 83.3 46.2 86.9 25.0 32.4 25.0 94.4
Expert radiologist 78.2 67.3 83.7 67.4 67.4 67.3 83.7
Acute 0.707 70.7 70.0 71.4 73.7 71.8 73.7 67.6
Chronic 0.732 74.0 70.2 76.3 64.5 67.2 64.5 80.7
Pathologic 0.667 90.0 38.5 94.9 41.7 40.0 41.7 94.2
Deep learning and trainee 77.8 66.7 83.3 67.9 67.3 66.7 83.3
Acute 0.722 72.0 70.0 74.3 75.7 72.7 75.7 68.4
Chronic 0.744 75.3 70.2 78.5 66.7 68.4 66.7 81.1
Pathologic 0.610 86.0 30.8 91.2 25.0 27.6 25.0 93.3
Deep learning and competent radiologist 81.6 72.7 86.0 73.4 73.1 72.2 86.3
Acute 0.767 76.7 76.3 77.1 79.2 77.7 79.2 74.0
Chronic 0.779 79.3 71.9 83.9 73.2 72.6 73.2 83.0
Pathologic 0.729 88.7 53.8 92.0 38.9 45.2 38.9 95.5
Deep learning and expert radiologist 85.3 77.6 89.1 77.4 77.5 77.6 89.1
Acute 0.801 80.0 80.5 79.5 80.5 80.5 80.5 79.5
Chronic 0.825 83.3 78.9 86.0 77.6 78.3 77.6 87.0
Pathologic 0.751 92.7 53.8 96.4 58.3 56.0 58.3 95.7