Skip to main content
. 2024 Jan 16;37(1):268–279. doi: 10.1007/s10278-023-00909-7

Table 2.

Performance of the lung graph–based machine learning models in the identification of f-ILD on the testing set

Evaluation level Method AUC Accuracy Sensitivity Specificity PPV NPV
Scan-level Split 1 0.983 0.918 0.9 0.939 0.947 0.886
Split 2 0.996 0.984 0.973 1 1 0.963
Split 3 1 1 1 1 1 1
Split 4 0.965 0.908 0.897 0.923 0.946 0.857
Split 5 0.913 0.841 0.941 0.743 0.78 0.929
Mean 0.971 ± 0.032 0.930 ± 0.057 0.942 ± 0.040 0.921 ± 0.094 0.935 ± 0.081 0.927 ± 0.051
Patient-level Split 1 0.969 0.881 0.84 0.941 0.955 0.8
Split 2 0.99 0.976 0.958 1 1 0.944
Split 3 1 1 1 1 1 1
Split 4 0.949 0.854 0.833 0.882 0.909 0.789
Split 5 0.958 0.878 0.917 0.824 0.88 0.875
Mean 0.973 ± 0.019 0.918 ± 0.059 0.910 ± 0.065 0.929 ± 0.068 0.949 ± 0.048 0.882 ± 0.081

All results are shown as mean values and standard deviations over the five random splits. Evaluation results (except AUC) of the proposed method were calculated by using the standard classification decision threshold of 0.5

AUC area under the curve, PPV positive predict value, NPV negative predict value