Table 2.
Performances of LR and RF (top: accuracy, middle: AUC, bottom: Brier score): (top: accuracy, middle: AUC, bottom: Brier score): mean performance μ, standard deviation σ and confidence interval for the mean (estimated via the bootstrap BCa method [38]) on the 243 datasets
| Acc | μ | σ | BCa confidence interval |
|---|---|---|---|
| Logistic regression | 0.826 | 0.135 | [0.808, 0.842] |
| Random forest | 0.854 | 0.134 | [0.837, 0.870] |
| Difference | 0.029 | 0.067 | [0.021, 0.038] |
| Auc | |||
| Logistic regression | 0.826 | 0.149 | [0.807, 0.844] |
| Random forest | 0.867 | 0.147 | [0.847, 0.884] |
| Difference | 0.041 | 0.088 | [0.031, 0.054] |
| Brier | |||
| Logistic regression | 0.129 | 0.091 | [0.117, 0.140] |
| Random forest | 0.102 | 0.080 | [0.092, 0.112] |
| Difference | -0.0269 | 0.054 | [-0.034, -0.021] |