Skip to main content
. 2020 Aug 27;20(17):4833. doi: 10.3390/s20174833

Table 6.

Classification performance of the test dataset. Models include the constant (outcome bias), baseline models (k-nearest neighbors (KNN), logistic regression, naïve Bayes, decision tree), and black box models (support vector machine (SVM), neural network, AdaBoost, and random forest).

Reported Focus (Yes/No) Liberal (95% Training) Conservative (10% Training)
Model AUC F1 Precision Recall AUC F1 Precision Recall
Constant 0.46 0.64 0.56 0.75 0.50 0.64 0.56 0.75
K-Nearest Neighbors (k = 3) 0.87 0.80 0.80 0.80 0.81 0.79 0.79 0.79
Logistic Regression 0.54 0.64 0.56 0.75 0.52 0.64 0.69 0.75
Naïve Bayes 0.68 0.64 0.56 0.75 0.66 0.66 0.68 0.74
Decision Tree (depth = 4) 0.73 0.75 0.81 0.80 0.71 0.74 0.74 0.77
Support Vector Machine 0.51 0.61 0.61 0.61 0.53 0.64 0.62 0.67
Neural Network 0.54 0.64 0.56 0.75 0.52 0.64 0.68 0.75
AdaBoost 0.86 0.90 0.90 0.90 0.72 0.78 0.78 0.78
Random Forest 0.96 0.90 0.90 0.91 0.85 0.81 0.81 0.82