Skip to main content
. 2025 Jul 28;16:13. doi: 10.1186/s13326-025-00334-5

Table 5.

Weighted average F1 score for input scenario Q (oversampling)

Label No Overs. SMOTE B. SMOTE ADASYN Pseudo. Keywo.
XGBoost 0.52 0.55 0.54 0.52 0.52 0.52
Naïve Bayes 0.42 0.58 0.59 0.42 0.42 0.42
Perceptron 0.56 0.55 0.55 0.56 0.56 0.56
MLP 0.37 0.36 0.38 0.35 0.37 0.35
Decision Tree 0.46 0.37 0.39 0.45 0.46 0.46
SVM 0.42 0.52 0.28 0.43 0.42 0.42
BERT 0.08 0.09 0.00

Note that we only applied keyword-based and pseudolabeling oversampling for the BERT-based classifier