. 2025 Jul 28;16:13. doi: 10.1186/s13326-025-00334-5

Table 5.

Weighted average F1 score for input scenario Q (oversampling)

Label	No Overs.	SMOTE	B. SMOTE	ADASYN	Pseudo.	Keywo.
XGBoost	0.52	0.55	0.54	0.52	0.52	0.52
Naïve Bayes	0.42	0.58	0.59	0.42	0.42	0.42
Perceptron	0.56	0.55	0.55	0.56	0.56	0.56
MLP	0.37	0.36	0.38	0.35	0.37	0.35
Decision Tree	0.46	0.37	0.39	0.45	0.46	0.46
SVM	0.42	0.52	0.28	0.43	0.42	0.42
BERT	0.08	–	–	–	0.09	0.00

Note that we only applied keyword-based and pseudolabeling oversampling for the BERT-based classifier