Skip to main content
. 2023 Oct 16;23:217. doi: 10.1186/s12911-023-02315-z

Table 2.

Performance comparison of NLP classification models on the test dataset. The highest score values are shown in bold

NLP Classification Model Accuracy F1 Score Precision Recall
Keyword-based labelling 0.84 0.86 0.97 0.77
Word2vec and XGBoost 0.76 0.75 0.78 0.73
TF-IDF and Logistic Regression 0.88 0.87 0.94 0.81
BERT 0.90 0.89 0.96 0.83
BERTweet 0.90 0.90 0.95 0.85