Skip to main content
. 2024 Oct 22;14:24882. doi: 10.1038/s41598-024-76079-5

Table 3.

Comprehensive 10-fold cross-validated mean performance evaluation metrics with standard deviations for various models and embeddings, including time performance (training and inference times).

Word embedding Model Macro average precision Macro average recall Macro average F1-score Accuracy Training time (s) Inference time (s)
BoW RF 75% ± 1% 61% ± 2% 61% ± 2% 67% ± 1% 6.73 0.03
BoW NB 63% ± 1% 62% ± 2% 62% ± 2% 64% ± 1% 0.82 0.01
BoW SVM 76% ± 2% 76% ± 2% 76% ± 2% 77% ± 1% 4.35 0.30
BoW GBM 82% ± 1% 79% ± 1% 79% ± 1% 81% ± 1% 262.81 0.02
BoW LGBM 83% ± 1% 81% ± 1% 81% ± 1% 83% ± 1% 35.54 0.01
BoW XGBoost 77% ± 0% 75% ± 1% 75% ± 1% 78% ± 0% 86.50 0.02
BoW Catboost 80% ± 0% 74% ± 1% 75% ± 1% 77% ± 1% 536.16 0.02
BoW LGBM+KNN+MLP 82% ± 1% 81% ± 2% 81% ± 2% 84% ± 0% 1422.53 45.43
BoW RF+KNN+MLP 74% ± 1% 73% ± 2% 74% ± 1% 75% ± 1% 1212.43 40.54
BoW GBM+RF Stacking Classifier 78% ± 0% 76% ± 1% 76% ± 1% 78% ± 0% 1364.06 50.57
BoW GBM+RF Voting Classifier 78% ± 0% 69% ± 1% 70% ± 1% 72% ± 1% 1250.37 48.30
BoW Single Hidden Layer NN 71% ± 4% 69% ± 4% 69% ± 4% 72% ± 4% 646.34 24.53
BoW 3 Hidden Layers NN 68% ± 4% 67% ± 4% 67% ± 4% 70% ± 4% 904.64 25.35
TFIDF RF 69% ± 1% 60% ± 3% 60% ± 3% 66% ± 2% 6.75 0.04
TFIDF NB 65% ± 1% 56% ± 3% 56% ± 3% 61% ± 2% 0.89 0.02
TFIDF SVM 74% ± 2% 70% ± 3% 70% ± 3% 73% ± 2% 5.15 0.46
TFIDF GBM 80% ± 1% 77% ± 3% 77% ± 3% 79% ± 1% 273.51 0.05
TFIDF LGBM 80% ± 0% 77% ± 3% 78% ± 3% 80% ± 1% 36.12 0.02
TFIDF XGBoost 68% ± 1% 67% ± 1% 67% ± 1% 70% ± 1% 87.22 0.03
TFIDF Catboost 74% ± 1% 72% ± 1% 72% ± 1% 74% ± 1% 542.44 0.03
TFIDF LGBM+KNN+MLP 79% ± 0% 77% ± 1% 77% ± 1% 79% ± 0% 1532.37 46.24
TFIDF RF Bagging 76% ± 0% 48% ± 1% 43% ± 1% 56% ± 1% 764.34 35.34
TFIDF RF+KNN+MLP 75% ± 1% 71% ± 3% 72% ± 2% 74% ± 0% 1254.65 42.75
TFIDF GBM+RF Stacking Classifier 77% ± 0% 76% ± 1% 76% ± 1% 78% ± 0% 1352.53 52.65
TFIDF GBM+RF Voting Classifier 75% ± 0% 69% ± 2% 70% ± 1% 73% ± 0% 1283.23 50.75
TFIDF Single Hidden Layer NN 68% ± 3% 67% ± 4% 67% ± 4% 69% ± 2% 650.34 22.43
TFIDF 3 Hidden Layers NN 93% ± 3% 92% ± 4% 92% ± 3% 93% ± 4% 954.64 27.53
word2vec RF 51% ± 1% 49% ± 2% 49% ± 2% 56% ± 1% 6.82 0.05
word2vec NB 41% ± 1% 34% ± 1% 19% ± 1% 36% ± 1% 1.03 0.04
word2vec SVM 67% ± 0% 49% ± 2% 45% ± 3% 57% ± 1% 5.78 0.57
word2vec GBM 51% ± 1% 50% ± 2% 50% ± 2% 56% ± 1% 282.10 0.08
word2vec LGBM 53% ± 1% 49% ± 1% 47% ± 1% 57% ± 1% 37.41 0.03
word2vec XGBoost 55% ± 1% 50% ± 2% 48% ± 2% 56% ± 1% 88.60 0.03
word2vec Catboost 71% ± 0% 49% ± 1% 45% ± 1% 57% ± 1% 553.37 0.04
word2vec LGBM+KNN+MLP 53% ± 1% 46% ± 2% 41% ± 3% 53% ± 1% 1448.76 47.34
word2vec RF+KNN+MLP 55% ± 1% 49% ± 3% 43% ± 3% 57% ± 1% 1345.53 55.23
word2vec GBM+RF Stacking Classifier 46% ± 2% 45% ± 2% 45% ± 2% 51% ± 1% 1412.64 53.73
word2vec GBM+RF Voting Classifier 47% ± 1% 47% ± 1% 47% ± 1% 53% ± 1% 1350.23 28.78
word2vec Single Hidden Layer NN 36% ± 5% 45% ± 4% 40% ± 4% 53% ± 3% 704.65 28.34
word2vec 3 Hidden Layers NN 38% ± 4% 48% ± 4% 42% ± 4% 56% ± 3% 1034.89 33.38
BERT RF 53% ± 2% 51% ± 3% 49% ± 3% 58% ± 1% 805.43 45.53
BERT NB 50% ± 3% 51% ± 2% 50% ± 2% 53% ± 1% 1.12 0.07
BERT SVM 52% ± 2% 52% ± 2% 52% ± 2% 56% ± 1% 6.12 0.76
BERT GBM 56% ± 1% 54% ± 2% 54% ± 2% 60% ± 2% 291.11 0.11
BERT LGBM 58% ± 2% 57% ± 2% 57% ± 2% 61% ± 2% 37.66 0.03
BERT XGBoost 56% ± 2% 56% ± 2% 56% ± 2% 60% ± 2% 89.10 0.04
BERT Catboost 56% ± 2% 49% ± 2% 46% ± 2% 57% ± 2% 562.29 0.05
BERT LGBM+KNN+MLP 61% ± 1% 59% ± 2% 59% ± 2% 62% ± 1% 1623.65 65.34
BERT RF+KNN+MLP 60% ± 1% 58% ± 2% 57% ± 2% 62% ± 1% 1443.22 60.44
BERT GBM+RF Stacking Classifier 55% ± 2% 54% ± 2% 54% ± 2% 56% ± 1% 1523.75 70.23
BERT GBM+RF Voting Classifier 60% ± 1% 58% ± 2% 58% ± 2% 66% ± 0% 1452.45 67.85
BERT Single Hidden Layer NN 61% ± 3% 58% ± 4% 59% ± 4% 62% ± 2% 945.53 35.64
BERT 3 Hidden Layers NN 62% ± 4% 60% ± 4% 60% ± 4% 64% ± 4% 1305.39 45.49
SBERT RF 61% ± 0% 48% ± 3% 44% ± 4% 54% ± 2% 6.95 0.06
SBERT NB 53% ± 2% 40% ± 3% 32% ± 4% 44% ± 2% 1.14 0.09
SBERT SVM 56% ± 1% 57% ± 2% 56% ± 2% 58% ± 1% 6.28 0.81
SBERT GBM 55% ± 2% 51% ± 2% 49% ± 2% 55% ± 2% 302.43 0.15
SBERT LGBM 55% ± 2% 52% ± 2% 50% ± 2% 56% ± 2% 38.18 0.04
SBERT XGBoost 56% ± 2% 56% ± 2% 56% ± 2% 57% ± 2% 90.53 0.06
SBERT Catboost 69% ± 0% 48% ± 3% 42% ± 4% 53% ± 2% 577.54 0.05
SBERT LGBM+KNN+MLP 55% ± 2% 49% ± 2% 47% ± 2% 53% ± 2% 1734.23 68.64
SBERT RF+KNN+MLP 56% ± 1% 54% ± 2% 54% ± 2% 57% ± 1% 1522.42 63.43
SBERT GBM+RF Stacking Classifier 44% ± 2% 44% ± 2% 43% ± 2% 48% ± 1% 1623.86 72.57
SBERT GBM+RF Voting Classifier 53% ± 2% 50% ± 2% 50% ± 2% 54% ± 2% 1553.54 74.67
SBERT Single Hidden Layer NN 55% ± 3% 55% ± 4% 54% ± 3% 59% ± 3% 954.64 38.48
SBERT 3 Hidden Layers NN 56% ± 4% 57% ± 3% 57% ± 3% 59% ± 2% 1402.54 48.48
RoBERTa RF 62% ± 2% 62% ± 2% 62% ± 2% 64% ± 1% 7.23 0.08
RoBERTa NB 63% ± 1% 55% ± 2% 52% ± 3% 54% ± 2% 1.16 1.05
RoBERTa SVM 65% ± 1% 65% ± 1% 65% ± 1% 67% ± 0% 6.76 0.88
RoBERTa GBM 63% ± 2% 62% ± 3% 63% ± 2% 65% ± 2% 314.44 0.17
RoBERTa LGBM 64% ± 2% 63% ± 3% 64% ± 2% 66% ± 1% 38.89 0.05
RoBERTa XGBoost 65% ± 1% 63% ± 2% 63% ± 2% 66% ± 1% 91.14 0.07
RoBERTa Catboost 63% ± 2% 62% ± 3% 61% ± 4% 64% ± 2% 583.28 0.07
RoBERTa LGBM+KNN+MLP 66% ± 2% 66% ± 2% 65% ± 3% 66% ± 2% 1823.93 72.23
RoBERTa RF+KNN+MLP 58% ± 3% 55% ± 2% 55% ± 2% 60% ± 1% 1654.54 67.76
RoBERTa GBM+RF Stacking Classifier 61% ± 3% 61% ± 3% 61% ± 3% 63% ± 2% 1705.36 75.96
RoBERTa GBM+RF Voting Classifier 60% ± 2% 60% ± 2% 59% ± 3% 60% ± 2% 1653.78 73.47
RoBERTa Single Hidden Layer NN 84% ± 4% 84% ± 4% 84% ± 4% 84% ± 4% 1349.46 154.39
RoBERTa 3 Hidden Layers NN 84% ± 3% 83% ± 4% 83% ± 4% 84% ± 3% 1898.05 153.64
RoBERTa BiLSTM+3 Hidden Layers NN 84% ± 3% 84% ± 3% 84% ± 3% 85% ± 2% 3404.54 148.43
RoBERTa BilSTM+CNN 83% ± 2% 81% ± 4% 82% ± 3% 83% ± 2% 5328.73 178.64
RoBERTa Proposed TRABSA model 94% ± 1% 93% ± 2% 94% ± 1% 94% ± 1% 3675.21 147.14