Skip to main content
. 2025 Feb 28;11:e2738. doi: 10.7717/peerj-cs.2738

Table 7. Validation results for deep learning models on the 110 K review dataset.

Method Accuracy Std. dev. Class Precision Recall F1-score
RNN 0.957 0.0006 Positive 0.981 0.982 0.982
Neutral/Mixed 0.751 0.773 0.762
Negative 0.930 0.907 0.919
GRU 0.958 0.0002 Positive 0.978 0.985 0.981
Neutral/Mixed 0.774 0.751 0.763
Negative 0.930 0.907 0.918
LSTM 0.958 0.0009 Positive 0.979 0.984 0.982
Neutral/Mixed 0.766 0.754 0.760
Negative 0.931 0.909 0.920
BERT 0.975 0.0001 Positive 0.989 0.990 0.989
Neutral/Mixed 0.854 0.846 0.850
Negative 0.954 0.951 0.952