Skip to main content
. 2025 Feb 28;11:e2738. doi: 10.7717/peerj-cs.2738

Table 5. Validation results for deep learning models on the 21 K review dataset.

Method Accuracy Std. dev. Class Precision Recall F1-score
RNN 0.946 0.0017 Positive 0.970 0.981 0.975
Neutral/Mixed 0.688 0.621 0.652
Negative 0.930 0.914 0.922
GRU 0.950 0.0011 Positive 0.971 0.983 0.977
Neutral/Mixed 0.710 0.627 0.665
Negative 0.935 0.924 0.930
LSTM 0.945 0.0007 Positive 0.971 0.976 0.974
Neutral/Mixed 0.676 0.621 0.646
Negative 0.925 0.936 0.931
BERT 0.964 0.0027 Positive 0.975 0.993 0.984
Neutral/Mixed 0.834 0.679 0.748
Negative 0.955 0.946 0.951