Skip to main content
. 2023 Dec 2;23(23):9571. doi: 10.3390/s23239571

Table 2.

We compare the baseline results of three deep learning models (CNN-LSTM, ResNet, and DeepConvLSTM) with Accuracy (higher is better) and F1 score (higher is better) using three cross-validation methods (k-fold, leave-recordings-out, and leave-one-subject-out) with five sensors. The standard deviation is reported with the ± symbol. We embolden the model per column.

Model k-Fold Leave-Recordings-Out Leave-One-Subject-Out
Accuracy F1 Accuracy F1 Accuracy F1
CNN-LSTM 0.832 (±0.01) 0.830 (±0.02) 0.632 (±0.03) 0.630 (±0.02) 0.565 (±0.05) 0.556 (±0.04)
ResNet 0.864 (±0.01) 0.864 (±0.01) 0.600 (±0.02) 0.603 (±0.03) 0.531 (±0.05) 0.527 (±0.03)
DeepConvLSTM 0.735 (±0.02) 0.730 (±0.04) 0.563 (±0.05) 0.565 (±0.05) 0.525 (±0.06) 0.515 (±0.04)