Skip to main content
. 2024 Jan 30;24(3):901. doi: 10.3390/s24030901

Table 5.

Confusion matrices of speech emotion recognition (4 classes) with LittleBeats™ and smartphone audio data.

LittleBeats™ Data Smartphone Data
Ground Truth Labels NEU HAP SAD ANG NEU HAP SAD ANG
Neutral (NEU) 22
(0.786)
2
(0.071)
2
(0.071)
2
(0.071)
17
(0.607)
1
(0.036)
6
(0.214)
4
(0.143)
Happy (HAP) 0
(0)
24
(0.649)
2
(0.054)
11
(0.297)
1
(0.027)
25
(0.676)
2
(0.054)
9
(0.243)
Sad (SAD) 6
(0.154)
8
(0.205)
25
(0.641)
0
(0)
6
(0.158)
4
(0.105)
27
(0.711)
1
(0.026)
Angry (ANG) 0
(0)
6
(0.158)
4
(0.105)
28
(0.737)
1
(0.026)
12
(0.316)
3
(0.079)
22
(0.579)

Note: Rows represent the ground truth labels, and columns represent the predicted data. The proportions of a given ground truth label that were predicted as neutral, happy, sad, and angry, respectively, are shown in parentheses.