Skip to main content
. 2021 Mar 22;8(21):16035–16046. doi: 10.1109/JIOT.2021.3067605

TABLE V. Normalized Confusion Matrices (in [%]) of the Best Models in Each Task on the Test Set. Inline graphic Task: Late Fusion of Open Smile: ComParE Func. + SVM, Inline graphic:.01; DeepSelf: e2e CNN+RNN; OpenXBOW: ComParE BoAW + SVM, Inline graphic:.01, Inline graphic: 125; auDeep: RNN+SVM, Inline graphic:.1, Inline graphic: −60 dB. Inline graphic Task: Late Fusion of OpenSMILE: ComParE Func. +SVM, Inline graphic:.0001; auDeep: RNN+SVM, Inline graphic:.1, Inline graphic: Fused; DeepSpectrum: VGG19 +SVM, Inline graphic:.01; OpenXBOW: ComParE BoAW +SVM, Inline graphic: 1.0, Inline graphic: 250; DeepSELF: e2e CNN, Channel: [3, 6], Kernel Size: [16, 8], Stride Size: [16, 8], Learning Rate:.0001. A Task: OpenSMILE: ComParE Func. +SVM, Inline graphic:.0001. (a) Inline graphic Task (UAR = 44.3%, Chance Level: 33.3%). (b) Inline graphic Task (UAR = 44.4%, Chance Level: 33.3%). (c) Inline graphic Task (UAR = 55.3%, Chance Level: 33.3%).

(a)
Pred - > Good Normal Bad
Good 50.0 28.1 21.9
Normal 0.0 40.0 60.0
Bad 28.6 28.6 42.9