Table 2.
Audio-Model | Disent. | Audio-only | Word2vec Fusion (Text-only) | DeID (Audio-only) |
---|---|---|---|---|
| ||||
Raw-Audio ECAPA-TDNN | ADV | 0.790 | 0.860 (0.762) | 22.32% |
ComparE16 LSTM-only | USSD | 0.776 | 0.830 (0.762) | 92.87% |
Audio-Model | Disent. | Audio-only | Word2vec Fusion (Text-only) | DeID (Audio-only) |
---|---|---|---|---|
| ||||
Raw-Audio ECAPA-TDNN | ADV | 0.790 | 0.860 (0.762) | 22.32% |
ComparE16 LSTM-only | USSD | 0.776 | 0.830 (0.762) | 92.87% |