Table 3:
Ablation studies on CMU-MOSI dataset. The complete RAVEN that models subword dynamics and word shifts works best.
Dataset | CMU-MOSI | ||
---|---|---|---|
Metric | MAE | Corr | Acc-2 |
RAVEN | 0.915 | 0.691 | 78.0 |
RAVEN w/o SHIFT | 0.954 | 0.666 | 77.7 |
RAVEN w/o SUB | 0.934 | 0.652 | 73.9 |
RAVEN w/o SUB&SHIFT | 1.423 | 0.116 | 50.6 |