Table 1:
Research Area | Size | Dataset | Modalities | # Samples | Prediction task |
---|---|---|---|---|---|
Affective Computing | S M L L |
MUStARD [24] CMU-MOSI [181] UR-FUNNY [64] CMU-MOSEI [183] |
{, , } {, , } {, , } {, , } |
690 2,199 16,514 22,777 |
Sarcasm sentiment humor sentiment, emotions |
Healthcare | L | MIMIC [78] | {, } | 36,212 | mortality, ICD-9 codes |
Robotics | M L |
MuJoCo Push [90] Vision&Touch [92] |
{, , } {, , } |
37,990 147,000 |
object pose contact, robot pose |
Finance | M M M |
Stocks-F&B Stocks-Health Stocks-Tech |
{ ×18} { ×63} { ×100} |
5,218 5,218 5,218 |
stock price, volatility stock price, volatility stock price, volatility |
HCI | S | ENRICO [93] | {, } | 1,460 | design interface |
Multimedia | S M M L |
Kinetics400-S [80] MM-IMDb [8] AV-MNIST [161] Kinetics400-L [80] |
{, , } {, } {, } {, , } |
2,624 25,959 70,000 306,245 |
human action movie genre digit human action |