Table 6. Action recognition performance.
Training Set | Embeddings | RNN | Loss | Top 1 Acc. (%) | Top 5 Acc. (%) |
---|---|---|---|---|---|
AVMIT | YamNet + EffNetB0 | FRNN | 0.1841 | 94.58 | 99.90 |
MIT 16 | YamNet + EffNetB0 | FRNN | 0.2973 | 89.79 | 99.90 |
AVMIT | YamNet + EffNetB0 | GRU | 0.1600 | 95.73 | 99.90 |
MIT 16 | YamNet + EffNetB0 | GRU | 0.2430 | 92.29 | 99.90 |
AVMIT | YamNet + EffNetB0 | LSTM | 0.1674 | 95.52 | 99.79 |
MIT 16 | YamNet + EffNetB0 | LSTM | 0.2366 | 92.81 | 100 |
AVMIT | VGGish + VGG-16 | FRNN | 0.2980 | 90.73 | 99.79 |
MIT 16 | VGGish + VGG-16 | FRNN | 0.4388 | 84.79 | 99.58 |
AVMIT | VGGish + VGG-16 | GRU | 0.2917 | 91.04 | 99.79 |
MIT 16 | VGGish + VGG-16 | GRU | 0.4108 | 85.83 | 99.69 |
AVMIT | VGGish + VGG-16 | LSTM | 0.2892 | 90.94 | 99.90 |
MIT 16 | VGGish + VGG-16 | LSTM | 0.3527 | 86.98 | 99.90 |