Skip to main content
. 2022 Jan 18;55(6):4755–4808. doi: 10.1007/s10462-021-10116-x

Table A.6.

Video datasets

C1 C2 C3 C4 C5 C6 C7 C8 C9
Dataset name/Publicly available Year Source # Classes # Actor Body part involved Activity type Single/multiple person Size
FE H–O H–H ADL G RT
R1 CAD-60/Pub 2009 Kinect 5 (environments) 12 (activities) 4 (2 M, 2F) Whole body joint Single 60 videos
R2 CAD-120/Pub 2009 Kinect 10 (High level) 10 (sub activity labels) 12 (sub affordance labels) 4 (2 M, 2F) Whole body joint Single 120 videos
R3 MSR Action 3D/Pub 2009 DC 10 subjects, 20 action, 20 3D joints) 10 Whole body Single 336 action files
R4 UT Kinect/Pub 2012 Kinect 10 actions 10 Whole body Single 1.79 GB
R5 AVA/Pub 2018 MC 80 atomic visual actions 192 movies Whole body Both 57,600 videos
R6 UCF-101/Pub 2012 YT 101 actions 2,500 videos Whole body Both 13 K clips 27 h
R7 HMDB51/Pub 2011 YT, GV, MC 51 action classes 3,312 videos Whole body Both 6,766 clips of 2 GB
R8 Charades/Pub 2016 ADL 157 classes 267 Whole body Single 4855 KB
R9 Kinetics 400/Pub 2017 YT 400 400–1000 clips/class Whole body Both 3,00,000
R10 Kinetics 600/Pub 2018 YT 600 600–1000 clips/class Whole body Both 5,00,000
R11 SomethingSomething/Pub 2018 Objects Actions 174 classes H-I actions Whole body H–O interaction 108,499 videos
R12 Weizmann/Pub 2005 ADL 10 action classes 2 subs Whole body Single 90 videos
R13 UCSD/Pub 2013 camera Peds1 and Peds2 Subway People group Surveillance data Peds1: 60 & Peds2: 28

*CIT citations, ADL activities of daily living, M male, F female, YT YouTube, MC movie clip, DC depth camera, Pub publicly available, Prop proprietary