. Author manuscript; available in PMC: 2015 Feb 2.

Published in final edited form as: IEEE Trans Affect Comput. 2014 Oct-Dec;5(4):377–390. doi: 10.1109/TAFFC.2014.2336244

TABLE 1.

Comparison To Prior Work Using Data Set Criteria

Data Set	Greater than 50 people recorded (# people)	Greater than 5,000 Clips (# of clips)	At least 6 emotion categories (# categories)	At least 8 ratersper clip for over 95% of clips (# raters)	All 3 rating modalities (which modalities)
CREMA-D (this work)	✓ (91)	✓ (7,442)	✓ (6)	✓ (4-12,mean 9.8)	✓ (audio, visual, audio-visual)
GEMEP [31]	x(10)	x (1,260)	✓ (18)	✓ (Audio 23, Visual 25, AV 23)	✓ (audio, visual, audio-visual)
De Silva Multimodal [32]	x(2)	x(72)	✓ (6)	✓ (18)	✓ (audio, visual, audio-visual)
Mower Provost [15]	x(1)	x(72)	x(5)	✓ (117)	✓ (audio, visual, audio-visual, AV mismatch)
AV Integration [33]	x(6)	x(60)	x(2)	✓ (8)	✓ (audio, visual, audio-visual, AV mismatch)
AV Synthetic Character [34]	x (1 female voice,1 animated face)	x (210)	x(4)	x (3 to 4 for AV, 6 to 7 for A or V)	✓ (audio, visual, audio-visual)
RekEmozio [35]	x(17)	x (2,720)	x(0)	x (3 to 4)	x (audio for oral, visual for faces)
Vera Am Mittag German Audio- Visual Database [36]	✓ (104)	x (1,421)	x (7, for faces)	✓(Audio : 6 or 17 Face: 8-34,mean 14)	x (audio, visual)
IEMOCAP [37]	x(10)	✓ (10,039)	✓ (9)	x(3)	x (audio-visual)
Chen Bimodal [38]	✓ (100)	✓ (9,900)	✓ (11)	x (None)	x (audio-visual)
HUMAINE [39]	x (≤48, unspecified)	x(48)	✓ (48)	x(6)	x (audio-visual)
RECOLA [40]	x(46)	x(46)	x(2)	x(6)	x (audio-visual)
CHAD [41]	x(42)	✓ (6,228)	✓ (7)	✓ (120)	x (audio)
MAHNOB-HCI [42]	x(27)	x (1,296)	✓ (9)	x(1)	x (self-report)

✓ indicates the criterion is met. x -indicates criterion is not met. Each highlighted cell indicates that the criterion for the column was met by the data set.