Skip to main content
. 2024 Jul 23;7:1363531. doi: 10.3389/frai.2024.1363531

Table 3.

Comparison of rationale-annotated datasets for text classification.

Dataset name Domain Collection Instances Year References
MovieReviews (v.1.0) Product reviews Author 2,000 2007 Zaidan et al., 2007
AmazonReviews Product reviews Crowd 6,000 2007 Blitzer et al., 2007
HotelReviews Product reviews Crowd 109,000 2010 Wang et al., 2010
Nova Social media Crowd 12,000 2011 Guyon et al., 2011
IMDB Product reviews Crowd 25,000 2011 Maas et al., 2011
BeerAdvocate Product reviews Crowd 4,000 2012 McAuley et al., 2012
SST social media crowd 11,855 2013 Socher et al., 2013
WikiAttack Social media Author 1,089 2018 Carton et al., 2018
FEVER Social media Crowd 136,000 2018 Thorne et al., 2018
MovieReviews (v.2.0) Product reviews Crowd 200 2019 DeYoung et al., 2020
Snopes Corpus Social media Crowd 6,422 2019 Hanselowski et al., 2019
HateXplain Social media Crowd 20,148 2020 Mathew et al., 2021
Yelp-HAT Product reviews Crowd 15,000 2020 Sen et al., 2020
RaFoLa Modern slavery Author 989 2021 Mendez et al., 2022
Hummingbird Social media Crowd 500 2021 Hayati et al., 2021
SBIC Social media Author 360 2022 Marasović et al., 2022
DynaSent Product reviews Author 2,880 2023 Jakobsen et al., 2023