Table 3.
Comparison of rationale-annotated datasets for text classification.
| Dataset name | Domain | Collection | Instances | Year | References |
|---|---|---|---|---|---|
| MovieReviews (v.1.0) | Product reviews | Author | 2,000 | 2007 | Zaidan et al., 2007 |
| AmazonReviews | Product reviews | Crowd | 6,000 | 2007 | Blitzer et al., 2007 |
| HotelReviews | Product reviews | Crowd | 109,000 | 2010 | Wang et al., 2010 |
| Nova | Social media | Crowd | 12,000 | 2011 | Guyon et al., 2011 |
| IMDB | Product reviews | Crowd | 25,000 | 2011 | Maas et al., 2011 |
| BeerAdvocate | Product reviews | Crowd | 4,000 | 2012 | McAuley et al., 2012 |
| SST | social media | crowd | 11,855 | 2013 | Socher et al., 2013 |
| WikiAttack | Social media | Author | 1,089 | 2018 | Carton et al., 2018 |
| FEVER | Social media | Crowd | 136,000 | 2018 | Thorne et al., 2018 |
| MovieReviews (v.2.0) | Product reviews | Crowd | 200 | 2019 | DeYoung et al., 2020 |
| Snopes Corpus | Social media | Crowd | 6,422 | 2019 | Hanselowski et al., 2019 |
| HateXplain | Social media | Crowd | 20,148 | 2020 | Mathew et al., 2021 |
| Yelp-HAT | Product reviews | Crowd | 15,000 | 2020 | Sen et al., 2020 |
| RaFoLa | Modern slavery | Author | 989 | 2021 | Mendez et al., 2022 |
| Hummingbird | Social media | Crowd | 500 | 2021 | Hayati et al., 2021 |
| SBIC | Social media | Author | 360 | 2022 | Marasović et al., 2022 |
| DynaSent | Product reviews | Author | 2,880 | 2023 | Jakobsen et al., 2023 |