Table 2.
Description of datasets.
Dataset | #Tweets | #Tweets extracted | Labels | Class distribution |
---|---|---|---|---|
Waseem and Hovy (2016) | 16,000 | 14,949 | Sexist, Racist, Non-Hateful | Hateful: 4839 |
Non-Hateful: 10,110 | ||||
SemEval 2019 (Basile et al., 2019) | 9000 | 9000 | Hateful, Non-Hateful | Hateful: 3783 |
Non-Hateful: 5217 | ||||
(Ziems et al., 2020) | 2400 | 1637 | Hate, Neutral | Hate: 677 |
Neutral: 960 | ||||
(Ziems et al., 2020) | 30M | 10,674 | Hate, Neutral | Hate: 4968 |
Neutral: 5706 | ||||
US elections | 1105 | 1105 | Hate, Neutral | Hate: 665 |
Neutral: 440 |