Table 1. Dataset characteristics.
Articles | 3 210 039 | |
Articles with talk page (ATP) | 871 485 | (27.1%) |
Editors who comment articles | 350 958 | |
Editors with ≥100 comments on ATP | 12 231 | (3.5%) |
Total comments in ATP | 11 041 246 | |
Comments containing ANEW words | 7 414 411 | (67.2%) |
Comments made by editors with ≥100 comments on ATP | 5 480 544 | (49.6%) |
Comments made by these editors used for sentiment analysis (containing ANEW words) | 3 649 297 | (33.3%) |