Skip to main content
. 2018 Apr 23;4(2):e43. doi: 10.2196/publichealth.5789

Table 2.

Sentiment tools based on type of tool: KNN: k-nearest-neighbors; N/A: not applicable; NB: Naïve Bayes; SVM; support vector machines.

Author Tool Annotators Kappa Manually annotated sample Sample size Manually annotated compared with total sample, n (%)
Cole-Lewis et al [29] Produced for study: machine learning classifiers based on 5 categories (NB, KNN, and SVM) 6 .64 250 17,098 250 (1.46)
Desai et al [31] Produced for study: rule based using AFINN (Named after the author, Finn Arup Neilsen) N/A N/A   N/A 993 N/A  
Daniulaityte et al [30] Produced for study: logistic regression, NB, SVM 2 .68 3000 N/A N/A
Myslin et al [34] Produced for study: machine learning (NB, KNN, SVM) 2 >.7 1000 7362 1000 (13.58)
Sofean and Smith [36] Produced for study: 5-fold validation using support vector machines (SVM’s) model using Waikato Environment for Knowledge Analysis toolkit toolkit N/A   N/A 500 N/A N/A
Tighe et al [37] Produced for study: rule based using AFINN N/A N/A N/A 65,000 N/A
Bhattacharya et al [27] Open source: SentiStrength 3 N/A N/A 164,104 N/A
Hawkins et al [33] Open source: machine learning classifier using Python library TextBlob 2+Amazon Mechanical Turk >.79 2216 404,065 2216 (0.55)
Ramagopalan et al [26] Open source: TwitteR R package + Jeffrey Breen’s sentiment analysis code N/A N/A N/A 60,037 N/A
Black et al [28] Commercial: radian6 N/A N/A N/A N/A N/A
Greaves et al [32] Commercial: TheySay N/A N/A 250 198,499 250 (0.13)
Nwosu et al [35] Open source: TopsyPro N/A N/A N/A 683,500 N/A