Table 2.
Classification of Twitter users who tweet about e-cigarettes: Gradient Boosting Regression Trees (GBRT) results comparing full model and metadata-only model.
| User type | Full model (metadata + derived data) | Metadata-only model | ||||
| F1 score, % | Recall, % | Precision, % | F1 score, % | Recall, % | Precision, % | |
| Individual | 91.1 | 92.3 | 89.8 | 83.6 | 86.2 | 81.2 |
| Vaper enthusiast | 47.1 | 40.0 | 57.1 | 16.2 | 12.0 | 25.0 |
| Informed agency | 84.4 | 78.5 | 91.3 | 70.0 | 67.7 | 72.4 |
| Marketer | 81.2 | 85.9 | 77.0 | 65.6 | 72.6 | 59.9 |
| Spammer | 79.5 | 81.1 | 78.0 | 74.8 | 71.9 | 78.0 |
| Average | 83.3 | 83.7 | 83.3 | 72.7 | 73.7 | 72.3 |