Table 3.
Machine learning performance for participant reply categories.
| Participant replies | Sensitivity (SD) | Specificity (SD) | PPVa (SD) | NPVb (SD) | FPRc (SD) | FNRd (SD) | F1-score (SD) |
| General comment | 0.684 (0.121) | 0.893 (0.055) | 0.817 (0.074) | 0.815 (0.073) | 0.107 (0.055) | 0.316 (0.121) | 0.737 (0.079) |
| Thanks | 0.911 (0.050) | 0.959 (0.026) | 0.863 (0.090) | 0.972 (0.027) | 0.041 (0.026) | 0.089 (0.050) | 0.771 (0.099) |
| Question | 0.815 (0.213) | 0.976 (0.014) | 0.474 (0.157) | 0.995 (0.007) | 0.024 (0.014) | 0.185 (0.213) | 0.592 (0.174) |
| Reporting healthy | 0.707 (0.097) | 0.940 (0.037) | 0.601 (0.198) | 0.960 (0.040) | 0.060 (0.037) | 0.293 (0.097) | 0.623 (0.111) |
| Reporting struggle | 0.649 (0.167) | 0.979 (0.012) | 0.696 (0.136) | 0.976 (0.013) | 0.021 (0.012) | 0.351 (0.167) | 0.658 (0.106) |
| Stop | 0.860 (0.147) | 0.993 (0.008) | 0.888 (0.131) | 0.992 (0.009) | 0.007 (0.008) | 0.140 (0.147) | 0.866 (0.116) |
| Other | 0.818 (0.082) | 0.956 (0.029) | 0.740 (0.132) | 0.972 (0.016) | 0.044 (0.029) | 0.182 (0.082) | 0.885 (0.065) |
| Averagee | 0.778 (0.048) | 0.957 (0.012) | 0.726 (0.071) | 0.955 (0.013) | 0.043 (0.012) | 0.222 (0.048) | 0.733 (0.054) |
aPPV: positive predictive value.
bNPV: negative predictive value.
cFPR: false positive rate.
dFNR: false negative rate.
eMacroaveraged.