Skip to main content
. 2020 Mar 31;10:5732. doi: 10.1038/s41598-020-61994-0

Table 1.

Classification of MCI based on complete transcript vs. simulated conversations.

Model AUC F1-Score Sensitivity Specificity
SVM w/ LIWC 0.712 (0.612–0.811) 0.631 (0.500–0.761) 0.680 (0.476–0.886) 0.744 (0.563–0.922)
Supervised DL w/ LIWC 0.689 (0.560–0.818) 0.182 (0.055–0.370) 0.300 (0.010–0.758) 0.767 (0.364–0.970)
SVM w/ SKP 0.797 (0.719–0.879) 0.719 (0.591–0.846) 0.654 (0.473–0.835) 0.939 (0.855–1.0)
Supervised DL w/ SKP 0.811 (0.715–0.907) 0.642 (0.469–0.813) 0.600 (0.366–0.833) 0.911 (0.838–0.984)
RL (T = 5) 0.633 (0.535–0.703) 0.486 (0.288–0.680) 0.459 (0.280–0.630) 0.811 (0.661–0.936)
RL (T = 10) 0.741 (0.631–0.852) 0.590 (0.352–0.829) 0.560 (0.309–0.811) 0.922 (0.823–0.969)
RL (T = 15) 0.721 (0.618–0.827) 0.595 (0.399–0.790) 0.50 (0.327–0.713) 0.922 (0.856–0.987)
RL (T = 20) 0.809 (0.706–0.914) 0.726 (0.551–0.901) 0.620 (0.413–0.827) 0.988 (0.953–1.0)
RL (T = 30) 0.853 (0.796–0.914) 0.801 (0.733–0.880) 0.818 (0.678–0.958) 0.898 (0.828–0.969)
RL(T = 35) 0.859 (0.787–0.952) 0.808 (0.735–0.883) 0.818 (0.677–0.958) 0.911 (0.839–1.0)
Difference 0.0616 (−0.049–0.172) 0.089 (−0.078–0.259) 0.163 (−0.083–0.410) −0.040 (−0.130–0.050)

Abbreviations: Parentheses denotes confidence interval (CI) for the metric. SVM denotes support vector machines classifier, and Supervised DL denotes 2-layer feed-forward neural network classifier. RL denotes reinforcement learning agent. For feature representation of corpus, LIWC is the original word-level embedding used in Asgari et al., 8. SKP denotes a 4800-dimensional Skip-Thought vector embedding was used to represent each conversational turn. A dialogue summary is obtained by averaging across all turn-based responses for each user. We then evaluate the performance of our RL-agent across 10 stratified shuffle splits. Each split uses 65% of data for training and 35% for testing.