Table 8.
Experiments: urgent/not urgent with users self-diagnoses with a single mental disorder (urgent/not urgent + single mental disorder), urgent/not urgent with users self-diagnoses with multiple mental disorders (urgent/not urgent + multiple mental disorders), urgent/not urgent with users diagnosed only with PTSD and evaluated on the expert annotated data (expert), urgent/not urgent with users diagnosed with PTSD or depression (urgent/not urgent using tweets).
| Experiments | Multi-task | Ps | Pm | Rs | Rm | F1s | F1m | ACCs (%) | ACCm (%) | AUCs | AUCm |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Urgent/not urgent + single mental disorder | Suicide + adhd | 0.704 | 0.685 | 0.718 | 0.695 | 0.707 | 0.688 | 72.00 | 70.40 | 0.777 | 0.780 |
| Adhd baseline | – | 0.752 | – | 0.773 | – | 0.753 | – | 76.00 | – | 0.841 | |
| Suicide + anxiety | 0.785 | 0.785 | 0.750 | 0.750 | 0.761 | 0.761 | 79.20 | 79.20 | 0.842 | 0.843 | |
| Anxiety baseline | – | 0.800 | – | 0.812 | – | 0.804 | – | 81.60 | – | 0.876 | |
| Suicide + autism | 0.717 | 0.717 | 0.686 | 0.686 | 0.694 | 0.694 | 73.60 | 73.60 | 0.780 | 0.778 | |
| Autism baseline | – | 0.723 | – | 0.742 | – | 0.715 | – | 72.00 | – | 0.815 | |
| Suicide + bipolar | 0.789 | 0.778 | 0.765 | 0.759 | 0.774 | 0.766 | 80.00 | 79.20 | 0.883 | 0.881 | |
| Bipolar baseline | – | 0.802 | – | 0.822 | – | 0.807 | – | 81.60 | – | 0.886 | |
| Suicide + depress | 0.818 | 0.805 | 0.773 | 0.767 | 0.787 | 0.779 | 81.60 | 80.80 | 0.868 | 0.867 | |
| Depress baseline | – | 0.792 | – | 0.786 | – | 0.789 | – | 80.80 | – | 0.895 | |
| Suicide + ocd | 0.722 | 0.722 | 0.712 | 0.712 | 0.716 | 0.716 | 74.40 | 74.40 | 0.855 | 0.853 | |
| Ocd baseline | – | 0.766 | – | 0.788 | – | 0.756 | – | 76.00 | – | 0.863 | |
| Suicide + ptsd | 0.859 | 0.859 | 0.865 | 0.865 | 0.862 | 0.862 | 87.20 | 87.20 | 0.946 | 0.946 | |
| Suicide + ptsd + EMPATH | 0.863 | 0.849 | 0.856 | 0.834 | 0.859 | 0.840 | 87.20 | 85.60 | 0.943 | 0.943 | |
| Suicide + ptsd + fastText | 0.860 | 0.860 | 0.840 | 0.840 | 0.848 | 0.848 | 86.40 | 86.40 | 0.942 | 0.941 | |
| Ptsd baseline | – | 0.839 | – | 0.866 | – | 0.843 | – | 84.80 | – | 0.932 | |
| Suicide + sch | 0.748 | 0.739 | 0.745 | 0.739 | 0.747 | 0.739 | 76.80 | 76.00 | 0.870 | 0.868 | |
| Schizophrenia baseline | – | 0.808 | – | 0.813 | – | 0.810 | – | 82.40 | – | 0.897 | |
| Suicide baseline | 0.616 | – | 0.625 | – | 0.616 | – | 63.20 | – | 0.680 | - | |
| Urgent/not urgent + multiple mental disorders | Suicide + adhd | 0.777 | 0.777 | 0.738 | 0.738 | 0.750 | 0.750 | 78.40 | 78.40 | 0.869 | 0.869 |
| Adhd baseline | – | 0.775 | – | 0.769 | – | 0.772 | – | 79.20 | – | 0.855 | |
| Suicide + anxiety | 0.846 | 0.843 | 0.838 | 0.843 | 0.842 | 0.843 | 85.60 | 85.60 | 0.921 | 0.921 | |
| Anxiety baseline | – | 0.841 | – | 0.858 | – | 0.847 | – | 85.60 | – | 0.912 | |
| Suicide + autism | 0.839 | 0.839 | 0.806 | 0.806 | 0.818 | 0.818 | 84.00 | 84.00 | 0.901 | 0.900 | |
| Autism baseline | – | 0.826 | – | 0.854 | – | 0.827 | – | 83.20 | – | 0.929 | |
| Suicide + bipolar | 0.846 | 0.858 | 0.786 | 0.808 | 0.802 | 0.824 | 83.20 | 84.80 | 0.901 | 0.901 | |
| Bipolar baseline | – | 0.814 | – | 0.838 | – | 0.817 | – | 82.40 | – | 0.908 | |
| Suicide + depress | 0.784 | 0.784 | 0.775 | 0.775 | 0.779 | 0.779 | 80.00 | 80.00 | 0.881 | 0.881 | |
| Depress baseline | – | 0.797 | – | 0.820 | – | 0.801 | – | 80.80 | – | 0.893 | |
| Suicide + ocd | 0.760 | 0.776 | 0.742 | 0.764 | 0.748 | 0.769 | 77.60 | 79.20 | 0.867 | 0.866 | |
| Ocd baseline | – | 0.765 | – | 0.768 | – | 0.766 | – | 78.40 | – | 0.861 | |
| Suicide + ptsd | 0.851 | 0.851 | 0.854 | 0.854 | 0.853 | 0.853 | 86.40 | 86.40 | 0.942 | 0.942 | |
| Ptsd baseline | – | 0.841 | – | 0.858 | – | 0.847 | – | 85.60 | – | 0.938 | |
| Suicide + sch | 0.833 | 0.842 | 0.842 | 0.848 | 0.837 | 0.845 | 84.80 | 85.60 | 0.910 | 0.911 | |
| Schizophrenia baseline | – | 0.823 | – | 0.849 | – | 0.826 | – | 83.20 | – | 0.937 | |
| Expert | Suicide + ptsd | 0.851 | 0.851 | 0.839 | 0.839 | 0.845 | 0.845 | 86.12 | 86.12 | 0.924 | 0.922 |
| Suicide baseline (expert) | 0.639 | – | 0.646 | – | 0.641 | – | 66.53 | – | 0.643 | - | |
| Urgent/not urgent using tweets | Suicide + more_ptsd | 0.878 | 0.878 | 0.907 | 0.907 | 0.884 | 0.884 | 88.80 | 88.80 | 0.959 | 0.959 |
| Suicide + more_depress | 0.814 | 0.814 | 0.841 | 0.841 | 0.812 | 0.812 | 81.60 | 81.60 | 0.925 | 0.925 | |
| Suicide + ptsd_depress | 0.866 | 0.866 | 0.881 | 0.881 | 0.872 | 0.872 | 88.00 | 88.00 | 0.920 | 0.920 | |
| Suicide + more_ptsd + embed | 0.866 | 0.866 | 0.886 | 0.886 | 0.873 | 0.873 | 88.00 | 88.00 | 0.924 | 0.925 | |
| Suicide + more_ptsd + empath | 0.883 | 0.891 | 0.899 | 0.910 | 0.889 | 0.898 | 89.60 | 90.40 | 0.946 | 0.945 | |
| Baseline (more_ptsd + depress) | – | 0.840 | – | 0.868 | – | 0.829 | – | 83.20 | – | 0.965 |
The bold values represent the highest F1 score for suicide ideation and mental disorder detection.