Table 6.
Results from SVM classification using id-itf and word embedding as features. Here, I denotes SVM with id-itf while W denotes SVM with word embedding. We calculate the average for each word embedding of individual tokens along the word embedding dimension so that each token will only have one mean value as its word embedding feature.
Phenotypes | Precision% | Recall% | ROCAUC% | F1% | ||||
---|---|---|---|---|---|---|---|---|
I | W | I | W | I | W | I | W | |
Adv. Cancer | 92.10 | 67.97 | 36.02 | 16.87 | 67.84 | 58.02 | 50.83 | 26.39 |
±2.86 | ±9.97 | ±3.70 | ±2.80 | ±1.85 | ±1.40 | ±4.15 | ±4.02 | |
Adv. Heart Disease | 90.08 | 19.21 | 20.74 | 18.87 | 60.10 | 51.46 | 33.48 | 18.93 |
±3.49 | ±1.72 | ±1.33 | ±2.28 | ±0.67 | ±1.02 | ±1.86 | ±1.98 | |
Adv. Lung Disease | 86.66 | 37.29 | 10.22 | 12.05 | 55.07 | 55.05 | 17.98 | 17.89 |
±1.83 | ±8.22 | ±1.82 | ±3.10 | ±0.90 | ±1.57 | ±3.03 | ±4.31 | |
Chronic Neuro | 84.91 | 25.90 | 20.40 | 24.42 | 59.63 | 51.82 | 32.55 | 24.99 |
±3.37 | ±1.93 | ±1.66 | ±2.02 | ±0.88 | ±1.04 | ±2.34 | ±1.84 | |
Chronic Pain | 60.83 | 25.89 | 06.84 | 21.49 | 52.84 | 53.14 | 12.21 | 23.28 |
±7.82 | ±1.69 | ±1.19 | ±2.05 | ±0.62 | ±0.89 | ±2.06 | ±1.80 | |
Alcohol Abuse | 95.23 | 17.81 | 35.76 | 13.86 | 67.77 | 52.58 | 51.82 | 15.38 |
±2.43 | ±2.45 | ±2.23 | ±2.22 | ±1.16 | ±1.08 | ±2.67 | ±2.25 | |
Substance Abuse | 95.83 | 45.47 | 28.45 | 16.20 | 64.16 | 57.34 | 42.79 | 23.45 |
±2.84 | ±8.12 | ±3.46 | ±4.04 | ±1.73 | ±2.03 | ±4.55 | ±5.33 | |
Obesity | 60.00 | 15.51 | 05.64 | 06.34 | 52.82 | 51.79 | 10.21 | 08.71 |
±1.63 | ±4.67 | ±1.75 | ±2.39 | ±0.87 | ±1.20 | ±3.08 | ±2.97 | |
Psychiatric Disorders | 79.80 | 20.98 | 20.32 | 18.00 | 59.62 | 51.36 | 31.92 | 19.13 |
±4.03 | ±2.19 | ±2.65 | ±2.25 | ±1.38 | ±1.14 | ±3.64 | ±2.07 | |
Depression | 77.52 | 31.63 | 22.86 | 30.86 | 60.15 | 51.82 | 35.10 | 31.06 |
±4.21 | ±2.03 | ±1.97 | ±1.85 | ±1.11 | ±1.32 | ±2.72 | ±1.76 |