Table 3.
Machine Learning-Derived Algorithm Performance Characteristics in the Partners HealthCare Biobank Validation Cohort
Coded-term Only Algorithms 1 and 2* | Coded + NLP Algorithms 3 and 4† | |||||
---|---|---|---|---|---|---|
Algorithms | Specificity (%) | Sensitivity (%) | PPV (%) | Specificity (%) | Sensitivity (%) | PPV (%) |
Definite SLE vs. Probable SLE or Non-SLE (First Definition) | 99 | 51 | 96 | 99 | 31 | 93 |
98 | 60 | 93 | 98 | 43 | 90 | |
97 | 64 | 90 | 97 | 46 | 87 | |
96 | 67 | 87 | 96 | 48 | 84 | |
95 | 68 | 85 | 95 | 50 | 81 | |
Definite SLE and Probable SLE vs. Non-SLE (Second Definition) | 99 | 22 | 94 | 99 | 16 | 92 |
98 | 37 | 93 | 98 | 30 | 91 | |
97 | 47 | 92 | 97 | 41 | 90 | |
96 | 55 | 90 | 96 | 49 | 90 | |
95 | 62 | 89 | 95 | 56 | 89 | |
93 | 69 | 87 | 94 | 65 | 88 | |
91 | 77 | 85 | 92 | 70 | 86 | |
90 | 79 | 84 | 89 | 77 | 83 | |
86 | 86 | 81 | 86 | 82 | 80 |
Algorithm 1 (coded-only, first case definition: Definite SLE), Algorithm 2 (coded-only, second case definition: Definite SLE and Probable SLE)
Algorithm 3 (coded plus natural language processing, first case definition: Definite SLE), Algorithm 4 (coded plus natural language processing, second case definition: Definite SLE and Probable SLE)