Skip to main content
. Author manuscript; available in PMC: 2020 Aug 1.
Published in final edited form as: Semin Arthritis Rheum. 2019 Jan 4;49(1):84–90. doi: 10.1016/j.semarthrit.2019.01.002

Table 3.

Machine Learning-Derived Algorithm Performance Characteristics in the Partners HealthCare Biobank Validation Cohort

Coded-term Only Algorithms 1 and 2* Coded + NLP Algorithms 3 and 4

Algorithms Specificity (%) Sensitivity (%) PPV (%) Specificity (%) Sensitivity (%) PPV (%)
Definite SLE vs. Probable SLE or Non-SLE (First Definition) 99 51 96 99 31 93
98 60 93 98 43 90
97 64 90 97 46 87
96 67 87 96 48 84
95 68 85 95 50 81

Definite SLE and Probable SLE vs. Non-SLE (Second Definition) 99 22 94 99 16 92
98 37 93 98 30 91
97 47 92 97 41 90
96 55 90 96 49 90
95 62 89 95 56 89
93 69 87 94 65 88
91 77 85 92 70 86
90 79 84 89 77 83
86 86 81 86 82 80
*

Algorithm 1 (coded-only, first case definition: Definite SLE), Algorithm 2 (coded-only, second case definition: Definite SLE and Probable SLE)

Algorithm 3 (coded plus natural language processing, first case definition: Definite SLE), Algorithm 4 (coded plus natural language processing, second case definition: Definite SLE and Probable SLE)