Skip to main content
. 2021 Jun 22;23:174. doi: 10.1186/s13075-021-02553-4

Table 2.

Comparison of baseline characteristics between the machine learning defined case selection (cutoff=0.83) and the two criteria based selections

Patients from the cohort with EHR data and classification data
Predicted case based on machine learning (cutoff=0.83) 1987 criteria Based cases 2010 criteria
Based cases
N 373 357 426
Proportion women 0.65 0.63 0.66
Proportion anti-CCP2-positive 0.52 0.49 0.49
Proportion RF-positive 0.56 0.57 0.58
Median DAS44 at baseline 2.8 2.9 2.9
Median BMI 26.0 25.6 25.6
Median ESR 25 29 27
Median CRP 9.5 10.2 9.0
Median age at inclusion 57.2 58.6 57.2
Median symptom duration at diagnosis (days) 92.0 90.0 91.0
Median number of swollen joints 5 6 6

P values were calculated with the Pearson chi-squared for proportions, Mann-Whitney U for medians: *p<0.05; **p<0.01, ***p<0.001; Not statistically tested