Skip to main content
. 2023 Jun 29;7:e44331. doi: 10.2196/44331

Table 4.

Performance comparison of the baseline and machine learning–optimized matching configurations in SantéMPI in the held-out evaluation sets, for detection of possible record linkages needing manual review.

Data set Sensitivity (%) Specificity (%) Positive predictive value (%)

Baseline (95% CI) Optimized (change; 95% CI) Baseline (95% CI) Optimized (change; 95% CI) Baseline (95% CI) Optimized (change; 95% CI)
FEBRL1a 95.0 (90.1 to 98.9) 100.0 (+5.0%; 1.1 to 9.6) 100.0 (100.0 to 100.0) 99.4 (−0.6%; −0.4 to −0.7) 100.0 (100.0 to 100.0) 62.9 (−37.1%; −44.4 to −29.8)
FEBRL2 88.7 (84.3 to 92.8) 100.0 (+11.3%; 7.0 to 16.0) 100.0 (100.0 to 100.0) 99.3 (−0.7%; −0.6 to −0.7) 100.0 (100.0 to 100.0) 15.7 (−84.3%; −86.3 to −82.3)
FEBRL3 87.8 (85.1 to 90.3) 100.0 (+12.2%; 9.9 to 15.0) 100.0 (100.0 to 100.0) 99.3 (−0.7%; −0.7 to −0.7) 98.9 (97.9 to 99.6) 26.6 (−72.3%; −74.3 to −70.3)
FEBRL4 90.2 (88.4 to 92.0) 100.0 (+9.8%; 8.0 to 11.6) 100.0 (100.0 to 100.0) 95.9 (−4.1%; −4.1 to −4.0) 100 (100.0 to 100.0) 2.4 (−97.6%; −97.7 to −97.5)
Hawaii 93.1 (88.9 to 96.6) 100.0 (+6.9%; 3.4 to 11.1) 100.0 (100.0 to 100.0) 98.9 (−1.1%; −1.2 to −0.9) 99.4 (97.9 to 100.0) 31.9 (−67.5%; 71.6 to −63.6)

aFEBRL: Freely Extensible Biomedical Record Linkage.