. 2023 Jun 29;7:e44331. doi: 10.2196/44331

Table 4.

Performance comparison of the baseline and machine learning–optimized matching configurations in SantéMPI in the held-out evaluation sets, for detection of possible record linkages needing manual review.

Data set	Sensitivity (%)			Specificity (%)			Positive predictive value (%)
	Baseline (95% CI)	Optimized (change; 95% CI)	Baseline (95% CI)		Optimized (change; 95% CI)	Baseline (95% CI)		Optimized (change; 95% CI)
FEBRL1^a	95.0 (90.1 to 98.9)	100.0 (+5.0%; 1.1 to 9.6)	100.0 (100.0 to 100.0)		99.4 (−0.6%; −0.4 to −0.7)	100.0 (100.0 to 100.0)		62.9 (−37.1%; −44.4 to −29.8)
FEBRL2	88.7 (84.3 to 92.8)	100.0 (+11.3%; 7.0 to 16.0)	100.0 (100.0 to 100.0)		99.3 (−0.7%; −0.6 to −0.7)	100.0 (100.0 to 100.0)		15.7 (−84.3%; −86.3 to −82.3)
FEBRL3	87.8 (85.1 to 90.3)	100.0 (+12.2%; 9.9 to 15.0)	100.0 (100.0 to 100.0)		99.3 (−0.7%; −0.7 to −0.7)	98.9 (97.9 to 99.6)		26.6 (−72.3%; −74.3 to −70.3)
FEBRL4	90.2 (88.4 to 92.0)	100.0 (+9.8%; 8.0 to 11.6)	100.0 (100.0 to 100.0)		95.9 (−4.1%; −4.1 to −4.0)	100 (100.0 to 100.0)		2.4 (−97.6%; −97.7 to −97.5)
Hawaii	93.1 (88.9 to 96.6)	100.0 (+6.9%; 3.4 to 11.1)	100.0 (100.0 to 100.0)		98.9 (−1.1%; −1.2 to −0.9)	99.4 (97.9 to 100.0)		31.9 (−67.5%; 71.6 to −63.6)

^aFEBRL: Freely Extensible Biomedical Record Linkage.