Table 4.
Matching results of the four use cases evaluated on their respective ground truth sets of random-selected and manually reviewed record pairs.
| Data | Value, N | Sensitivity (95% CI) | Specificity (95% CI) | Positive predictive value (95% CI) | Negative predictive value (95% CI) | F1-score (95% CI) | ||||||||
| Expert-specified fields | ||||||||||||||
|
|
INPCa | |||||||||||||
|
|
|
MADb | 15,000 | 0.962 (0.958-0.967) | 0.990 (0.987-0.992) | 0.990 (0.988-0.992) | 0.960 (0.955-0.964) | 0.976 (0.974-0.978) | ||||||
|
|
|
MARc | 15,000 | 0.970 (0.966-0.974) | 0.988 (0.986-0.991) | 0.989 (0.987-0.991) | 0.968 (0.964-0.972) | 0.980 (0.977-0.982) | ||||||
|
|
SSAd | |||||||||||||
|
|
|
MAD | 16,500 | 0.781 (0.770-0.792) | 0.995 (0.994-0.996) | 0.989 (0.986-0.992) | 0.890 (0.884-0.895) | 0.873 (0.866-0.879) | ||||||
|
|
|
MAR | 16,500 | 0.785 (0.775-0.796) | 0.995 (0.993-0.996) | 0.989 (0.985-0.991) | 0.892 (0.886-0.897) | 0.875 (0.869-0.882) | ||||||
|
|
NBSe | |||||||||||||
|
|
|
MAD | 15,000 | 0.795 (0.786-0.804) | 0.881 (0.874-0.889) | 0.883 (0.876-0.891) | 0.791 (0.782-0.801) | 0.837 (0.830-0.843) | ||||||
|
|
|
MAR | 15,000 | 0.860 (0.852-0.868) | 0.873 (0.865-0.881) | 0.885 (0.877-0.892) | 0.846 (0.838-0.855) | 0.872 (0.866-0.878) | ||||||
|
|
MCHDf | |||||||||||||
|
|
|
MAD | 15,500 | 0.944 (0.937-0.949) | 0.989 (0.987-0.991) | 0.982 (0.979-0.986) | 0.966 (0.962-0.969) | 0.963 (0.959-0.966) | ||||||
|
|
|
MAR | 15,500 | 0.946 (0.940-0.952) | 0.988 (0.986-0.990) | 0.980 (0.976-0.983) | 0.967 (0.964-0.971) | 0.963 (0.959-0.966) | ||||||
| Data-driven fields | ||||||||||||||
|
|
INPC | |||||||||||||
|
|
|
MAD | 15,000 | 0.579 (0.568-0.590) | 0.988 (0.986-0.991) | 0.982 (0.978-0.985) | 0.682 (0.672-0.690) | 0.729 (0.719-0.737) | ||||||
|
|
|
MAR | 15,000 | 0.970 (0.966-0.974) | 0.987 (0.984-0.989) | 0.988 (0.985-0.990) | 0.968 (0.964-0.972) | 0.979 (0.976-0.981) | ||||||
|
|
SSA | |||||||||||||
|
|
|
MAD | 16,500 | 0.781 (0.770-0.792) | 0.995 (0.994-0.996) | 0.989 (0.986-0.992) | 0.890 (0.884-0.895) | 0.873 (0.866-0.879) | ||||||
|
|
|
MAR | 16,500 | 0.785 (0.775-0.796) | 0.995 (0.993-0.996) | 0.989 (0.985-0.991) | 0.892 (0.886-0.897) | 0.875 (0.869-0.882) | ||||||
|
|
NBS | |||||||||||||
|
|
|
MAD | 15,000 | 0.813 (0.805-0.822) | 0.875 (0.867-0.883) | 0.880 (0.873-0.888) | 0.805 (0.796-0.814) | 0.845 (0.839-0.852) | ||||||
|
|
|
MAR | 15,000 | 0.865 (0.858-0.873) | 0.870 (0.863-0.878) | 0.883 (0.876-0.890) | 0.851 (0.842-0.859) | 0.874 (0.868-0.880) | ||||||
|
|
MCHD | |||||||||||||
|
|
|
MAD | 15,500 | 0.635 (0.622-0.648) | 0.970 (0.967-0.974) | 0.929 (0.921-0.937) | 0.811 (0.804-0.818) | 0.754 (0.745-0.764) | ||||||
|
|
|
MAR | 15,500 | 0.954 (0.948-0.959) | 0.988 (0.985-0.990) | 0.979 (0.976-0.983) | 0.972 (0.968-0.975) | 0.967 (0.963-0.970) | ||||||
aINPC: Indiana Network for Patient Care.
bMAD: missing as disagreement.
cMAR: missing at random.
dSSA: Social Security Administration.
eNBS: newborn screening.
fMCHD: Marion County Health Department.