Skip to main content
. 2022 Sep 29;24(9):e33775. doi: 10.2196/33775

Table 3.

Summary of modeling information by data use case and by blocking scheme.

Data and block Expert-specified fieldsa Data-driven fieldsa
INPC

DB-LN-MB-YBb MRNc FNd SEXe TELf ADRg ZIPh SSNi MRN FN SEX TEL ADR ZIP SSN CITY EMAIL ETH MI NICK STj

DB-MB-YB-ZIP MRN LN FN SEX TEL ADR SSN MRN LN FN SEX TEL ADR SSN CITY EMAIL ETH MI NICK ST

FN-LN-YB MRN SEX DB MB TEL ADR ZIP SSN MRN SEX DB MB TEL ADR ZIP SSN CITY EMAIL MI ST ETH NICK

FN-TEL MRN LN SEX DB MB YB ADR ZIP SSN MRN LN SEX DB MB YB ADR ZIP SSN CITY EMAIL ETH MI NICK ST

SSN MRN LN FN SEX DB MB YB TEL ADR ZIP MRN LN FN SEX DB MB YB TEL ADR ZIP CITY EMAIL ETH MI NICK ST
SSAk

FN-LN-DB-MB-YB SSN MI ZIP SSN MI ZIP

FN-LN-MI-DB-MB ZIP YB SSN ZIP YB SSN

FN-LN-MI-YB DB MB ZIP SSN DB MB ZIP SSN

FN-LN-ZIP MI DB MB YB SSN MI DB MB YB SSN

SSN LN FN MI DB MB YB ZIP LN FN MI DB MB YB ZIP
NBSl

LN-FN MRN SEX DB MB YB TEL ADR ZIP MRN SEXm DB MB YBm TEL ADR ZIP CITY DR_FN DR_LN MI NK_FN NK_LN

MB-DB-ZIP MRN LN FN SEX YB TEL ADR MRN LN FN SEX YB TEL ADR CITY DR_FN DR_LN ETH MI NK_FN NK_LN NICK

MRN LN FN SEX DB MB YB TEL ADR ZIP LN FN SEXm DB MB YB TELm ADR ZIP CITY DR_FN DR_LN ETH MI NK_FN NK_LN ST

NK_LN-NK_FN MRN LN FN SEX DB MB YB TEL ADR ZIP MRN LNm FN SEX DB MB YB TEL ADR ZIP CITY DR_FN DR_LN ETH LN MI NICK ST

TEL MRN LN FN SEX DB MB YB ADR ZIP MRN LN FN SEX DB MB YBm ADR ZIP CITY DR_FN DR_LN ETH MI NK_FN NK_LN ST
MCHDn

LN-FN MRN SEX DB MB YB TEL ADR ZIP MRN SEXm DB MB YBm TEL ADR ZIP CITY DR_FN DR_LN MI NK_FN NK_LN

MB-DB-ZIP MRN LN FN SEX YB TEL ADR MRN LN FN SEX YB TEL ADR CITY DR_FN DR_LN ETH MI NK_FN NK_LN NICK

MRN LN FN SEX DB MB YB TEL ADR ZIP LN FN SEXm DB MB YB TELm ADR ZIP CITY DR_FN DR_LN ETH MI NK_FN NK_LN ST

NK_LN-NK_FN MRN LN FN SEX DB MB YB TEL ADR ZIP MRN LNm FN SEX DB MB YB TEL ADR ZIP CITY DR_FN DR_LN ETH LN MI NICK ST

TEL MRN LN FN SEX DB MB YB ADR ZIP MRN LN FN SEX DB MB YBm ADR ZIP CITY DR_FN DR_LN ETH MI NK_FN NK_LN ST

aColumns “Expert-specified fields” and “Data-driven fields” display the fields used in the Fellegi-Sunter (FS) model.

bDB-LN-MB-YB: day, month, and year of birth and last name.

cMRN: medical record number.

dFN: first name.

eSEX: sex.

fTEL: telephone number.

gADR: address.

hZIP: zip code.

iSSN: Social Security number.

jFields (italicized) selected only by data-driven methods.

kSSA: Social Security Administration.

lNBS: newborn screening.

mFields not selected by the data-driven method but specified by experts.

nMCHD: Marion County Health Department.