Table 3.
Summary of modeling information by data use case and by blocking scheme.
| Data and block | Expert-specified fieldsa | Data-driven fieldsa | |
| INPC | |||
|
|
DB-LN-MB-YBb | MRNc FNd SEXe TELf ADRg ZIPh SSNi | MRN FN SEX TEL ADR ZIP SSN CITY EMAIL ETH MI NICK STj |
|
|
DB-MB-YB-ZIP | MRN LN FN SEX TEL ADR SSN | MRN LN FN SEX TEL ADR SSN CITY EMAIL ETH MI NICK ST |
|
|
FN-LN-YB | MRN SEX DB MB TEL ADR ZIP SSN | MRN SEX DB MB TEL ADR ZIP SSN CITY EMAIL MI ST ETH NICK |
|
|
FN-TEL | MRN LN SEX DB MB YB ADR ZIP SSN | MRN LN SEX DB MB YB ADR ZIP SSN CITY EMAIL ETH MI NICK ST |
|
|
SSN | MRN LN FN SEX DB MB YB TEL ADR ZIP | MRN LN FN SEX DB MB YB TEL ADR ZIP CITY EMAIL ETH MI NICK ST |
| SSAk | |||
|
|
FN-LN-DB-MB-YB | SSN MI ZIP | SSN MI ZIP |
|
|
FN-LN-MI-DB-MB | ZIP YB SSN | ZIP YB SSN |
|
|
FN-LN-MI-YB | DB MB ZIP SSN | DB MB ZIP SSN |
|
|
FN-LN-ZIP | MI DB MB YB SSN | MI DB MB YB SSN |
|
|
SSN | LN FN MI DB MB YB ZIP | LN FN MI DB MB YB ZIP |
| NBSl | |||
|
|
LN-FN | MRN SEX DB MB YB TEL ADR ZIP | MRN SEXm DB MB YBm TEL ADR ZIP CITY DR_FN DR_LN MI NK_FN NK_LN |
|
|
MB-DB-ZIP | MRN LN FN SEX YB TEL ADR | MRN LN FN SEX YB TEL ADR CITY DR_FN DR_LN ETH MI NK_FN NK_LN NICK |
|
|
MRN | LN FN SEX DB MB YB TEL ADR ZIP | LN FN SEXm DB MB YB TELm ADR ZIP CITY DR_FN DR_LN ETH MI NK_FN NK_LN ST |
|
|
NK_LN-NK_FN | MRN LN FN SEX DB MB YB TEL ADR ZIP | MRN LNm FN SEX DB MB YB TEL ADR ZIP CITY DR_FN DR_LN ETH LN MI NICK ST |
|
|
TEL | MRN LN FN SEX DB MB YB ADR ZIP | MRN LN FN SEX DB MB YBm ADR ZIP CITY DR_FN DR_LN ETH MI NK_FN NK_LN ST |
| MCHDn | |||
|
|
LN-FN | MRN SEX DB MB YB TEL ADR ZIP | MRN SEXm DB MB YBm TEL ADR ZIP CITY DR_FN DR_LN MI NK_FN NK_LN |
|
|
MB-DB-ZIP | MRN LN FN SEX YB TEL ADR | MRN LN FN SEX YB TEL ADR CITY DR_FN DR_LN ETH MI NK_FN NK_LN NICK |
|
|
MRN | LN FN SEX DB MB YB TEL ADR ZIP | LN FN SEXm DB MB YB TELm ADR ZIP CITY DR_FN DR_LN ETH MI NK_FN NK_LN ST |
|
|
NK_LN-NK_FN | MRN LN FN SEX DB MB YB TEL ADR ZIP | MRN LNm FN SEX DB MB YB TEL ADR ZIP CITY DR_FN DR_LN ETH LN MI NICK ST |
|
|
TEL | MRN LN FN SEX DB MB YB ADR ZIP | MRN LN FN SEX DB MB YBm ADR ZIP CITY DR_FN DR_LN ETH MI NK_FN NK_LN ST |
aColumns “Expert-specified fields” and “Data-driven fields” display the fields used in the Fellegi-Sunter (FS) model.
bDB-LN-MB-YB: day, month, and year of birth and last name.
cMRN: medical record number.
dFN: first name.
eSEX: sex.
fTEL: telephone number.
gADR: address.
hZIP: zip code.
iSSN: Social Security number.
jFields (italicized) selected only by data-driven methods.
kSSA: Social Security Administration.
lNBS: newborn screening.
mFields not selected by the data-driven method but specified by experts.
nMCHD: Marion County Health Department.