Table 7. Blocking Sensitivity Analysis Linkage Complexity and Results.
nA represents the number of unique records in the Medicare data, nAnB represents the total number of possible record pairs that are partitioned into a gender and ZIP code block, nm represents the average number of linked records over M = 100 imputations, and run time reflects the approximate time in days our Bayesian record linkage algorithm required to complete 500 iterations using a single CPU core on a Linux system.
Blocking Criteria | nA | nAnB | 95% CI | Run time | |
---|---|---|---|---|---|
4 digit ZIP | 251,285 | 56,706,359 | 3829.67 | (3814.40, 3844.94) | 30 days |
5 digit ZIP | 247,724 | 13,786,172 | 3608.02 | (3570.91, 3645.14) | 18 days |
6 digit ZIP | 234,331 | 2,666,450 | 2807.21 | (2728.63, 2885.79) | 2 days |
7 digit ZIP | 144,230 | 436,049 | 2767.80 | (2543.67, 2991.93) | < 1 day |