Skip to main content
. 2013 Jun 5;13:64. doi: 10.1186/1472-6947-13-64

Table 5.

Improvement in predictive ability of data cleaning techniques

  Hospital admissions data Synthetic data
Remove punctuation
a0.08%
+0.08%
Remove alt. missing values
+0.5%
0%
Nickname lookup
−28%
−33%
Sex Imputation NA −5%

a Negative sign (-) refers to decrease in predictive ability, positive sign (+) refers to increase in predictive ability compared to baseline.