. 2014 Feb 7;9(7):1328–1335. doi: 10.2215/CJN.10141013

Table 3.

Hypothetical example of data with five imputed datasets

Imputed Dataset	ID	Age (yr)	Woman	BMI (kg/m²)			PPRA (%)		Stroke	Years Followed
Imputed Dataset	ID	Age (yr)	Woman	15–18.5	25–30	30–45	11–80	80–100	Stroke	Years Followed
1	1	39	No	0	1	0	0	1	No	8.4
2	1	39	No	0	1	0	0	1	No	8.4
3	1	39	No	0	1	0	0	1	No	8.4
4	1	39	No	0	1	0	0	1	No	8.4
5	1	39	No	0	1	0	0	1	No	8.4
1	2	44	Yes	0.35^a	0.34^a	−0.21^a	0	1	Yes	10.9
2	2	44	Yes	0.12^a	0.45^a	0.03^a	0	1	Yes	10.9
3	2	44	Yes	0.21^a	0.27^a	−0.47^a	0	1	Yes	10.9
4	2	44	Yes	−0.01^a	0.97^a	−0.44^a	0	1	Yes	10.9
5	2	44	Yes	0.38^a	0.80^a	0.64^a	0	1	Yes	10.9
1	4	67	No	0	0	0	−0.21^a	0.08^a	No	11.6
2	4	67	No	0	0	0	0.04^a	−0.33^a	No	11.6
3	4	67	No	0	0	0	0.25^a	0.21^a	No	11.6
4	4	67	No	0	0	0	0.31^a	−0.04^a	No	11.6
5	4	67	No	0	0	0	0.69^a	0.07^a	No	11.6

BMI=18.5–25 (normal) and PPRA=0–10 (normal) are used as reference groups and represented by a zero in all dummy variables pertaining to each variable. ID, identification; PPRA, panel reactive antibody.

Multiple imputation may lead to data that are not consistent with the original format; in this case, values imputed for missing observations of categorical (binary) data are continuous. Furthermore, although original categories of a variable may be mutually exclusive, imputed data may not be mutually exclusive, which is appropriate, because the imputed values, per se, do not have any meaning.