Table 3.
Multivariate imputation methods and their advantages and disadvantages for Use Case 1, the multivariate descriptor of health.
Method | Description | Advantages | Disadvantages |
---|---|---|---|
Multivariate Imputation by Chained Equations (MICE)43 | A multiple imputation method using a set of iterative regression models. |
|
|
Nearest Neighbor (NN) estimation | A supervised pattern recognition method based on the distance to each pair of observations based on non-missing variables and imputing based on a weighted mean |
|
|
Random Forest (RF)44 | A sequential, machine learning imputation process that predicts missing data from a training set consisting of observed data |
|
|
Multivariate Normal Imputation (MVNI)45,46 | An iterative process that imputes missing data from multivariate normal distribution parameters using an expectation-maximization algorithm. |
|
|
Selection Model: | Joint distribution of data Y and missingness indicator M is partititioned into f(M,Y|θ, ψ) = f(Y|θ)f(M|Y, ψ). |
|
|
Pattern Mixture Model:. | Joint distribution of data Y and missingness indicator M is partititioned into f(M,Y|ξ,ω) = f(Y|M,ξ)f(M|ω) |
|
|
Bayesian inference: | Likelihood given observed data is augmented with draws of missing data from their full conditional posterior predictive distribution given observed data and a sample of the parameter values from their full conditional distribution |
|
|
Bootstrap imputation47–49 | Methods for bootstrapping after multiple imputation or imputation following bootstrap. |
|
|