TABLE 2.
Various approaches of missing value imputation.
| Approach | Advantages | Limitations | Methods | References |
|---|---|---|---|---|
| Global | Optimal performance when data is homogeneous | Poor performance when data is heterogeneous | BPCA | Jörnsten et al. (2005), Oba et al. (2003), Souto et al. (2015) |
| SVD | Troyanskaya et al. (2001) | |||
| ANNImpute | García-Laencina et al. (2008) | |||
| RNNImpute | Bengio and Gingras (1995) | |||
| Local | Optimal performance when data is heterogeneous | Poor performance when data is homogeneous | KNNImpute | Dubey and Rasool (2021), McNicholas and Murphy (2010), Pan et al. (2011), Ryan et al. (2010) |
| LSImpute | Bo et al. (2004) | |||
| SVRimpute | Wang et al. (2006) | |||
| GMCImpute | Ouyang et al. (2004) | |||
| Hybrid | Optimal performance regardless of local or global correlation | Sub-optimal performance when data is noisy and has high missing rates | LinCmb | Jörnsten et al. (2005) |
| EMDI | Pan et al. (2011) | |||
| RMI | Li et al. (2015) | |||
| VAE, DAPL | Qiu et al. (2020), Qiu et al. (2018) | |||
| Knowledge-assisted | Optimal performance in presence of noisy data | Sub-optimal performance when data has high missing rates | iMISS | Hu et al. (2006) |
| GOImpute | Tuikkala et al. (2006) | |||
| POCSimpute | Gan et al. (2006) | |||
| HAIimpute | Xiang et al. (2008) |