Skip to main content
. Author manuscript; available in PMC: 2024 Sep 12.
Published in final edited form as: Proc Mach Learn Res. 2024 May;238:4195–4203.

Figure 1:

Figure 1:

The procedure of temporal missing value imputation with SADI. Each table represents a patient’s EHR data, with gray cells indicating missing values, orange cells indicating observed values, and blue cells indicating imputed values. The process involves (1) conducting an initial imputation (e.g., using CSDI) on the entire dataset Xii=1n to generate imputed data {X¯i}i=1n, (2) applying a clustering algorithm (e.g., K-SC) to categorize imputed data {X¯i}i=1n into K clusters {X¯Gk}k=1K and acquiring the corresponding original data clusters {XGk}k=1K, (3) training SADI on the K clustered datasets {XGk}k=1K, and (4) sampling imputations for the data clusters using the reverse process of SADI.