Skip to main content
. 2022 Sep 1;23:184. doi: 10.1186/s13059-022-02738-3

Fig. 1.

Fig. 1

Workflow for MIRTH Imputation of Metabolomics Data. a Individual datasets are normalized and rank-tranformed, accounting for left-censoring. b Preprocessed datasets (Di) are combined into a sparse aggregate data matrix (X), which is then factorized into embedding matrices W and H. The product WH yields an imputed data matrix (X^). c Aggregate data from 9 pan-cancer metabolomics datasets with tumor and normal samples reveals poor across-dataset metabolite feature overlap and high degree of missingness