Skip to main content
. Author manuscript; available in PMC: 2024 Mar 11.
Published in final edited form as: Anal Chem. 2020 Mar 10;92(7):5231–5239. doi: 10.1021/acs.analchem.9b05763

Figure 1.

Figure 1.

An overview of the Disparate Metabolomics Data Reassembler (DIMEDR) workflow for integrating potentially incongruent datasets originating from multiple experiments. The algorithm relies on an initial selection of a “primary” dataset, with all other datasets designated as “target” sets. The primary dataset can be the largest or, what is determined to be the highest quality, and serves as the template for creating the unified matrix that data from all other target sets will be integrated into. This is facilitated by extracting persistent spectral features from the primary dataset, called endogenous anchors, that are utilized as reference points for universal data correction. Endogenous anchors are initially identified in each target dataset, and subsequently used to align, identify, and integrate mutual spectral features that are shared with the primary dataset into the unified matrix. Novel features, for which retention times have also been corrected via this process, are collected across all target datasets, grouped, and appended to the unified matrix as well.