Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2024 Jul 9;13(7):512. doi: 10.3390/biology13070512

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© 2024 by the authors.

Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

PMC Copyright notice

The imputation process using sc-PHENIX. The sc-PHENIX imputation approach for scRNA-seq data consists of two main steps: (A) The construction of the distance matrix ( $D_{D i s t}$ ): sc-PHENIX is characterized by applying PCA and then UMAP (PCA-UMAP). In this PCA-UMAP multidimensional space, sc-PHENIX constructs the best denoise representation of cell distance measurements for the diffusion process to preserve data structures. (B) The diffusion maps for imputation: the imputation process using diffusion maps consists of several steps: (i) Construction of the Markov transition matrix M from $D_{D i s t}$ : sc-PHENIX uses the adaptive Gaussian kernel to generate a non-symmetric affinity matrix ( $A_{n o n - s i m}$ ), it is symmetrized. Then, it is normalized to generate (M). (ii) Diffusion process: M is exponentiated to a chosen power t (random walk of length t named “diffusion time”) to obtain the exponentiated Markov matrix (M^t). The M^t graph well preserves the continuum structure better than the previous steps. (iii) Imputation: This step consists of multiplying the exponentiated Markov matrix (M^t) times the single-cell-matrix data D to obtain an imputed and denoised scRNA-seq matrix ( $D_{i m p u t e d}$ ). Note: The symbol * used in this figure indicates matrix multiplication for M^t and D in a computational formalism, which is equivalent to the formal mathematical notation M^t ⋅ D. All equations are described in the Section 2 section. (C) Visualization of the exponentiated Markov matrix: We convert the M^t into a distance matrix ( $D_{D i s t}$ ). Then, we apply a multidimensional scaling method to project data in 2D or 3D dimensions. This projection can be used as a heuristic method for quality control of imputation.