Skip to main content
. Author manuscript; available in PMC: 2021 Jul 18.
Published in final edited form as: Nat Biotechnol. 2021 Jan 18;39(5):599–608. doi: 10.1038/s41587-020-00795-2

Figure 1 – Overview of the CopyKAT analysis workflow.

Figure 1 –

a, The CopyKAT workflow begins with a UMI count matrix to order genes by their genomic positions and uses the raw count matrix to perform log-Freeman Turkey Transformation to stabilize variance and smooth outliers using a polynomial dynamic linear model. b, A subset of normal cells is defined using integrative clustering and GMM method to infer the copy number baseline. c, Relative gene expression values in single cells are used for MCMC segmentation and segments are merged by KS testing. d, Aneuploid tumor and normal cell clusters are classified using a normal cell enrichment and GMM distribution tests. e, Clonal substructure of tumor cells are delineated by clustering and subclones are used for differential expression analysis.