Skip to main content
[Preprint]. 2024 Nov 5:2024.03.06.583775. Originally published 2024 Mar 8. [Version 2] doi: 10.1101/2024.03.06.583775

Figure 1: Overview of LongSom.

Figure 1:

a. LongSom’s methodology for detecting somatic SNVs, fusions, and CNAs and subsequently inferring cancer subclones in LR scRNA-seq individual patients data. (1) SNV and (2) fusion candidates are detected from pseudo-bulk samples. (3) High-confidence cancer variants (SNVs and fusions) are selected based on mutated cell fraction in cancer and non-cancer cells. (4) Cells are re-annotated based on high-confidence cancer variants. (5) A new set of candidate variants is called based on re-annotated barcodes. (6) Candidate SNVs are filtered through a set of 10 filters. (7) cells are clustered based on somatic fusions and SNVs. In parallel, (8) gene expression per cell is computed, (9) CNAs are detected, (10) cells are clustered based on CNAs, and (11) CNA clones are incorporated to the fusions and SNVs clustered matrix. b. Candidate nuclear SNV successive filtering steps. Candidates passing all 10 steps are called as somatic SNVs (Methods). c. Candidate mtSNVs filtering steps. ΔMCF represents the difference of mutated cells fraction between cancer and non-cancer cells. Candidates passing all 5 steps are called as somatic mtSNVs (Methods).