Skip to main content
. 2019 Jul 3;9:9580. doi: 10.1038/s41598-019-45842-4

Figure 2.

Figure 2

Standardised quality-control metrics improve clustering quality of scRNA-seq data. tSNE plots showing principle component-based clustering and average silhouette width for: (a) unfiltered data, (b) data filtered using widely-used quality-control metrics and (c) the optimised approach described here (shown in dg). Each point represents an individual cell, groups of cells with similar transcriptomes are referred to as a ‘cluster’ and distinguished by colour. Fibroblast populations are encircled in black. (dg) Cell filtering using our optimised processing pipeline: (d) Multi-way importance plot showing the relative importance of each metric in distinguishing between low-quality events and droplets, (e) Plot of log(nGene) vs. log(nUMI). Data points identified as low-quality events are highlighted in red, (f) Plot of nUMI vs. nGene. The red line indicates the upper nGene threshold (defined as 2.5 MAD above the median) and (g) Plot of nGene vs. percent mito. The red line indicates the upper threshold (defined as 2.5 MAD above the median). Further data associated with this figure can be found in Fig. S5.