Skip to main content
. 2018 Mar 27;7:e33105. doi: 10.7554/eLife.33105

Figure 2. Assessment of single-cell transcriptome sequence purity, diversity and accuracy.

(A) Individually sorted P. falciparum and P. berghei cells from a mixed pool revealed no doublets and little contamination. (B) Distributions of numbers of genes identified as expressed in our three main datasets. (C) Expressed genes (those with at least 10 reads in at least five cells) were representative of average gene length, suggesting that although the reverse transcriptase might not copy the whole of long transcripts, fragments of long genes are still detected. (D) Sequencing library preparation often introduces end bias, where either the 5’ or 3’ end of transcripts tend to be better covered. Our protocol introduced a small 5’-bias, which could be attributable to the reverse transcription sometimes initiating within transcripts in internal polyA regions, rather than in the 3’ poly-A tail.

Figure 2.

Figure 2—figure supplement 1. Dual sorting of P. berghei and P. falciparum cells shows that contamination from ambient RNA is low.

Figure 2—figure supplement 1.

(A) Purified asexual late blood stage of GFP P. falciparum and mCherry P. berghei were mixed at a 1:1 ratio, inactivated in RNAlater, and sorted individually by flow cytometry, gated on respective fluorescent channels. (B) P. falciparum transcripts contaminating P. berghei cells and (C) P. berghei transcripts contaminating P. falciparum cells suggesting that contaminants reflect leakage from lysed cells (D) Histogram of the number of reads (log-transformed) from P. berghei cells that mapped to genomes of P. berghei and P. falciparum. A small proportion of detected genes were from the incorrect species and these tended to be expressed at low levels. (E) Histogram of the number of reads (log-transformed) from P. falciparum cells that mapped to P. falciparum and P. berghei.
Figure 2—figure supplement 2. The GC content of transcript fragments agreed well with the GC content of genes.

Figure 2—figure supplement 2.

There was no apparent over- or under-representation of GC-rich regions.