Skip to main content
. 2013 Nov 27;7(1):23. doi: 10.1186/1479-7364-7-23

Figure 1.

Figure 1

Data analysis flowchart. Before step I, sequencing reads with phred-like quality scores q < 30 were removed. N in steps II, III, and IV reflects the number of reference sequences from corresponding databases. For the alignment in steps II, III, and IV, we combined reference fasta files into ‘supergenomes’ including vector sequences, bacterial genomes, and viral genomes, respectively. Each individual reference sequence in the ‘supergenome’ was treated as a chromosome. All supergenome reference files were indexed before alignment steps.