Table 2.
63 500 25 bp fragments, 100 from each genome, are BLASTed and compared to the N = 15 NBC. BLAST gives 66% of them unique top-scoring hits, where all of them were correct. Almost 34% of the reads have ambiguous top-scoring hits, meaning that there are multiple organisms that have top scores and E-values. Also, even though the exact string or complement exist in the database, 287 fragments receive no hit from BLAST with an E-value of 3000. NBC is able to correctly identify 71% of those. Being that the multiple top-scoring genomes can be randomly chosen as a top hit, we can compare directly, how often BLAST would get the genome correct compared to the NBC. Taking this and the single top hits into consideration, NBC scored 48118 (75.8%) fragments correct while BLAST matched 47889 (75.4%) fragments correct.
63 500 fragments | ||
---|---|---|
BLAST category | Interpretation of BLAST results | NBC's results for the BLAST category |
No. of reads that had Unique Top-scoring hits in BLAST | No. that BLAST got correct | No. that NBC got correct |
41641 | 41641 | 41211 |
No. of reads that had Multiple Top-scoring hits in BLAST | BLAST hits for reads where the multiple top-scoring list contained the correct one/no. of unique top-hits BLAST would get by chance from ambiguous hits | No. that NBC got a correct, Unique Top-hit |
21572 | 21559/6248 | 6702 |
Reads that had No hits in BLAST (E-value of 3000) | Could not be assigned in BLAST | No. that NBC got correct |
287 | 0 | 205 |