Table 2.
Real datasets used for the evaluation of graph aligner tools
Abbr. | Organism | Reference ID | Genome | Repeated | Cov. | Sequencing | Read | Trimmed | Dataset ID |
---|---|---|---|---|---|---|---|---|---|
size | 31-mers (%) | platform | length | reads | |||||
R1 | Bifidobacterium dentium | Nc013714.1 | 2.6 Mbp | 0.4 | 373 X | Illumina MiSeq | 251 bp | SRR1151311 | |
R2 | Escherichia coli K-12 DH10B | NC010473 | 4.5 Mbp | 3.2 | 418 X | Illumina MiSeq | 150 bp | Ill. Data library | |
R3 | Escherichia coli K-12 MG1655 | NC000913 | 4.5 Mbp | 0.6 | 612 X | Illumina GAII | 100 bp | ERA000206 | |
R4 | Salmonella enterica | NC011083.1 | 4.7 Mbp | 0.5 | 97 X | Illumina MiSeq | 239 bp | ✓ | SRR1206093 |
R5 | Pseudomonas aeruginosa | ERR330008 | 6.1 Mbp | 0.6 | 169 X | Illumina MiSeq | 120 bp | ✓ | ERR330008 |
R6 | Homo sapiens Chr. 21 | HG19 | 45.2 Mbp | 4.3 | 29 X | Illumina HiSeq | 100 bp | Ill. Data library | |
R7 | Caenorhabditis elegans | WS222 | 97.6 Mbp | 2.6 | 58 X | Illumina HiSeq | 101 bp | SRR543736 | |
R8 | Drosophila melanogaster | Release 5 | 116.4 Mbp | 1.1 | 52 X | Illumina HiSeq | 100 bp | SRR823377 |