Table 1.
Data sets used in the evaluation of our heuristic extension algorithm, with organism indicating the starting organism, related organisms indicating the related model organisms that BLAST is applied to, library indicating the total number of libraries, size indicating the total number of bases in all the reads after quality trimming, and reference indicating the publication that describes the libraries
| Organism | Related organisms | Library | Size | Reference |
|---|---|---|---|---|
| S. pombe | S. cerevisiae | 32 | 17 G | [12] |
| N. crassa | ||||
| D. melanogaster | D. pseudoobscura | 13 | 9.6 G | [37] |
| A. gambiae | ||||
| H. sapiens | S. boliviensis | 4 | 16 G | [38] |
| M. musculus | ||||
| A. thaliana | A. lyrata | 5 | 16 G | [39] |
| O. sativa | ||||
| L. sericata | D. melanogaster | 9 | 4.6 G | [23] |
| H. glaber | H. sapiens | 13 | 61 G | [24] |
| C. sociabilis | H. sapiens | 10 | 66 G | [25] |
| C. arietinum | A. thaliana | 3 | 8.6 G | [26] |
| M. albus | A. thaliana | 12 | 5.5 G | New data |
| M. siculus | A. thaliana | 12 | 5.4 G | New data |