Table 5.
The Detail Information of the Training Datasets for Five Species
| Kingdom | Species | Promoter | Non-promoter |
Location | |
|---|---|---|---|---|---|
| CDS | Non-CDSa | ||||
| Eukaryotes (300 bp) | H. sapiens | 1,787 | 1,800 | 1,800 | [−249, +50] |
| D. melanogaster | 1,886 | 1,799 | 2,859 | [−249, +50] | |
| Prokaryotes (81 bp) | C. elegans | 598 | 600 | 600 | [−249, +50] |
| B. subtilis | 270 | 300 | 300 | [−60, +20] | |
| E. coli | 741 | 700 | 700 | [−60, +20] | |
CDS, coding sequences.
Intron for eukaryotes and convergent intergenetic region for prokaryotes.