Table 1.
Comparison of accuracy at the genus level for different tools on RefSeq and out-of-distribution datasets across varying sequence lengths
RefSeq dataset | Out-of-distribution dataset | |||||||
---|---|---|---|---|---|---|---|---|
Length | ViTax | Kraken | CAT | PhaGenus | ViTax | Kraken | CAT | PhaGenus |
4k | 0.923 | 0.911 | 0.851 | 0.749 | 0.648 | 0.596 | 0.573 | 0.471 |
6k | 0.937 | 0.922 | 0.862 | 0.751 | 0.675 | 0.595 | 0.578 | 0.473 |
8k | 0.948 | 0.927 | 0.866 | 0.750 | 0.688 | 0.600 | 0.580 | 0.477 |
10k | 0.951 | 0.932 | 0.875 | 0.745 | 0.698 | 0.603 | 0.590 | 0.470 |
12k | 0.954 | 0.936 | 0.878 | 0.717 | 0.699 | 0.606 | 0.586 | 0.465 |
14k | 0.957 | 0.936 | 0.880 | 0.737 | 0.700 | 0.608 | 0.586 | 0.460 |
16k | 0.957 | 0.936 | 0.880 | 0.737 | 0.700 | 0.612 | 0.594 | 0.453 |
Complete genome | 0.950 | 0.936 | 0.878 | 0.713 | 0.864 | 0.728 | 0.674 | 0.611 |