TABLE 1.
Accuracy of different methods on GEBA, FDA-ARGROS, and NCTC 3000 datasets.
Expecteda | 16Sb | KmerFinder (Com)c | KmerFinder-(type)d | fIDBACe | fIDBAC-ANI (95%)f | |
GEBA | 973 | 617 (63.41%) | 272 (27.95%) | 908 (93.32%) | 963 (98.97%) | 963 (98.97%) |
FDA-ARGROS | 523 | 365 (69.79%) | 490 (93.69%) | 432 (82.06%) | 501 (95.79%) | 488 (93.31%) |
NCTC-3000 | 996 | 649 (65.16%) | 884 (88.76%) | 676 (67.87%) | 962 (96.58%) | 951 (95.88%) |
Total | 2,492 | 1,631 (65.45%) | 1,646 (66.05%) | 2,016 (80.90%) | 2,427 (97.39%) | 2,402 (96.38%) |
aExpected, the number of queries genomes.
b16S, number of 16S top 1 result that matches the original label.
cKmerFinder (Com), number of KmerFinder top 1 result with its complete genome database that matches the original label.
dKmerFinder (type), number of KmerFinder top 1 result with its type strain genome database that matches 1he original label.
efIDBAC, number of fIDBAC top 1 result with its curated genome database that matches the original label.
ffIDBAC, number of fIDBAC top 1 result with its curated genome database that matches the 95% ANI threshold and the original label.