Table 2.
The precision and recall of MetaGO for the simulated dataset using different k-mer lengths.
| k-mer length | 10 (%) | 20 (%) | 30 (%) | 40 (%) | 50 (%) | 60 (%) | |
|---|---|---|---|---|---|---|---|
| Logicalized k-mers | Precision | –∗ | 99.03 | 99.05 | 99.11 | 99.45 | 99.35 |
| Recall | –∗ | 89.79 | 92.16 | 98.89 | 97.01 | 95.23 | |
| Numercial k-mers | Precision | 99.63 | 96.81 | 96.07 | 97.72 | 98.22 | 98.58 |
| Averaged recall | 23.89 | 95.70 | 97.93 | 98.00 | 96.82 | 94.76 | |
The “averaged recall” in numerical k-mers is the average of the recall of B. caccae ATCC 43185 genome and the recall of the common regions between strain B. thetaiotaomicron 7330 and B. thetaiotaomicron VPI-5482. ∗When k = 10, there is no logicalized k-mer identified, so it is marked with “–”.