Table 1.
Dataset | Source | No. of reads | Read length (bp) | No. of distinct k-mers |
---|---|---|---|---|
Zebrafish RNA-seq | SRX3022435 | 59,741,039 | 101 | 124,740,993 |
Human RNA-seq | SRR957915 | 49,459,840 | 101 | 101,017,526 |
Human chromosome 14 | GAGE (Salzberg et al., 2012) | 36,504,800 | 101 | 99,941,572 |
Whole human genome | SRR034939 | 36,201,642 | 100 | 391,766,120 |
Human gut metagenome | SRR341725 | 25,479,128 | 90 | 103,814,001 |
Human RNA-seq () | SRR957915 | 49,459,840 | 101 | 75,013,109 |
Singletons are not included in the k-mer count. Unless otherwise stated, .