Skip to main content
. 2021 Apr 20;28(4):381–394. doi: 10.1089/cmb.2020.0431

Table 1.

Dataset Characteristics

Dataset Source No. of reads Read length (bp) No. of distinct k-mers
Zebrafish RNA-seq SRX3022435 59,741,039 101 124,740,993
Human RNA-seq SRR957915 49,459,840 101 101,017,526
Human chromosome 14 GAGE (Salzberg et al., 2012) 36,504,800 101 99,941,572
Whole human genome SRR034939 36,201,642 100 391,766,120
Human gut metagenome SRR341725 25,479,128 90 103,814,001
Human RNA-seq (k=61) SRR957915 49,459,840 101 75,013,109

Singletons are not included in the k-mer count. Unless otherwise stated, k=31.