Skip to main content
. 2017 Apr 1;24(4):280–288. doi: 10.1089/cmb.2016.0151

Table 3.

Compression Ratios of the Five Genome Files Using Different Huffman Tree Implementations, SHT, UHT, and UHTL

  SHT, % UHT, % UHTL, %
Data set k = 8 k = 16 k = 32 k = 64 k∈[2,10] k∈[3,10] k∈[2,10] k∈[3,10]
Cholerae 29.10 30.12 32.05 31.11 27.12 26.72 26.01 26.33
Abscessus 29.07 29.07 29.58 30.29 26.72 25.33 25.72 25.44
S. cerevisiae 29.95 29.86 30.27 30.99 27.29 25.97 26.09 25.59
N. crassa 30.59 30.27 32.07 31.70 27.15 26.97 26.34 26.69
Chr22 22.47 22.70 23.97 23.13 20.85 20.57 19.75 20.11

In SHT, the selection of the number of the most frequent k-mers is shown, whereas, in UHT and UHTL, the k-mer range is specified. Note that single bases are also encoded, by default.

SHT, ; UHT ; UHTL, .