Skip to main content
. 2010 May 30;26(14):1699–1703. doi: 10.1093/bioinformatics/btq268

Table 5.

File size by content type, data set A

Data type File size (%) Bits per seq. Bits per base
bin/range 4.7 7.75 0.18
Seq bases 23.5 38.83 0.88
Seq quality 42.6 70.36 1.60
Seq name 25.6 42.28 0.96
Seq other 3.7 6.08 0.14

File sizes from tg_index -z 16384 -d data_type. ‘Seq other’ here is a general per-sequence overhead. The ‘bin/range’ type includes everything needed to draw the Template Display window; sequence positions, mapping quality and read pairings.