Skip to main content
. Author manuscript; available in PMC: 2008 Nov 1.
Published in final edited form as: J Chem Inf Model. 2007 Oct 30;47(6):2098–2109. doi: 10.1021/ci700200n

Table 4.

Average size in bits of compressed molecular fingerprints for different compression schemes, with path and circular substructure features. These values correspond to the values on the y axis in Figures ?? and ?? when x = log Nhash = 30. Headers are not included.

Encoding
Path
Circular
Golomb-Rice[hash] 4094.8 1247.5
Golomb-Rice[posthash] 2066.1 563.1
Golomb-Rice[posthash,sorted]
1879.6
460.8
MOV[hash] 4955.4 1425.5
MOV[posthash] 2954.9 725.7
MOV[posthash,sorted]
1803.3
379.5
MOL[hash] 4489.3 1354.4
MOL[posthash] 2455.1 658.2
MOL[posthash,sorted] 1420.6 316.2