. 2019 Oct 25;20:519. doi: 10.1186/s12859-019-3115-8

Table 3.

An increase in file size was observed per genome added to the graph that demonstrated the compression of data that occurs by collapsing regions of shared aligned sequences into single representative nodes

Number of genomes	1	2	3	4	5	6	10
File size	4,5Mb	5,9Mb	7,6Mb	8,5Mb	11Mb	13Mb	38Mb
Number of nodes	0	3,690	8,106	9,320	13,264	15,355	43,290
Number of edges	0	4,886	10,868	12,485	17,823	22,296	73,652

The compression is related to the similarity of the sequences, as sequences that only differ by few bases will only require a few additional nodes. (Additional file 3)