Skip to main content
. 2011 Apr 19;12:106. doi: 10.1186/1471-2105-12-106

Table 1.

Genomes from the NCBI Genome database for first data set.

Species name Refseq Genes PC
Buchnera aphidicola str. APS NC_002528 607 564
Escherichia coli str. K-12 substr. MG1655 NC_000913 4493 4149
Haemophilus in uenzae Rd KW20 NC_000907 1789 1657
Pasteurella multocida subsp. multocida str. Pm70 NC_002663 2092 2015
Xylella fastidiosa 9a5c NC_002488 2838 2766

Five γ -proteobacteria from the NCBI Genome database, used for detection of approximate gene clusters to generate biological instances of the center string problem. 'Refseq' is the reference sequence from NCBI Genome database, 'PC' the number of protein-coding genes.