Table 1.
Genomes from the NCBI Genome database for first data set.
Species name | Refseq | Genes | PC |
---|---|---|---|
Buchnera aphidicola str. APS | NC_002528 | 607 | 564 |
Escherichia coli str. K-12 substr. MG1655 | NC_000913 | 4493 | 4149 |
Haemophilus in uenzae Rd KW20 | NC_000907 | 1789 | 1657 |
Pasteurella multocida subsp. multocida str. Pm70 | NC_002663 | 2092 | 2015 |
Xylella fastidiosa 9a5c | NC_002488 | 2838 | 2766 |
Five γ -proteobacteria from the NCBI Genome database, used for detection of approximate gene clusters to generate biological instances of the center string problem. 'Refseq' is the reference sequence from NCBI Genome database, 'PC' the number of protein-coding genes.