Skip to main content
. 2011 Apr 19;12:106. doi: 10.1186/1471-2105-12-106

Table 2.

Genomes from the NCBI Genome database for second data set.

Species name Refseq Genes PC
Aquifex aeolicus NC_000918 1580 1529
Clostridium acetobutylicum ATCC 824 NC_003030 3843 3671
Corynebacterium glutamicum ATCC 13032 NC_003450 3073 2993
Deinococcus radiodurans R1 chromosome 1, NC_001263 2687 2629
Deinococcus radiodurans R1 chromosome 2 NC_001264 369 268
Fusobacterium nucleatum NC_003454 2125 2063
Listeria innocua Clip11262 NC_003212 3065 2968
Mesorhizobium loti NC_002678 6804 674
Mycoplasma genitalium NC_000908 524 475
Mycoplasma pneumoniae NC_000912 733 689
Mycoplasma pulmonis NC_002771 815 782
Mycobacterium tuberculosis CDC1551 NC_002755 4293 4189
Ralstonia solanacearum, megaplasmid NC_003296 1684 1676
Ralstonia solanacearum NC_003295 3503 3437
Rickettsia conorii str. Malish 7 NC_003103 1414 1374
Salmonella typhimurium LT2 NC_003197 4620 4423
Staphylococcus aureus subsp. aureus N315 NC_002745 2664 2583
Synechocystis sp. PCC 6803 NC_000911 3229 3179
Thermotoga maritima NC_000853 1928 1858
Ureaplasma urealyticum NC_011374 695 646
Bacillus halodurans C-125 NC_002570 4170 4065
Bacillus subtilis NC_014479 4170 4062
Borrelia burgdorferi NC_001318 890 851
Buchnera sp. APS NC_002528 607 564
Campylobacter jejuni NC_008787 1707 1653
Caulobacter crescentus NC_002696 3819 3737
Chlamydia pneumoniae NC_000922 1122 1052
Chlamydia trachomatis NC_000117 940 895
Escherichia coli O157:H7 NC_002695 5371 5229
Escherichia coli str. K-12 substr. MG1655 NC_000913 4493 4149
Haemophilus influenzae Rd NC_000907 1789 1657
Helicobacter pylori 26695 NC_000915 1627 1573
Helicobacter pylori str. J99 NC_000921 1534 1488
Lactococcus lactis NC_002662 2425 2321
Xylella fastidiosa NC_002488 2838 2766
Neisseria meningitidis serogroup B str. MC58 NC_003112 2225 2063
Pasteurella multocida PM70 NC_002663 2092 2015
Pseudomonas aeruginosa PA01 NC_002516 5669 5566
Rickettsia prowazekii str. Madrid E NC_000963 888 835
Streptococcus pneumoniae NC_012467 2254 2073
Streptococcus pyogenes str. SF370 serotype M1 NC_002737 1810 1696
Treponema pallidum NC_000919 1095 1036
Vibrio cholerae chromosome 1 NC_012668 2897 2768
Vibrio cholerae chromosome 2 NC_012667 1013 1004
Neisseria meningitidis serogroup A str. Z2491 NC_003116 2065 1909
Mycobacterium leprae str. TN NC_002677 2770 1605

Genomes from the NCBI Genome database used for detection of approximate gene clusters to generate biological instances of the center string problem. 'Refseq' is the reference sequence from NCBI Genome database, 'PC' the number of protein-coding genes.