Table 2.
Genomes from the NCBI Genome database for second data set.
Species name | Refseq | Genes | PC |
---|---|---|---|
Aquifex aeolicus | NC_000918 | 1580 | 1529 |
Clostridium acetobutylicum ATCC 824 | NC_003030 | 3843 | 3671 |
Corynebacterium glutamicum ATCC 13032 | NC_003450 | 3073 | 2993 |
Deinococcus radiodurans R1 chromosome 1, | NC_001263 | 2687 | 2629 |
Deinococcus radiodurans R1 chromosome 2 | NC_001264 | 369 | 268 |
Fusobacterium nucleatum | NC_003454 | 2125 | 2063 |
Listeria innocua Clip11262 | NC_003212 | 3065 | 2968 |
Mesorhizobium loti | NC_002678 | 6804 | 674 |
Mycoplasma genitalium | NC_000908 | 524 | 475 |
Mycoplasma pneumoniae | NC_000912 | 733 | 689 |
Mycoplasma pulmonis | NC_002771 | 815 | 782 |
Mycobacterium tuberculosis CDC1551 | NC_002755 | 4293 | 4189 |
Ralstonia solanacearum, megaplasmid | NC_003296 | 1684 | 1676 |
Ralstonia solanacearum | NC_003295 | 3503 | 3437 |
Rickettsia conorii str. Malish 7 | NC_003103 | 1414 | 1374 |
Salmonella typhimurium LT2 | NC_003197 | 4620 | 4423 |
Staphylococcus aureus subsp. aureus N315 | NC_002745 | 2664 | 2583 |
Synechocystis sp. PCC 6803 | NC_000911 | 3229 | 3179 |
Thermotoga maritima | NC_000853 | 1928 | 1858 |
Ureaplasma urealyticum | NC_011374 | 695 | 646 |
Bacillus halodurans C-125 | NC_002570 | 4170 | 4065 |
Bacillus subtilis | NC_014479 | 4170 | 4062 |
Borrelia burgdorferi | NC_001318 | 890 | 851 |
Buchnera sp. APS | NC_002528 | 607 | 564 |
Campylobacter jejuni | NC_008787 | 1707 | 1653 |
Caulobacter crescentus | NC_002696 | 3819 | 3737 |
Chlamydia pneumoniae | NC_000922 | 1122 | 1052 |
Chlamydia trachomatis | NC_000117 | 940 | 895 |
Escherichia coli O157:H7 | NC_002695 | 5371 | 5229 |
Escherichia coli str. K-12 substr. MG1655 | NC_000913 | 4493 | 4149 |
Haemophilus influenzae Rd | NC_000907 | 1789 | 1657 |
Helicobacter pylori 26695 | NC_000915 | 1627 | 1573 |
Helicobacter pylori str. J99 | NC_000921 | 1534 | 1488 |
Lactococcus lactis | NC_002662 | 2425 | 2321 |
Xylella fastidiosa | NC_002488 | 2838 | 2766 |
Neisseria meningitidis serogroup B str. MC58 | NC_003112 | 2225 | 2063 |
Pasteurella multocida PM70 | NC_002663 | 2092 | 2015 |
Pseudomonas aeruginosa PA01 | NC_002516 | 5669 | 5566 |
Rickettsia prowazekii str. Madrid E | NC_000963 | 888 | 835 |
Streptococcus pneumoniae | NC_012467 | 2254 | 2073 |
Streptococcus pyogenes str. SF370 serotype M1 | NC_002737 | 1810 | 1696 |
Treponema pallidum | NC_000919 | 1095 | 1036 |
Vibrio cholerae chromosome 1 | NC_012668 | 2897 | 2768 |
Vibrio cholerae chromosome 2 | NC_012667 | 1013 | 1004 |
Neisseria meningitidis serogroup A str. Z2491 | NC_003116 | 2065 | 1909 |
Mycobacterium leprae str. TN | NC_002677 | 2770 | 1605 |
Genomes from the NCBI Genome database used for detection of approximate gene clusters to generate biological instances of the center string problem. 'Refseq' is the reference sequence from NCBI Genome database, 'PC' the number of protein-coding genes.