Table 1.
Completely Sequenced Organisms and Other Fragmentary Data Considered in this Analysis
Organism | Domaina | Codeb | ORFsc | Partitionsd |
---|---|---|---|---|
H. influenzae | B | HI | 1713 | 1377 |
M. genitalium | B | MG | 468 | 361 |
Synechocystis sp. | B | Ssp | 3168 | 2002 |
M. pneumoniae | B | MP | 677 | 424 |
H. pylori | B | HP | 1577 | 1226 |
E. coli | B | EC | 4290 | 2473 |
B. subtilis | B | BS | 4100 | 2573 |
B. burgdorferi | B | BB | 850 | 696 |
A. aeolicus | B | AE | 1522 | 1157 |
M. tuberculosis | B | MT | 3924 | 2329 |
T. pallidum | B | TP | 1031 | 852 |
C. trachomatis | B | CT | 877 | 718 |
C. jejuni | B | CJ | 1731 | 1323 |
R. prowazekii | B | RP | 837 | 653 |
M. jannaschii | A | MJ | 1735 | 1180 |
M. thermoautotrophicum | A | MTH | 1871 | 1227 |
A. fulgidus | A | AF | 2437 | 1423 |
P. horikoshii OT3 | A | PH | 2061 | 1373 |
S. cerevisiae | E | SC | 6182 | 4437 |
C. elegans | E | CE | 19,099 | 7558 |
S. pombee | E | SP | 3579 | 2248 |
H. sapiens | E | Hs | ||
M. musculuse | E | Mm |
Predicted ORF products considered in this study are essentially as described in the original publications: H. influenzae (Fleischman et al. 1995), M. genitalium (Fraser et al. 1995), M. jannaschii (Bult et al. 1996), Synechocystis sp. strain PCC6803 (Kaneko et al. 1996), M. pneumoniae (Himmelreich et al. 1996), H. pylori (Tomb et al. 1997), E. coli (Blattner et al. 1997), M. thermoautotrophicum (Smith et al. 1997), B. subtilis (Kunst et al. 1997), A. fulgidus (Klenk et al. 1997), B. burgdorferi (Fraser et al. 1997), A. aeolicus (Deckert et al. 1998), M. tuberculosis (Cole et al. 1998), P. horikoshii (Kawarabayasi et al. 1998), T. pallidum (Fraser et al. 1998), C. trachomatis (Stephens et al. 1998), R. prowazekii (Andersson et al. 1998), and C. elegans (The C. elegans Sequencing Consortium, 1998). Yeast S. cerevisiae ORF products (Goffeau et al. 1997) correspond to those indicated in the MIPS server: http://www.mips.biochem.mpg.de/, with a few modifications. Preliminary complete proteome of C. jejuni (ftp.sanger.ac.uk/in/pub/pathogens/Cj/) was considered.
(B) Bacteria; (A) Archaea; and (E) Eukarya.
The total number of predicted ORF products.
The total number of distinct partitions.
S. pombe correspond to those at the Sanger ftp server: ftp.sanger.ac.uk under/pub/yeast/sequences/pombe/pompep/), human (H. sapiens) and mouse (M. musculus) sequences Hsuniq, Mmuniq (Boguski et al. 1995). An incomplete set of data was used, containing 3579 ORF products representing at least 68% of total proteome (V. Wood pers. comm.); 43,088, and 8,821 sets of clustered ESTs derived from GenBank release 106, respectively. Hsuniq and Mmuniq were used solely as targets for comparisons with the other organisms.