Skip to main content
letter
. 2003 Dec;13(12):2541–2558. doi: 10.1101/gr.1429003

Table 4.

Number of Genes and Pseudogenes in Completely Sequenced Genomes

Organism Genome size (Mb) No. genes No. pseudogenes No. processed pseudogenes References
R. prowazekii 1.1 834 241 0 Andersson et al. 1998; Ogata et al. 2001
M. leprae 3.3 1604 1116 0 Cole et al. 2001
Y. pestis 4.6 4061 160 0 Parkhill et al. 2001
E. coli K-12 strain 4.6 1100 95 0 Homma et al. 2002
E. coli, O157 strain 5.5 6000 101 0 Homma et al. 2002
S. cerevisiae 12.1 6340 241 0 Harrison et al. 2002a
C. elegans 102.9 20,009 2168 208 Harrison et al. 2001
D. melanogaster 128.3 14,332 110 34 Harrison et al. 2003
A. thaliana 115.4 25,464 >700 ?? Arabidopsis Genome Initiative 2000
H. sapiens 3040 22,000–39,000 13,398 (19,929)a 9747 This study
M. musculus 2493 22,011 14,000 (∼10,000)b (4700)b Waterston et al. 2002
a

The number in the parentheses includes pseudogenic fragments

b

Unpublished results by the authors