Table 1.
Variable | Dataset | |||
---|---|---|---|---|
SingleGene | FinishGene | DraftGene | GoldenPath | |
No. of sequences | 175 | 194 | 1038 | 156500 |
No. of complete genes (partial) | 175 | 206 | 116 (256) | — |
Mean sequence lengths (kbp) | 7 | 96 | 14 | 17 |
No. of genes/Mbp (estimated) | 144 | 17 | 14 | (10) |
No. of exons/complete gene (partial) | 5.0 | 7.0 | 5.7 (3.0) | — |
Mean C + G% | 49.6 | 45.1 | 45.2 | 39.9 |
No. of aa/complete protein (partial) | 324 | 404 | 321 (170) | — |
Datasets are described in Methods. Some genes in the DraftGene set are represented by multiple partial genes in different draft contigs, data for these genes are listed in parentheses. Gene density in the GoldenPath set assumes 30,000 human genes in a 3000-Mbp genome.