Skip to main content
. 2001 May;11(5):803–816. doi: 10.1101/gr.175701

Table 1.

Summary of Sequence Sets Used in This Study

Variable Dataset


SingleGene FinishGene DraftGene GoldenPath




No. of sequences 175 194 1038 156500
No. of complete genes (partial) 175 206 116 (256)
Mean sequence lengths (kbp) 7 96 14 17
No. of genes/Mbp (estimated) 144 17 14 (10)
No. of exons/complete gene (partial) 5.0 7.0 5.7 (3.0)
Mean C + G% 49.6 45.1 45.2 39.9
No. of aa/complete protein (partial) 324 404 321 (170)

Datasets are described in Methods. Some genes in the DraftGene set are represented by multiple partial genes in different draft contigs, data for these genes are listed in parentheses. Gene density in the GoldenPath set assumes 30,000 human genes in a 3000-Mbp genome.