Skip to main content
. Author manuscript; available in PMC: 2009 Aug 28.
Published in final edited form as: Nature. 2004 May 27;429(6990):369–374. doi: 10.1038/nature02465

Table 2.

Annotation summary

Class* Gene
count
Total length in bp
(% coverage)
Mean gene
length (bp)
Mean exon
length
Number of
exons
Known genes 613 41,424,864 (37.9) 68,440 322 6,628
Novel genes 147 4,707,061 (4.3) 32,024 474 605
Novel transcripts 176 4,655,574 (4.3) 26,974 296 683
Putative genes 213 1,912,134 (1.7) 8,980 244 488
Total genes 1,149 51,435,292 (47.0) 46,407 326 8,404
Processed pseudogenes 397 459,658 (0.4) 1,157 709 475
Unprocessed pseudogenes 29 405,851 (0.4) 13,994 165 216
Total structures 1,575 52,022,232 (48.0) 34,405 342 9,095
*

Classes follow those laid down in ref. 4 and are based on the longest transcript of each gene; the total number of exons from all alternative transcripts is 14,640. Known genes, identical to known human cDNA or protein and have an entry in Locuslink; novel genes, have an ORF and are identical to spliced human ESTs or have some similarity to other cDNA/proteins (all species); novel transcripts, similar to novel genes but have ambiguous ORFs or multiple evidence for non-coding RNA; putative genes, with identity to 1–3 spliced ESTs but do not contain evidence for an ORF; pseudogenes, similar to known proteins but contain a frameshift and/or stop codon that disrupts the ORF.