Table 3. Genome statistics.
Attribute | Value | % of totala |
---|---|---|
Genome size (bp) | 4,768,352 | 100.00 |
DNA coding region (bp) | 4,018,014 | 84.26 |
DNA G+C content (bp) | 2,484,311 | 52.10 |
Total genesb | 4,885 | 100.00 |
RNA genes | 98 | 2.00 |
Protein-coding genes | 4,691 | 96.03 |
Pseudogenes | 96 | 1.97 |
Genes in paralog clusters | 623 | 13.28 |
Genes assigned to COGs | 3,534 | 75.34 |
Genes with signal peptides | 388 | 8.27 |
Genes with transmembrane helices | 1,096 | 23.36 |
CRISPR repeat | 1 |
a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
b) Also includes 96 pseudogenes