Table 3.
Genome statistics
| Attribute | Value | % of Totala |
|---|---|---|
| Genome size (bp) | 2,263,488 | 100.00 |
| DNA coding (bp) | 2,137,656 | 94.44 |
| DNA G + C (bp) | 1,552,285 | 68.58 |
| DNA scaffolds | 1 | 100.00 |
| Total genesb | 2,333 | 100.00 |
| Protein coding genes | 2,279 | 97.64 |
| RNA genes | 54 | 2.31 |
| Pseudo genes | 1 | 0.04 |
| Genes in internal clusters | 822 | 36.07 |
| Genes with function prediction | 2,072 | 90.92 |
| Genes assigned to COGs | 2,098 | 89.89 |
| Genes with Pfam domains | 1,469 | 64.46 |
| Genes with signal peptides | 113 | 4.96 |
| Genes with transmembrane helices | 460 | 20.18 |
| CRISPR repeats | 8 | 0.34 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
bPseudogenes may also be counted as protein coding or RNA genes, so their number is not additive under the total gene count