Table 3. Genome statistics.
| Attribute | Value | % of totala |
|---|---|---|
| Genome size (bp) | 5,442,549 | 100.00% |
| DNA Coding region (bp) | 4,770,475 | 87.65% |
| DNA G+C content (bp) | 3,045,680 | 55.96% |
| Total genes | 5,139 | 100.00% |
| RNA genes | 112 | 2.18% |
| rRNA operons | 7 | 0.14% |
| Protein-coding genes | 4,951 | 96.34% |
| Pseudogenes | 76 | 1.48% |
| Genes in paralog clusters | 112 | 2.18% |
| Genes assigned to COGs | 3,805 | 74.04% |
| Genes assigned in Pfam domains | 4,183 | 81.39% |
| Genes with signal peptides | 676 | 13.15% |
| Genes with transmembrane helices | 1,228 | 23.89% |
| CRISPR repeats | 1 | % of totala |
a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.