Table 1. General features of the genome of the Welgevonden strain of E. ruminantium.
Size, bp | 1,516,355 |
G + C content, % | 27.5 |
Protein coding regions,* % | 62.0 |
CDSs total, n | 920 |
Average length, bp | 1,032 |
Probable pseudogenes, n (%) | 32 (3.5) |
Average length, bp | 276 |
Predicted protein CDSs, n (%) | 888 (96.5) |
Average length, bp | 1,059 |
CDSs with functional information,†n (%) | 758 (82.8) |
Conserved hypothetical genes, n (%) | 50 (5.5) |
Genes with no functional information, n (%) | 80 (8.7) |
Stable RNAs | |
rRNAs, n | 3 |
tRNAs, n | 36 |
Other RNAs (tmRNA/rnpB), n | 2 |
Tandem repeats, bp (%) | 82,172 (5.4) |
Dispersed repeats (direct and inverted), bp (%) | 43,976 (2.9) |
Not including pseudogenes.
Includes CDSs with database matches to genes of known function, matches to pfam or prosite entries, or informative hydrophobicity plots.