Abstract
Kosmotoga olearia strain TBF 19.5.1 is a member of the Thermotogales that grows best at 65°C and very well even at 37°C. Information about this organism is important for understanding the evolution of mesophiles from thermophiles. Its genome sequence reveals extensive gene gains and a large content of mobile genetic elements. It also contains putative hydrogenase genes that have no homologs in the other member of the Thermotogales.
GENOME ANNOUNCEMENT
Kosmotoga olearia strain TBF 19.5.1 is a thermophilic anaerobic bacterium belonging to the order Thermotogales and was isolated from oil production fluid collected at the Troll B oil platform in the North Sea. It grows at temperatures from 20 to 80°C with a doubling time of 103 ± 8 min at its optimum growth temperature of 65°C on its preferred carbon source, pyruvate (2). It shows an unusual ability to grow well at suboptimal temperatures, with a doubling time on pyruvate of 175 ± 59 min at 37°C (2). Phylogenetic analyses place it as a close relative to the apparently mesophilic Thermotogales, known as the mesotogas (11). Given this phylogenetic position and its ability to grow well over such a range of temperatures, it represents an interesting opportunity to examine the evolutionary transition from a thermophilic to a mesophilic lifestyle.
The genome of K. olearia was sequenced using a combination of Sanger (8-kbp and 40-kbp fosmids) and 454 sequencing. All general aspects of library construction and sequencing can be found at the Joint Genome Institute website (www.jgi.doe.gov). Reads from all libraries were assembled using the Phred/Phrap/Consed assembler (3–5). Possible misassemblies were corrected and gaps between contigs were closed by editing in Consed or by custom primer walks from subclones or PCR products.
Genes were identified using Prodigal (7) as part of the Oak Ridge National Laboratory genome annotation pipeline, followed by a round of manual curation using the JGI GenePRIMP pipeline (12). In order to determine a product description for each of the predicted coding sequences, they were translated and used to search the National Center for Biotechnology Information (NCBI) nonredundant database and the UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG, and InterPro databases. Noncoding genes and miscellaneous features were predicted using tRNAscan-SE (10), RNAMMer (9), Rfam (6), TMHMM (8), and signalP (1).
The complete K. olearia genome consists of 2,302,126 bp in a single circular chromosome, making it one of the largest sequenced Thermotogales genomes. The genome has an average G+C content of 41.5%, and there are a total of 2,118 predicted protein-coding genes, two ribosomal operons, 46 tRNAs, and 51 pseudogenes.
Compared to the other sequenced genomes in this order, the K. olearia genome has many unique mobile genetic elements, including transposons and group II introns. The large number of mobile genetic elements suggests that this genome undergoes frequent recombination, rearrangement, and gene gain/loss events. This apparent genome instability associated with mobile genetic elements may have allowed K. olearia to acquire genes that allow it to grow at a wide range of temperatures and may also have contributed to its relatively large genome size.
The K. olearia genome contains 21 open reading frames transcribed in the same direction (Kole_0555 to Kole_575), comprising a region unique among the Thermotogales. Encoded in this region are a putative Ni-Fe hydrogenase, NADH:quinone reductase complex, and cation antiporter. These annotations raise the possibility of an uncharacterized phenotype for this organism and perhaps an energy conservation mechanism novel among the Thermotogales.
Nucleotide sequence accession number.
The complete genome sequence of Kosmotoga olearia strain TBF 19.5.1 is available in GenBank under accession number CP001634.
Acknowledgments
This work was supported by funds from the NASA Astrobiology Program (NNX08AQ10G), the U.S. Department of Energy Office of Biological and Environmental Research (DE-PS02-08ER08-12), and the National Science Foundation Assembling the Tree of Life program (DEB0830024). The work conducted by the U.S. Department of Energy Joint Genome Institute is supported by the Office of Science of the U.S. Department of Energy under contract no. DE-AC02-05CH11231.
We thank W. Ford Doolittle for assistance in initiating the sequencing effort.
REFERENCES
- 1. Bendtsen J. D., Nielsen H., von Heijne G., Brunak S. 2004. Improved prediction of signal peptides: SignalP 3.0. J. Mol. Biol. 340:783–795 [DOI] [PubMed] [Google Scholar]
- 2. DiPippo J. L., et al. 2009. Kosmotoga olearia gen. nov., sp. nov., a thermophilic, anaerobic heterotroph isolated from an oil production fluid. Int. J. Syst. Evol. Microbiol. 59:2991–3000 [DOI] [PubMed] [Google Scholar]
- 3. Ewing B., Green P. 1998. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 8:186–194 [PubMed] [Google Scholar]
- 4. Ewing B., Hillier L., Wendl M. C., Green P. 1998. Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 8:175–185 [DOI] [PubMed] [Google Scholar]
- 5. Gordon D., Abajian C., Green P. 1998. Consed: a graphical tool for sequence finishing. Genome Res. 8:195–202 [DOI] [PubMed] [Google Scholar]
- 6. Griffiths-Jones S., Bateman A., Marshall M., Khanna A., Eddy S. R. 2003. Rfam: an RNA family database. Nucleic Acids Res. 31:439–441 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Hyatt D., et al. 2010. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Krogh A., Larsson B., von Heijne G., Sonnhammer E. L. 2001. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J. Mol. Biol. 305:567–580 [DOI] [PubMed] [Google Scholar]
- 9. Lagesen K., et al. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35:3100–3108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Lowe T. M., Eddy S. R. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25:955–964 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Nesbø C. L., Dlutek M., Zhaxybayeva O., Doolittle W. F. 2006. Evidence for existence of “mesotogas,” members of the order Thermotogales adapted to low-temperature environments. Appl. Environ. Microbiol. 72:5061–5068 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Pati A., et al. 2010. GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes. Nat. Methods 7:455–457 [DOI] [PubMed] [Google Scholar]
