Abstract
Rhodobacter sphaeroides is a purple nonsulfur photosynthetic bacterium that is considered a possible source of H2 production. R. sphaeroides KD131, which was isolated from sea mud in South Korea, was found to produce high levels of H2. Here we report the complete and annotated genome sequence of R. sphaeroides KD131.
Rhodobacter sphaeroides KD131 (KCTC12085) is a halophilic, purple nonsulfur photosynthetic bacterium that produces high levels of biohydrogen (H2). The strain, KD131, was originally isolated from sea mud off the coast of DaeBu Island, South Korea (8). Recently, the strain was genetically developed for H2 production under any conditions, regardless of the presence of light (6).
We have determined the complete genome sequence of strain KD131, which consists of two chromosomes and two plasmids, using the traditional shotgun whole-genome sequencing method. To accomplish this, two genome libraries (∼2 kb and ∼35 kb) were generated using randomly sheared genomic DNA of strain KD131. The sequences with as much as five times more genomic information were obtained from the 38,063 chromatograms produced by the libraries using ABI 3730xl autosequencers. The DNA-sequencing chromatograms were then analyzed automatically by base calling, fragment assembly, and contig editing using the Phred/Phrap/Consed software package (http://www.phrap.org) (4). For gap closing and to improve the quality of regions with a low Phred quality score or depth, noncloned PCR products amplified using the primers designed by Consed and genomic DNA as a template were sequenced. In addition, we performed manual curation based on the preannotated information in the draft genome sequence to ensure the quality of the genome database. The final assembly of the total genome was then performed manually based on the location and mate information to avoid improper assembly due to the presence of repeat sequences using Consed. The final predicted error rates were 0.07, 0.05, 0.15, and 0.04/10 kb in chromosome 1, chromosome 2, plasmid A, and plasmid B, respectively.
The complete sequences were analyzed using Glimmer3 (3) for the protein-coding gene, tRNAscan-SE (9) for the tRNA, and RNAmmer (7) for the rRNA. The functions of predicted protein-coding genes were then annotated through comparisons with the databases of UniRef90 (11), NCBI-NR (1), COG (12), and KEGG (5). The genome of R. sphaeroides KD131 consists of chromosome 1, which contains 3,152,792 bp (3,101 open reading frames [ORFs], 39 tRNAs, and one 16S-23S-5S rRNA), chromosome 2, which contains 1,297,647 bp (1,224 ORFs, 15 tRNAs, and three 16S-23S-5S rRNAs), plasmid A, which contains 157,345 bp (142 ORFs), and plasmid B (102 ORFs), which contains 103,355 bp. In addition, the genome has a G+C content of ∼68.7% to ∼70.1%.
The aspect of distribution and content of the genes in strain KD131 were highly similar to those of a closely related cognate strain, 2.4.1, which has been analyzed by the Department of Energy Joint Genome Institute (2). However, the genome of R. sphaeroides KD131 may have a more-developed H2-evolving system that consists mainly of an uptake hydrogenase and two nitrogenases. An uptake hydrogenase gene cluster and a molybdenum nitrogenase gene cluster are found in strain 2.4.1, but another iron nitrogenase gene cluster that is not found in strain 2.4.1 is highly similar to the homologue of Rhodopseudomonas palustris CGA009 (10). The information regarding the complete genome of strain KD131 that we report here will facilitate proteomic and metabolic studies designed to develop this strain for H2 production.
Nucleotide sequence accession numbers.
Genome information for the two chromosomes and two plasmids of Rhodobacter sphaeroides KD131 were deposited in GenBank under the accession numbers CP001150 to CP001153.
Acknowledgments
This study was conducted for the Hydrogen Energy R&D Center, which is one of the 21st Century Frontier R&D Programs funded by the Ministry of Education, Science, and Technology of Korea.
Footnotes
Published ahead of print on 21 November 2008.
REFERENCES
- 1.Benson, D. A., I. Karsch-Mizrachi, D. J. Lipman, J. Ostell, and D. L. Wheeler. 2008. GenBank. Nucleic Acids Res. 36D25-D30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Choudhary, M., X. Zanhua, Y. X. Fu, and S. Kaplan. 2007. Genome analyses of three strains of Rhodobacter sphaeroides: evidence of rapid evolution of chromosome II. J. Bacteriol. 1891914-1921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Delcher, A. L., K. A. Bratke, E. C. Powers, and S. L. Salzberg. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23673-679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Gordon, D., C. Desmarais, and P. Green. 2001. Automated finishing with autofinish. Genome Res. 11614-625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kanehisa, M., M. Araki, S. Goto, M. Hattori, M. Hirakawa, M. Itoh, T. Katayama, S. Kawashima, S. Okuda, T. Tokimatsu, and Y. Yamanishi. 2008. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 36D480-D484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Kim, E.-J., M.-S. Kim, and J. K. Lee. 2008. Hydrogen evolution under photoheterotrophic and dark fermentative conditions by recombinant Rhodobacter sphaeroides containing the genes for fermentative pyruvate metabolism of Rhodospirillum rubrum. Int. J. Hydrogen Energy 335131-5136. [Google Scholar]
- 7.Lagesen, K., P. F. Hallin, E. A. Rodland, H. H. Staerfeldt, T. Rognes, and D. W. Ussery. 2007. RNammer: consistent annotation of rRNA genes in genomic sequences. Nucleic Acids Res. 353100-3108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Lee, I. H., J. Y. Park, D. H. Kho, M. S. Kim, and J. K. Lee. 2002. Reductive effect of H2 uptake and poly-beta-hydroxybutyrate formation on nitrogenase-mediated H2 accumulation of Rhodobacter sphaeroides according to light intensity. Appl. Microbiol. Biotechnol. 60147-153. [DOI] [PubMed] [Google Scholar]
- 9.Lowe, T. M., and S. R. Eddy. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25955-964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Oda, Y., S. K. Samanta, F. E. Rey, L. Wu, X. Liu, T. Yan, J. Zhou, and C. S. Harwood. 2005. Functional genomic analysis of three nitrogenase isozymes in the photosynthetic bacterium Rhodopseudomonas palustris. J. Bacteriol. 1877784-7794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Suzek, B. E., H. Huang, P. McGarvey, R. Mazumder, and C. H. Wu. 2007. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 231282-1288. [DOI] [PubMed] [Google Scholar]
- 12.Tatusov, R. L., E. V. Koonin, and D. J. Lipman. 1997. A genomic perspective on protein families. Science 278631-637. [DOI] [PubMed] [Google Scholar]