Skip to main content
Journal of Bacteriology logoLink to Journal of Bacteriology
. 2012 Aug;194(16):4455–4456. doi: 10.1128/JB.00837-12

Draft Genome Sequence of Gluconobacter oxydans WSH-003, a Strain That Is Extremely Tolerant of Saccharides and Alditols

Lili Gao a,b, Jingwen Zhou a,b,, Jie Liu c, Guocheng Du a,b, Jian Chen a,b,
PMCID: PMC3416238  PMID: 22843589

Abstract

Gluconobacter oxydans is known for its incomplete oxidation of a wide range of alcohols, sugars, and acids in a bioprocess. The corresponding oxidation products are secreted almost completely into the medium. Here, we present the high-quality draft genome sequence of G. oxydans WSH-003, an industrial strain with both high l-sorbose productivity and extreme tolerance to saccharides and alditols.

GENOME ANNOUNCEMENT

Gluconobacter oxydans WSH-003 is an l-sorbose-accumulating strain of industrial interest due to its powerful ability to oxidize d-sorbitol into l-sorbose with a high quantitative yield of over 98% on an industrial scale (4). G. oxydans WSH-003 and its relative derivative strains are widely used for the industrial production of vitamin C in China, which occupies more than 90% of the global vitamin C market. G. oxydans WSH-003 was mutated by different methods from a wild-type strain at least 90 times with reliable records to improve the production of l-sorbose and the tolerance to saccharides and alditols, such as l-sorbose and d-sorbitol. Here, we present the draft genome of G. oxydans WSH-003, the industrial strain with both high l-sorbose productivity and extreme tolerance to saccharides and alditols. The elucidation of the genome sequence might provide a basis for both evolutionary analysis and improvement of the biotechnological usage of the bacterium.

Whole-genome sequencing of G. oxydans WSH-003 was performed using a combination of Illumina Hiseq2000 technology (8) (500-bp single-end shotgun sequencing, 3,728,170 sequencing reads, 85.6-fold coverage) and the Roche Genome Sequencer FLX system (paired ends with an insert size of 2 kb, 1,864,135 reads, 42.8-fold coverage). Assembly of the Illumina sequencing reads was performed with SOAPdenovo 1.05 (7). One hundred seventy-nine contigs were generated using Newbler Assembler (454 Life Sciences, Branford, CT) and assembled into 50 scaffolds using paired-end reads. Physical gaps, repeats, and assembly ambiguities were corrected by multiplex PCR and Sanger sequencing. Protein-coding genes were predicted by using Glimmer 3.02 (3) and Genemark 2.5 (1). Artemis was used for final verification of the annotation results (10). rRNAs and tRNAs were predicted using rRNAmmer (6) and tRNAscan-SE (11), respectively. Each gene was annotated through BLASTP searches against the databases NR, Swiss-Prot, TrEMBL, COG, and KEGG (2, 5, 12).

The complete genome of G. oxydans WSH-003 contains a single circular chromosome of 3,364,884 bp. The overall GC content of the chromosome is 56.77%. The draft genome harbors 3,705 predicted protein-coding sequences (CDSs); a total of 48 tRNA genes and one 16S-23S-5S rRNA were identified. The genome sequences of G. oxydans 621H (2.7Mbp) (GenBank accession no. CP000009) (9) were used as the reference, and the number of reads mapped to the reference gene was 95. The most significant feature of G. oxydans WSH-003 is the ability to produce high-yield l-sorbose. The gene clusters encoding d-sorbitol dehydrogenase (sldhAB, 2,670 bp) and responsible for synthesis of the cofactor pyrroloquinoline quinone (pqqABCDE, 2,979 bp) were identified from the draft genome sequence. The comparison of these two sequences with those of G. oxydans 621H indicated that the homology was 74.8% and 45.8%, respectively. In addition, several genes encoding aldehyde dehydrogenase, alcohol dehydrogenase, glucose dehydrogenase, formaldehyde dehydrogenase, etc., were annotated.

In summary, the genome sequence will facilitate additional bioinformatics and experimental investigations to further improve the vitamin C production. Furthermore, comparative genomics analysis and functional genomics analysis could also be performed to trace the origin and evolution of this organism.

Nucleotide sequence accession numbers.

This Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession number AHKI00000000. The version described in this paper is the first version, AHKI01000000.

ACKNOWLEDGMENTS

This work was supported by grants from the Major State Basic Research Development Program of China (973 Program, grant 2012CB720806), the National High Technology Research and Development Program of China (863 Program, grant SS2012AA023403), the Priority Academic Program Development of Jiangsu Higher Education Institutions, the 111 Project (grant 111-2-06), and the Enterprise Doctor Gathering Project, Jiangsu Province.

REFERENCES

  • 1. Besemer J, Borodovsky M. 2005. GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Res. 33: 451–454 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Boeckmann B, et al. 2003. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 31: 365–370 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23: 673–679 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Deppenmeier U, Hoffmeister M, Prust C. 2002. Biochemistry and biotechnological applications of Gluconobacter strains. Appl. Microbiol. Biotechnol. 60: 233–242 [DOI] [PubMed] [Google Scholar]
  • 5. Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M. 2004. The KEGG resource for deciphering the genome. Nucleic Acids Res. 32: 277–280 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Lagesen K, et al. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35: 3100–3108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Li R, et al. 2010. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 20: 265–272 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Margulies M, et al. 2005. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437: 376–380 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Prust C, et al. 2005. Complete genome sequence of the acetic acid bacterium Gluconobacter oxydans. Nat. Biotechnol. 23: 195–200 [DOI] [PubMed] [Google Scholar]
  • 10. Rutherford K, et al. 2000. Artemis: sequence visualization and annotation. Bioinformatics 16: 944–945 [DOI] [PubMed] [Google Scholar]
  • 11. Schattner P, Brooks AN, Lowe TM. 2005. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 33: 686–689 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Tatusov RL, et al. 2001. The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 29: 22–28 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Journal of Bacteriology are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES