Abstract
Here we report the genome sequence of Corynebacterium glutamicum S9114, an industrial producer widely used in production of glutamate in China. Preliminary comparison with the sequences of the Corynebacterium glutamicum strains ATCC 13032 and R revealed some notable mutagenesis that might be related to the high yield of glutamate.
GENOME ANNOUNCEMENT
Corynebacterium glutamicum, a fast-growing and aerobic Gram-positive microorganism able to secrete large amounts of glutamate under suitable conditions (10, 16), has a long history of use for the industrial production of various primary metabolites, including amino acids and nucleotides (5, 6). The complete genome sequences of two variants of C. glutamicum ATCC 13032 have been published (7, 8). Genome sequences have been reported for some other closely related microorganisms, such as Corynebacterium efficiens YS-314 (14) and Corynebacterium glutamicum R (17). Here we report the genome sequence of Corynebacterium glutamicum S9114, an industrial producer generated by conventional mutagenesis (18) which has been widely used in production of glutamate in China. Strain S9114 has a high resistance to high sugar and glutamate concentrations, which could be applied in glutamate production under biotin limitation or an excess of biotin with addition of penicillin.
The genome was sequenced using the Illumina Solexa GA IIx instrument at the Beijing Genomics Institute (BGI; Shenzhen, China). A library containing 500-bp inserts was constructed. Sequencing was performed with the paired-end strategy of 90-bp reads to produce 327 Mb of filtered sequences, representing a 95.5-fold coverage of the genome. The sequences were assembled into 67 contigs and 47 scaffolds using the SOAPdenovo package (12). Some remaining gaps in scaffolds were closed by Sanger sequencing of PCR products.
Genome annotation was performed at the NCBI Prokaryotic Genomes Automatic Annotation Pipeline (PGAAP). Open reading frames (ORFs) were identified by Glimmer 3.02 (4) and Genemark (2). The GenBank NR database, as well as the KEGG (9) and COG (15) databases, was employed for BLASTP identification (1). tRNA and rRNA genes were detected by tRNAscan-SE (13) and RNAmmer (11), respectively.
The chromosome of Corynebacterium glutamicum S9114 is 3,292,392 bp in length, with an average G+C content of 53.93%. In the genome sequence, 3,098 ORFs, 54 tRNA genes, and 1 rRNA fragment were identified. Fourteen pseudogenes were also detected, which coincided with the fact that Corynebacterium glutamicum S9114 was selected by a random mutagenesis process. Moreover, four potential prophage sites were found using Prophage Finder (3).
A brief comparative analysis on some genes related to the glutamate synthesis pathway was conducted on S9114, ATCC 13032, and Corynebacterium glutamicum R. Functions of those genes include serving as a key enzyme in the biosynthesis pathway (pyc, pdh, pepC, odhA, and gltB), nutrient assimilation (sugR, lldR, and amtR), cell wall biosynthesis and glutamate transport (murE, ftsI, and dtsR), and metabolic regulation (odhI, pknG, dtxR, and ripA). The preliminary results showed that some notable mutagenesis might lead to the high glutamate yield of S9114. A detailed report will be included in a future publication, including the results of a complete sequence and functional validation of some mutated proteins.
Nucleotide sequence accession numbers.
This whole-genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession number AFYA00000000. The version described in this paper is the first version, AFYA01000000.
Acknowledgments
This work was financially supported by the Fundamental Research Funds for the Central Universities, SCUT, project no. 2009ZM0289.
REFERENCES
- 1. Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J. 1990. Basic local alignment search tool. J. Mol. Biol. 215:403–410 [DOI] [PubMed] [Google Scholar]
- 2. Besemer J., Lomsadze A., Borodovsky M. 2001. GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. Nucleic Acids Res. 29:2607–2618 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Bose M., Barber R. D. 2006. Prophage Finder: a prophage loci prediction tool for prokaryotic genome sequences. In Silico Biol. 6:223–227 [PubMed] [Google Scholar]
- 4. Delcher A. L., Bratke K. A., Powers E. C., Salzberg S. L. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Demain A. L. 2000. Small bugs, big business: the economic power of the microbe. Biotechnol. Adv. 18:499–514 [DOI] [PubMed] [Google Scholar]
- 6. Hermann T. 2003. Industrial production of amino acids by coryneform bacteria. J. Biotechnol. 104:155–172 [DOI] [PubMed] [Google Scholar]
- 7. Ikeda M., Nakagawa S. 2003. The Corynebacterium glutamicum genome: features and impacts on biotechnological processes. Appl. Microbiol. Biotechnol. 62:99–109 [DOI] [PubMed] [Google Scholar]
- 8. Kalinowski J., et al. 2003. The complete Corynebacterium glutamicum ATCC 13032 genome sequence and its impact on the production of L-aspartate-derived amino acids and vitamins. J. Biotechnol. 104:5–25 [DOI] [PubMed] [Google Scholar]
- 9. Kanehisa M., et al. 2008. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 36:D480–D484 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Kumagai H. 2000. Microbial production of amino acids in Japan. Adv. Biochem. Eng. Biotechnol. 69:71–85 [DOI] [PubMed] [Google Scholar]
- 11. Lagesen K., et al. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35:3100–3108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Li R., et al. 2010. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 20:265–272 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Lowe T. M., Eddy S. R. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25:955–964 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Nishio Y., et al. 2003. Comparative complete genome sequence analysis of the amino acid replacements responsible for the thermostability of Corynebacterium efficiens. Genome Res. 13:1572–1579 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Tatusov R. L., Galperin M. Y., Natale D. A., Koonin E. V. 2000. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 28:33–36 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Vertes A. A., Inui M., Yukawa H. 2005. Manipulating corynebacteria, from individual genes to chromosomes. Appl. Environ. Microbiol. 71:7633–7642 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Yukawa H., et al. 2007. Comparative analysis of the Corynebacterium glutamicum group and complete genome sequence of strain R. Microbiology 153:1042–1058 [DOI] [PubMed] [Google Scholar]
- 18. Zhang C. Y., Shi Z., Gao P., Duan Z., Mao Z. 2005. On-line prediction of products concentrations in glutamate fermentation using metabolic network model and linear programming. Biochem. Eng. J. 25:99–108 [Google Scholar]