Abstract
Here, we report the complete genome sequence of Bacillus amyloliquefaciens TA208, a strain for industrial production of guanosine and synthesis of ribavirin by assimilation of formamide. Comparison of its genome sequence with those of strains DSM7 and FZB42 revealed horizontal gene transfer represented by unique prophages and restriction-modification systems and indicated significant accumulation of guanosine.
TEXT
With the development of next-generation sequencing technology, microbial genome data have increased explosively (11). However, knowledge on the industrially valuable strains is not yet sufficient, especially at the genome level. Here we report the complete genome sequence of Bacillus amyloliquefaciens TA208, which has been used in production of guanosine and also employed in microbial synthesis of the antiviral drug ribavirin by assimilation of formamide in fermentation. Strain TA208 is an adenine auxotrophic, 8-azaguanine- and methionine sulfoxide-resistant strain generated by conventional mutagenesis (15).
The genome was sequenced using the Illumina HiSeq 2000 at Beijing Genomics Institute (BGI; Shenzhen, China). A library containing 500-bp inserts was constructed. Sequencing was performed with the pair-end strategy of 90-bp reads to produce 419.04 Mb of filtered sequences, representing a 106.4-fold coverage of the genome. The sequences were assembled into 162 contigs and 18 scaffolds using the SOAPdenovo package (8). Gaps were closed by PCR and sequencing of the amplicons by primer walking. The closed genome is a circular chromosome of 3,937,511 bp without plasmids.
Genome annotation was performed at the NCBI Prokaryotic Genomes Automatic Annotation Pipeline, where 4,135 open reading frames (ORFs), six 16S-23S-5S rRNA operons, and 71 tRNAs were identified using Glimmer (4) and Genemark (10), BLAST (13) against the Rfam database (5), and tRNAscan-SE (9), respectively. A 1,259,096-bp fragment (position 1,057,022 to 2,316,118) was found to be inversed in the chromosome of strain TA208 compared to that in strains DSM7 (14) and FZB42 (2) by the Mauve package (3), presumably due to phage infection, since two novel potential prophages flanking this region were identified using Prophage Finder (1). Moreover, six other prophage and prophage remnants were found, providing evidence for horizontal gene transfer. Preliminary analysis revealed sets of restriction-modification systems in strain TA208, including M.BamHI, M.BamHII, R.BamHI, M.H2I (7), and a novel type II system composed of the ORFs BAMTA208_19835 and BAMTA208_19845. Strain TA208 harbors a complete set of genes required for natural competence but does not readily incorporate exogenous DNA (15), which might be caused by the frameshift mutation in comS (6) and the complex restriction-modification systems.
The most noticeable change in the genes for purine de novo synthesis is the presence of a truncated copy of purA in strain TA208, revealing the loss of activity for adenylosuccinate synthetase. Mutations potentially elevating the corresponding activity or abolishing the end product feedback inhibition were also observed in the enzymes encoded by purL (D624N), purM (Q322K, H334Q), and purC (N6S, I89V, H166L) compared to the reference strains. The nonsense mutation in pbuX caused the absence of guanine permease in strain TA208, corresponding to the phenotype of 8-azaguanine resistance. However, most of the other genes encoding the enzymes in de novo purine synthesis pathway, the nucleoside efflux pumps, and their riboswitch containing promoter regions (12) remain unchanged, indicating the potential for further strain improvement.
Nucleotide sequence accession number.
The complete genome sequence of B. amyloliquefaciens TA208 has been deposited in GenBank (accession number CP002627).
ACKNOWLEDGMENTS
This research was funded by the Knowledge Innovation Program of the Chinese Academy of Sciences (KSCX2-EW-J-6) and the National Drug Discovery Program of China (2008ZX09401-05).
Footnotes
Published ahead of print on 22 April 2011.
REFERENCES
- 1. Bose M., Barber R. D. 2006. Prophage Finder: a prophage loci prediction tool for prokaryotic genome sequences. In Silico Biol. 6:223–227 [PubMed] [Google Scholar]
- 2. Chen X. H., et al. 2007. Comparative analysis of the complete genome sequence of the plant growth-promoting bacterium Bacillus amyloliquefaciens FZB42. Nat. Biotechnol. 25:1007–1014 [DOI] [PubMed] [Google Scholar]
- 3. Darling A. C. E., Mau B., Blattner F. R., Perna N. T. 2004. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 14:1394–1403 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Delcher A. L., Harmon D., Kasif S., White O., Salzberg S. L. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Res. 27:4636–4641 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Griffiths-Jones S., et al. 2005. Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 33:D121–D124 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Hamoen L. W., Eshuis H., Jongbloed J., Venema G., Sinderen D. V. 1995. A small gene, designated comS, located within the coding region of the fourth amino acid activation domain of srfA, is required for competence development in Bacillus subtilis. Mol. Microbiol. 15:55–63 [DOI] [PubMed] [Google Scholar]
- 7. Langea C., Noyer-Weidnera M., Trautner T. A., Weinerb M., Zahlerb S. A. 1991. M.H2I, a multispecific 5C-DNA methyltransferase encoded by Bacillus amyloliquefaciens phage H2. Gene 100:213–218 [DOI] [PubMed] [Google Scholar]
- 8. Li R., et al. 2010. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 20:265–272 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Lowe T. M., Eddy S. R. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25:955–964 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Lukashin A. V., Borodovsky M. 1998. GeneMark. hmm: new solutions for gene finding. Nucleic Acids Res. 26:1107–1115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. MacLean D., Jones J. D. G., Studholme D. J. 2009. Application of ‘next-generation’ sequencing technologies to microbial genetics. Nat. Rev. Microbiol. 7:287–296 [DOI] [PubMed] [Google Scholar]
- 12. Mandal M., Boese B., Barrick J. E., Winkler W. C., Breaker R. R. 2003. Riboswitches control fundamental biochemical pathways in Bacillus subtilis and other bacteria. Cell 113:577–586 [DOI] [PubMed] [Google Scholar]
- 13. McGinnis S., Madden T. L. 2004. BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res. 32:W20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Rückert C., Blom J., Chen X., Reva O., Borriss R. Genome sequence of B. amyloliquefaciens type strain DSM7T reveals differences to plant-associated B. amyloliquefaciens FZB42. J. Biotechnol., in press. doi:10.1016/j.jbiotec.2011.01.006, [DOI] [PubMed] [Google Scholar]
- 15. Zhang G., et al. 2011. Enhancing electro-transformation competency of recalcitrant Bacillus amyloliquefaciens by combining cell-wall weakening and cell-membrane fluidity disturbing. Anal. Biochem. 409:130–137 [DOI] [PubMed] [Google Scholar]