Abstract
Bacillus amyloliquefaciens strain KCP2 was isolated from municipal food waste samples collected in Vallabh Vidyanagar, Gujarat, India. Strain KCP2 is noteworthy due to its ability to produce a thermostable, alkaliphilic α-amylase and a protease. These enzymes have importance in several industrial processes including bread making, brewing, starch processing, pharmacy, and textile industries. Whole genome sequencing of strain KCP2 showed that the estimated genome size was 3.9 Mb, the G + C content was 46%, and it coded for 4113 genes.
Keywords: Bacillus amyloliquefaciens, Ion torrent, NGS, Whole genome sequencing, Amylases, Protease
Here, we report on a draft genome sequence of Bacillus amyloliquefaciens strain KCP2 which produces a thermostable alkaliphilic α-amylase and protease, useful in bread making, brewing, starch processing, pharmacy, and textile industries. Strain KCP2 was isolated from municipal food waste samples collected in Vallabh Vidyanagar, Gujarat, India (Prajapati et al. 2015a). The cells of the strain are aerobic motile rods arranged singly or in chains and stain Gram positive. The strain is oxidase and catalase positive, hydrolyses gelatine and casein, starch, and Tween 40 and Tween, suggesting that it is proteolytic, amylolytic, and lipolytic, respectively. The α-amylase produced by strain KCP2 has been used in the saccharification and fermentation of raw corn starch and food waste for the production of bioethanol (Prajapati et al. 2015b). The strain is also positive in Vogel-Proskauer test and grows on glucose, mannitol, glycerol, glycogen, salicin, cellobiose, fructose, galactose, maltose, ribose, sorbitol, sucrose, and D-xylose while producing acid (Wang et al. 2008; Borriss et al. 2011).
Bacterial DNA from strain KCP2 was extracted and purified (Prajapati et al. 2015b), and single end sequencing was performed using a 318 chipset and 400 bp chemistry on an Ion Torrent Personal Genome Machine (PGM) housed at the Department of Animal Biotechnology College of Veterinary Science and Animal Husbandry, Anand Agricultural University, Gujarat, India using the standard protocols described by the manufacturer (Roche Diagnostics Ltd., United Kingdom). In total, 2,590,427 reads were obtained, which were corrected with Pollux to 2,554,487 reads (Marinier et al. 2015). The genome was assembled using SPAdes version 3.10.1 (Bankevich et al. 2012) and MIRA version 4.0.2 assembler (Chevreux et al. 2004). The two assemblies and reads were used to re-verify the longest and reliable contigs using custom Perl scripts. The assembled draft genome of B. amyloliquefaciens strain KCP2 consisted of 34 scaffolds, a N50 of 439,503 bp with the largest scaffold of 979,499 bp. The genome size was estimated to be 3,906,932 bp which had a GC content of 46%. BUSCO2 analysis (Simão et al. 2015) using bacterial lineage data set in the presence of conserved orthologous genes among species of the genus indicated 98% completeness. Gene annotation was performed using NCBI Prokaryotic Genome Annotation Pipeline (PGAP) (Tatusova et al. 2016), which identified 3681 protein-coding sequences (Table 1). The nearest alpha amylase sequence was found from B. amyloliquefaciens strain DSM-7, a soil bacterium characterized by its enormous potential to produce extracellular enzymes of industrials importance including amylases and proteases. The PGAP annotation confirmed the presence of amylase in the genome. The Multiple Sequence Alignment (MSA) and phylogenetic analysis of α-amylase show wide diversification (Fig. 1). Further in silico study of α-amylase may provide novel thermostability signatures. The genome sequence and their annotation reported here would also be useful genetic resource for engineering metabolism of B. amyloliquefaciens and improve their usability. The comparative genomics analysis of this strain with 52 sequenced genomes of B. amyloliquefaciens is in progress. We expect to yield a better understanding of B. amyloliquefaciens evolution through genome dynamics, population structure, and phylogenies of species groups.
Table 1.
Items | Count |
---|---|
Total Genes | 4113 |
Total CDS | 3999 |
Coding Genes | 3681 |
Coding CDS | 3681 |
Genes (RNA) | 114 |
rRNAs | 9, 9, 6 (5S, 16S, 23S) |
tRNAs | 85 |
ncRNAs | 5 |
Total Pseudo Genes | 318 |
tmRNA | 1 |
Compliance with ethical standards
Conflict of interest
The authors declare that there is no conflict of interest regarding the publication of this article.
Footnotes
Accession number(s) The whole genome data have been deposited at DDBJ/ENA/GenBank under the accession number NMRK00000000. The version described in this paper is version NMRK01000000.
References
- Bankevich A, Nurk S, Antipov D, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Borriss R, Chen X-H, Rueckert C, et al. Relationship of Bacillus amyloliquefaciens clades associated with strains DSM 7T and FZB42T: a proposal for Bacillus amyloliquefaciens subsp. amyloliquefaciens subsp. nov. and Bacillus amyloliquefaciens subsp. plantarum subsp. nov. based on complete genome sequence comparisons. Int J Syst Evol Microbiol. 2011;61:1786–1801. doi: 10.1099/ijs.0.023267-0. [DOI] [PubMed] [Google Scholar]
- Chevreux B, Pfisterer T, Drescher B, et al. Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs. Genome Res. 2004;14:1147–1159. doi: 10.1101/gr.1917404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marinier E, Brown DG, McConkey BJ. Pollux: platform independent error correction of single and mixed genomes. BMC Bioinformatics. 2015;16:10. doi: 10.1186/s12859-014-0435-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prajapati V, Trivedi U, Patel KC. Bioethanol production from the raw corn starch and food waste employing simultaneous saccharification and fermentation approach. Waste and Biomass Valorization. 2015;6:191–200. doi: 10.1007/s12649-014-9338-z. [DOI] [Google Scholar]
- Prajapati VS, Trivedi UB, Patel KC. A statistical approach for the production of thermostable and alklophilic alpha-amylase from Bacillus amyloliquefaciens KCP2 under solid-state fermentation. 3. Biotech. 2015;5:211–220. doi: 10.1007/s13205-014-0213-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simão FA, Waterhouse RM, Ioannidis P, et al. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
- Tatusova T, DiCuccio M, Badretdin A, et al. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res. 2016;44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang L-T, Lee F-L, Tai C-J, Kuo H-P. Bacillus velezensis is a later heterotypic synonym of Bacillus amyloliquefaciens. Int J Syst Evol Microbiol. 2008;58:671–675. doi: 10.1099/ijs.0.65191-0. [DOI] [PubMed] [Google Scholar]