Abstract
In the present study, we report data on the draft genome sequence of a lipopeptide producing rhizospheric Bacillus subtilis GM5 isolate. The genome consists of 4,271,280 bp with a GC-pair content of 43.3%. A total of 4518 genes including 75 tRNA genes, 3 operons coding for rRNA genes and 56 pseudogenes were annotated. Gene clusters responsible for the biosynthesis of secondary metabolites were validated. Six of the thirty-three clusters identified in the genome code for antimicrobial non-ribosomal peptides synthesis. The Whole Genome Shotgun project of B. subtilis GM5 has been deposited in the NCBI database under the accession number NZ_NKJH00000000 (https://www.ncbi.nlm.nih.gov/nuccore/NZ_NKJH00000000.1).
Keywords: Bacillus subtilis, Analysis and assembly of the genome, Antimicrobial lipopeptides
Specifications table
Subject area | Biology |
More specific subject area | Bioinformatics (Genomics) |
Type of data | Table, figures |
How data was acquired | Genome sequencing: Illumina Miseq (USA) |
Bioinformatics approaches: NCBI Prokaryotic Genomes Automatic Annotation Pipeline (PGAAP), the RAST web server (http://rast.nmpdr.org/), the antiSMASH server (http://antismash.secondarymetabolites.org) | |
Data format | Analyzed |
Experimental factors | Genomic DNA from pure culture |
Experimental features | Isolation of bacteria, genome sequencing, draft genome assembly and annotation, active metabolites prediction |
Data source location | B. subtilis GM5 was isolated from the potato rhizosphere (Kazan, Russia) |
Data accessibility | The whole genome sequence of B. subtilis GM5 has been deposited in GenBank under the accession number NZ_NKJH00000000 |
(https://www.ncbi.nlm.nih.gov/nuccore/NZ_NKJH00000000.1) | |
Related research article | « Bacillus subtilis strains with antifungal activity against the phytopathogenic fungi» by Mardanova et al. [1] |
«New Bacillus subtilis Strains as Promising Probiotics» by Hadieva et al. [2] |
Value of the data
-
•
The data on the B. subtilis GM5 genome is resourceful and can be utilized in understanding their potential biotechnological applications. B. subtilis GM5 was concluded to be a promising strain for use as probiotics.
-
•
33 potential clusters of secondary metabolite synthesis have been identified in the genome of B. subtilis GM5 strain, including six gene clusters participating in the biosynthesis of non-ribosomal peptide synthetase (NRPS).
-
•
The data demonstrated here can be used by other researchers working or studying in the field of genome analysis.
-
•
The data presented expands the molecular information on the diversity of B. subtilis.
1. Data
The Bacillus subtilis GM5 isolated from the rhizosphere of potatoes possesses remarkable antimicrobial properties [1] and a probiotic potential [2]. Previous studies have shown that GM5 shows antagonism to pathogenic and opportunistic enterobacteria via the production of cyclic lipo- and dipeptides [1], [2]. The strain was resistant to 1–10% chicken bile and to a wide range of the ambient pH. B. subtilis GM5 possessed proteolytic and phytate-hydrolyzing activity and proved to be safe for model animals. Scanning electron microscopy (SEM) of isolated culture showed the presence of rod-shaped cells that are approximately 0.63–0.71 µm in width and 1.80–2.50 µm in length (Fig. 1).
Based on the homology of the 16 S rRNA gene, strain GM5 shares similarity with B. subtilis 168 (98% for a 1010 bp sequence). Using a 16 S rRNA based tree, the phylogenetic affiliation of B. subtilis GM5 to closely related species within the genus is exhibited (Fig. 2).
The genome of B. subtilis GM5 includes 21 contigs, united in 19 scaffolds (N50 – 551,988 and L50 - 2 bp.). The genome has a size of 4,271,280 bp with a GC-pair content of 43.3%. 4,518 genes were annotated (Table 1), including 4,364 protein coding genes, 75 tRNA genes, 3 operons coding for rRNA genes and 56 pseudogenes. The B. subtilis GM5 strain was compared with probiotic strains Bacillus coagulans S-lac, B. subtilis TO-A JPC [3] and Bacillus toyonensis BCT-7112 [4] (Table 1). B. subtilis GM5 and B. subtilis TO-A JPC share similarity in their primary genomic properties: size, CDS, GC content. (Table 1).
Table 1.
Genome feature | Bacillus subtilis GM5 | Bacillus subtilis TO-A JPC [3] | Bacillus coagulans S-lac [3] | Bacillus toyonensis BCT-7112 [4] |
---|---|---|---|---|
NCBI Accession number | NZ_NKJH00000000.1 | CP011882 | CP011939 | NC_022781.1 |
Genome size (bp) | 4.271.280 | 4.090.708 | 3.694.837 | 5.025.419 |
GC content (%) | 43.3 | 43.8 | 46.2 | 35.5 |
CDS | 4.364 | 4.231 | 4.088 | 4.999 |
Isolation source | rhizosphere | Vibact® | Sporlac® | soil |
Importance of the strain | Probiotic in animal nutrition, biocontrol of plant pathogens | Probiotic for humans and animals | Probiotic for humans and animals | Probiotic in animal nutrition |
Identified secondary metabolite clusters | 15 (33) | 11 | 1 | 7 |
The RAST server predicted 4,479 coding sequences, of which 2101 coding sequences (47%) were annotated as seed subsystem features and 2378 coding sequences (53%) annotated as outside of the seed subsystem (Fig. 3).
Analysis using the antiSMASH program showed that the genome of GM5 has 33 potential gene clusters, of which 15 (Table 1) are responsible for the synthesis of secondary metabolites, including antimicrobial peptides, terpenes, fatty acids and others. A comparative analysis of the 15 clusters showed 6 gene clusters belonging to the following NRPS family: BGC0001095_c1 Fengycin_biosynthetic_gene_cluster (Compound: Fengycin - possesses antifungal properties [5]), BGC0000407_c1 Plipastatin_ biosynthetic_gene_cluster (compound: plipastatin – a known type of surface-active compound [6]), BGC0001089_c1 Bacillaene_biosynthetic_gene_cluster (Compound: bacillaene - known to exhibit antibacterial activity [7]); BGC0001184_c1 Bacilysin_biosynthetic _gene_cluster (compound: bacilysin - possesses antibacterial activity [8]); BGC0000309_c1 Bacillibactin_biosynthetic_gene_cluster (Compound: bacillibactin – an iron chelator [9]); BGC0000433_c1 Surfactin_biosynthetic_gene_ cluster (compound: surfactin - a known type of surface-active compound [5], [6]) (Table 2). In addition, 18 of the identified clusters were only predicted to be putative.
Table 2.
Cluster | Type | Most similar known cluster | MIBiG BGC-ID |
---|---|---|---|
1 | Nrps | Bacillibactin_biosynthetic_gene_cluster (100% of genes show similarity) | BGC0000309_c1 |
2 | Nrps | Bacillaene_biosynthetic_gene_cluster (100% of genes show similarity) | BGC0001089_c1 |
3 | Nrps | Fengycin_biosynthetic_gene_cluster (100% of genes show similarity) | BGC0001095_c1 |
4 | Nrps | Surfactin_biosynthetic_gene_cluster (82% of genes show similarity) | BGC0000433_c1 |
5 | Nrps | Plipastatin_biosynthetic_gene_cluster (38% of genes show similarity) | BGC0000407_c1 |
6 | Nrps | Bacilysin_biosynthetic_gene_cluster (100% of genes show similarity) | BGC0001184_c1 |
Thus, the listed gene clusters found in the genome of B. subtilis GM5 are responsible for the synthesis of the non-ribosomal cyclic lipopeptides: fengycin, plipastin, surfactin, bacillaene, bacilysin and bacillibactin dipeptide. Thus, an important feature of the B. subtilis GM5 genome lies in the fact that much of its genetic material is devoted to the biosynthesis of secondary metabolites. By utilizing these metabolites, B. subtilis GM5 is able to suppress pathogenic and conditionally pathogenic microflora, capable of causing intestinal dysbiosis in chickens. The B. subtilis GM5 strain has great prospects as a potential probiotic.
2. Experimental design, materials and methods
2.1. Morphological analysis
The bacterial morphology was investigated using electron scanning microscopy according to the method described in Ref. [10].
2.2. Phylogeny analysis
The initial phylogeny of the isolate GM5 was studied using 16 S rRNA analysis. Phylogenetic analysis of the strain GM5 was performed using MEGA 7.0.14 software. Phylogenetic tree was generated using the Maximum likelihood (ML) algorithm with 1000 bootstrap iterations. The strain was deposited in the museum of the laboratory "Biosynthesis and Bioengineering of Enzymes" (Kazan Federal University, Russia).
2.3. Genomic DNA preparation
B. subtilis strain GM5 was inoculated in 20 ml of LB medium and grown overnight at 37 °C with rocking rate of 200 rpm. 10 mL were centrifuged at 5000×g for 10 min at 4 °C and genomic DNA was extracted using Kit NucleoSpin® Microbial DNA. The quality of the final DNA sample were evaluated by gel electrophoresis (1.5% agarose gel) and DNA concentration was estimated by using a NanoDrop 2000с Spectrophotometer (Thermo Scientific). In total, 500 ng/µL of genomic DNA was received and sent for the sequencing.
2.4. Genome sequencing and assembly
Whole-genome sequencing of strain GM5 and analysis of genes responsible for the synthesis of antimicrobial peptides were conducted. DNA sequencing was performed using Illumina MiSeq technology by the paired-end method. The quality of the sequencing was checked using the FastQC_v0.11.3. software. De novo assembly and analysis of contigs were carried out using assembler SPAdes_v3.8.1. The statistics of the assembly was calculated with the QUAST_v2.3 program.
2.5. Genome annotation
The genome was annotated using the PGAAP NCBI program and the RAST web server [11].
The antiSMASH program (antibiotics & Secondary Metabolite Analysis Shell) [12] was used to analyze the clusters of the antimicrobial metabolites of the GM5 strain, Bacillus subtilis TO-A JPC, Bacillus coagulans S-lac, Bacillus toyonensis BCT-7112. The complete strain genome (FASTA file) was uploaded to the public web version of antiSMASH. The result of the analysis is presented on an interactive HTML page with SVG graphics, and the various parts of the analysis have been displayed in different panels for each gene cluster.
Acknowledgements
The work is performed in accordance with the Russian Government Program of Competitive Growth of Kazan Federal University and was supported by grant from Russian Scientific Foundation (Project no. 16-16-04062).
Footnotes
Transparency data associated with this article can be found in the online version at https://doi.org/10.1016/j.dib.2018.12.081.
Supplementary data associated with this article can be found in the online version at https://doi.org/10.1016/j.dib.2018.12.081.
Contributor Information
G.F. Hadieva, Email: g.h95@mail.ru.
M.T. Lutfullin, Email: lutfullin.marat2012@yandex.ru.
D.S. Pudova, Email: dasha171711@gmail.com.
Y.A. Akosah, Email: akosah2005@gmail.com.
N.E. Gogoleva, Email: negogoleva@gmail.com.
E.I. Shagimardanova, Email: rjuka@mail.ru.
A.M. Mardanova, Email: mardanovaayslu@mail.ru.
M.R. Sharipova, Email: marsharipova@gmail.com.
Transparency document. Supplementary material
.
Appendix A. Supplementary material
.
References
- 1.Mardanova A.M., Hadieva G.F., Lutfullin M.T., Khilyas I.V., Minnullina L.F., Gilyazeva A.G., Bogomolnaya L.M., Sharipova M.R. Bacillus subtilis strains with antifungal activity against the phytopathogenic fungi. Agric. Sci. 2017;8:1–20. [Google Scholar]
- 2.Khadieva G.F., Lutfullin M.T., Mochalova N.K., Lenina O.A., Sharipova M.R., Mardanova A.M. New Bacillus subtilis strains as promising probiotics. Mikrobiologiya. 2018;87:463–471. [Google Scholar]
- 3.Khatri I., Sharma S., Ramya T.N.C., Subramanian S. Complete genomes of Bacillus coagulans S-lac and Bacillus subtilis TO-A JPC, two phylogenetically distinct probiotics. PLoS One. 2016;11:1–25. doi: 10.1371/journal.pone.0156745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Lopes R., Cerdeira L., Tavares G.S., Ruiz J.C., Blom J., Horbcio E.C.A., Mantovani H.C., de Queiroz M.V. Genome analysis reveals insights of the endophytic Bacillus toyonensis BAC3151 as a potentially novel agent for biocontrol of plant pathogens. World J. Microbiol. Biotechnol. 2017;33:1–15. doi: 10.1007/s11274-017-2347-x. [DOI] [PubMed] [Google Scholar]
- 5.Ongena M., Jourdan E., Adam A., Paquot M., Brans A., Joris B., Arpigny J., Thonart P. Surfactin and fengycin lipopeptides of Bacillus subtilis as elicitors of induced systemic resistance in plants. Environ. Microbiol. 2007;9:1084–1090. doi: 10.1111/j.1462-2920.2006.01202.x. [DOI] [PubMed] [Google Scholar]
- 6.Roongsawang N., Thaniyavarn J., Thaniyavarn S., Kameyama T., Haruki M., Imanaka T., Morikawa M., Kanaya S. Isolation and characterization of a halotolerant Bacillus subtilis BBK-1 which produces three kinds of lipopeptides: bacillomycin L, plipastatin, and surfactin. Extremophiles. 2002;6:499–506. doi: 10.1007/s00792-002-0287-2. [DOI] [PubMed] [Google Scholar]
- 7.Um S., Fraimout A., Sapountzis P., Oh D., Poulsen M. The fungus-growing termite Macrotermes natalensis harbors bacillaene-producing Bacillus sp. that inhibit potentially antagonistic fungi. Sci. Rep. 2013;3:1–7. doi: 10.1038/srep03250. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chena X.H., Scholz R., Borriss M., Junge H., Mögel G., Kunzc S., Borriss R. Difficidin and bacilysin produced by plant-associated Bacillus amyloliquefaciens are efficient in controlling fire blight disease. J. Biotechnol. 2009;140:38–44. doi: 10.1016/j.jbiotec.2008.10.015. [DOI] [PubMed] [Google Scholar]
- 9.Miethke M., Westers H., Blom E., Kuipers O.P., Marahiel M.A. Iron starvation triggers the stringent response and induces amino acid biosynthesis for bacillibactin production in Bacillus subtilis. J. Bacteriol. 2006;188:8655–8657. doi: 10.1128/JB.01049-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Raho R., Paladini F., Lombardi F.A., Boccarella S., Zunino B., Pollini M. In-situ photo-assisted deposition of silver particles on hydrogel fibers for antibacterial applications. Mater. Sci. Eng. C Mater. Biol. Appl. 2015;55:42–49. doi: 10.1016/j.msec.2015.05.050. [DOI] [PubMed] [Google Scholar]
- 11.Aziz R.K., Bartels D., Best A.A., DeJongh M., Disz T., Edwards R.A. The RAST server: rapid annotations using sub- systems technology. BMC Genom. 2008;9:75. doi: 10.1186/1471-2164-9-75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Weber T., Blin K., Duddela S., Krug D., Kim H.U., Bruccoleri R., Lee S.Y., Fischbach M.A., Müller R., Wohlleben W., Breitling R., Takano E., Medema M.H. antiSMASH 4.0-a comprehensive resource for the genome mining of biosynthetic gene clusters. Nucleic Acids Res. 2015;43:237–243. doi: 10.1093/nar/gkv437. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.