Genome sequences of Bacillus spizenii SHT-15 isolated from cotton (Gossypium hirsutum) rhizosphere in the arid region of Northwest China

Zhichao Meng; XinXiang Niu; Ablimit Nuraliya; Yue Sheng; Hongmei Yang; Ming Chu; Ning Wang; Huifang Bao; Faqiang Zhan; Rong Yang; Kai Lou; Shuang Dou; Zhao Zhang; Yun Chen; Yignwu Shi

doi:10.1128/mra.00536-25

. 2026 Jan 26;15(2):e00536-25. doi: 10.1128/mra.00536-25

Genome sequences of Bacillus spizenii SHT-15 isolated from cotton (Gossypium hirsutum) rhizosphere in the arid region of Northwest China

Zhichao Meng ^1,^2,³, XinXiang Niu ^4,⁵, Ablimit Nuraliya ^2,³, Yue Sheng ^2,³, Hongmei Yang ^2,^3,⁵, Ming Chu ^2,^3,⁵, Ning Wang ^2,^3,⁵, Huifang Bao ^2,³, Faqiang Zhan ^2,³, Rong Yang ^2,³, Kai Lou ^2,³, Shuang Dou ^1,^2,³, Zhao Zhang ^1,^2,³, Yun Chen ^2,^3,⁶, Yignwu Shi ^1,^2,^3,^✉

Editor: Vanja Klepac-Ceraj⁷

PMCID: PMC12896162 PMID: 41586513

ABSTRACT

Bacillus spizenii strain SHT-15 was isolated from the rhizosphere soil in Shihezi, Xinjiang, China. This study presents the whole-genome sequencing of strain SHT-15, revealing a genome size of 4.082 Mb, which comprises 4,185 predicted protein-coding sequences and 96 RNA genes.

KEYWORDS: Bacillus spizenii, cotton Verticillium wilt, antibacterial ability, coding sequence, genome sequencing

ANNOUNCEMENT

Bacillus spizenii, a gram-positive bacterium, is renowned for secreting antimicrobial peptides and other substances inhibiting plant pathogens (1). It also acts as a biosurfactant, enhancing anti-pathogenic efficacy. The B. spizenii is extensively studied in molecular and cell biology due to its large genome and genetic versatility (2, 3).

In Shihezi City, Xinjiang, China, the rhizosphere antagonistic bacterium SHT-15 was isolated from healthy cotton plants to control cotton verticillium wilt (4). Using the dilution plating method, soil samples were diluted to 10⁻⁶, spread onto TSA medium plates, and incubated at 33°C for 24 h. The antibacterial activity was determined by the plate confrontation method, and dominant antagonistic strains were purified on nutrient agar medium. Pot experiments with a 2×10⁸ CFU/mL bacterial suspension showed that SHT-15 achieved a 89.23% control efficiency against tomato root rot caused by Fusarium oxysporum during the seedling stage.

The 16S rRNA gene sequence similarity was calculated using DNAMAN 8.0 software with primers 27F (5-AGAGTTTGATCCTGGCTCAG-3) and 1492R (5- ACGGCTACCTTGTTACGACTT-3), revealing a 99.52% identity with B. spizenii (NR_112686.1) (5). Additionally, a BLAST search in the NCBI NT database confirmed that the closest matching sequence was B. spizenii (NR_112686.1).

Genomic DNA of B. spizenii SHT-15 was extracted using the Ezup Column Bacterial Genomic DNA Extraction Kit. Bioinformatics analysis was performed on data from the Illumina platform. Quality checks and trimming were conducted using FastQC v0.11.7 (6) and Trimmomatic v0.39 (7). The clean short reads were assembled into complete genomes using SOAPdenovo v. 2.04 (8) and polished with Pilon v1.22 (9), and Quast v5.0.2 (10) evaluated the genome assembly quality. CheckM v1.1.6 (11) assessed completeness and contamination. The draft genome was annotated using the Rapid Annotation System Technology (RAST) (12) Pipeline and NCBI PGAP v6.5 (13) and assessed against the Genome Taxonomy Database using GTDB-Tk v1.7.0 (14).

The whole genome of B. spizenii SHT-15 consisted of a 4,081,549 bp chromosome with 279,920 reads, 3,920 protein-coding sequences, 30 rRNA genes, 86 tRNA genes, and a G+C content of 44.69%. Among 4,034 CDS, 767 are associated with transporters, 1,132 encode transmembrane proteins, and 421 relate to virulence, potentially explaining its inhibitory effect on cotton verticillium wilt (Fig. 1). Future studies will investigate the inhibition mechanism of B. spizenii SHT-15 on cotton verticillium wilt.

Circular genome map of Bacillus spizenii SHT-15 with concentric data rings. Layers display genome size, coding sequences with functional classifications, RNA genes, GC content variations, and GC-skew values showing DNA replication directionality. — Genome-wide mapping of *B. spizenii* SHT-15. Note: The outermost circle of the circle graph is the identification of genome size; the second and third circles are CDS on the positive and negative chains, and different colors represent the functional classification of different COGs of CDS; the fourth circle is rRNA and tRNA; the fifth circle is GC content. The outward red part indicates that the GC content in this region is higher than the average GC content of the whole genome. The higher the peak value is, the greater the difference between the average GC content is. The inward blue part indicates that the GC content in this region is lower than the average GC content of the whole genome. The higher the peak value is, the greater the difference between the average GC content and the average GC content is. The innermost circle is the GC-Skew value, and the specific algorithm is G-C / G+C, which can assist in judging the leading chain and the lagging chain. In general, the leading chain GC skew > 0, the lagging chain GC skew.

ACKNOWLEDGMENTS

This work was supported by the Xinjiang Major Science and Technology Projects (Grant No. 2022A02005-3) and the Key R&D Project of Xinjiang Uygur Autonomous Region, China (Grant No. 2022B02053-2) awarded to Yingwu Shi, as well as the Project of Fund for Stable Support to Agricultural Sci-Tech Renovation in Xinjiang (Grant Nos. xjnkywdzc2025003-06-02-01 and xjnkywdzc-2024003-66) awarded to Nuraliya Ablimit and Yue Sheng, respectively.

Zhichao Meng, Formal analysis, Investigation, Software, Writing – original draft | Xinxiang Niu, Data curation, Formal analysis, Investigation, Methodology, Writing – review and editing | Hongmei Yang, Data curation, Formal analysis, Investigation, Writing – review and editing | Min Chu, Investigation, Writing – review and editing | Ning Wang, Investigation, Writing – review and editing | Huifang Bao, Investigation, Writing – review and editing | Faqiang Zhan, Investigation, Writing – review and editing | Rong Yang, Investigation, Writing – review and editing | Kai Lou, Investigation, Writing – review and editing | Shuang Dou, Investigation, Writing – review and editing | Zhao Zhang, Investigation, Writing – review and editing | Yun Chen, Investigation, Writing – review and editing | Yingwu Shi, Conceptualization, Data curation, Formal analysis, Funding acquisition, Project administration, Resources, Supervision, Writing – review and editing.

Contributor Information

Yignwu Shi, Email: syw1973@126.com.

Vanja Klepac-Ceraj, Wellesley College, Wellesley, Massachusetts, USA.

DATA AVAILABILITY

The whole-genome shotgun project for B. spizenii SHT-15 has been deposited at DDBJ/ENA/GenBank under the accession CP167793, and the version described in this paper is version CP167793. The raw reads are available under the BioProject accession number PRJNA1149500, and the BioSample accession number is SAMN43249788. The sequence data obtained in this work have been deposited in the NCBI Sequence Read Archive under the accession number SRR33847918. Additionally, the assembled genome sequence is available in GenBank under accession number GCA_041501465.1.

REFERENCES

1. Kunst F, Ogas E, Cano-Prieto C, Bartolini M. 2021. The surfactin-like lipopeptides from Bacillus spp.: natural biodiversity and synthetic biology for a broader application range. Front Bioeng Biotechnol 9. doi: 10.3389/fbioe.2021.623701 [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Zhao H, Shao D, Jiang C, Shi J, Li Q, Huang Q, Rajoka MSR, Yang H, Jin M. 2017. Biological activity of lipopeptides from Bacillus. Appl Microbiol Biotechnol 101:5951–5960. doi: 10.1007/s00253-017-8396-0 [DOI] [PubMed] [Google Scholar]
3. Chen Q, Gao J, Yang X, Qiu Y, Wang Y, Wang H. 2023. Synergistic effects of Bacillus velezensis SDTB038 and phenamacril on Fusarium crown and root rot of tomato. Plant Pathol 72:1453–1462. doi: 10.1111/ppa.13769 [DOI] [Google Scholar]
4. Feng ZZ, Chen TC, Duan JN, Chen DX, Cheng JL, An DR. 2012. Screening, identification and antifungal activity of antagonistic rhizospheric Bacillus FB-16 against tobacco black shank. Acta Phytophylacica Sinica 39:224–230. doi: 10.13802/j.cnki.zwbhxb.2012.03.005 [DOI] [Google Scholar]
5. Johnson JS, Spakowicz DJ, Hong B-Y, Petersen LM, Demkowicz P, Chen L, Leopold SR, Hanson BM, Agresta HO, Gerstein M, Sodergren E, Weinstock GM. 2019. Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis. Nat Commun 10:5029. doi: 10.1038/s41467-019-13036-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Andrews S. 2010. FastQC: a quality control tool for high throughput sequence data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc.
7. Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, et al. 2012. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1:18. doi: 10.1186/2047-217X-1-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
9. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963 [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086 [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, et al. 2008. The RAST Server: rapid annotations using subsystems technology. BMC Genomics 9:75. doi: 10.1186/1471-2164-9-75 [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569 [DOI] [PMC free article] [PubMed] [Google Scholar]
14. Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH, Hancock J. 2019. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36:1925–1927. doi: 10.1093/bioinformatics/btz848 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

[B1] 1. Kunst F, Ogas E, Cano-Prieto C, Bartolini M. 2021. The surfactin-like lipopeptides from Bacillus spp.: natural biodiversity and synthetic biology for a broader application range. Front Bioeng Biotechnol 9. doi: 10.3389/fbioe.2021.623701 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B2] 2. Zhao H, Shao D, Jiang C, Shi J, Li Q, Huang Q, Rajoka MSR, Yang H, Jin M. 2017. Biological activity of lipopeptides from Bacillus. Appl Microbiol Biotechnol 101:5951–5960. doi: 10.1007/s00253-017-8396-0 [DOI] [PubMed] [Google Scholar]

[B3] 3. Chen Q, Gao J, Yang X, Qiu Y, Wang Y, Wang H. 2023. Synergistic effects of Bacillus velezensis SDTB038 and phenamacril on Fusarium crown and root rot of tomato. Plant Pathol 72:1453–1462. doi: 10.1111/ppa.13769 [DOI] [Google Scholar]

[B4] 4. Feng ZZ, Chen TC, Duan JN, Chen DX, Cheng JL, An DR. 2012. Screening, identification and antifungal activity of antagonistic rhizospheric Bacillus FB-16 against tobacco black shank. Acta Phytophylacica Sinica 39:224–230. doi: 10.13802/j.cnki.zwbhxb.2012.03.005 [DOI] [Google Scholar]

[B5] 5. Johnson JS, Spakowicz DJ, Hong B-Y, Petersen LM, Demkowicz P, Chen L, Leopold SR, Hanson BM, Agresta HO, Gerstein M, Sodergren E, Weinstock GM. 2019. Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis. Nat Commun 10:5029. doi: 10.1038/s41467-019-13036-1 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B6] 6. Andrews S. 2010. FastQC: a quality control tool for high throughput sequence data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc.

[B7] 7. Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B8] 8. Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, et al. 2012. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1:18. doi: 10.1186/2047-217X-1-18 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9] 9. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10] 10. Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B12] 12. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, et al. 2008. The RAST Server: rapid annotations using subsystems technology. BMC Genomics 9:75. doi: 10.1186/1471-2164-9-75 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B13] 13. Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B14] 14. Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH, Hancock J. 2019. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36:1925–1927. doi: 10.1093/bioinformatics/btz848 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Genome sequences of Bacillus spizenii SHT-15 isolated from cotton (Gossypium hirsutum) rhizosphere in the arid region of Northwest China

Zhichao Meng

XinXiang Niu

Ablimit Nuraliya

Yue Sheng

Hongmei Yang

Ming Chu

Ning Wang

Huifang Bao

Faqiang Zhan

Rong Yang

Kai Lou

Shuang Dou

Zhao Zhang

Yun Chen

Yignwu Shi

Roles

ABSTRACT

ANNOUNCEMENT

Fig 1.

ACKNOWLEDGMENTS

Contributor Information

DATA AVAILABILITY

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Genome sequences of Bacillus spizenii SHT-15 isolated from cotton (Gossypium hirsutum) rhizosphere in the arid region of Northwest China

Zhichao Meng

XinXiang Niu

Ablimit Nuraliya

Yue Sheng

Hongmei Yang

Ming Chu

Ning Wang

Huifang Bao

Faqiang Zhan

Rong Yang

Kai Lou

Shuang Dou

Zhao Zhang

Yun Chen

Yignwu Shi

Roles

ABSTRACT

ANNOUNCEMENT

Fig 1.

ACKNOWLEDGMENTS

Contributor Information

DATA AVAILABILITY

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases