Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2020 May 14;9(20):e00352-20. doi: 10.1128/MRA.00352-20

Complete Genome Sequence of Bacillus velezensis Strain S4, Isolated from Biochar-Treated Soil

Priscilla P Hempel a, Mengyin Yao b, Sarah Yannarell c,*, Olga Shevchenko d, Franziska Vogt e,*, Nicole Donofrio e, Julia A Maresca b,
Editor: David A Baltrusf
PMCID: PMC7225544  PMID: 32409545

Here, we report the complete genome sequence of Bacillus velezensis strain S4, which was isolated from biochar-amended agricultural soil collected in Smyrna, Delaware. The genome is 4.07 Mbp, encodes 3,918 predicted proteins, and has a GC content of 46.4%.

ABSTRACT

Here, we report the complete genome sequence of Bacillus velezensis strain S4, which was isolated from biochar-amended agricultural soil collected in Smyrna, Delaware. The genome is 4.07 Mbp, encodes 3,918 predicted proteins, and has a GC content of 46.4%.

ANNOUNCEMENT

While investigating biochar-amended soils, we isolated a Bacillus strain. Since soil Bacillus spp. often produce antibiotics (1), we report the genome sequence of Bacillus velezensis strain S4 and its potential to synthesize antimicrobial compounds.

Biochar-amended agricultural soil (∼0.1 g) was collected in 2011 in Smyrna, Delaware, diluted 1:10−10 in sterile water, spread onto Luria-Bertani (LB) agar, and incubated at 28°C until colonies appeared. Individual colonies were restreaked onto LB agar until they were axenic, as determined by microscopy. Isolates were identified by Sanger sequencing of 16S amplicons (primers 8F and 1492R [2]).

Strain S4 was revived on LB agar from a −80°C stock (10% glycerol). One colony was restreaked onto LB agar and grown overnight at 28°C, and then one colony was transferred to 50 ml LB medium and grown overnight. DNA was extracted from 3.5 ml of this culture (Monarch DNA extraction kit, NEB product number T3010).

Single-molecule real-time (SMRT) libraries were prepared using the standard PacBio protocol for 20-kb libraries. DNA fragments larger than 6 kb were size selected using BluePippin (Sage Science). The average fragment size of the library was 12 kb, as measured with a fragment analyzer (Advanced Analytical Technologies, Inc.). Sequencing was completed on a PacBio RS II sequencer in one SMRT cell using P6-C4 chemistry with a 6-h movie.

This sequencing resulted in 150,292 reads (average length, 13,783 bp [range, 0 to 74,529 bp]). Reads were filtered by quality of ≥0.8 and length of ≥1 kb using the PreAssembler Filter v1 protocol within the SMRT Analysis v2.3.0 software (Pacific Biosciences) (3); 90,373 reads passed filtering (average length, 19,683 bp [range, 1,000 to 74,592 bp]), and 84,594 reads were assembled using HGAP3 (4), with 15-kb seed read fragment lengths. This assembly resulted in one contig (4,075,299 bp), which was plotted against itself using Gepard v1.4 (5) (zoom, 5,781; word length, 10; window size, 0; use matrix, DNA). The region of overlap was identified (zoom, 7; word length, 10; window size, 0; use matrix, DNA) and trimmed using custom scripts (https://github.com/MarescaLab/contig_to_linear_genome), producing a 4,065,174-bp circular chromosome with 315× coverage and a GC content of 46.4%. There is a 1-nucleotide discrepancy in the region of overlap at position 1318741. We included the extra nucleotide in the assembly because the sequence with the insertion has greater coverage (305×, compared to <30×), and the top BLASTn hit for the predicted gene (BVELS4_01369) has the additional nucleotide.

Prokka v1.11 (6) and barrnap v0.7 (https://github.com/tseemann/barrnap) were used to predict open reading frames using default parameters. This genome contains 9 rRNA operons, 86 tRNA genes, 1 transfer mRNA gene, and 3,918 predicted protein-coding genes.

Based on its 16S gene sequence, isolate S4 was initially identified as a strain of Bacillus amyloliquefaciens subsp. plantarum, which has been reclassified as B. velezensis subsp. plantarum (7, 8). Pairwise average nucleotide identity based on BLAST (ANIb) values comparing isolate S4 to 10 other Bacillus spp. (7) were calculated using PyANI v0.2.9 (9) with default parameters in anvi’o v6.1 (10). The genome of isolate S4 is most similar to that of B. velezensis subsp. plantarum FZB42 (GenBank accession number CP000560.1) (ANIb, 0.9106) and quite distant from that of Bacillus pumilus 150a (CP027034.1) (ANIb, 0.2497).

AntiSMASH v5.1.0 (11) (bacterial version, program defaults) identified nine gene clusters with >90% similarity to known antibiotic gene clusters. The predicted products include lipopeptides, polyketides, lanthipeptides, and non-ribosomally synthesized peptides.

Data availability.

The raw reads have been deposited in the Sequence Read Archive under accession number PRJNA612570. The assembly is available at NCBI under the accession number CP050424.

ACKNOWLEDGMENTS

This research was funded by the Delaware Environmental Institute and a Unique Strengths Core Facility fee waiver grant from the College of Agriculture and Natural Resources at the University of Delaware.

We gratefully acknowledge Mingxin Guo at Delaware State University for providing the soil sample and the University of Delaware Sequencing and Genotyping Center for the library preparation and sequencing.

REFERENCES

  • 1.Kaspar F, Neubauer P, Gimpel M. 2019. Bioactive secondary metabolites from Bacillus subtilis: a comprehensive review. J Nat Prod 82:2038–2053. doi: 10.1021/acs.jnatprod.9b00110. [DOI] [PubMed] [Google Scholar]
  • 2.Reysenbach AL, Wickham GS, Pace NR. 1994. Phylogenetic analysis of the hyperthermophilic pink filament community in Octopus Spring, Yellowstone National Park. Appl Environ Microbiol 60:2113–2119. doi: 10.1128/AEM.60.6.2113-2119.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Pacific Biosciences. 2017. SMRT Link user guide. Pacific Biosciences of California, Menlo Park, CA: https://www.pacb.com/wp-content/uploads/SMRT-Link-User-Guide-v8.0.pdf. [Google Scholar]
  • 4.Chin CS, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C, Clum A, Copeland A, Huddleston J, Eichler EE, Turner SW, Korlach J. 2013. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods 10:563–569. doi: 10.1038/nmeth.2474. [DOI] [PubMed] [Google Scholar]
  • 5.Krumsiek J, Arnold R, Rattei T. 2007. Gepard: a rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics 23:1026–1028. doi: 10.1093/bioinformatics/btm039. [DOI] [PubMed] [Google Scholar]
  • 6.Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
  • 7.Rabbee M, Ali M, Choi J, Hwang B, Jeong S, Baek K. 2019. Bacillus velezensis: a valuable member of bioactive molecules within plant microbiomes. Molecules 24:1046. doi: 10.3390/molecules24061046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Dunlap CA, Kim S-J, Kwon S-W, Rooney AP. 2016. Bacillus velezensis is not a later heterotypic synonym of Bacillus amyloliquefaciens; Bacillus methylotrophicus, Bacillus amyloliquefaciens subsp. plantarum and “Bacillus oryzicola” are later heterotypic synonyms of Bacillus velezensis based on phylogenomics. Int J Syst Evol Microbiol 66:1212–1217. doi: 10.1099/ijsem.0.000858. [DOI] [PubMed] [Google Scholar]
  • 9.Pritchard L, Glover RH, Humphris S, Elphinstone JG, Toth IK. 2016. Genomics and taxonomy in diagnostics for food security: soft-rotting enterobacterial plant pathogens. Anal Methods 8:12–24. doi: 10.1039/C5AY02550H. [DOI] [Google Scholar]
  • 10.Eren AM, Esen OC, Quince C, Vineis JH, Morrison HG, Sogin ML, Delmont TO. 2015. Anvi’o: an advanced analysis and visualization platform for ‘omics data. PeerJ 3:e1319. doi: 10.7717/peerj.1319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Blin K, Shaw S, Steinke K, Villebro R, Ziemert N, Lee SY, Medema MH, Weber T. 2019. antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline. Nucleic Acids Res 47:W81–W87. doi: 10.1093/nar/gkz310. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The raw reads have been deposited in the Sequence Read Archive under accession number PRJNA612570. The assembly is available at NCBI under the accession number CP050424.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES