Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2022 May 11;11(6):e00204-22. doi: 10.1128/mra.00204-22

Complete Genome Sequences of Four Parageobacillus Strains Isolated from Soil in Japan

Kentaro Miyazaki a,b,c,, Kae Hosoya a,d
Editor: Irene L G Newtone
PMCID: PMC9202437  PMID: 35543511

ABSTRACT

We isolated four Parageobacillus strains from soil in Japan and completely sequenced their genomes. Three of four strains showed ≥98.9% average nucleotide identity (ANI) to Parageobacillus caldoxylosilyticus S1812T, while one strain, designated KH3-4, showed the highest ANI (91%) to Parageobacillus thermantarcticus M1T, suggesting the species novelty of KH3-4.

ANNOUNCEMENT

Parageobacillus is a genus of betaproteobacteria in the family Burkholderiaceae that is Gram-positive and a facultatively anaerobic thermophile. Parageobacillus species have great biotechnological potential (1), for example, as a source for thermophilic enzymes (2), fuel production (3, 4), and the bioremediation of environmental pollutants (5). At the time of writing, there are six validly named species in the genus Parageobacillus (https://lpsn.dsmz.de/genus/parageobacillus). So far, seven complete genome sequences have been reported for Parageobacillus, including for Parageobacillus caldoxylosilyticus (1 strain), Parageobacillus thermoglucosidasius (4 strains), and Parageobacillus toebii (2 strains).

We collected soil samples from the city of Tsukuba, Japan. The samples were suspended in distilled water and spread over Lennox LB agar (1.6% [wt/vol]) plates. After incubation at 65°C overnight, dozens of well-separated single colonies were isolated; colony PCR was conducted to analyze the 16S rRNA genes using a set of primers, Bac8f(C) and UN1542r (6). Among the colonies, four strains, designated KH1-5, KH1-6, KH3-4, and KH3-5, which were expected to belong to the genus Parageobacillus, were subjected to complete genome analysis.

To prepare the genomic DNA, cells were grown in 5 mL LB broth at 65°C for 24 h with vigorous shaking (200 rpm). The genomic DNA was purified using a blood and cell culture DNA mini kit (Qiagen). For long-read sequencing, unsheared genomic DNA (1 μg) was treated using a short-read eliminator kit (Circulomics) to remove fragments of <10 Kbp, and a library was constructed using a ligation sequencing kit (Oxford Nanopore Technologies [ONT]). Sequencing was performed using a GridION X5 system on a FLO-MIN106 R9.41 revD flow cell (ONT). Base calling was conducted using Guppy v.4.0.11. The raw sequencing data (Table 1) were filtered (Q < 10; length, <1,000 bases) using NanoFilt v.2.7.1 (7). For short-read sequencing, a library was constructed using an MGIEasy FS PCR free DNA library prep set (MGI) with a ~400 to 500-bp insert. Paired-end sequencing (2 × 150 bases) was then performed on a DNBSEQ-400 instrument (MGI). The raw sequencing data (Table 1) were filtered (Q < 30; length, <20 bases) using fastp v.0.20.1 (8). The trimmed long- and short-read data were assembled using Unicycler v.0.4.8 (9), and the assembly was polished using Pilon v.1.24 (10). Each strain contained a single circular chromosome, and KH3-5 contained one circular plasmid; the circularity was confirmed using Unicycler.

TABLE 1.

Sequencing metrics for the four Parageobacillus strains in this study

Strain BioSample accession no. Chromosome or plasmid DNBSEQ (short-read) data
GridION (long-read) data
Length (bp) GC content (%) GenBank accession no.
No. of paired-end reads Total length (Mb) SRA accession no. No. of reads N50 (bp) Total length (Mb) SRA accession no.
P. caldoxylosilyticus KH1-5 SAMD00442691 Chromosome 7,592,538 1,139 DRR346603 136,964 5,757 552 DRR346607 3,850,765 44.3 AP025623
P. caldoxylosilyticus KH1-6 SAMD00442692 Chromosome 7,161,769 1,074 DRR346604 179,515 9,837 1,244 DRR346608 3,850,773 44.3 AP025624
Parageobacillus sp. KH3-4 SAMD00442693 Chromosome 9,226,524 1,384 DRR346605 981,529 4,531 3,090 DRR346609 3,816,932 43.0 AP025627
P. caldoxylosilyticus KH3-5 SAMD00442694 Chromosome 7,158,442 1,074 DRR346606 1,107,180 4,644 3,572 DRR346610 3,832,285 44.2 AP025625
Plasmid (pPcaKH3-5b) 6,889 51.7 AP025626

Automatic annotation was conducted using DFAST v.1.2.15 (11); the genomic features are summarized in Table 1. A JSpecies analysis (12) revealed that KH1-5, KH1-6, and KH3-5 showed ≥98.9% average nucleotide identity (ANI) to each other and to the type strain of P. caldoxylosilyticus (strain S1812; GenBank accession number GCF_019272935.1), while KH3-4 showed the highest ANI (91.9%) to the type strain of P. thermantarcticus (strain M1; GCF_900111865.1), suggesting the species novelty of KH3-4 (95% ANI being the cutoff for the delineation of a species). For all software, default parameters were used.

Data availability.

All four Parageobacillus strains reported in this paper are associated with BioProject accession number PRJDB12551. The BioSample accession numbers, genome sequences, and raw sequencing data are available under the accession numbers listed in Table 1.

ACKNOWLEDGMENTS

This work was partly supported by the following grants awarded to K.M. from the Japan Society for the Promotion of Science (JSPS): a Grant-in-Aid for Scientific Research (A) (19H00936) and a Grant-in-Aid for Challenging Research (Pioneering) (19H05538). Part of this study was conducted under the Summer Internship Program of Nihon University.

Contributor Information

Kentaro Miyazaki, Email: miyazaki@icb.osaka-u.ac.jp.

Irene L. G. Newton, Indiana University, Bloomington

REFERENCES

  • 1.Hussein AH, Lisowska BK, Leak DJ. 2015. The genus Geobacillus and their biotechnological potential. Adv Appl Microbiol 92:1–48. doi: 10.1016/bs.aambs.2015.03.001. [DOI] [PubMed] [Google Scholar]
  • 2.De Maayer P, Brumm PJ, Mead DA, Cowan DA. 2014. Comparative analysis of the Geobacillus hemicellulose utilization locus reveals a highly variable target for improved hemicellulolysis. BMC Genomics 15:836. doi: 10.1186/1471-2164-15-836. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Cripps RE, Eley K, Leak DJ, Rudd B, Taylor M, Todd M, Boakes S, Martin S, Atkinson T. 2009. Metabolic engineering of Geobacillus thermoglucosidasius for high yield ethanol production. Metab Eng 11:398–408. doi: 10.1016/j.ymben.2009.08.005. [DOI] [PubMed] [Google Scholar]
  • 4.Zhou J, Wu K, Rao CV. 2016. Evolutionary engineering of Geobacillus thermoglucosidasius for improved ethanol production. Biotechnol Bioeng 113:2156–2167. doi: 10.1002/bit.25983. [DOI] [PubMed] [Google Scholar]
  • 5.Moxley E, Puerta-Fernández E, Gómez EJ, Gonzalez JM. 2019. Influence of abiotic factors temperature and water content on bacterial 2-chlorophenol biodegradation in soils. Front Environ Sci 7:41. doi: 10.3389/fenvs.2019.00041. [DOI] [Google Scholar]
  • 6.Miyazaki K, Sato M, Tsukuda M. 2017. PCR primer design for 16S rRNAs for experimental horizontal gene transfer test in Escherichia coli. Front Bioeng Biotechnol 5:14. doi: 10.3389/fbioe.2017.00014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.De Coster W, D'Hert S, Schultz DT, Cruts M, Van Broeckhoven C. 2018. NanoPack: visualizing and processing long-read sequencing data. Bioinformatics 34:2666–2669. doi: 10.1093/bioinformatics/bty149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Chen S, Zhou Y, Chen Y, Gu J. 2018. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13:e1005595. doi: 10.1371/journal.pcbi.1005595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Tanizawa Y, Fujisawa T, Kaminuma E, Nakamura Y, Arita M. 2016. DFAST and DAGA: Web-based integrated genome annotation tools and resources. Biosci Microbiota Food Health 35:173–184. doi: 10.12938/bmfh.16-003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Richter M, Rosselló-Móra R, Oliver Glöckner F, Peplies J. 2016. JSpeciesWS: a Web server for prokaryotic species circumscription based on pairwise genome comparison. Bioinformatics 32:929–931. doi: 10.1093/bioinformatics/btv681. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All four Parageobacillus strains reported in this paper are associated with BioProject accession number PRJDB12551. The BioSample accession numbers, genome sequences, and raw sequencing data are available under the accession numbers listed in Table 1.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES