ABSTRACT
We isolated four Parageobacillus strains from soil in Japan and completely sequenced their genomes. Three of four strains showed ≥98.9% average nucleotide identity (ANI) to Parageobacillus caldoxylosilyticus S1812T, while one strain, designated KH3-4, showed the highest ANI (91%) to Parageobacillus thermantarcticus M1T, suggesting the species novelty of KH3-4.
ANNOUNCEMENT
Parageobacillus is a genus of betaproteobacteria in the family Burkholderiaceae that is Gram-positive and a facultatively anaerobic thermophile. Parageobacillus species have great biotechnological potential (1), for example, as a source for thermophilic enzymes (2), fuel production (3, 4), and the bioremediation of environmental pollutants (5). At the time of writing, there are six validly named species in the genus Parageobacillus (https://lpsn.dsmz.de/genus/parageobacillus). So far, seven complete genome sequences have been reported for Parageobacillus, including for Parageobacillus caldoxylosilyticus (1 strain), Parageobacillus thermoglucosidasius (4 strains), and Parageobacillus toebii (2 strains).
We collected soil samples from the city of Tsukuba, Japan. The samples were suspended in distilled water and spread over Lennox LB agar (1.6% [wt/vol]) plates. After incubation at 65°C overnight, dozens of well-separated single colonies were isolated; colony PCR was conducted to analyze the 16S rRNA genes using a set of primers, Bac8f(C) and UN1542r (6). Among the colonies, four strains, designated KH1-5, KH1-6, KH3-4, and KH3-5, which were expected to belong to the genus Parageobacillus, were subjected to complete genome analysis.
To prepare the genomic DNA, cells were grown in 5 mL LB broth at 65°C for 24 h with vigorous shaking (200 rpm). The genomic DNA was purified using a blood and cell culture DNA mini kit (Qiagen). For long-read sequencing, unsheared genomic DNA (1 μg) was treated using a short-read eliminator kit (Circulomics) to remove fragments of <10 Kbp, and a library was constructed using a ligation sequencing kit (Oxford Nanopore Technologies [ONT]). Sequencing was performed using a GridION X5 system on a FLO-MIN106 R9.41 revD flow cell (ONT). Base calling was conducted using Guppy v.4.0.11. The raw sequencing data (Table 1) were filtered (Q < 10; length, <1,000 bases) using NanoFilt v.2.7.1 (7). For short-read sequencing, a library was constructed using an MGIEasy FS PCR free DNA library prep set (MGI) with a ~400 to 500-bp insert. Paired-end sequencing (2 × 150 bases) was then performed on a DNBSEQ-400 instrument (MGI). The raw sequencing data (Table 1) were filtered (Q < 30; length, <20 bases) using fastp v.0.20.1 (8). The trimmed long- and short-read data were assembled using Unicycler v.0.4.8 (9), and the assembly was polished using Pilon v.1.24 (10). Each strain contained a single circular chromosome, and KH3-5 contained one circular plasmid; the circularity was confirmed using Unicycler.
TABLE 1.
Sequencing metrics for the four Parageobacillus strains in this study
| Strain | BioSample accession no. | Chromosome or plasmid | DNBSEQ (short-read) data |
GridION (long-read) data |
Length (bp) | GC content (%) | GenBank accession no. | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| No. of paired-end reads | Total length (Mb) | SRA accession no. | No. of reads | N50 (bp) | Total length (Mb) | SRA accession no. | ||||||
| P. caldoxylosilyticus KH1-5 | SAMD00442691 | Chromosome | 7,592,538 | 1,139 | DRR346603 | 136,964 | 5,757 | 552 | DRR346607 | 3,850,765 | 44.3 | AP025623 |
| P. caldoxylosilyticus KH1-6 | SAMD00442692 | Chromosome | 7,161,769 | 1,074 | DRR346604 | 179,515 | 9,837 | 1,244 | DRR346608 | 3,850,773 | 44.3 | AP025624 |
| Parageobacillus sp. KH3-4 | SAMD00442693 | Chromosome | 9,226,524 | 1,384 | DRR346605 | 981,529 | 4,531 | 3,090 | DRR346609 | 3,816,932 | 43.0 | AP025627 |
| P. caldoxylosilyticus KH3-5 | SAMD00442694 | Chromosome | 7,158,442 | 1,074 | DRR346606 | 1,107,180 | 4,644 | 3,572 | DRR346610 | 3,832,285 | 44.2 | AP025625 |
| Plasmid (pPcaKH3-5b) | 6,889 | 51.7 | AP025626 | |||||||||
Automatic annotation was conducted using DFAST v.1.2.15 (11); the genomic features are summarized in Table 1. A JSpecies analysis (12) revealed that KH1-5, KH1-6, and KH3-5 showed ≥98.9% average nucleotide identity (ANI) to each other and to the type strain of P. caldoxylosilyticus (strain S1812; GenBank accession number GCF_019272935.1), while KH3-4 showed the highest ANI (91.9%) to the type strain of P. thermantarcticus (strain M1; GCF_900111865.1), suggesting the species novelty of KH3-4 (95% ANI being the cutoff for the delineation of a species). For all software, default parameters were used.
Data availability.
All four Parageobacillus strains reported in this paper are associated with BioProject accession number PRJDB12551. The BioSample accession numbers, genome sequences, and raw sequencing data are available under the accession numbers listed in Table 1.
ACKNOWLEDGMENTS
This work was partly supported by the following grants awarded to K.M. from the Japan Society for the Promotion of Science (JSPS): a Grant-in-Aid for Scientific Research (A) (19H00936) and a Grant-in-Aid for Challenging Research (Pioneering) (19H05538). Part of this study was conducted under the Summer Internship Program of Nihon University.
Contributor Information
Kentaro Miyazaki, Email: miyazaki@icb.osaka-u.ac.jp.
Irene L. G. Newton, Indiana University, Bloomington
REFERENCES
- 1.Hussein AH, Lisowska BK, Leak DJ. 2015. The genus Geobacillus and their biotechnological potential. Adv Appl Microbiol 92:1–48. doi: 10.1016/bs.aambs.2015.03.001. [DOI] [PubMed] [Google Scholar]
- 2.De Maayer P, Brumm PJ, Mead DA, Cowan DA. 2014. Comparative analysis of the Geobacillus hemicellulose utilization locus reveals a highly variable target for improved hemicellulolysis. BMC Genomics 15:836. doi: 10.1186/1471-2164-15-836. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Cripps RE, Eley K, Leak DJ, Rudd B, Taylor M, Todd M, Boakes S, Martin S, Atkinson T. 2009. Metabolic engineering of Geobacillus thermoglucosidasius for high yield ethanol production. Metab Eng 11:398–408. doi: 10.1016/j.ymben.2009.08.005. [DOI] [PubMed] [Google Scholar]
- 4.Zhou J, Wu K, Rao CV. 2016. Evolutionary engineering of Geobacillus thermoglucosidasius for improved ethanol production. Biotechnol Bioeng 113:2156–2167. doi: 10.1002/bit.25983. [DOI] [PubMed] [Google Scholar]
- 5.Moxley E, Puerta-Fernández E, Gómez EJ, Gonzalez JM. 2019. Influence of abiotic factors temperature and water content on bacterial 2-chlorophenol biodegradation in soils. Front Environ Sci 7:41. doi: 10.3389/fenvs.2019.00041. [DOI] [Google Scholar]
- 6.Miyazaki K, Sato M, Tsukuda M. 2017. PCR primer design for 16S rRNAs for experimental horizontal gene transfer test in Escherichia coli. Front Bioeng Biotechnol 5:14. doi: 10.3389/fbioe.2017.00014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.De Coster W, D'Hert S, Schultz DT, Cruts M, Van Broeckhoven C. 2018. NanoPack: visualizing and processing long-read sequencing data. Bioinformatics 34:2666–2669. doi: 10.1093/bioinformatics/bty149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chen S, Zhou Y, Chen Y, Gu J. 2018. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13:e1005595. doi: 10.1371/journal.pcbi.1005595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Tanizawa Y, Fujisawa T, Kaminuma E, Nakamura Y, Arita M. 2016. DFAST and DAGA: Web-based integrated genome annotation tools and resources. Biosci Microbiota Food Health 35:173–184. doi: 10.12938/bmfh.16-003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Richter M, Rosselló-Móra R, Oliver Glöckner F, Peplies J. 2016. JSpeciesWS: a Web server for prokaryotic species circumscription based on pairwise genome comparison. Bioinformatics 32:929–931. doi: 10.1093/bioinformatics/btv681. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
All four Parageobacillus strains reported in this paper are associated with BioProject accession number PRJDB12551. The BioSample accession numbers, genome sequences, and raw sequencing data are available under the accession numbers listed in Table 1.
