ABSTRACT
We present the complete genome of Opitutales bacterium ASA1, isolated from soil. The genome is 5,821,695 bp with 4,638 protein-coding sequences. The genome data suggest that this strain belongs to the class Opitutae of the phylum Verrucomicrobiota, and its genome has six unique biosynthetic gene clusters associated with secondary metabolites.
KEYWORDS: bacteria, genome, Verrucomicrobiota
ANNOUNCEMENT
The class Opitutae of the phylum Verrucomicrobiota consists of physiologically and ecologically diverse members, including some soil isolates with ultra-small cell-size characteristics (1–3). Here, we present the complete genome sequence of Opitutales strain ASA1.
The microbial sample was collected 5–15 cm below the surface of garden soil at the AIST Hokkaido Center (43.0201 N 141.4182 E). Soil samples were suspended in sterilized saline (0.9% NaCl) and diluted in a 10-fold series. Aliquots (100 µL) from each dilution were inoculated onto 100-fold diluted PYG (containing peptone, yeast extract, and glucose) medium prepared with autoclaving phosphate and agar separately (pH 8.5) (4) and incubated at 25°C in the dark. After 14 days of incubation, small colonies were picked up and transferred to fresh agar plates for pure isolation. The 16S rRNA gene of the resulting strain ASA1 was PCR-amplified using two universal primers, 27F and 1492R, and its partial sequencing was conducted with the universal primer 907R.
For genome sequencing, strain ASA1 was cultured (pH 8.3) in R2A broth (DAIGO; Nihon Pharmaceutical, Tokyo) at 30°C for 2 weeks. DNA extraction was carried out using a phenol-chloroform method (5). After DNA shearing to approximately 10- to 20-kbp fragments using g-TUBE (Covaris), a sequencing library was prepared using a SMRTbell gDNA Sample Amplification Kit (PacBio) and SMRTbell Express Template Prep Kit v.2.0 (PacBio). The library was sequenced with Sequel IIe (PacBio), and then HiFi reads were obtained using SMRT Link v.12.0.0.177059. Note that the default settings were used for all software for subsequent data processes unless otherwise specified. PCR adapters, PCR duplicates, and ≤1,000-bp short reads were removed with lima verson 2.7.1, pbmarkdup v.1.0.3, and Filtlong v.0.2.1 (https://github.com/rrwick/Filtlong, with the options “--min_length 1000” and “--keep_percent 90”), respectively. Finally, 45,742 qualified HiFi reads with an average length of 6,486 bp were obtained (the total sequence length was 296,678,282 bp). The obtained reads were assembled with Flye v.2.9.1-b1780 using the “--pacbio-hifi” mode and the option “-i 3” (6), and then the assembly graphs for assessing completeness was visualized using Bandage v.0.8.1 (7). The assembled circular genome was annotated with DFAST v.1.2.11 (8). To identify the full-length 16S rRNA gene sequence, we conducted a BLASTn search against the National Center for Biotechnology Information nt/nr database (accessed on 12 September 2023). We also calculated the average nucleotide identity of strain ASA1 in comparison to the type strain database using DFAST. Additionally, we employed the antiSMASH v.7.0 online tool to predict biosynthetic gene clusters (9).
The genome consists of a single chromosome of 5,821,695 bp with a 65.6% G + C content. The chromosome contains a total of 4,690 genes, including 4,638 protein-coding genes, 3 rRNA genes, 48 tRNA genes, and 1 CRISPR gene. Strain ASA1 is affiliated with the class Opitutae (Opitutia corrig.) of the phylum Verrucomicrobiota. However, it exhibits low 16S rRNA gene sequence identities (<93%) to the closest type strain, Alterococcus agarolyticus (accession no. NR_036763.1), and an undescribed strain, Opitutus sp. strain CFH (KY039339.1). Furthermore, the antiSMASH analysis identified six unique biosynthetic gene clusters in the ASA1 genome (Table 1).
TABLE 1.
Summary of predicted BGCs in the Opitutales bacterium strain ASA1 genome by antiSMASH v.7.0a
| Genomic region | BGC type | Gene position | Most similar known cluster | Most similar type in MIBiG comparison (similarity score, %) | |
|---|---|---|---|---|---|
| From | To | ||||
| Region 1 | NRPS | 3,252,427 | 3,293,703 | NA | NRP + Alkaloid from Pseudomonas fluorescens (0.31) |
| Region 2 | NRPS | 3,572,624 | 3,618,738 | NA | NRP + Alkaloid from Pseudomonas fluorescens (0.31) |
| Region 3 | T3PKS | 3,629,430 | 3,670,473 | NA | Polyketide from Candidatus Entotheonella serta (0.32) |
| Region 4 | Arylpolyene | 5,402,678 | 5,443,805 | NA | Other from Planctomycetales bacterium 10988 (0.33) |
| Region 5 | NRPS-like | 5,540,659 | 5,581,974 | NA | NRP + Saccharide from Streptomyces hygroscopicus (0.34) |
| Region 6 | T1PKS and NRPS | 5,776,738 | 5,821,695 | NA | NRP from Pseudomonas sp. J465 (0.67) |
BGC, biosynthetic gene cluster; NRP, nonribosomal peptide; NRPS, nonribosomal peptide synthase; T3PKS, Type III polyketide synthase.
ACKNOWLEDGMENTS
This study was partially funded by a JSPS KAKENHI Grant-in-Aid for Early-Career Scientists (JP20K15443 to K.I.) and a MEXT KAKENHI Grant-in-Aid for Scientific Research on Innovative Areas “Post-Koch Ecology” (JP19H05679 and JP19H05683 to R.N.). M.S.I was supported by a MEXT scholarship for young talent.
Contributor Information
Ryosuke Nakai, Email: nakai-ryosuke@aist.go.jp.
Kensuke Igarashi, Email: igarashi.kensuke@aist.go.jp.
Julia A. Maresca, SUNY College of Environmental Science and Forestry, Syracuse, New York, United States
DATA AVAILABILITY
The ASA1 genome sequence was deposited in the DDBJ/ENA/GenBank database under accession number AP028972.1 (BioProject/BioSample PRJDB16742/SAMD00647589; DDBJ Sequence Read Archive DRA017385).
REFERENCES
- 1. Janssen PH, Schuhmann A, Mörschel E, Rainey FA. 1997. Novel anaerobic ultramicrobacteria belonging to the Verrucomicrobiales lineage of bacterial descent isolated by dilution culture from anoxic rice paddy soil. Appl Environ Microbiol 63:1382–1388. doi: 10.1128/aem.63.4.1382-1388.1997 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Nakai R. 2020. Size matters: ultra-small and filterable microorganisms in the environment. Microbes Environ 35:ME20025. doi: 10.1264/jsme2.ME20025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Rodrigues JLM, Isanapong J. 2014. The family Opitutaceae, p 751–756. In Rosenberg E, DeLong EF, Lory S, Stackebrandt E, Thompson F (ed), The Prokaryotes. Springer, Berlin Heidelberg, Berlin, Heidelberg. [Google Scholar]
- 4. Kato S, Yamagishi A, Daimon S, Kawasaki K, Tamaki H, Kitagawa W, Abe A, Tanaka M, Sone T, Asano K, Kamagata Y. 2018. Isolation of previously uncultured slow-growing bacteria by using a simple modification in the preparation of agar media. Appl Environ Microbiol 84:e00807-18. doi: 10.1128/AEM.00807-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Sambrook J, Fritsch EF, Maniatis T. 1989. Molecular cloning: a laboratory manual. In Cold spring harbor laboratory press [Google Scholar]
- 6. Kolmogorov M, Yuan J, Lin Y, Pevzner PA. 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. doi: 10.1038/s41587-019-0072-8 [DOI] [PubMed] [Google Scholar]
- 7. Wick RR, Schultz MB, Zobel J, Holt KE. 2015. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics 31:3350–3352. doi: 10.1093/bioinformatics/btv383 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Tanizawa Y, Fujisawa T, Nakamura Y. 2018. DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication. Bioinformatics 34:1037–1039. doi: 10.1093/bioinformatics/btx713 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Blin K, Shaw S, Augustijn HE, Reitz ZL, Biermann F, Alanjary M, Fetter A, Terlouw BR, Metcalf WW, Helfrich EJN, van Wezel GP, Medema MH, Weber T. 2023. antiSMASH 7.0: new and improved predictions for detection, regulation, chemical structures and visualisation. Nucleic Acids Res 51:W46–W50. doi: 10.1093/nar/gkad344 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The ASA1 genome sequence was deposited in the DDBJ/ENA/GenBank database under accession number AP028972.1 (BioProject/BioSample PRJDB16742/SAMD00647589; DDBJ Sequence Read Archive DRA017385).
