ABSTRACT
Desulfomicrobium sp. strain ZS1 is an obligate anaerobic, sulfate-reducing member of the Desulfobacterota from Zodletone Spring, an anoxic sulfide-rich spring in southwestern Oklahoma. Its complete genome was sequenced using a combination of Illumina and Oxford Nanopore platforms and encodes 3,364 proteins and 81 RNAs on a single chromosome.
ANNOUNCEMENT
The anoxic sediments and air-exposed water of Zodletone Spring (Kiowa County, OK, global positioning system [GPS] coordinates 35.002444 N, 98.688167 W) host some of the most highly diverse microbial communities known, encompassing over 60 phyla and candidate phyla of Bacteria and Archaea (1). The interplay of anaerobic and aerobic sulfur-based metabolism in Zodletone may be reminiscent of physiological adaptations characteristic of microbial life during Archean Earth leading to the great oxygenation event (2). Desulfomicrobium sp. strain ZS1 was isolated from a Zodletone sediment sample (1) as a strict anaerobic, motile chemolithoheterotroph, reducing sulfate with lactate as electron donor. A pure culture was obtained following streaking to single colonies on DSMZ medium 63 at 25°C under 85% N2, 10% CO2, and 5% H2.
For genomic DNA isolation, strain ZS1 was grown in 50 mL liquid DSMZ medium 63 for 5 days at 25°C. All subsequent protocols followed manufacturers’ instructions. DNA was isolated using the Promega Wizard HMW DNA extraction kit. A short insert library was prepared using the Illumina Nextera XT DNA library preparation kit, followed by sequencing (2 × 250-nucleotide reads) on a MiSeq instrument (Illumina, Inc., San Diego, CA), yielding 1 million paired reads. Trimmomatic v0.36 (3) was used for quality-based trimming. Long-read sequencing was performed using the Oxford Nanopore ligation sequencing kit followed by sequencing on a MinION R9.4.1 device (Oxford Nanopore Technologies [ONT], Inc., Cambridge MA), yielding 1.9 Gb with an N50 of 8.2 kbp. All subsequent data analyses were performed using software defaults. Base calling and long-read polishing were performed using ONT’s Guppy and Medaka v1.5, respectively. First-pass assemblies of the long reads were generated using the Trycycler v0.5.0 pipeline (4) with Miniasm/Minpolish v0.3-r179 (5), Flye v2.9 (6), Raven v1.5.3 (7), and wtdbg2 v2.5 (8) assemblers, followed by polishing of the consensus single contig with the Illumina short reads using Polypolish v0.4.3 (9) and POLCA v4.0.5 (10) in tandem. Circularity was confirmed as part of Trycycler pipeline assembly and further verified by back mapping of the Illumina reads. The genome is 3,867,579 bp long, with an average coverage of 479× and a G+C% of 58.9.
To predict and annotate the genes, we used the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v6.1 (11). The genome encodes 3,364 proteins and 81 RNAs, which include three rRNA operons. The gene encoding the chromosomal replication initiator protein DnaA was set as the first gene. Comparative genomic analyses were performed using software implemented in KBase (12) as follows. A phylogenetic tree constructed using SpeciesTree v2.2.0, using a set of 49 core, universal bacterial genes, placed Desulfomicrobium sp. ZS1 closest to Desulfomicrobium baculatum strain XT, the genus type species (NCBI accession number PRJNA29527) (13, 14) (Fig. 1). Whole-genome average nucleotide identity (ANI) between the two genomes, calculated using FastANI v0.1.3 (15), is 94%, suggesting that ZS1 is a species closely related to D. baculatum. Desulfomicrobium sp. ZS1 will facilitate studies on the evolution of microbial sulfur metabolism and adaptation to anoxic and microoxic environments.
FIG 1.
Phylogenetic tree of Desulfomicrobium sp. ZS1 and related Desulfobacterota species, based on 49 core, universal bacterial proteins. Numbers at the nodes indicate support values. The scale bar indicates estimated amino acid substitutions per site.
Data availability.
The annotated genome sequence has been deposited in GenBank under the accession number CP100351. The version described in this article is CP100351.1. The Nanopore and Illumina reads are available in the NCBI Sequence Read Archive (SRA) under the accession numbers SRR21699889 and SRR20017258, respectively.
ACKNOWLEDGMENTS
This research was funded by the National Science Foundation grants 2016371 (to M.P.) and 2016423 (to N.H.Y. and M.S.E.). Oak Ridge National Laboratory is managed by UT-Battelle, LLC, for the U.S. Department of Energy under contract DE-AC05-00OR22725.
Footnotes
For a companion article on this topic, see https://doi.org/10.1128/mBio.00016-22.
Contributor Information
Mircea Podar, Email: podarm@ornl.gov.
Vanja Klepac-Ceraj, Wellesley College.
REFERENCES
- 1.Hahn CR, Farag IF, Murphy CL, Podar M, Elshahed MS, Youssef NH. 2022. Microbial diversity and sulfur cycling in an early earth analogue: from ancient novelty to modern commonality. mBio 13:e0001622. doi: 10.1128/mbio.00016-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Buhring SI, Sievert SM, Jonkers HM, Ertefai T, Elshahed MS, Krumholz LR, Hinrichs KU. 2011. Insights into chemotaxonomic composition and carbon cycling of phototrophic communities in an artesian sulfur-rich spring (Zodletone, Oklahoma, USA), a possible analog for ancient microbial mat systems. Geobiology 9:166–179. doi: 10.1111/j.1472-4669.2010.00268.x. [DOI] [PubMed] [Google Scholar]
- 3.Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Wick RR, Judd LM, Cerdeira LT, Hawkey J, Meric G, Vezina B, Wyres KL, Holt KE. 2021. Trycycler: consensus long-read assemblies for bacterial genomes. Genome Biol 22:266. doi: 10.1186/s13059-021-02483-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Li H. 2016. Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences. Bioinformatics 32:2103–2110. doi: 10.1093/bioinformatics/btw152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Kolmogorov M, Yuan J, Lin Y, Pevzner PA. 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. doi: 10.1038/s41587-019-0072-8. [DOI] [PubMed] [Google Scholar]
- 7.Vaser R, Šikić M. 2021. Time- and memory-efficient genome assembly with Raven. Nat Comput Sci 1:332–336. doi: 10.1038/s43588-021-00073-4. [DOI] [PubMed] [Google Scholar]
- 8.Ruan J, Li H. 2020. Fast and accurate long-read assembly with wtdbg2. Nat Methods 17:155–158. doi: 10.1038/s41592-019-0669-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wick RR, Holt KE. 2022. Polypolish: short-read polishing of long-read bacterial genome assemblies. PLoS Comput Biol 18:e1009802. doi: 10.1371/journal.pcbi.1009802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Zimin AV, Salzberg SL. 2020. The genome polishing tool POLCA makes fast and accurate corrections in genome assemblies. PLoS Comput Biol 16:e1007981. doi: 10.1371/journal.pcbi.1007981. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, Dehal P, Ware D, Perez F, Canon S, Sneddon MW, Henderson ML, Riehl WJ, Murphy-Olson D, Chan SY, Kamimura RT, Kumari S, Drake MM, Brettin TS, Glass EM, Chivian D, Gunter D, Weston DJ, Allen BH, Baumohl J, Best AA, Bowen B, Brenner SE, Bun CC, Chandonia JM, Chia JM, Colasanti R, Conrad N, Davis JJ, Davison BH, DeJongh M, Devoid S, Dietrich E, Dubchak I, Edirisinghe JN, Fang G, Faria JP, Frybarger PM, Gerlach W, Gerstein M, Greiner A, Gurtowski J, Haun HL, He F, Jain R, Joachimiak MP, Keegan KP, Kondo S, Kumar V, Land ML, Meyer F, Mills M, Novichkov PS, Oh T, Olsen GJ, Olson R, Parrello B, Pasternak S, Pearson E, Poon SS, Price GA, Ramakrishnan S, Ranjan P, Ronald PC, Schatz MC, Seaver SMD, Shukla M, Sutormin RA, Syed MH, Thomason J, Tintle NL, Wang D, Xia F, Yoo H, Yoo S, Yu D. 2018. KBase: The United States Department of Energy Systems Biology Knowledgebase. Nat Biotechnol 36:566–569. doi: 10.1038/nbt.4163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Anonymous. 1994. Validation of the publication of new names and new combinations previously effectively published outside the IJSB. List no. 49. Int J Syst Bacteriol 44:370–371. [DOI] [PubMed] [Google Scholar]
- 14.Copeland A, Spring S, Goker M, Schneider S, Lapidus A, Del Rio TG, Tice H, Cheng JF, Chen F, Nolan M, Bruce D, Goodwin L, Pitluck S, Ivanova N, Mavrommatis K, Ovchinnikova G, Pati A, Chen A, Palaniappan K, Land M, Hauser L, Chang YJ, Jeffries CC, Meincke L, Sims D, Brettin T, Detter JC, Han C, Chain P, Bristow J, Eisen JA, Markowitz V, Hugenholtz P, Kyrpides NC, Klenk HP, Lucas S. 2009. Complete genome sequence of Desulfomicrobium baculatum type strain (X). Stand Genomic Sci 1:29–37. doi: 10.4056/sigs.13134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Jain C, Rodriguez RL, Phillippy AM, Konstantinidis KT, Aluru S. 2018. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun 9:5114. doi: 10.1038/s41467-018-07641-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The annotated genome sequence has been deposited in GenBank under the accession number CP100351. The version described in this article is CP100351.1. The Nanopore and Illumina reads are available in the NCBI Sequence Read Archive (SRA) under the accession numbers SRR21699889 and SRR20017258, respectively.

