ABSTRACT
The complete genome sequence of the thermoacidophilic archaeon Metallosphaera sedula (DSM 5348) is reported here. M. sedula, originally isolated from a volcanic field in Italy, is a prolific iron-oxidizing archaeon with applications in bioleaching of sulfide minerals.
KEYWORDS: Metallosphaera sedula, thermoacidophile, archeaon
ANNOUNCEMENT
Metallosphaera sedula is an obligate aerobe, Chemolithoautotroph/heterotroph, isolated from a solfataric field in Italy, with optimal growth at pH of 2.0 and 73°C (1, 2). It was assigned to the new genus Metallosphaera in 1989 and is described as an iron-oxidizing archaea with applications in biomining of sulfide minerals (2, 3). It can also be grown on meteorites (4). The previous M. sedula reference genome was removed from NCBI due to inability to verify source material. The closed genome sequence of M. sedula DSM 5348 is reported here, consisting of 2,190,631 bp.
M. sedula was obtained from DSMZ and grown from −80°C freezer stocks at 70°C in modified DSM88 medium. NEB Monarch Genomic DNA Purification Kit (New England Biolabs, USA) was used to extract and purify genomic DNA from liquid cultures. The 1× dsDNA HS Assay Kit (Invitrogen, Q33231) was used to quantify DNA on a Qubit 4.0 fluorimeter. Oxford Nanopore Technologies (UK) Native Barcoding Kit (SQK-NBD112-24) was used for DNA barcoding, and Oxford Nanopore Technologies (UK) R9.4.1 flow cell (FLO-MIN106D) on a MinION Mk1B was used for DNA sequencing without size selection Sequencing was carried out for 72 h for the library using MinKNOW v22.12.7, while Guppy v6.4.6 was used for live high-accuracy GPU base calling and quality verification of the run, respectively.
Guppy base-calling workflow was used to trim the long reads initially, and NanoFilt v2.8.0 (5) was used to filter the reads, with read quality and length cutoffs set at 10 and 1,000, respectively. The ONT long reads were then used to generate a circularized contig for each strain using Flye assembler (v2.9.1-b1780) (6). Rotation of the assembled genome to start at the dnaA gene was carried out using the fixstart command in Circlator v1.5.5 (7). GraphMap v0.5.2 (8) was used for mapping the reads, SAMtools v1.16.1 (9) was used for indexing and sorting the mapped reads, and Pilon v1.24 (10) and Medaka v1.11.1 (Oxford Nanopore Technologies, UK) were used for error correction and polishing of the assemblies. Error correction and polishing were done using the trimmed and filtered reads. QUAST v5.2.0 (11), Pilon (10), and CheckM v1.2.2 (12) were used to generate quality and statistics reports for the finalized assemblies. Annotation of final assemblies was carried out using PGAP2023-10-03.build7061 . All software was used with default parameters unless specified otherwise. A summary of assembly statistics is shown in Table 1.
TABLE 1.
Statistics for M. sedula genome assembly
| Metallosphaera sedula | |
|---|---|
| Genome Accession ID | CP139956 |
| BioProject | PRJNA1046088 |
| BioSample | SAMN38472205 |
| Sequence Read Archive | SRR27048297 |
| # of Reads | 126,414 |
| Total Reads | 571,623,454 |
| Read n50 | 7,468 |
| Genome Size | 2,190,631 |
| GC Content | 46.22% |
| Read Coverage | 240 |
| Completeness | 100.00 |
| Contamination | 0.00 |
| # of Predicted Genes | 2,367 |
ACKNOWLEDGMENTS
This work was supported, in part, by the US Air Force Office of Sponsored Research (AFOSR) award FA9550-20-1-0216 and the US National Science Foundation award CBET-1802939. D.J.W. and R.G.B. acknowledge support from NIH Biotechnology Traineeships (NIH T32 GM008776-16).
Contributor Information
Robert M. Kelly, Email: rmkelly@ncsu.edu.
Kenneth M. Stedman, Portland State University, Portland, Oregon, United States
DATA AVAILABILITY
To access the data, the accession numbers, including raw reads SRA, BioSample, BioProject, and Genome, used in this project, previously deposited at NCBI, are provided in Table 1.
REFERENCES
- 1. Manesh MJH, Willard DJ, Lewis AM, Kelly RM. 2024. Extremely thermoacidophilic archaea for metal bioleaching: what do their genomes tell us?. Bioresour Technol 391:129988. doi: 10.1016/j.biortech.2023.129988 [DOI] [PubMed] [Google Scholar]
- 2. Huber G, Spinnler C, Gambacorta A, Stetter KO. 1989. Metallosphaera sedula gen, and sp. nov. represents a new genus of aerobic, metal-mobilizing, thermoacidophilic archaebacteria. Systematic and Applied Microbiology 12:38–47. doi: 10.1016/S0723-2020(89)80038-4 [DOI] [Google Scholar]
- 3. Auernik KS, Maezato Y, Blum PH, Kelly RM. 2008. The genome sequence of the metal-mobilizing, extremely thermoacidophilic archaeon Metallosphaera sedula provides insights into bioleaching-associated metabolism. Appl Environ Microbiol 74:682–692. doi: 10.1128/AEM.02019-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Kölbl D, Pignitter M, Somoza V, Schimak MP, Strbak O, Blazevic A, Milojevic T. 2017. Exploring fingerprints of the extreme thermoacidophile Metallosphaera sedula grown on synthetic martian regolith materials as the sole energy sources. Front Microbiol 8:1918. doi: 10.3389/fmicb.2017.01918 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. De Coster W, D’Hert S, Schultz DT, Cruts M, Van Broeckhoven C. 2018. Nanopack: visualizing and processing long-read sequencing data. Bioinformatics 34:2666–2669. doi: 10.1093/bioinformatics/bty149 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Kolmogorov M, Yuan J, Lin Y, Pevzner PA. 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. doi: 10.1038/s41587-019-0072-8 [DOI] [PubMed] [Google Scholar]
- 7. Hunt M, Silva ND, Otto TD, Parkhill J, Keane JA, Harris SR. 2015. Circlator: automated circularization of genome assemblies using long sequencing reads. Genome Biol 16:294. doi: 10.1186/s13059-015-0849-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Sović I, Šikić M, Wilm A, Fenlon SN, Chen S, Nagarajan N. 2016. Fast and sensitive mapping of nanopore sequencing reads with GraphMap. Nat Commun 7:11307. doi: 10.1038/ncomms11307 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. 2021. Twelve years of SAMtools and BCFtools. Gigascience 10:giab008. doi: 10.1093/gigascience/giab008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Mikheenko A, Prjibelski A, Saveliev V, Antipov D, Gurevich A. 2018. Versatile genome assembly evaluation with QUAST-LG. Bioinformatics 34:i142–i150. doi: 10.1093/bioinformatics/bty266 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
To access the data, the accession numbers, including raw reads SRA, BioSample, BioProject, and Genome, used in this project, previously deposited at NCBI, are provided in Table 1.
