An extremely halophilic archaeon, Salarchaeum sp. strain JOR-1, was isolated from the east coast of the Dead Sea, Kingdom of Jordan, and sequenced using single-molecule real-time (SMRT) sequencing. The GC-rich 2.5-Mbp genome was composed of a circular chromosome and a megaplasmid. The genome contained 2,633 genes and was incorporated into HaloWeb (https://halo.umbc.edu/).
ABSTRACT
An extremely halophilic archaeon, Salarchaeum sp. strain JOR-1, was isolated from the east coast of the Dead Sea, Kingdom of Jordan, and sequenced using single-molecule real-time (SMRT) sequencing. The GC-rich 2.5-Mbp genome was composed of a circular chromosome and a megaplasmid. The genome contained 2,633 genes and was incorporated into HaloWeb (https://halo.umbc.edu/).
ANNOUNCEMENT
Halophilic microbes capable of surviving extreme conditions are of interest for biotechnology and astrobiology (1, 2). To increase our access to genomes of these novel microbes, an extremely halophilic archaeon, Salarchaeum sp. strain JOR-1, was isolated from the east coast of the Dead Sea, Kingdom of Jordan, and sequenced. As a result, the number of halophilic archaea from the Dead Sea with a completed genome in GenBank, which includes Halobaculum gomorrense and Haloarcula marismortui, is expanded to three genera (3, 4). These microbes provide the basis for comparative genomic analysis of Haloarchaea isolated from below sea level.
To isolate the new haloarchaeal strain, brine was sampled from 30 cm below the surface of the Dead Sea (31.6998272 N, 35.5811240 E), 415 m below sea level, with a temperature of 34.4°C. The brine was inoculated into Halorubrum medium plus trace metals (HLM+), and growth was stimulated with shaking at 220 rpm at 37°C under white light illumination in an Innova 43R shaker (5). The enrichment cultures were plated onto HLM+ agar plates, and the isolate, an extremely halophilic pigmented halophilic microorganism, was purified by 3 rounds of streaking.
After growing a shaking culture in liquid HLM+ to the stationary phase, nucleic acids were extracted using the classic phenol extraction method of Marmur modified for Haloarchaea (6), and sequencing was performed using the PacBio Sequel platform (Pacific Biosciences, Menlo Park, CA). Using the manufacturer’s 20-kb library preparation protocol with minor modifications, SMRTbell libraries were prepared from 2 μg genomic DNA sheared to 40 kb with a Megaruptor device (Diagenode, Inc., Danville, NJ). The library was sequenced on a single-molecule real-time (SMRT) cell using the Sequel binding kit version 3.0, 10 h of collection time, and 2 h of preextension time (7). Sequencing subreads were filtered and assembled de novo using Hierarchical Genome Assembly Process version 4 (HGAP4) with default parameters. There were 342,641 filtered subreads (mean length, 8,697 bp; mean coverage, 745×) in the preassembly, and 2,782× coverage of the chromosome and 2,237× coverage of plasmid pJOR178 in the final assembly. HGAP4 automatically determined circularity for pJOR178, while the chromosome assembled as a linear contig with 75,204 bp of repeated sequence at the two ends. One copy of the repeat was removed and the contig was circularized; resequencing showed that the single copy was consistent in coverage with the remainder of the molecule.
The Salarchaeum sp. JOR-1 genome sequence was annotated using NCBI’s Prokaryotic Genome Annotation Pipeline (PGAP) build 3190 (8), incorporated into HaloWeb (version r1559404112; https://halo.umbc.edu/), and analyzed using HaloWeb (9) and EMBOSS version 6.6.0.0 (http://www.bioinformatics.nl/cgi-bin/emboss). Default parameters were used for all software unless otherwise noted. The JOR-1 genome was 2,523,371 bp long, with a GC content of 66.2%, with the 2,344,456-bp circular chromosome having 66.3% GC content and the 178,915-bp circular plasmid pJOR178 having 63.8% GC content. Based on its 16S rRNA sequence, the genome was most closely related to that of Salarchaeum japonicum, but it also exhibited similarity to other strains (90 to 99%).
The JOR-1 genome contained 2,633 genes, including a single rRNA operon and 45 tRNA genes. The proteome was highly acidic (10), with a calculated mean pI value of 4.7 based on EMBOSS Pepstats. Nearly all core haloarchaeal orthologous groups (cHOGs) were encoded in the JOR-1 genome (11, 12). It contained multiple copies of genes from several families, which included those encoding the origin recognition complex and general transcription factors, as well as more than 2 dozen transposases (13, 14). A gene cluster for gas vesicle nanoparticles was also present (15). Kinetic data from the sequencing results revealed two major methylated motifs, (m4C)ATG (98.7%) and AGCG(m6A)NC (99.3%), and the gene products recognizing these sequences are named M.SspJOR1I and SspJOR1II, respectively. Cleavage with the m5C-dependent restriction enzyme MspJI (New England Biolabs) revealed an additional motif, GGW(m5C)C, and the gene product recognizing this sequence is named M.SspJOR1III (16).
Data availability.
The Salarchaeum sp. JOR-1 genome sequence has been deposited in GenBank under the accession numbers CP042240 and CP042241, and raw data are available in the NCBI Sequence Read Archive under the accession number SRX6627796.
ACKNOWLEDGMENTS
The DasSarma laboratory was supported by NASA grant 80NSSC19K0563. F.L.M. was supported by the Fulbright Fellowship Program.
B.P.A. and R.J.R. work for New England Biolabs, a company that sells research reagents, including restriction enzymes and DNA methyltransferases, to the scientific community.
REFERENCES
- 1.DasSarma S, DasSarma P. 2017. Halophiles In Encyclopedia of life science. John Wiley & Sons, Hoboken, NJ. [Google Scholar]
- 2.DasSarma S, Schwieterman EW. 2018. Early evolution of purple retinal pigments on Earth and implications for exoplanet biosignatures. Int J Astrobiol:1–10. doi: 10.1017/S1473550418000423. [DOI] [Google Scholar]
- 3.DasSarma S, Fomenkov A, DasSarma SL, Vincze T, DasSarma P, Roberts RJ. 2019. Methylomes of two extremely halophilic Archaea species, Haloarcula marismortui and Haloferax mediterranei. Microbiol Resour Announc 8:e00577-19. doi: 10.1128/MRA.00577-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Goo YA, Roach J, Glusman G, Baliga NS, Deutsch K, Pan M, Kennedy S, DasSarma S, Ng WV, Hood L. 2004. Low-pass sequencing for microbial comparative genomics. BMC Genomics 5:3. doi: 10.1186/1471-2164-5-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Anderson IJ, DasSarma P, Lucas S, Copeland A, Lapidus A, Del Rio TG, Tice H, Dalin E, Bruce DC, Goodwin L, Pitluck S, Sims D, Brettin TS, Detter JC, Han CS, Larimer F, Hauser L, Land M, Ivanova N, Richardson P, Cavicchioli R, DasSarma S, Woese CR, Kyrpides NC. 2016. Complete genome sequence of the Antarctic Halorubrum lacusprofundi type strain ACAM 34. Stand Genomic Sci 11:70. doi: 10.1186/s40793-016-0194-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ng WL, Yang CF, Halladay JT, Arora P, DasSarma S. 1995. Protocol 25. Isolation of genomic and plasmid DNAs from Halobacterium halobium, p 179–184. In DasSarma S, Fleischmann EM, (ed), Archaea, a laboratory manual: halophiles. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. [Google Scholar]
- 7.Chin CS, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C, Clum A, Copeland A, Huddleston J, Eichler EE, Turner SW, Korlach J. 2013. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods 10:563–569. doi: 10.1038/nmeth.2474. [DOI] [PubMed] [Google Scholar]
- 8.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.DasSarma SL, Capes MD, DasSarma P, DasSarma S. 2010. HaloWeb: the haloarchaeal genomes database. Saline Systems 6:12. doi: 10.1186/1746-1448-6-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.DasSarma S, DasSarma P. 2015. Halophiles and their enzymes: negativity put to good use. Curr Opin Microbiol 25:120–126. doi: 10.1016/j.mib.2015.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Capes MD, DasSarma P, DasSarma S. 2012. The core and unique proteins of haloarchaea. BMC Genomics 13:39. doi: 10.1186/1471-2164-13-39. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kennedy SP, Ng WV, Salzberg SL, Hood L, DasSarma S. 2001. Understanding the adaptation of Halobacterium species NRC-1 to its extreme environment through computational analysis of its genome sequence. Genome Res 11:1641–1650. doi: 10.1101/gr.190201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Capes MD, Coker JA, Gessler R, Grinblat-Huse V, DasSarma SL, Jacob CG, Kim J-M, DasSarma P, DasSarma S. 2011. The information transfer system of halophilic archaea. Plasmid 65:77–101. doi: 10.1016/j.plasmid.2010.11.005. [DOI] [PubMed] [Google Scholar]
- 14.DasSarma S. 2004. Genome sequence of an extremely halophilic archaeon, p 383–399. In Fraser CM, Read T, Nelson KE (ed), Microbial genomes. Humana Press, Inc, Totowa, NJ. [Google Scholar]
- 15.DasSarma S, DasSarma P. 2015. Gas vesicle nanoparticles for antigen display. Vaccines (Basel) 3:686–702. doi: 10.3390/vaccines3030686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cohen-Karni D, Xu D, Apone L, Fomenkov A, Sun Z, Davis PJ, Kinney SR, Yamada-Mabuchi M, Xu SY, Davis T, Pradhan S, Roberts RJ, Zheng Y. 2011. The MspJI family of modification-dependent restriction endonucleases for epigenetic studies. Proc Natl Acad Sci U S A 108:11040–11045. doi: 10.1073/pnas.1018448108. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The Salarchaeum sp. JOR-1 genome sequence has been deposited in GenBank under the accession numbers CP042240 and CP042241, and raw data are available in the NCBI Sequence Read Archive under the accession number SRX6627796.