Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2024 Jun 11;13(7):e00084-24. doi: 10.1128/mra.00084-24

Complete genome sequence of a haloalkaliphilic heterotrophic bacterium Halomonas salifodinae IM328

Zheyi Wang 1,2, Niping Yang 3, Mengshuang Liu 1,2, Dahe Zhao 1, Hua Xiang 1, Yanning Zheng 1,
Editor: Irene L G Newton4
PMCID: PMC11256830  PMID: 38860781

ABSTRACT

The genome of a halophilic bacterium Halomonas salifodinae IM328 was completely sequenced in order to offer convenience for the research such as the synthesis of compatible solutes. The genome contains a circular chromosome which was sequenced by PacBio system.

KEYWORDS: genome, DNA sequencing, Halomonas salifodinae, compatible solutes

ANNOUNCEMENT

Halomonas salifodinae IM328, which is a Gram-negative haloalkaliphilic heterotrophic bacterium, was previously isolated from the sediment of Yanhaizi Lake, Inner Mongolia, China; 108°26′14″N, 40°9′6′E (1). The sample was diluted and spread on HM-3.6 solid medium plates containing the following (g/L): sodium chloride 36, potassium chloride 2, magnesium sulfate heptahydrate 1, calcium chloride 0.27, sodium bromide 0.23, sodium bicarbonate 0.06, peptone 5, yeast extract 10, ferric chloride 0.001, pH 8.5 (adjusted with NaOH), and agar 1.5% (wt/vol) for solid medium. As a halophilic bacterium that could reach high optical density at 600 nm (above 2.0 with flask) and tolerate relatively high salinity, IM328 has the potential for a high yield of compatible solutes such as ectoine (2) and betaine (3). Since extremophiles have become a popular chassis for industrial bioproduction of high-value chemicals such as hydroxyectoine (4), which is costly to produce through chemical catalysis, complete genome sequencing is required for proper genetic modification in H. salifodinae IM328.

The whole genome sequencing was performed at Azenta Genewiz Biotech Co. Ltd, China with Illumina NovaSeq6000 platform and PacBio Sequel lle system. For genome DNA preparation, the cells were incubated at 37℃ overnight in 80 Luria-Bertani liquid medium, which contains 10.0 g/L of tryptone, 5.0 g/L of yeast extract, and 80.0 g/L of NaCl. Cells cultured in 10 mL liquid medium were collected. The genome DNA was extracted by HiPure Soil DNA Kit (Magen, China). For NovaSeq6000 platform, 100 ng gDNA was randomly broken by Covaris S220 and repaired by End Prep Enzyme Mix using VAHTS Universal Plus DNA Library Prep Kit for Illumina V2 (Vazyme, China). The concentration and quality of the library were examined by Qubit 3.0 Fluorometer (Invitrogen, Carlsbad, CA, USA) and Agilent 2100 Bioanalyzer System, successively. The read length of Illumina NovaSeq6000 platform was set to 2 × 150 bp, and fastp (v0.23.0) (5) is used to remove the adapter or primer sequences and reads <75 bp for quality control. The reads with Q20 <40% were also removed using fastp. Clean reads (40,047,768) (from 40,470,970 total reads) and 6,000,990,546 clean bases were obtained. For PacBio Sequel lle system (6), 5 µg gDNA was randomly broken using PacBio Covaris g-Tube DNA Shearing for SMRTbell Prep Kit 3.0, followed by quantification using Qubit 3.0 Fluorometer. The size of the library was checked to be appropriate using Agilent 2100 Bioanalyzer System before sequencing following the instructions of the manufacturer. A total of 21,341 sequences (average length: 7,603.45 bp) were obtained with an N50 value of 8,745.

PacBio reads were assembled using Hifiasm (v0.13-r308) (7) and Canu (v1.7) (8) with default settings for identifying the overlaps and trimming of mismatching bases, followed by recorrecting with software Pilon (v1.2.2) (9) using Illumina raw data. Out of a total of 21,341 reads, 21,339 reads were mapped to the genome with a coverage of 100% with a mean depth of 41× and assembly N50 value of 3,952,647. The genome assembly contains a single chromosome with a length of 3,952,647 nucleotides (66.5% guanine-cytosine content), and the origin of the circular genome was rotated to the start of chromosomal replication initiator protein gene DnaA. The sequence was deposited in the GenBank under the accession number CP141893.

ACKNOWLEDGMENTS

This study was funded by the National Key R&D Program of China (2020YFA0906800), the Strategic Priority Research Program of the Chinese Academy of Sciences (XDA28030201), and the National Natural Science Foundation of China (92351302, 91851102, 32070034, and 32270056).

Contributor Information

Yanning Zheng, Email: zhengyn@im.ac.cn.

Irene L. G. Newton, Indiana University, Bloomington, Bloomington, Indiana, USA

DATA AVAILABILITY

Sequences are available in NCBI databases under BioProject PRJNA1057535, Biosample SAMN39136389, and GenBank CP141893 (chromosome), and the Pacbio raw data and Illumina raw data have been deposited in SRA with the accession numbers SRR28162546 and SRR28904962.

REFERENCES

  • 1. Zhang M, Xue Q, Zhang S, Zhou H, Xu T, Zhou J, Zheng Y, Li M, Kumar S, Zhao D, Xiang H. 2021. Development of whole-cell catalyst system for sulfide biotreatment based on the engineered haloalkaliphilic bacterium. AMB Express 11:142. doi: 10.1186/s13568-021-01302-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Liu MS, Liu H, Shi M, Jiang MY, Li LL, Zheng YN. 2021. Microbial production of ectoine and hydroxyectoine as high-value chemicals. Microb Cell Fact 20:76. doi: 10.1186/s12934-021-01567-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Jungmann L, Hoffmann SL, Lang C, De Agazio R, Becker J, Kohlstedt M, Wittmann C. 2022. High-efficiency production of 5-hydroxyectoine using metabolically engineered Corynebacterium glutamicum. Microb Cell Fact 21:274. doi: 10.1186/s12934-022-02003-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Chen WC, Hsu CC, Wang LF, Lan JCW, Chang YK, Wei YH. 2019. Exploring useful fermentation strategies for the production of hydroxyectoine with a halophilic strain, Halomonas salina BCRC 17875. J Biosci Bioeng 128:332–336. doi: 10.1016/j.jbiosc.2019.02.015 [DOI] [PubMed] [Google Scholar]
  • 5. Chen S. 2023. Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp. iMeta 2:e107. doi: 10.1002/imt2.107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. McCarthy A. 2010. Third generation DNA sequencing: pacific biosciences' single molecule real time technology. Chem Biol 17:675–676. doi: 10.1016/j.chembiol.2010.07.004 [DOI] [PubMed] [Google Scholar]
  • 7. Cheng H, Concepcion GT, Feng X, Zhang H, Li H. 2021. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods 18:170–175. doi: 10.1038/s41592-020-01056-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Nurk S, Walenz BP, Rhie A, Vollger MR, Logsdon GA, Grothe R, Miga KH, Eichler EE, Phillippy AM, Koren S. 2020. HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads. Genome Res 30:1291–1305. doi: 10.1101/gr.263566.120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng QD, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Sequences are available in NCBI databases under BioProject PRJNA1057535, Biosample SAMN39136389, and GenBank CP141893 (chromosome), and the Pacbio raw data and Illumina raw data have been deposited in SRA with the accession numbers SRR28162546 and SRR28904962.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES