Abstract
The draft genome sequence of Halorubrum sp. SAH-A6, isolated from commercial rock salts of the Danakil depression, Ethiopia. The genome comprised 3,325,770 bp, with the G + C content of 68.0%. The strain has many genes which are responsible for secondary metabolites biosynthesis, transport and catabolism as compared to other Halorubrum archaea members. Abundant genes responsible for numerous transport systems, solute accumulation, and aromatic/sulfur decomposition were detected. The first genomic analysis encourages further research on comparative genomics, and biotechnological applications. The NCBI accession number for this genome is SAMN04278861 and ID: 4278861 and strain deposited with accession number KCTC 43215.
Keywords: Halorubrum sp. SAH-A6, Genome, Rock salts, Danakil depression, Ethiopia
1. Resource table
| Name of resource | Halorubrum species strain SAH-A6 |
|---|---|
| Institution | This strain is available from Korean Collection for Type Cultures (KCTC) with the accession number KCTC 43215 |
| Person who created resource | Ashagrie Gibtan, Mingyeong Woo, Dokyung Oh, Kyounghee Park, Han-Seung Lee, Jae Hak Sohn, Dong-Woo Lee, Jung-Kue Shin, Sang-Jae Lee,⁎ |
| Contact person and email | Sang-Jae Lee, sans76@silla.ac.kr |
| Date archived/stock date | June 2, 2016 |
| Type of resource | Whole genome sequence data |
| Link to directly related literature that employed/validated this resource | http://www.ncbi.nlm.nih.gov/genome/14537?genome_assembly_id=263668 |
| Information in public databases | http://www.ncbi.nlm.nih.gov/genome/14537?genome_assembly_id=263668 |
2. Resource details
The genus Halorubrum was originally proposed by McGenity and Grant [1] and constitutes a large group of extremely halophilic aerobic archaea belonging to the family Halobacteriaceae. At the time of writing, 31 species have been described in the genus Halorubrum [2] which are widely distributed in diverse natural and artificial hypersaline environments such as marine salterns, salt lakes, soda lakes, saline soils, salt fermented foods, and salt preserved food products [3], [4]. Hence, more investigations at genomic level are required to improve our understanding of its ecology, physiology, genetics, and potentiality in biotechnological applications. Halorubrum sp. SAH-A6 strain was isolated from the commercial rock salt produced from the Danakil depression of Ethiopia. Currently, neither genome of this species nor Halorubrum genome from commercial rock salt of the Danakil depression of Ethiopia reported. To fill this gap, Halorubrum sp. SAH-A6 was chosen for genome sequencing.
3. Specifications
| Organism | Halorubrum sp. |
|---|---|
| Strain | SAH-A6 |
| Sequencer or array type | PacBio RS II |
| Data format | Analyzed |
| Experimental factors | Archaea strain |
| Experimental features | Assembled and annotated whole genome |
| Consent | N/A |
| Sample source location | Rock salts of the Danakil depression, Ethiopia |
The draft genome sequence of Halorubrum sp. SAH-A6, isolated from commercial rock salts of the Danakil depression, Ethiopia. The assembled genome comprised 3,325,770 bp, with high G + C content of 68.0% (Table 1). The strain has many genes which are responsible for secondary metabolites biosynthesis, transport, and catabolism as compared to other Halorubrum Archaea. In addition, strain SAH-A6 use universal strategies for extreme adaptation as indicated by the genome. Abundant genes responsible for numerous transport systems, solute accumulation, and aromatic/sulfur decomposition were detected. The subsystem category distribution statistics for Halorubrum sp. strain SAH-A6 were shown in Fig. 1.
Table 1.
Comparison of the genomic feature of Halorubrum sp. SAH-A6 strain with various halophilic Halorubrum strains. The information of the reference genomes was obtained from NCBI data base.
| Organism | BioProject | Resource | Genome size | Contigs | G + C (%) | r + tRNA |
|---|---|---|---|---|---|---|
| H. sp. SAH-A6a | PRJNA302707 | Danakil depression, Ethiopia | 3,325,770 | 3 | 68.0 | 6 + 45 |
|
H. lipolyticum DSM 21995 |
PRJNA188614 | Xin-Jiang, China | 3,425,042 | 41 | 68.0 | 3 + 44 |
|
H. aidingense JCM 13560 |
PRJNA188616 | Xin-Jiang, China | 3,108,525 | 37 | 67.2 | 4 + 49 |
|
H. kocurii JCM 14978 |
PRJNA188615 | Inner Mongolia, China | 3,619,738 | 105 | 66.9 | 1 + 46 |
|
H. lacusprofundi ATCC 49239 |
PRJNA58807 | Deep Lake, Antarctica | 3,692,576 | 3 | 64.0 | 9 + 51 |
|
H. saccharovorum DSM 1137 |
PRJNA188612 | California, USA | 3,423,703 | 72 | 66.9 | 2 + 445 |
|
H. coriense DSM 10284 |
PRJNA188619 | Geelong, Australia | 3,645,313 | 69 | 67.0 | 3 + 48 |
|
H. distributum JCM 10118 |
PRJNA188621 | Turkmenistan | 3,306,135 | 68 | 68.1 | 4 + 43 |
This study.
Fig. 1.
The subsystem category distribution statistics for Halorubrum sp. strain SAH-A6. The whole genome sequence of SAH-A6 was annotated using the Rapid Annotation System Technology (RAST) server. The pie chart showed the count of each subsystem feature and the subsystem coverage.
The genomic analysis showed that the overall central metabolism of SAH-A6 seems to be similar to other Halorubrum species. All members of Halorubrum share the same genes that are responsible for full glycolysis/gluconeogenesis, citrate cycle, pentose phosphate, and pyruvate pathways and sugars metabolism. This shows a horizontal gene transfer within the genus. However, metabolic differences were predicted in many other pathways. Among them, for example, the number of genes which are responsible for secondary metabolites biosynthesis, transport, and catabolism are very high in SAH-A6 strain as compared to other Halorubrum groups.
Further genomic analysis of strain SAH-A6 showed the genetic capacity for adaptation to harsh environments. Unlike other Halorubrum groups, SAH-A6 has much more genes responsible for inorganic ion transport, energy conversion, amino acid transport, and metabolism which can help it to cope with the hot, saline, and nutrient limited environments. Strain SAH-A6 revealed the presence of numerous ionic regulation genes, including magnesium, and copper transport, arsenic pump-driving, ABC transporters, cobalt-zinc-cadmium resistance, and P-type ATPase. These genes help SAH-A6 and other microbes to overcome the high metallic ion in rock salt as compared to other saline environments. Apart from this, strain SAH-A6 is using genes such as stress response, heat shock proteins, DNA repair systems, maintenance of membrane fluidity, and accumulation of compatible solutes as indicated by the genome. In addition, SAH-A6 also has other unique feature for adaptation in slow growth in nutrient-poor commercial rock salt in that it possesses a single rRNA operon. However, fewer genes encoding transposase, lipid transport, and metabolism were found in the SAH-A6 genome, compared with other Halorubrum members.
Phylogenetic tree was built based on neighbor joining tree with the alignment of the 16S rRNA gene sequences (~ 1470 bp) showing the relationship between Halorubrum genomes available at the EzTaxon data base and SAH-A6 using MEGA6 [5] (Supplementary Fig. 1).
4. Materials and methods
The genome sequencing was performed using a single molecule real-time (SMRT) sequencing platform on the PacBio RS II (Pacific Biosciences, Menlo Park, CA) [6]. Genomic DNA was extracted using a standard genomic DNA isolation kit (Promega, USA). The whole genome sequence of strain SAH-A6 was performed using single SMRT cell with a single 180 min movie (Pacific Biosciences) with P6C4 chemistry. The open reading frames of the assembled genome were predicted and annotated using the hierarchical genome-assembly process (HGAP) [7] protocol RS HGAP Assembly 2 in SMRT analysis version 2.3.0 (Pacific Biosciences; https://github.com/PacificBiosciences/SMRT-Analysis), IMG-ER [8], NCBI COG function [9], Pfam information [10], and EzTaxon [11] database. The rRNA and tRNA genes were identified using RNAmmer 1.2 [12] and tRNA scan-SE 1.23 [13], respectively. The whole genome sequence of SAH-A6 was annotated using the Rapid Annotation System Technology (RAST) server. The pie chart showed the count of each subsystem feature and the subsystem coverage.
5. Direct link to deposited data
http://www.ncbi.nlm.nih.gov/genome/14537?genome_assembly_id=263668
The following are the supplementary data related to this article.
Phylogenetic tree constructed using the neighbor-joining method based on 16SrRNA gene sequences, showing the taxonomic position of strain SAH-A6 in the genus Halorubrum. The information of the reference genomes was obtained from EzTaxon data base.
Alignment of the 16S rRNA gene of SAH-A6 with other strains.
Conflict of interest
The authors have nothing to disclose.
Verification and authentication
The whole draft genomic sequence of Halorubrum sp. SAH-A6 (Bio project PRJNA302707) has been deposited at NCBI GenBank database under accession numbers SAMN04278861 and ID: 4278861. This strain is available from Korean Collection for Type Cultures (KCTC) with the accession number KCTC 43215.
Acknowledgements
This work was supported by the Human Resource Training Program for Regional Innovation & Creativity through the Ministry of Education (MOE) & National Research Foundation of Korea (NRF-2014H1C1A1066945) and by the Basic Science Research Program through the National Research Foundation of Korea, which is funded by the Ministry of Education, Science and Technology (NRF-2014R1A1A1006415).
References
- 1.McGenity T.J., Grant W.D. Transfer of Halobacterium saccharovorum, Halobacterium sodomense, Halobacterium trapanicum NRC 34021 and Halobacterium lacusprofundi to the genus Halorubrumgen. nov., as Halorubrum saccharovorum comb. nov., Halorubrum sodomense comb. nov., Halorubrum trapanicum comb. nov., and Halorubrum lacusprofundi comb. nov. Int. J. Syst. Bacteriol. 1995;18:237–243. [Google Scholar]
- 2.Parte A.C. LPSN-list of prokaryotic names with standing in nomenclature. Nucleic Acids Res. 2016;42(Database issue):D613–D616. doi: 10.1093/nar/gkt1111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Fullmer M.S., Soucy S.M., Swithers K.S., Makkay A.M., Wheeler R., Ventosa A., Papke R.T. Population and genomic analysis of the genus Halorubrum. Front. Microbiol. 2014;5(140):1–15. doi: 10.3389/fmicb.2014.00140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Lee H.S. Diversity of halophilic archaea in fermented foods and human intestines and their application. J. Microbiol. Biotechnol. 2013;23:1645–1653. doi: 10.4014/jmb.1308.08015. [DOI] [PubMed] [Google Scholar]
- 5.Tamura K., Stecher G., Peterson D., Filipski A., Kumar S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol. Biol. Evol. 2013;30:2725–2729. doi: 10.1093/molbev/mst197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Eid J., Fehr A., Gray J., Luong K., Lyle J., Otto G., Turner S. Real-time DNA sequencing from single polymerase molecules. Science. 2009;323(5910):133–138. doi: 10.1126/science.1162986. [DOI] [PubMed] [Google Scholar]
- 7.Chin C.S., Alexander D.H., Marks P., Klammer A.A., Drake J., Heiner C., Korlach J. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods. 2013;10(6):563–569. doi: 10.1038/nmeth.2474. [DOI] [PubMed] [Google Scholar]
- 8.Markowitz V.M., Mavromatis K., Ivanova N.N., Chen I.M., Chu K., Kyrpides N.C. IMG ER: a system for microbial genome annotation expert review and curation. Bioinformatics. 2009;25(17):2271–2278. doi: 10.1093/bioinformatics/btp393. [DOI] [PubMed] [Google Scholar]
- 9.Tatusov R.L., Galperin M.Y., Natale D.A., Koonin E.V. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 2000;28(1):33–36. doi: 10.1093/nar/28.1.33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Finn R.D., Bateman A., Clements J., Coggill P., Eberhardt R.Y., Eddy S.R., Punta M. Pfam: the protein families database. Nucleic Acids Res. 2014;42:D222–D230. doi: 10.1093/nar/gkt1223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kim O.S., Cho Y.J., Lee K., Yoon S.H., Kim M., Na H., Chun J. Introducing EzTaxon-e: a prokaryotic 16S rRNA gene sequence database with phylotypes that represent uncultured species. Int. J. Syst. Evol. Microbiol. 2012;62(Pt 3):716–721. doi: 10.1099/ijs.0.038075-0. [DOI] [PubMed] [Google Scholar]
- 12.Lagesen K., Hallin P., Rødland E.A., Staerfeldt H.H., Rognes T., Ussery D.W. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35(9):3100–3108. doi: 10.1093/nar/gkm160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Lowe T.M., Eddy S.R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25(5):955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Phylogenetic tree constructed using the neighbor-joining method based on 16SrRNA gene sequences, showing the taxonomic position of strain SAH-A6 in the genus Halorubrum. The information of the reference genomes was obtained from EzTaxon data base.
Alignment of the 16S rRNA gene of SAH-A6 with other strains.

