Erythrobacter flavus strain KJ5 (formerly called Erythrobacter sp. strain KJ5) is a yellowish marine bacterium that was isolated from a hard coral in the Karimunjawa Islands of Indonesia.
ABSTRACT
Erythrobacter flavus strain KJ5 (formerly called Erythrobacter sp. strain KJ5) is a yellowish marine bacterium that was isolated from a hard coral in the Karimunjawa Islands of Indonesia. Here, we report the complete genome sequence of the bacterium and provide a useful resource for studies of the biosynthetic pathways of its unique carotenoids.
ANNOUNCEMENT
Erythrobacter flavus strain KJ5 (formerly called Erythrobacter sp. strain KJ5) is a yellowish marine bacterium that was isolated from a tropical sea of the Karimunjawa Islands in Indonesia (1). This bacterium was originally found as a possible symbiont of Acropora nasuta, a hard coral, and was found to be close relative of Erythrobacter flavus strain SW-46 according to its 16S RNA sequence (1). Preliminary analysis of the pigment content of this bacterium revealed that it lacks bacteriochlorophyll a but possesses at least 16 species of carotenoids, including β-carotene and zeaxanthin (2). Molecular species of the three most dominant carotenoids are still unknown, but they have been found to have more polarity than zeaxanthin (2). Moreover, two of these three carotenoids are diminished by saponification, indicating that these two pigments might be esterified by hydrophilic compounds (e.g., sugars, sulfate) (2). We analyzed the exact structures of these carotenoids, and simultaneously, we performed genome sequence analysis of strain KJ5 to determine the biosynthetic pathways of these carotenoids.
Strain KJ5 was cultivated in an Erythrobacter medium as reported by Shioi (3). Genomic DNA was extracted using the DNeasy blood and tissue kit (Qiagen, Venlo, Netherlands) with the manufacturer’s instructions. The sequencing library was synthesized using the TruSeq DNA Nano kit (Illumina, Inc., San Diego, CA). Whole-genome sequencing was performed by a massive parallel MiSeq sequencer (Illumina, Inc.) with a read length of 302 bp and an average insert size of 800 bp. Reads were filtered for quality values of >30 with the adaptor trimming option using the CLC Genomics Workbench v.10.0.1. (Qiagen). De novo genome assembly was performed using SPAdes v.3.0.1 (4) and was checked with QUAST v.5.0.2 (5) and Bandage v.0.8.1 (6), both with default settings. The single assembly was finalized by removing the terminal overhanging region. Gene prediction and functional annotation were performed using the DFAST pipeline v.1.1.4 with default settings (7). The start codon of the dnaA gene was defined as the +1 position of the chromosome. The mean genome coverage was 103×.
The complete genome sequence of strain KJ5 consists of a 2,819,202-bp circular chromosome with an average G+C content of 63.92%. Functional annotation revealed a total of 2,713 coding sequences, 48 tRNA genes, and 2 copies of the rRNA operons. The 16S rRNA gene sequence of strain KJ5 has 99.89% similarity with that of Erythrobacter flavus strain VG1, isolated from the Red Sea (GenBank accession number NZ_CP022528), according to a BLASTn search using the 16S rRNA gene sequence of strain KJ5 as a query and using strain VG1 as a reference. The synteny of the genes was also highly similar between strain KJ5 and strain VG1 over the whole genomic sequences, which were analyzed by constructing a genome rearrangement map using amino acid sequences by Genome Traveler v.3.0.25 with default settings (In Silico Biology, Inc., Yokohama, Japan).
According to the annotation, strain KJ5 was found to lack the genes encoding enzymes responsible for biosynthesis from protoporphyrin IX to bacteriochlorophyll a. On the other hand, for the carotenoid synthetic pathway, all the genes responsible for biosynthesis from geranylgeranyl diphosphate to β-carotene and zeaxanthin were found. These results completely fit the pigment content of the bacterium (2).
Data availability.
The genome sequence of Erythrobacter flavus strain KJ5 has been deposited in DDBJ/ENA/GenBank under the accession number AP019389. The accession number of the original read data set in the SRA is DRR162742.
ACKNOWLEDGMENTS
This study was supported by the competence research grant and World Class Research (number 061/SP2H/LT/K7/KM/2018 and 7/E/KPT/2019) to T.H.P.B., Ministry of Research, Technology, and Higher Education of the Republic of Indonesia.
REFERENCES
- 1.Wusqy NK, Limantara L, Karwur FF. 2014. Exploration, isolation and quantification of β-carotene from bacterial symbiont of Acropora sp. Microbiol Indones 8:58–64. [Google Scholar]
- 2.Juliadiningtyas AD, Pringgenies D, Heriyanto, Salim KP, Radjasa OK, Shioi Y, Limantara L, Brotosudarmo THP. 2018. Preliminary investigation of the carotenoid composition of Erythrobacter sp. strain KJ5 by high-performance liquid chromatography and mass spectrometry. Philipp J Sci 147:91–98. [Google Scholar]
- 3.Shioi Y. 1986. Growth characteristics and substrate specificity of aerobic photosynthetic bacterium, Erythrobacter sp. (OCh 114). Plant Cell Physiol 27:567–572. doi: 10.1093/oxfordjournals.pcp.a077135. [DOI] [Google Scholar]
- 4.Bankevich A, Nurk S, Antipov D, Gurevich A, Dvorkin M, Kulikov AS, Lesin V, Nikolenko S, Pham S, Prjibelski A, Pyshkin A, Sirotkin A, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Wick RR, Schultz MB, Zobel J, Holt KE. 2015. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics 31:3350–3352. doi: 10.1093/bioinformatics/btv383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Tanizawa Y, Fujisawa T, Nakamura Y. 2018. DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication. Bioinformatics 34:1037–1039. doi: 10.1093/bioinformatics/btx713. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The genome sequence of Erythrobacter flavus strain KJ5 has been deposited in DDBJ/ENA/GenBank under the accession number AP019389. The accession number of the original read data set in the SRA is DRR162742.