Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2023 Jan 4;12(1):e01164-22. doi: 10.1128/mra.01164-22

Complete Genome Sequence of a Cognatishimia activa Strain Assembled from the Phycosphere of the Marine Diatom Skeletonema tropicum

Kangli Guo a,b,#, ShuaiShuai Xu a,c,#, Yuanhao Liu a, Zhuo Chen d,, Shengwei Hou a,b,e,
Editor: Simon Rouxf
PMCID: PMC9872673  PMID: 36598233

ABSTRACT

Cognatishimia activa, previously known as Thalassobius activus, has been frequently isolated from marine environments. Here, we present the complete genome sequence of C. activa strain SOCE 004, assembled from the phycosphere of a long-term laboratory-maintained culture of the diatom Skeletonema tropicum. The complete genome is 3,211,994 bp long, with an average G+C content of 53.69%. The genome contains 3,195 genes, including 3,133 protein-coding genes, 50 tRNAs, and 3 copies each of 5S, 16S, and 23S rRNA genes.

ANNOUNCEMENT

Bacteria inhabiting the phycosphere of marine algae have a noteworthy effect on phytoplankton growth and proliferation via nutrient exchanges and biochemical communications (1, 2). Skeletonema tropicum is a warm water diatom species restricted to tropical and subtropical areas (3, 4). Several studies have shown that Skeletonema spp. have a complex commensal microbiota (5, 6), and members of the family Rhodobacteraceae have been frequently found to be associated with Skeletonema and other diatoms in long-term cultures (5, 7). The chain-forming Skeletonema aggregates serve as complex phycosphere systems to study algal-epibiont interactions (8, 9), while the limited genomic information available about the commensal microbiota hampers our understanding of microbial roles in these microenvironments (10, 11).

Here, we report the circular genome sequence of Cognatishimia activa (“Thalassobius activus”) SOCE 004, which was assembled from the microbiome of the diatom S. tropicum. The S. tropicum strain was obtained from the Research Center for Harmful Algal and Marine Biology of Jinan University; the strain was originally collected from the Pearl River Estuary (113.71°E, 21.96°N). Since its isolation, it has been continuously cultivated in artificial seawater supplemented with f/2 medium in a laboratory for more than 8 years. Cultures have been maintained at 20°C under a 12:12-h light/dark regime with a light intensity of 100 μmol photons m−2·s−1. This nonaxenic culture provides a great opportunity to identify long-term bacterium-diatom interactions.

DNA was extracted using the Biospin bacteria genomic DNA extraction kit (Bioer Technology, Hangzhou, China), following the instructions of the manufacturer. Briefly, cells growing in the logarithmic phase were lysed with proteinase K and sodium dodecyl sulfate. Then, genomic DNA was extracted using the column extraction method and quantified using a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, MA, USA). Finally, the DNA was sequenced using a combination of the QitanTech Nanopore (Qitan Technology Co., Beijing, China) and Illumina (Biomarker Technology, Beijing, China) sequencing platforms. For QitanTech Nanopore sequencing, a library was prepared using the QSK v1.1.1 kit according to the manufacturer’s instructions (Qitan Technology Co.) and sequenced on a QCell-384 flow cell. This Nanopore sequencing generated 4.6 Gbp raw data (quality, >Q7; 1.76 million reads), with an N50 value of ~6.24 Kbp and a mean read length of ~2.62 Kbp. For Illumina sequencing, a paired-end short-read (2 × 150-bp) library was prepared following the Illumina Nextera XT protocol and sequenced on an Illumina NovaSeq 6000 platform (Biomarker Technology, Beijing, China). The Illumina sequencing resulted in 39.74 Gbp clean reads (quality, >Q30; 132 million reads).

The Nanopore reads were base called and demultiplexed using Hound v1.1 (Qitan Technology Co.). Adapter sequences were removed using Porechop v0.2.4 (12). The sequence quality was assessed using NanoPack v1.25.0 (13). Flye v2.9 (14) and Canu v2.2 (15) were used to assemble the long-read sequences generated by the QitanTech Nanopore sequencing. The assemblies were then polished using Pilon v1.24 (16) and NextPolish v1.4.0 (17) with short reads generated using the Illumina platform. Unicycler v0.4.8 (12) was used to identify and clean circular contigs from the polished assemblies. A circular contig with a 16S rRNA gene similar to that of C. activa strain 2012CJ37-3 (98.65% identity via BLASTN) was selected for further analysis. A maximum likelihood phylogenetic tree was constructed using IQ-Tree v1.6.12 and visualized using the iTOL Web server (https://itol.embl.de) (Fig. 1). All tools were run with default parameters unless otherwise specified.

FIG 1.

FIG 1

Maximum likelihood phylogenetic tree of C. activa and other closely related species. Closely related strains were obtained from the NCBI database, and only 16S rRNA sequences with valid names were selected. The 16S sequences were aligned using MUSCLE 5.1. The maximum likelihood phylogenetic tree was inferred from sequence alignment of the best-fitting model automatically selected using ModelFinder in IQ-Tree v1.6.12. The tree file generated was visualized using the iTOL Web server. The lengths of the branches represent phylogenetic distances from the reference genome. The three C. activa 16S sequences are shown in red. Three Marivita strains (highlighted in blue) were used as the outgroup to root the tree.

The circular assembly of C. activa SOCE 004 was annotated using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) (18). The complete genome sequence is 3,211,994 bp long, with an average G+C content of 53.69%. In total, 3,195 genes were annotated in the C. activa SOCE 004 genome. Among them, 3,133 are coding sequences (CDSs), 50 are tRNAs, 9 are rRNAs (5S, 16S, and 23S), and 3 are noncoding RNAs (ncRNAs). Several genes involved in nitrogen metabolism were also predicted, including nitrite/sulfite reductase (nirD and nirB) and denitrifying reductase (narI, narH, and narJ) genes. The availability of this genome will help us understand how Cognatishimia spp. adapt to the Skeletonema phycosphere.

Data availability.

The complete genome sequence of C. activa SOCE 004 has been deposited at GenBank under the accession number CP096147. All data are available under BioProject accession number PRJNA827970 and BioSample accession number SAMN27646840. The raw Illumina and Nanopore data were submitted to the SRA and are available via accession numbers SRR22269038 and SRR22262228, respectively.

ACKNOWLEDGMENTS

This study was supported by the National Natural Science Foundation of China (42276163 and 31970367), the MEL Visiting Fellowship of Xiamen University (MELRS2210), the Guangdong Basic and Applied Basic Research Foundation (2021B1515120080), and the Experimental Teaching Demonstration Center project of Guangdong Province (SJZLGC202111).

Contributor Information

Zhuo Chen, Email: chenzhuo4357@sdnu.edu.cn.

Shengwei Hou, Email: housw@sustech.edu.cn.

Simon Roux, DOE Joint Genome Institute.

REFERENCES

  • 1.Amin SA, Parker MS, Virginia Armbrust E. 2012. Interactions between diatoms and bacteria. Microbiol Mol Biol Rev 76:667–684. doi: 10.1128/MMBR.00007-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Hou S, López-Pérez M, Pfreundt U, Belkin N, Stüber K, Huettel B, Reinhardt R, Berman-Frank I, Rodriguez-Valera F, Hess WR. 2018. Benefit from decline: the primary transcriptome of Alteromonas macleodii str. Te101 during Trichodesmium demise. ISME J 12:981–996. doi: 10.1038/s41396-017-0034-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Liu D, Jiang J, Wang Y, Zhang Y, Di B. 2012. Large scale northward expansion of warm water species Skeletonema tropicum (Bacillariophyceae) in China seas. Chinese J Oceanol Limnol 30:519–527. doi: 10.1007/s00343-012-1249-x. [DOI] [Google Scholar]
  • 4.Liu S, Wang Y, Xu Q, Zhang M, Chen N. 2021. Comparative analysis of full-length mitochondrial genomes of five Skeletonema species reveals conserved genome organization and recent speciation. BMC Genomics 22:746. doi: 10.1186/s12864-021-07999-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Grossart H-P, Levold F, Allgaier M, Simon M, Brinkhoff T. 2005. Marine diatom species harbour distinct bacterial communities. Environ Microbiol 7:860–873. doi: 10.1111/j.1462-2920.2005.00759.x. [DOI] [PubMed] [Google Scholar]
  • 6.Grossart H-P, Czub G, Simon M. 2006. Algae–bacteria interactions and their effects on aggregation and organic matter flux in the sea. Environ Microbiol 8:1074–1084. doi: 10.1111/j.1462-2920.2006.00999.x. [DOI] [PubMed] [Google Scholar]
  • 7.Amin SA, Hmelo LR, van Tol HM, Durham BP, Carlson LT, Heal KR, Morales RL, Berthiaume CT, Parker MS, Djunaedi B, Ingalls AE, Parsek MR, Moran MA, Armbrust EV. 2015. Interaction and signalling between a cosmopolitan phytoplankton and associated bacteria. Nature 522:98–101. doi: 10.1038/nature14488. [DOI] [PubMed] [Google Scholar]
  • 8.Gutierrez T, Whitman WB, Huntemann M, Copeland A, Chen A, Kyrpides N, Markowitz V, Pillay M, Ivanova N, Mikhailova N, Ovchinnikova G, Andersen E, Pati A, Stamatis D, Reddy TBK, Ngan CY, Chovatia M, Daum C, Shapiro N, Cantor MN, Woyke T. 2016. Genome sequence of Marinobacter sp. strain MCTG268 isolated from the cosmopolitan marine diatom Skeletonema costatum. Genome Announc 4:e00937-16. doi: 10.1128/genomeA.00937-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Zhu J, Cheng K, Chen G, Zhou J, Cai Z. 2022. Complete genome sequence of Vibrio maritimus BH16, a siderophore-producing mutualistic bacterium isolated from diatom Skeletonema costatum. Mol Plant Microbe Interact 35:723–726. doi: 10.1094/MPMI-01-22-0025-A. [DOI] [PubMed] [Google Scholar]
  • 10.Crenn K, Duffieux D, Jeanthon C. 2018. Bacterial epibiotic communities of ubiquitous and abundant marine diatoms are distinct in short- and long-term associations. Front Microbiol 9:2879. doi: 10.3389/fmicb.2018.02879. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Jackrel SL, Yang JW, Schmidt KC, Denef VJ. 2021. Host specificity of microbiome assembly and its fitness effects in phytoplankton. ISME J 15:774–788. doi: 10.1038/s41396-020-00812-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13:e1005595. doi: 10.1371/journal.pcbi.1005595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.De Coster W, D'Hert S, Schultz DT, Cruts M, Van Broeckhoven C. 2018. NanoPack: visualizing and processing long-read sequencing data. Bioinformatics 34:2666–2669. doi: 10.1093/bioinformatics/bty149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kolmogorov M, Yuan J, Lin Y, Pevzner PA. 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. doi: 10.1038/s41587-019-0072-8. [DOI] [PubMed] [Google Scholar]
  • 15.Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Hu J, Fan J, Sun Z, Liu S. 2020. NextPolish: a fast and efficient genome polishing tool for long-read assembly. Bioinformatics 36:2253–2255. doi: 10.1093/bioinformatics/btz891. [DOI] [PubMed] [Google Scholar]
  • 18.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The complete genome sequence of C. activa SOCE 004 has been deposited at GenBank under the accession number CP096147. All data are available under BioProject accession number PRJNA827970 and BioSample accession number SAMN27646840. The raw Illumina and Nanopore data were submitted to the SRA and are available via accession numbers SRR22269038 and SRR22262228, respectively.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES