ABSTRACT
Here, we present HetDA_MAG_SS10, a metagenome-assembled genome (MAG) from an enrichment of a heterocystous diazotroph originally living in association with Trichodesmium spp. obtained near Station ALOHA in the North Pacific Ocean. HetDA_MAG_SS10, an alphaproteobacterium in the order Micavibrionales, is proposed to be photoheterotrophic via rhodopsin and has the potential for dimethylsulfoniopropionate (DMSP) demethylation.
ANNOUNCEMENT
Here, we describe HetDA_MAG_SS10 a genome for a novel epibiont of the cyanobacterial HetDA enrichment. Samples were collected from the oligotrophic North Pacific Ocean near Station ALOHA, as detailed by Momper et al. (1). Briefly, Trichodesmium spp. were collected from the top 10 m with a 130-μm plankton net, and colonies were hand-picked and incubated in sterile YBC-II medium at 24°C with a 12-h light (100 μmol photons m−2 s−1)/12-h dark cycle. The enrichment was grown under the aforementioned conditions for 5 years prior to sequencing and was transferred to fresh medium every month. A 500-mL subsample of the enrichment was gravity filtered onto a 5.0-μm polycarbonate filter, and DNA was extracted using the Qiagen DNeasy PowerSoil kit. The DNA was sent to the University of Southern California Epigenome Center and sequenced on an Illumina MiSeq sequencer (paired-end 250-bp reads, with a total of 4,725,335 raw reads) using the MiSeq reagent kit v2 with 300 cycles. Reads were trimmed using Trimmomatic v0.38 (parameters: –phred33, ILLUMINACLIP:TruSeq3-PE.fa:2:30:10 SLIDINGWINDOW:10:28 MINLEN:50) (2). These reads were subsampled at the following percentages: 1, 3, 5, 10, 25, 30, and 50. Subsampled reads were assembled using MEGAHIT v1.2.8 (3). As described by Tully et al. (4), primary contigs from initial MEGAHIT runs were passed through CD-HIT-EST v4.8.1 (parameters: –M 500000 –c 0.99 –n 10) to identify overlapping primary contigs (5), followed by a secondary assembly in Minimus2 (AMOS v3.1.0) (parameters: –D OVERLAP = 1000 MINID = 98) (6). The Minimus2-generated contigs and the primary contigs that did not assemble with Minimus2 were combined to create the final contigs. The reads were mapped using Bowtie2 v2.3.5 (7) and filtered using CoverM v0.4.0 (–min-read-percent-identity 0.95 –min-read-aligned-percent 0.75) to calculate coverage (8). The assembly was binned using MetaBat2 v2.12.1 (9), BinSanity-wf v0.3.8 (10), and Concoct v1.1.0 (11) using default parameters for each, and DASTool v1.1.2 (12) was used to determine a final set of metagenome-assembled genomes (MAGs). All bins were run through MetaSanity v1.3.0 (13), FuncSanity, and PhyloSanity to provide annotations, using the default configurations and provided databases.
The closest relative of HetDA_MAG_SS10, with an average nucleotide identity (ANI) value of 77.19%, was Micavibrionales strain TMED2 (GenBank assembly accession number GCA_002168225.1), and HetDA_MAG_SS10 was determined to be novel via GTDB-tk (14)-generated ANI and relative evolutionary divergence (RED) scores. It has 16 contigs, a genome size of 3,186,557 bp, an N50 value of 457,653 bp, a coding density of 91%, and a GC content of 57.9%. It is 99.57% complete with 2.17% redundancy, as assessed using CheckM (15), and falls in the Alphaproteobacteria order Micavibrionales.
HetDA_MAG_SS10 contains most of the genes required for glycolysis and the tricarboxylic acid (TCA) cycle and lacks ribulose-1,5-bisphosphate carboxylase-oxygenase (RuBisCo). It contains the cytochrome bd complex (KEGG Orthology codes K00425 and K00426), which is associated with microaerobic respiration (16). Based on the presence of retinal biosynthesis genes (KEGG Orthology codes K06443, K02291, and K10027), β-carotene 15,15′-monooxygenase (TIGRFAM accession number TIGR03753), and rhodopsin (PFAM accession number PF01036), this organism is predicted to be capable of photoheterotrophy. HetDA_MAG_SS10 may also play a role in sulfur cycling, because it contains the dmdA gene for dimethylsulfoniopropionate (DMSP) demethylase, which is involved in DMSP degradation (17).
In summary, our MAG is predicted to be photoheterotrophic and to have roles in sulfur cycling, with its nearest relatives being obligatory parasites.
Data availability.
Raw sequences and MAGs are available under BioProject accession number PRJNA719568. Raw reads are available in the SRA under accession number SRR14140256. The genome is available under BioSample accession number SAMN18613316.
ACKNOWLEDGMENTS
This work was funded by the University of Southern California.
We thank Joshua Roemer and Samantha Lei for their initial input.
This work and was part of the laboratory component of BISC431, Aquatic Microbiology.
E.A.W. grew the cultures, extracted DNA, and provided sequences and samples. J.F.H. and E.A.W. advised on manuscript content. M.S. and E.D.G. performed all bioinformatic analyses. M.S. wrote the first draft.
Contributor Information
Eric A. Webb, Email: eawebb@usc.edu.
Frank J. Stewart, Montana State University
REFERENCES
- 1.Momper LM, Reese BK, Carvalho G, Lee P, Webb EA. 2015. A novel cohabitation between two diazotrophic cyanobacteria in the oligotrophic ocean. ISME J 9:882–893. doi: 10.1038/ismej.2014.186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Li D, Luo R, Liu C-M, Leung C-M, Ting H-F, Sadakane K, Yamashita H, Lam T-W. 2016. MEGAHIT v1.0: a fast and scalable metagenome assembler driven by advanced methodologies and community practices. Methods 102:3–11. doi: 10.1016/j.ymeth.2016.02.020. [DOI] [PubMed] [Google Scholar]
- 4.Tully BJ, Graham ED, Heidelberg JF. 2018. The reconstruction of 2,631 draft metagenome-assembled genomes from the global oceans. Sci Data 5:170203. doi: 10.1038/sdata.2017.203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Fu L, Niu B, Zhu Z, Wu S, Li W. 2012. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28:3150–3152. doi: 10.1093/bioinformatics/bts565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Treangen TJ, Sommer DD, Angly FE, Koren S, Pop M. 2011. Next generation sequence assembly with AMOS. Curr Protoc Bioinformatics 33:11.8.1–11.8.18. doi: 10.1002/0471250953.bi1108s33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Langmead B, Salzberg SL. 2012. Fast gapped-read alignment with Bowtie 2. Nat Methods 9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Woodcroft BJ. 2007. CoverM. https://github.com/wwood/CoverM.
- 9.Kang DD, Li F, Kirton E, Thomas A, Egan R, An H, Wang Z. 2019. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ 7:e7359. doi: 10.7717/peerj.7359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Graham ED, Heidelberg JF, Tully BJ. 2017. BinSanity: unsupervised clustering of environmental microbial assemblies using coverage and affinity propagation. PeerJ 5:e3035. doi: 10.7717/peerj.3035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Alneberg J, Bjarnason BS, de Bruijn I, Schirmer M, Quick J, Ijaz UZ, Lahti L, Loman NJ, Andersson AF, Quince C. 2014. Binning metagenomic contigs by coverage and composition. Nat Methods 11:1144–1146. doi: 10.1038/nmeth.3103. [DOI] [PubMed] [Google Scholar]
- 12.Sieber CMK, Probst AJ, Sharrar A, Thomas BC, Hess M, Tringe SG, Banfield JF. 2018. Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy. Nat Microbiol 3:836–843. doi: 10.1038/s41564-018-0171-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Neely CJ, Graham ED, Tully BJ. 2020. MetaSanity: an integrated microbial genome evaluation and annotation pipeline. Bioinformatics 36:4341–4344. doi: 10.1093/bioinformatics/btaa512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. 2019. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36:1925–1927. doi: 10.1093/bioinformatics/btz848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Giuffrè A, Borisov VB, Arese M, Sarti P, Forte E. 2014. Cytochrome bd oxidase and bacterial tolerance to oxidative and nitrosative stress. Biochim Biophys Acta Bioenerg 1837:1178–1187. doi: 10.1016/j.bbabio.2014.01.016. [DOI] [PubMed] [Google Scholar]
- 17.Kiene RP, Linn LJ. 2000. Distribution and turnover of dissolved DMSP and its relationship with bacterial production and dimethylsulfide in the Gulf of Mexico. Limnol Oceanogr 45:849–861. doi: 10.4319/lo.2000.45.4.0849. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Raw sequences and MAGs are available under BioProject accession number PRJNA719568. Raw reads are available in the SRA under accession number SRR14140256. The genome is available under BioSample accession number SAMN18613316.
