Here, the complete assembly of the Moritella marina MP-1 (ATCC 15381) genome, combining Illumina and long Nanopore reads, is presented. The gapless assembly consists of a 4.7-Mb circular chromosome and a 26-kb plasmid, with a G+C content of 40.7%, and will assist in further studies of the molecular pathways in this biotechnologically significant organism.
ABSTRACT
Here, the complete assembly of the Moritella marina MP-1 (ATCC 15381) genome, combining Illumina and long Nanopore reads, is presented. The gapless assembly consists of a 4.7-Mb circular chromosome and a 26-kb plasmid, with a G+C content of 40.7%, and will assist in further studies of the molecular pathways in this biotechnologically significant organism.
ANNOUNCEMENT
There is a high demand for polyunsaturated fatty acids (PUFAs) as a food supplement (1, 2). The marine bacterium Moritella marina (MP-1) has been reported to produce unusually high levels of the PUFA docosahexaenoic acid (DHA) (3, 4). This makes MP-1 interesting with regard to the biotechnological production of DHA (5). Currently, only 2 draft genome assemblies for MP-1 are publicly available (GenBank accession numbers GCA_000291685.1 and GCA_000381865.1). To address this, we used a hybrid assembly approach to generate a complete genome.
MP-1 was obtained from ATCC and grown in marine broth medium 2216 (catalog number BD279110) at 18°C for 72 hours in a conical flask agitated at 150 rpm. Two DNA preparations were purified from cultured bacteria using the NucleoSpin tissue kit (MachereyNagel), with one being subjected to paired-end sequencing on the HiSeq 2500 Illumina platform (MR DNA, TX) and the other used for Nanopore sequencing (Oxford Nanopore Technologies [ONT]) on a MinION instrument in our lab. The Illumina library was prepared using the Nextera DNA sample preparation kit (Illumina) with 41-ng input DNA, following the manufacturer’s user guide, which yielded an average library size of 1,370 bp, as determined with a 2100 bioanalyzer (Agilent Technologies). Library preparation for Nanopore sequencing was performed with the ONT rapid sequencing kit (catalog number SQK-RAD004) using 832-ng DNA as the input. A single sequencing run on a SpotON flow cell (R9.4.1) yielded 2.93 × 106 reads, with a total of 8.99 × 109 bases that were called with Albacore v2.2.6. Default parameters were used for all software unless specified otherwise. By filtering for reads longer than 20 kb and quality scores (Q) higher than 10 using the BBmap tool suite (v37.90) (6), 5,407 reads with a total of 1.31 × 108 bases, corresponding to ∼28× coverage, were selected as input for the Canu assembly pipeline (v1.7) (7) set to an estimated genome size of 4.7 × 106 bp. This resulted in assembly A of a gapless circular contig with a length of 4,733,441 bp. The use of a data set including shorter reads (>8 kb and Q > 10) from the same MinION run for assembly B with Canu at the same settings revealed a 26-kb circular extrachromosomal element. Analysis with PPR-Meta (1.0) identified the large contig as a bacterial chromosome and the 26-kb contig as a plasmid (8).
The chromosome assembled in A and the plasmid from B were used for downstream processing for sequence improvement. The increase in sequence accuracy was evaluated by comparison to the two available draft assemblies using MUMmer (9, 10). We employed signal-level analysis using NanoPolish (11), which increased the average alignment identity from 99.33% to 99.80%. (12). Using the previously generated Illumina data set (12.8 × 106 reads, 250-bp length, ∼624× genome coverage), we ran 4 iterations of Pilon (v1.23), leading to final alignment identities of 99.98% to both draft assemblies. Duplicate overhangs were removed manually, resulting in final polished sequences of a 4,734,363-bp chromosome and a 26,062-bp plasmid.
Annotation with PGAP upon submission to NCBI yielded a total of 4,278 genes, including 198 RNA genes, comprised of 141 tRNA, 53 rRNA, and 4 noncoding RNA (ncRNA) genes (13). We expect that the availability of the complete genome sequence of MP-1 will pave the way for a more comprehensive study of this biotechnologically significant organism.
Data availability.
The complete genome sequence has been deposited in GenBank under the accession number GCA_008931805 (CP044398 [plasmid] and CP044399 [chromosome]). The version described in this paper is the first version, GCA_008931805.1. Unfiltered raw sequencing reads have been deposited in the NCBI Sequence Read Archive under the accession numbers SRX6654081 (Nanopore) and SRX6827227 (Illumina).
ACKNOWLEDGMENTS
This project was also supported by the German Federal Ministry of Economic Affairs (BMWi) under the grant number 50WB1836. This project was also supported by the German Academic Exchange service (DAAD) under grant number 57339330 (DAAD-Hochschulpartnerschaften). We acknowledge support of this work by the project “The National Research Infrastructures on Integrated Structural Biology, Drug Screening Efforts and Drug Target Functional Characterization,” “INSPIRED” (MIS 5002550), which is implemented under the Action “Reinforcement of the Research and Innovation Infrastructure,” funded by the Operational Program “Competitiveness, Entrepreneurship and Innovation” (NSRF 2014-2020) and cofinanced by Greece and the European Union (European Regional Development Fund).
The responsibility for the content of this publication lies with us.
REFERENCES
- 1.Wiktorowska-Owczarek A, Berezińska M, Nowak JZ. 2015. PUFAs: structures, metabolism and functions. Adv Clin Exp Med 24:931–941. doi: 10.17219/acem/31243. [DOI] [PubMed] [Google Scholar]
- 2.Tocher DR, Betancor MB, Sprague M, Olsen RE, Napier JA. 2019. Omega-3 long-chain polyunsaturated fatty acids, EPA and DHA: bridging the gap between supply and demand. Nutrients 11:89. doi: 10.3390/nu11010089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Urakawa H, Kita-Tsukamoto K, Steven SE, Ohwada K, Colwell RR. 1998. A proposal to transfer Vibrio marinus (Russell 1891) to a new genus Moritella gen. nov. as Moritella marina comb. nov. FEMS Microbiol Lett 165:373–378. doi: 10.1111/j.1574-6968.1998.tb13173.x. [DOI] [PubMed] [Google Scholar]
- 4.Delong EF, Yayanos AA. 1986. Biochemical function and ecological significance of novel bacterial lipids in deep-sea procaryotes. Appl Environ Microbiol 51:730–737. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Moi IM, Leow ATC, Ali MSM, Rahman R, Salleh AB, Sabri S. 2018. Polyunsaturated fatty acids in marine bacteria and strategies to enhance their production. Appl Microbiol Biotechnol 102:5811–5826. doi: 10.1007/s00253-018-9063-9. [DOI] [PubMed] [Google Scholar]
- 6.Bushnell B. 2014. BBMap. https://sourceforge.net/projects/bbmap/.
- 7.Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Fang Z, Tan J, Wu S, Li M, Xu C, Xie Z, Zhu H. 2019. PPR-Meta: a tool for identifying phages and plasmids from metagenomic fragments using deep learning. Gigascience 8:giz066. doi: 10.1093/gigascience/giz066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Kautharapu KB, Jarboe LR. 2012. Genome sequence of the psychrophilic deep-sea bacterium Moritella marina MP-1 (ATCC 15381). J Bacteriol 194:6296–6297. doi: 10.1128/JB.01382-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg SL. 2004. Versatile and open software for comparing large genomes. Genome Biol 5:R12. doi: 10.1186/gb-2004-5-2-r12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Loman NJ, Quick J, Simpson JT. 2015. A complete bacterial genome assembled de novo using only nanopore sequencing data. Nat Methods 12:733–735. doi: 10.1038/nmeth.3444. [DOI] [PubMed] [Google Scholar]
- 12.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The complete genome sequence has been deposited in GenBank under the accession number GCA_008931805 (CP044398 [plasmid] and CP044399 [chromosome]). The version described in this paper is the first version, GCA_008931805.1. Unfiltered raw sequencing reads have been deposited in the NCBI Sequence Read Archive under the accession numbers SRX6654081 (Nanopore) and SRX6827227 (Illumina).