Abstract
We present here the genome sequence of Shewanella corallii strain A687 isolated from pufferfish Sphoeroides spengleri (Family Tetraodontidae). The assembly consists of 5,215,037 bp and contains 284 contigs, with a G+C content of 50.3%.
Keywords: Shewanella, gammaproteobacteria, genome, gene prediction, secondary metabolism
The genus Shewanella comprises 67 recognized species. These bacteria are Gram-negative, rod-shaped, facultatively anaerobic gammaproteobacteria and frequently isolated from marine environments (MacDonell and Colwell, 1985; Sugimoto et al., 2018). Shewanella species are involved in the production of antimicrobial metabolites and tetrodotoxin, a strong neurotoxin (Matsui et al., 1989; Simidu et al., 1990; Magarlamov et al., 2017). Shewanella corallii was recorded from red sea coral (Shnit-Orland, 2010). The aim of the present study was to determine the genome sequence of Shewanella corallii strain A687.
S. corallii A687 was isolated from pufferfish Sphoeroides spengleri (Family Tetraodontidae), in Arraial do Cabo (Brazil) in 2016. Genomic DNA was extracted according to Pitcher’s protocol (Pitcher et al., 1989) and used for 300-bp paired-end library preparation with Nextera XT DNA Sample Preparation Kit. The genome was sequenced using MiSeq (Illumina, San Diego, CA, USA) as previously described (Walter et al., 2016). Sequences obtained were pre-processed using PRINSEQ software to remove reads smaller than 35 bp and low-score sequences (Phred 30) (Schmieder and Edwards, 2011). Sequence reads were assembled using A5-Miseq (Coil et al., 2015) and CAP3 software (Huang and Madan, 1999). The gene prediction and functional annotation were performed using the RAST server (Overbeek et al., 2014). Secondary metabolism was analyzed by antiSMASH (Weber et al., 2015) and clustered regularly interspaced short palindromic repeat (CRISPR) arrays were determined with CRISPRFinder (Grissa et al., 2007).
The sequencing generated a total of 4,557,272 reads and 768,079,097 bp that were assembled in 284 contigs (N50=298,540 bp). The estimated genome size is 5,215,037 bp with G+C of 50.3%, and a coverage of 146-fold. RAST predicted 4,555 coding sequences, and 175 RNA sequences (147 tRNAs, 11 16S rRNAs, 6 23S rRNAs, and 11 5S rRNAs). Analyzing the genes predicted by RAST, a total of 74 genes were involved in resistance to antibiotics and toxic compounds including copper homeostasis (N=8); bile hydrolysis (N=2); cobalt-zinc-cadmium resistance (N=20); resistance to fluoroquinolones (N=4); arsenic resistance (N=4); copper homeostasis: copper tolerance (N=6); tetracycline resistance, ribosome protection type II (N=2); beta-lactamase (N=5); multidrug resistance efflux pumps (N=22); resistance to chromium compounds (N=1); 24 genes for the metabolism of aromatic compounds, including salicylate ester, chloroaromatic and quinate degradation. Phage elements sequences (N=44) were found in this genome, and CRISPRs arrays candidates were predicted in four sequences. We searched through subsystems for genes associated to symbiosis. Genes encoding type I, (lapBCDE, lapL, lapP, RTX and TolC) and type II (TadA, TadB, TadC, VirB11, RcpC, CpaB and CpaF) secretion systems were detected. We also identified 16 genes related to vitamin B12 synthesis and four LuxR gene families (Bondarev et al., 2013).
Data availability
This whole-genome shotgun project has been deposited in GenBank under the accession number QRCS00000000. The version described in this paper is the first version.
Acknowledgments
The authors thank CNPq, CAPES (Brasil - Finance code 001), and FAPERJ for their financial support. This paper is part of the D.Sc. requirements of Gustavo Pitta Reis de Azevedo (Genetics Graduate Program at the Federal University of Rio de Janeiro). LACT thanks FAPESP.
Footnotes
Associate Editor: Ana Tereza R. Vasconcelos
Conflict of Interest
The authors declare no conflicts of interests related to this work.
Author Contributions
GPRA and PHCP carried out the genomic and computational analysis, analyzed the results and wrote the manuscript. HKM, APBM, LV, BSS and LACT, GDG helped analyze the results, participated in the discussion and in the draft of the manuscript. LL, GC and LA carried out the genome sequencing and helped analyzing the results. DAT helped with genomic and computational analysis and drafted the manuscript. CCT and FLT conceived the study, analysed the data and wrote the manuscript. All authors read and approved the final manuscript.
References
- Bondarev V, Richter M, Romano S, Piel J, Schwedt A, Schulz-Vogt HN. The genus Pseudovibrio contains metabolically versatile bacteria adapted for symbiosis. Environ Microbiol. 2013;15:2095–2113. doi: 10.1111/1462-2920.12123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coil D, Jospin G, Darling AE. A5-miseq: An updated pipeline to assemble microbial genomes from Illumina MiSeq data. Bioinformatics. 2015;31:587–589. doi: 10.1093/bioinformatics/btu661. [DOI] [PubMed] [Google Scholar]
- Grissa I, Vergnaud G, Pourcel C. CRISPRFinder: A Web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 2007;35:W52–W57. doi: 10.1093/nar/gkm360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang X, Madan A. CAP3: A DNA sequence assembly program. Genome Res. 1999;9:868–877. doi: 10.1101/gr.9.9.868. [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacDonell MT, Colwell RR. Phylogeny of the Vibrionaceae, and recommendation for two new genera, Listonella and Shewanella . Syst Appl Microbiol. 1985;6:171–182. [Google Scholar]
- Magarlamov TY, Melnikova DI, Chernyshev AV. Tetrodotoxin-producing bacteria: Detection, distribution and migration of the toxin in aquatic systems. Toxins (Basel) 2017;9:166. doi: 10.3390/toxins9050166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matsui T, Taketsugu S, Kodama K, Shiba-Ishii A, Yamamori K, Shimizu C, Shira-Ishi A. Production of tetrodotoxin by the intestinal bacteria of a puffer fish Takifugu niphobles . Nippon Suisan Gakkaishi. 1989;55:2199–2203. [Google Scholar]
- Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, Edwards RA, Gerdes S, Parrello B, Shukla M, et al. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST) Nucleic Acids Res. 2014;42:D206–D214. doi: 10.1093/nar/gkt1226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pitcher DG, Saunders NA, Owen RJ. Rapid extraction of bacterial genomic DNA with guanidium thiocyanate. Lett Appl Microbiol. 1989;8:151–156. [Google Scholar]
- Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics. 2011;27:863–864. doi: 10.1093/bioinformatics/btr026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shnit-Orland M, Sivan A, Kushmaro A. Shewanella corallii sp. nov., a marine bacterium isolated from a Red Sea coral. Int J Syst Evol Microbiol. 2010;60:2293–2297. doi: 10.1099/ijs.0.015768-0. [DOI] [PubMed] [Google Scholar]
- Simidu U, Kita-Tsukamoto K, Yasumoto T, Yotsu M. Taxonomy of four marine bacterial strains that produce tetrodotoxin. Int J Syst Evol Microbiol. 1990;40:331–336. doi: 10.1099/00207713-40-4-331. [DOI] [PubMed] [Google Scholar]
- Sugimoto Y, Maruyama F, Suzuki S. Draft genome sequence of a Shewanella halifaxensis strain isolated from the intestine of marine Red Seabream (Pagrus major), which includes an integrative conjugative element with macrolide resistance genes. Genome Announc. 2018;6:e00297–18. doi: 10.1128/genomeA.00297-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walter JM, Tschoeke DA, Meirelles PM, de Oliveira L, Leomil L, Tenório M, Valle R, Salomon PS, Thompson CC, Thompson FL. Taxonomic and functional metagenomic signature of turfs in the Abrolhos reef system (Brazil) PLoS One. 2016;11:e0161168. doi: 10.1371/journal.pone.0161168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weber T, Blin K, Duddela S, Krug D, Kim HU, Bruccoleri R, Lee SY, Fischbach MA, Müller R, Wohlleben W, et al. AntiSMASH 3.0 — a comprehensive resource for the genome mining of biosynthetic gene clusters. Nucleic Acids Res. 2015;43:W237–W243. doi: 10.1093/nar/gkv437. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
This whole-genome shotgun project has been deposited in GenBank under the accession number QRCS00000000. The version described in this paper is the first version.
