Skip to main content
Genomics Data logoLink to Genomics Data
. 2016 Aug 11;9:143–144. doi: 10.1016/j.gdata.2016.08.008

Draft genome sequence of pathogenic bacteria Vibrio parahaemolyticus strain Ba94C2, associated with acute hepatopancreatic necrosis disease isolate from South America

Leda Restrepo a,, Bonny Bayot a, Irma Betancourt a, Andres Pinzón b
PMCID: PMC4992007  PMID: 27570736

Abstract

Vibrio parahaemolyticus is a pathogenic bacteria which has been associated to the early mortality syndrome (EMS) also known as hepatopancreatic necrosis disease (AHPND) causing high mortality in shrimp farms. Pathogenic strains contain two homologous genes related to insecticidal toxin genes, PirA and PirB, these toxin genes are located on a plasmid contained within the bacteria. Genomic sequences have allowed the finding of two strains with a divergent structure related to the geographic region from where they were found. The isolates from the geographic collection of Southeast Asia and Mexico show variable regions on the plasmid genome, indicating that even though they are not alike they still conserve the toxin genes. In this paper, we report for the first time, a pathogenic V. parahaemolyticus strain in shrimp from South America that showed symptoms of AHPND. The genomic analysis revealed that this strain of V. parahaemolyticus found in South America appears to be more related to the Southeast Asia as compared to the Mexican strains. This finding is of major importance for the shrimp industry, especially in regards to the urgent need for disease control strategies to avoid large EMS outbreaks and economic loss, and to determine its dispersion in South America. The whole-genome shotgun project of V. parahaemolyticus strain Ba94C2 have been deposited at DDBJ/EMBL/GenBank under the accession PRJNA335761.

Keywords: Acute hepatopancreatic necrosis disease, Draft genome, Geographical variation, Shrimp, Vibrio parahaemolyticus


Specifications
Organism Vibrio parahaemolyticus
Strain Ba94C2
Sequencer Illumina MiSeq
Data format assembled
Experimental factors microbial strain originally isolated from necrotizing lesions
on the hepatopancreas of shrimps
Experimental features whole genome shotgun sequencing followed by genome
assembly and gene description
Consent N/A
Sample source location South America

1. Direct link to deposited data

Deposited data can be found here: http://www.ncbi.nlm.nih.gov/bioproject/335761.

2. Experimental design, materials and methods

Vibrio parahaemolyticus is a member of the family Vibrionaceae and can be found in diverse marine habitats, which is typically known to cause food-borne gastroenteritis in humans. The pathogen causing EMS or AHPND has been identified as a V. parahaemolyticus strain by Tran et al. [1]. One bacterial strain was isolated from shrimp hepatopancreas obtained in cultured Penaeus (Litopenaeus) vannamei. The bacterial DNA sample was prepared according to the phenol-chloroform method of Sambrook and Russell [2]. The genome of V. parahaemolyticus strain was generated using an Illumina TruSeq DNA PCR-Free Library (Illumina Cambridge Ltd., UK) was performed on the Illumina MiSeq PE300 platform. We obtained 1,235,827 reads, for a total of 280.9 Mbp. After trimming and quality analysis, one hundred and fifteen contigs were de novo assembled using SPAdes [3], for a genome size of 2.07 Mbp (46.0 coverage average). We also assembled the sequenced genome by reference, by means of Bowtie using Vibrio parahaemolyticus RIMD 2210633 genome as template (5.16 Mbp size, with a GC content of 45.40%). Gene prediction was performed by GenePRIMP [4]. Structural and functional annotation was performed over coding sequences using RAST server [5]. Briefly, sequences were annotated using BLAST against non-redundant protein databases from NCBI (E-value ≤ 1e − 7 and minimal alignment length percentage ≥ 50%), Swiss-Prot [6], Clusters of Orthologous Groups (COG), Kyoto Encyclopedia of Genes and Genomes (KEGG) [7], [8] and Gene Ontology (GO) [9].

The V. parahaemolyticus strain Ba94C2 de novo final assembly consisted of 40 scaffolds, 2.07 Mbp in size, with 113 untranslated RNA sequences (Table 1) and 1932 predicted coding regions, represented by two chromosomes (Ch) of approximately 3.28 Mbp (Ch I) and 1.877 Mbp (Ch II). We also found one large extrachromosomal plasmid, previously reported in all AHPND strains [10], [11], [12], [13]. The contig number 16 (69.9 kbp length with a 66.9 coverage average) aligned significantly to V. parahaemolyticus strain 13–028/A3 plasmid from Vietnam [14]. The putative virulence genes hereby annotated were also identified and used to diagnose AHPND by PCR.

Table 1.

Genome characteristics of V. parahaemolyticus strain Ba94C2 draft genome.

Attributes Values
Assembly size (bp) 2,070,435
Total number of contigs 115
Contig N50 (bp) 34,478
Total number of scaffolds 40
Scaffold N50 (bp) 294,884
L50 3
GC content % 45.7
CDS 1932
rRNAs 12
tRNAs 101

A homologue of the insecticidal Photorhabdus insect-related binary toxin PirA/B was also identified in this study. This one is flanked by an arrangement of SWAT-3 type transposases (VSWAT3 like transposases). This might suggest that the infection cluster could be a mobile element, which implies that it is interchangeable between different Vibrio populations. Nevertheless, in this region, we were not able to identify either resistance genes or secretion systems pivotal for the infection process. Therefore, this virulence cluster should be further tested to evaluate if it can only be transmitted between populations where organisms have a pre-adaptation to the acquisition of insecticidal production genes.

Most coding gene functions were predicted by the functional categorization of COGs and GO. Gene functional distribution showed three pathogenicity mechanism systems of invasion and intracellular resistance involved in the protein synthesis (SSU ribosomal proteins): SSU ribosomal protein S2p (SAe), SSU ribosomal protein S15p (S13e), SSU ribosomal protein S16p, SSU ribosomal protein S18p (zinc-independent), SSU ribosomal protein S6p, SSU ribosomal protein S7p (S5e), SSU ribosomal protein S12p (S23e) and translation elongation factor G. We also found at least twelve systems that contribute to the resistance of antibiotics and toxic compounds, related to copper homeostasis and tolerance, and multidrug resistance efflux pumps. Likewise, we found sections with this kind of system also described in previous studies for V. cholerae, which are: tetracycline resistance and ribosome protection type (translation elongation factor G paralog), cobalt-zinc-cadmium resistance protein, CopG protein, copper-translocating and copper homeostasis protein CutE, magnesium and cobalt efflux protein CorC, acriflavin resistance protein, multi antimicrobial extrusion protein (Na (+)/drug antiporter) and MATE family of MDR efflux pumps. We were also able to annotate at least three secretion systems (T1SS, T2SS and T4SS).

In conclusion, in this study, we sequenced a V. parahaemolyticus strain isolated from a shrimp pond in South America; this being a pathogenic bacteria related to AHPND. This bacterial strain exhibited the putative toxin genes (PirA/B) as well as a plasmid sequence similar to the one previously reported in Southeast Asia isolates, but less related to the strain from Mexico. The results of this study warrant a more profound analysis of this emergent disease and the implementation of disease control strategies for cultured shrimp to avoid future disease outbreaks. This may also contribute to progress fundamental knowledge on pathogens to develop new disease control strategies.

3. Nucleotide sequence accession number

The whole-genome shotgun project of V. parahaemolyticus Ba94C2 have been deposited at DDBJ/EMBL/GenBank under the accession PRJNA335761.

Acknowledgments

This work was funded by the Secretaria de Educación Superior, Ciencia, Tecnología e Innovación (SENESCYT) in the framework of the PIC-14-CENAIM-003 project Desarrollo e implementación de métodos de control y prevención de enfermedades en especies acuáticas de uso comercial y uso potencial en maricultura y repoblación. We thank Leandro Bajaña for his technical help in this study.

References

  • 1.Tran L., Nunan L., Redman R.M., Mohney L.L., Pantoja C.R., Fitzsimmons K., Lightner D.V. Determination of the infectious nature of the agent of acute hepatopancreatic necrosis syndrome affecting penaeid shrimp. Dis. Aquat. Org. 2013;105(1):45–55. doi: 10.3354/dao02621. [DOI] [PubMed] [Google Scholar]
  • 2.Sambrook J., Russell D.W. Purification of nucleic acids by extraction with phenol: chloroform. Cold Spring Harb. Protoc. 2006;1 doi: 10.1101/pdb.prot4455. (pdb-prot4455) [DOI] [PubMed] [Google Scholar]
  • 3.Bankevich A., Nurk S., Antipov D., Gurevich A.A., Dvorkin M., Kulikov A.S. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 2012;19(5):455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Pati A., Ivanova N.N., Mikhailova N., Ovchinnikova G., Hooper S.D., Lykidis A., Kyrpides N.C. GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes. Nat. Methods. 2010;7(6):455–457. doi: 10.1038/nmeth.1457. [DOI] [PubMed] [Google Scholar]
  • 5.Aziz R.K., Bartels D., Best A.A., DeJongh M., Disz T., Edwards R.A. The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008;9(1):1. doi: 10.1186/1471-2164-9-75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Boeckmann B., Bairoch A., Apweiler R., Blatter M.C., Estreicher A., Gasteiger E. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 2003;31(1):365–370. doi: 10.1093/nar/gkg095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Tatusov R.L., Fedorova N.D., Jackson J.D., Jacobs A.R., Kiryutin B., Koonin E.V. The COG database: an updated version includes eukaryotes. BMC Bioinf. 2003;4(1):1. doi: 10.1186/1471-2105-4-41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Kanehisa M., Goto S., Kawashima S., Okuno Y., Hattori M. The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004;32(Suppl. 1):D277–D280. doi: 10.1093/nar/gkh063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M. Gene Ontology: tool for the unification of biology. Nat. Genet. 2000;25(1):25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Kondo H., Tinwongger S., Proespraiwong P., Mavichak R., Unajak S., Nozaki R., Hirono I. Draft genome sequences of six strains of Vibrio parahaemolyticus isolated from early mortality syndrome/acute hepatopancreatic necrosis disease shrimp in Thailand. Genome Announc. 2014;2(2) doi: 10.1128/genomeA.00221-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Gomez‐Gil B., Soto‐Rodríguez S., Lozano R., Betancourt‐Lozano M. Draft genome sequence of Vibrio parahaemolyticus strain M0605, which causes severe mortalities of shrimps in Mexico. Genome Announc. 2014;2(2) doi: 10.1128/genomeA.00055-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Yang Y.T., Chen I.T., Lee C.T., Chen C.Y., Lin S.S., Hor L.I. Draft genome sequences of four strains of Vibrio parahaemolyticus, three of which cause early mortality syndrome/acute hepatopancreatic necrosis disease in shrimp in China and Thailand. Genome Announc. 2014;2(5) doi: 10.1128/genomeA.00816-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Han J.E., Tang K.F., Tran L.H., Lightner D.V. Photorhabdus insect‐related (Pir) toxin‐like genes in a plasmid of Vibrio parahaemolyticus, the causative agent of acute hepatopancreatic necrosis disease (AHPND) of shrimp. Dis. Aquat. Org. 2015;113(1):33. doi: 10.3354/dao02830. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Yang Y.T., Chen I.T., Lee C.T., Chen C.Y., Lin S.S., Hor L.I. Draft genome sequences of four strains of Vibrio parahaemolyticus, three of which cause early mortality syndrome/acute hepatopancreatic necrosis disease in shrimp in China and Thailand. Genome Announc. 2014;2(5) doi: 10.1128/genomeA.00816-14. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genomics Data are provided here courtesy of Elsevier

RESOURCES