Skip to main content
Data in Brief logoLink to Data in Brief
. 2024 Jun 3;55:110588. doi: 10.1016/j.dib.2024.110588

Nuclear and mitochondrial genome datasets for spiny lobsters genus Panulirus (Decapoda: Achelata: Palinuridae)

J Antonio Baeza a,b,c,, Alyssa Baker a, Michael Childress c, Stacy Pirro d
PMCID: PMC11225021  PMID: 38974010

Abstract

Spiny lobsters (Decapoda: Palinuridae) in the genus Panulirus are targets of lucrative fisheries globally and have relevant ecological functions in tropical and subtropical environments. Only a few, but increasing, number of genetic and genomic resources exist for them. Nuclear and mitochondrial genome assemblies can provide insights into their phylogenetic relationships and support fishery management strategies in species that are heavily exploited. Herein, using Illumina short reads whole genome sequencing, we assembled the nuclear and mitochondrial genomes of a total of 14 species. Genomic DNA was extracted from specimens deposited at Clemson University Crustacean Collection and sequenced in a HiSeq X Ten system. The number of paired-end (PE) reads generated for the different studied species varied between 219,917,346 in P. argus and 70,215,423 in P. cygnus. Nuclear and mitochondrial genomes were ‘de novo’ assembled. Nuclear genomes ranged between 1,624,400,357 bp in P. guttatus and 935,571,898 bp in P. cygnus with scaffold numbers varying between 466,583 in P. versicolor and 852,228 in P. longipes. Mitochondrial genomes varied between 15,613 bp and 15,768 bp in P. pascuensis and P. versicolor, respectively. The totality of the short reads, nuclear, and mitochondrial genome assemblies are available at NCBI's GenBank.

Keywords: Mitogenome, Assembly, Crustacea, Arthropoda


Specifications Table

Subject Crustacean Science
Specific subject area Genomics
Data format Raw and Analyzed
Type of data Table
Data collection Genomic DNA was extracted from 14 spiny and slipper lobster muscle tissue samples using the DNeasy Blood and Tissue Kit. Libraries were prepared using the Illumina TruSeq kit (Illumina, San Diego, CA, USA). Next generation sequencing was performed using Illumina HiSeq X Ten. Illumina reads were assembled using SPAdes v2.5. Sequence contamination was removed using FSCR v3.0 and GX. Contigs were integrated using Zanfona v1.0. Mitochondrial genomes were de novo assembled using GetOrgnanelle v1.6.4 using as a 'seed' the mitochondrial genome of P. argus (MH068821).
Data source location Institution: Clemson University Crustacean Collection
City: Clemson
State: South Carolina
Country: United States
Data accessibility Repository name: NCBI GenBank
Data identification number:
Bioproject IDs: PRJNA1014903, PRJNA996201–PRJNA996212, PRJNA996222
BioSample IDs: SAMN36530433–SAMN36530444, SAMN36531993, SAMN37343104
SRA Accession Numbers: SRR13036344, SRR25338398, SRR25338439, SRR25338495, SRR25338518, SRR25338519, SRR25338784, SRR25338798, SRR25340716, SRR25340736, SRR25340737, SRR25341181, SRR25341344, SRR25341383
Mitogenome Accessions: NC_039671, OR612305–OR612317
Nuclear Assemblies: ASM3808873, ASM3236138, ASM3236140, ASM3236144, ASM3236148, ASM3236176, ASM3236186, ASM3236188, ASM3236170, ASM3227352, ASM3227360, ASM3227372. ASM3227384, ASM3478053

1. Value of the Data

  • These nuclear and mitochondrial genome sequences will aid in evaluating phylogenetic relationships among species in the genus, understanding diversification processes, and exploring the historical biogeography of spiny lobsters.

  • The dataset will be useful for bioprospecting and marine biodiversity monitoring using non-intrusive environmental DNA (eDNA) approaches.

  • The generated dataset can be used as a base to detect misidentification/mislabeling of spiny lobsters in the marketplace and in monitoring and enforcing fisheries and trade management restrictions.

2. Background

The infraorder Achelata consists of the exclusively marine spiny (fam. Palinuridae) and slipper (fam. Scyllaridae) lobsters, which are characterized by enlarged antennae, a long-lived phyllosoma larval stage, and the absence of chelae [1,2]. Panulirus is the most specious genus of spiny lobster and contains 20 species and 5 subspecies that are targeted by lucrative tropical and subtropical fisheries worldwide [2]. The aim of this study is to assemble nuclear and mitochondrial genomes in spiny lobsters belonging to the genus Panulirus using a Illumina short read whole genome sequencing (WGS) strategy. The genomic datasets produced by this study represent new resources to aid with conservation and management via application in eDNA sampling and in species identification via barcoding and have further use in evaluating the adaptive evolution of protein-coding genes.

3. Data Description

Illumina short read WGS was conducted for a total of 13 species belonging to the spiny lobster genus Panulirus and the slipper lobster Scyllarides nodifer to assemble nuclear and mitochondrial genomes. Nuclear genomes were assembled using SPAdes v2.5, FSCR v3.0, and Zanfona v1.0 and are available in the National Center for Biotechnology Information (NCBI) Genbank (Table 1). Draft nuclear genomes ranged from 935.6 Mb in P. cygnus to 1.6 Gb in P. guttatus and consisted of 618,989 to 989,875 contigs in P. laevicauda and P. pascuensis, respectively. Scaffold numbers varied between 466,583 in P. versicolor and 852,228 in P. longipes. Mitogenome sequences were assembled via the GetOrganelle v1.6.4 pipeline and are available at NCBI GenBank accession numbers NC_039671 and OR612306–OR612317. Mitogenome lengths varied between 15,613 bp and 15,768 bp in P. pascuensis and P. versicolor, respectively. Corresponding raw reads are available as SRA datasets in NCBI GenBank under BioProjects PRJNA996201-PRJNA996212 and PRJNA1014903. Nuclear, mitochondrial, and raw read assembly details are summarized in Table 1.

Table 1.

Raw reads, nuclear, and mitochondrial assemblies for spiny lobsters Panulirus spp. and the slipper lobster Scyllarides nodifer.

Raw Reads
Mitochondrial Genome
Nuclear Genome
Species Name SRA Spots (bp) Bases (G) GC (%) Genbank Accession Number Length (bp) Mitogenome Coverage Genbank Accession Code Assembly Length (bp) Assembly Coverage Contigs (n) Scaffolds (n)
Panulirus argus SRR13036344 219,917,346 66 44.9 NC_039671 15,739 70x ASM3808873 1812,672,735 90x 434,038 424,852
Panulirus cygnus SRR25340737 70,215,423 21.1 44.4 OR612313 15,731 30x ASM3236148 935,571,898 50x 926,678 844,337
Panulirus gracilis SRR25338518 76,845,122 23.1 43.4 OR612307 15,745 30x ASM3236144 1279,827,122 50x 751,506 679,949
Panulirus guttatus SRR25338519 91,313,030 27.4 45.2 OR612312 15,702 30x ASM3236138 1624,400,357 50x 746,306 657,369
Panulirus homarus SRR25340716 82,043,440 24.6 43.3 OR612305 15,665 30x ASM3236140 1313,005,488 50x 704,650 618,289
Panulirus inflatus SRR25341181 78,968,575 23.7 43.0 OR612314 15,670 30x ASM3236176 1321,811,166 50x 705,867 632,377
Panulirus interruptus SRR25338398 83,625,231 25.1 44.8 OR612311 15,657 30x ASM3227372 1407,497,722 50x 859,704 748,487
Panulirus laevicauda SRR25338798 82,408,003 24.7 43.2 OR612306 15,675 30x ASM3227360 1371,664,687 50x 618,989 552,489
Panulirus longipes SRR25338439 80,710,679 24.2 43.8 OR612309 15,706 30x ASM3227384 1243,798,248 50x 972,289 852,228
Panulirus marginatus SRR25341383 86,771,275 26.0 43.9 OR612310 15,725 30x ASM3236188 1276,116,527 50x 959,315 842,519
Panulirus ornatus SRR25341344 82,272,566 24.7 43.9 OR612315 15,677 30x ASM3227352 1364,558,317 50x 739,006 666,649
Panulirus pascuensis SRR25340736 74,534,229 22.4 44.3 OR612316 15,613 30x ASM3236186 1147,801,627 50x 989,875 848,215
Panulirus versicolor SRR25338784 89,867,779 27.0 43.4 OR612308 15,768 30x ASM3236170 1465,571,352 50x 518,312 466,583
Scyllarides nodifer SRR25338495 81,609,917 24.5 44.0 OR612317 15,648 30x ASM3478053 1014,165,050 50x 888,648 801,177

4. Experimental Design, Materials and Methods

4.1. Specimens, DNA extraction, and sequencing

All specimens used for sequencing belonging to the different species of Panulirus were available at the Clemson University Crustacean Collection (Table 1). Small tissue samples (approx. 5 mm3) were dissected from pereiopods and immediately stored separately in sterile centrifuge tubes containing ethyl alcohol (95 %) that were shipped to Iridian Genomes, Inc. (Bethesda, MD) for genomic DNA (gDNA) extraction and next generation sequencing (NGS). gDNA was extracted from each tissue sample with the DNeasy Blood and Tissue Kit (Qiagen, Germany) using the manufacturer's protocol. Library preparation for each sample was performed using the Illumina TruSeq kit following the manufacturer's instructions. NGS was performed in a Illumina HiSeq X Ten system (Illumina, San Diego, CA, USA) using a 2 × 150 cycle. Between 219,917,346 and 70,215,423 pairs (PE) of reads were produced, respectively, for P. argus in P. cygnus by Iridian Genomes and are available in the short read archive (SRA) repository (Bioprojects: PRJNA1014903, PRJNA996201–PRJNA996212, PRJNA996222; BioSamples: SAMN36530433–SAMN36530444, SAMN36531993, SAMN37343104; SRA accession number: see Table 1) at NCBI's GenBank.

4.2. Nuclear genome assembly

The totality of the reads available for each specimen were used for nuclear genome assembly. First, the raw reads were trimmed of adapter sequences and low-quality regions with Trimmomatic v0.33 [3]. Next, trimmed sequences were assembled using SPAdes v2.5 [4]. Then, we used NCBI's sequence contamination screening pipeline FSCR v3.0 and GX (https://github.com/ncbi/fcs/wiki/FCS-GX) [5] for identifying and removing technical and biological contaminants from the newly assembled genomes. Lastly, we applied a finishing step to the assembly with the pipeline Zanfona v1.0 [6] to make additional contig joins based on conserved regions in related species.

4.3. Mitochondrial genome assembly

The mitochondrial genomes of each of the studied species were ‘de novo’ assembled with the program GetOrganelle v1.6.4 [7]. The complete mitochondrial genome of the congeneric Caribbean spiny lobster Panulirus argus (GenBank's accession number MH068821- [8]) was used as a ‘seed’ for each of the assemblies. All assemblies were run using k-mer sizes of 21, 55, 85, and 115.

Limitations

Non Applicable

Ethics Statement

The authors declare that this work follows the ethical requirements for publication in Data in Brief and does not involve human subjects or animal experiments that require ethical approval.

CRediT Author Statement

J. A. Baeza: Conceptualization, Resources, Methodology, Formal analysis, Data curation, Visualization, Writing – original draft. A. Baker: Methodology, Formal analysis, Data curation, Visualization, Writing – review & editing. S. Pirro: Resources, Methodology, Formal analysis, Data curation, Visualization, Writing – review & editing. M. Childress: Resources, Writing – review & editing.

Acknowledgments

J.A. Baeza thanks Dr. Vincent. P. Richards for bioinformatics support. The great majority of the specimens available in the collection were donated by Dr. R. W. George to Dr. M. Childress.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Data Availability

References

  • 1.Holthuis L.B. XVI. – On a small collection of isopod crustacea from the greenhouses of the Royal Botanic Gardens, Kew. Ann. Mag. Nat. Hist. 1946;98:122–137. doi: 10.1080/00222934608654533. [DOI] [Google Scholar]
  • 2.Holthuis L.B. FAO species catalogue. Vol. 13. Marine lobsters of the world. An annotated and illustrated catalogue of species of interest to fisheries known to date. FAO Fish. Synop. 1991;125:13. [Google Scholar]
  • 3.Bolger A.M., Lohse M., Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Bankevich A., Nurk S., Antipov D., Gurevich A.A., Dvorkin M., Kulikov A.S., Pevzner P.A. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 2012;19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Astashyn A., Tvedte E.S., Sweeney D., Sapojnikov V., Bouk N., Joukov V., Mozes E., Strope P.K., Sylla P.M., Wagner L., Bidwell S.L. Rapid and sensitive detection of genome contamination at scale with FCS-GX. bioRxiv. 2023 doi: 10.1101/2023.06.02.543519. 2023-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.M. Kieras, Zanfona, a genome finishing process for use with paired-end short reads (2021) https://github.com/zanfona734/zanfona.
  • 7.Jin J.J., Yu W.B., Yang J.B., Song Y., DePamphilis C.W., Yi T.S., Li D.Z. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 2020;21:241. doi: 10.1186/s13059-020-02154-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Baeza J.A. The complete mitochondrial genome of the Caribbean spiny lobster Panulirus argus. Nat., Sci. Rep. 2018;8:17690. doi: 10.1038/s41598-018-36132-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement


Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES