Skip to main content
Data in Brief logoLink to Data in Brief
. 2023 Jul 21;49:109432. doi: 10.1016/j.dib.2023.109432

Mitochondrial genome datasets for the sweetpotato weevil, Cylas formicarius elegantulus (Coleoptera: Brentidae), collected in the United States

Sharon A Andreason a,, Zachary Lahey a, Dongyan Zhao b, Katherine Mejia-Guerra b, Livy H Williams a, Moira Sheehan b, Alvin M Simmons a, Phillip A Wadl a
PMCID: PMC10403706  PMID: 37547168

Abstract

The sweetpotato weevil, Cylas formicarius elegantulus (Summers) (Coleoptera: Brentidae), is one of the most destructive pests of sweetpotato worldwide. Genomic analyses of sweetpotato weevils can provide insights into their genetic diversity, population structure, and dispersal as well as provide information to support management strategies. Adult sweetpotato weevils were collected by various methods from Ipomoea batatas L. (sweetpotato) or I. coccinea L. (red morning glory) in the U.S. states of Georgia, Hawaii, South Carolina, and Texas. Genomic DNA was extracted from individual weevil specimens and sequenced using Illumina NovaSeq. A total of 181 GB of 150 base pair (bp) paired-end reads were generated for 40 specimens. Mitochondrial genomes were assembled for each specimen via reference mapping and annotated using Geneious Prime. Full mitochondrial genome sequences range from 17,141 to 17,152 bp with an average GC content of 21.8% and average coverage of 3307 × . A maximum likelihood phylogenetic analysis considering the mitochondrial protein coding genes is provided. Mitochondrial genomes and assembled reads are deposited in NCBI GenBank, providing 40 mitogenomes of C. formicarius elegantulus collected in the U.S.

Keywords: Mitogenome, mtDNA, Assembly, Annotation, Coleoptera, Brentidae


Specifications Table

Subject Biological Sciences
Specific subject area Entomology and Insect Science
Type of data Raw sequencing reads, Assemblies, Annotations, Tables, Figures
How the data were acquired The data were acquired from whole-body insect specimen genomic DNA extraction followed by whole-genome skim sequencing using Illumina NovaSeq 6000. Insect mitochondrial genome data and filtered SRA datasets were obtained by mapping paired-end reads to a reference genome available in GenBank using Geneious Prime.
Data format Raw, filtered, and analyzed
Description of data collection Sweetpotato weevil specimens were collected from Ipomoea batatas or I. coccinea by hand or pheromone-baited pitfall trap collection. Genomic DNA was extracted from individual whole-body adult insect specimens. DNA purity and concentration were measured before sequencing. Sequences were obtained by the Illumina NovaSeq 6000 followed by reference-guided assembly using Geneious Prime.
Data source location Sweetpotato weevil specimens were collected in sweetpotato fields or greenhouses in Georgia (Tifton); Hawaii (Akaka Falls and Onomea); South Carolina (Charleston), and Texas (Cherokee County), USA.
Data accessibility Raw sequence data, assembled mitochondrial genomes, and genome annotations are available in NCBI GenBank.
Repository name: NCBI GenBank
Data identification number: BioProject PRJNA945076; Accessions OQ763214–OQ763253
Direct URL to data: Cylas formicarius (ID 945076) – BioProject – NCBI (nih.gov)

Value of the Data

  • These data are useful for analysis of intraspecific divergence within the mitochondrial genomes of sweetpotato weevils.

  • These mitochondrial genome sequences will be a useful resource for entomologists and pest management professionals seeking to analyze and determine the genetic differences among sweetpotato weevil populations.

  • The data can be used to identify SNPs and other genetic markers to discriminate weevil populations, to study phylogenetic relationships among weevil populations, to determine region of origin, and to develop sequence-based weevil identification tools.

1. Objective

Adult Cylas formicarius elegantulus were collected for genome sequencing and analysis of genetic diversity and population structure of weevils captured across a wide swath of the U.S. sweetpotato production areas. Mitochondrial genomes were assembled for the purpose of providing the first mitochondrial genomes for sweetpotato weevils collected in the U.S., for determining intraspecific sequence divergence within mitochondrial genes, and for inferring phylogenetic relationships among sweetpotato weevil populations.

2. Data Description

Skim sequencing was performed on C. formicarius elegantulus (Fig. 1) genomic DNA to generate genomic sequences for population genetics studies. Weevil specimen and collection details are summarized in Table 1. Mitochondrial genome sequences were assembled in Geneious Prime and are available in NCBI Genbank at Accessions OQ763214–OQ763253. Assembly results are summarized in Table 2. Corresponding mapped reads are available as SRA datasets under BioProject PRJNA945076: Sweetpotato weevil genomics [1]. Mitogenomes were annotated using Geneious Prime. A maximum likelihood (ML) phylogenetic analysis was performed on the mitochondrial protein coding genes (Fig. 2).

Fig. 1.

Fig 1

Adult Cylas formicarius elegantulus. Photo credit: Mike Quinn, TexasEnto.net.

Table 1.

Cylas formicarius elegantulus specimens analyzed in this study.

Specimen Location Collection Host Sex
GAB1 Tifton, GA Pheromone baited pitfall trap Ipomoea batatas Male
GABF03 Tifton, GA Pheromone baited pitfall trap Ipomoea batatas Male
GABF4 Tifton, GA Pheromone baited pitfall trap Ipomoea batatas Male
GACC05 Tifton, GA Pheromone baited pitfall trap Ipomoea batatas Male
GACC13 Tifton, GA Pheromone baited pitfall trap Ipomoea batatas Male
GACC16 Tifton, GA Pheromone baited pitfall trap Ipomoea batatas Male
GACC17 Tifton, GA Pheromone baited pitfall trap Ipomoea batatas Male
GACC3 Tifton, GA Pheromone baited pitfall trap Ipomoea batatas Male
GACC4 Tifton, GA Pheromone baited pitfall trap Ipomoea batatas Male
GACC7 Tifton, GA Pheromone baited pitfall trap Ipomoea batatas Male
GAGS1 Tifton, GA Pheromone baited pitfall trap Ipomoea coccinea Male
GAGS3 Tifton, GA Pheromone baited pitfall trap Ipomoea coccinea Male
HI_Ref Hawaii AgPest 100 reference specimen Ipomoea batatas Female
HIF01 Akaka Falls, HI Hand-catch (field) Ipomoea batatas Female
HIF02 Akaka Falls, HI Hand-catch (field) Ipomoea batatas Female
HIF04 Akaka Falls, HI Hand-catch (field) Ipomoea batatas Female
HIF05 Onomea, HI Hand-catch (field) Ipomoea batatas Female
HIF06 Onomea, HI Hand-catch (field) Ipomoea batatas Female
HIF07 Onomea, HI Hand-catch (field) Ipomoea batatas Female
HIF08 Onomea, HI Hand-catch (field) Ipomoea batatas Female
HIM02 Akaka Falls, HI Hand-catch (field) Ipomoea batatas Male
HIM03 Akaka Falls, HI Hand-catch (field) Ipomoea batatas Male
HIM4 Akaka Falls, HI Hand-catch (field) Ipomoea batatas Male
HIM5 Onomea, HI Hand-catch (field) Ipomoea batatas Male
HIM08 Onomea, HI Hand-catch (field) Ipomoea batatas Male
SCF13 Charleston, SC Hand-catch (colony) Ipomoea batatas Female
SCF14 Charleston, SC Hand-catch (colony) Ipomoea batatas Female
SCF06 Charleston, SC Hand-catch (colony) Ipomoea batatas Female
SCF07 Charleston, SC Hand-catch (colony) Ipomoea batatas Female
SC0F9 Charleston, SC Hand-catch (colony) Ipomoea batatas Female
SCM10 Charleston, SC Hand-catch (colony) Ipomoea batatas Male
SCM11 Charleston, SC Hand-catch (colony) Ipomoea batatas Male
SCM14 Charleston, SC Hand-catch (colony) Ipomoea batatas Male
SCM15 Charleston, SC Hand-catch (colony) Ipomoea batatas Male
SCM02 Charleston, SC Hand-catch (colony) Ipomoea batatas Male
SCM03 Charleston, SC Hand-catch (colony) Ipomoea batatas Male
SCM05 Charleston, SC Hand-catch (colony) Ipomoea batatas Male
SCM06 Charleston, SC Hand-catch (colony) Ipomoea batatas Male
SCM09 Charleston, SC Hand-catch (colony) Ipomoea batatas Male
TX1 Cherokee County, TX Hand-catch (greenhouse) Ipomoea batatas Female

Table 2.

Cylas formicarius elegantulus mitochondrial genome assemblies.

Specimen Mitogenome Accession # Sequence Length (bp) SRA Accession # # Mapped Reads Average Coverage % GC
GAB1 OQ763214 17,149 SRX19730765 90,031 787 21.8
GABF03 OQ763215 17,152 SRX19740992 102,315 895 21.8
GABF4 OQ763216 17,147 SRX19740993 97,074 849 21.8
GACC05 OQ763219 17,146 SRX19741004 769,202 6729 21.8
GACC13 OQ763221 17,147 SRX19741015 605,513 5297 21.8
GACC16 OQ763222 17,147 SRX19741025 532,052 4654 21.8
GACC17 OQ763223 17,147 SRX19741026 715,022 6255 21.8
GACC3 OQ763217 17,146 SRX19741027 89,879 786 21.8
GACC4 OQ763218 17,146 SRX19741028 180,156 1576 21.9
GACC7 OQ763220 17,146 SRX19741029 703,276 6153 21.8
GAGS1 OQ763224 17,148 SRX19741030 539,348 4718 21.9
GAGS3 OQ763225 17,148 SRX19740994 417,995 3656 21.9
HI_Ref OQ763238 17,145 SRX19740995 552,278 4832 21.9
HIF01 OQ763226 17,146 SRX19740996 262,776 2299 21.9
HIF02 OQ763227 17,145 SRX19740997 359,207 3143 21.9
HIF04 OQ763228 17,144 SRX19740998 254,933 2231 21.9
HIF05 OQ763229 17,141 SRX19740999 339,006 2967 21.9
HIF06 OQ763230 17,144 SRX19741000 288,496 2524 21.9
HIF07 OQ763231 17,146 SRX19741001 412,562 3609 21.9
HIF08 OQ763232 17,146 SRX19741002 354,908 3105 21.9
HIM02 OQ763233 17,146 SRX19741003 544,098 4760 21.9
HIM03 OQ763234 17,147 SRX19741005 694,717 6077 21.9
HIM4 OQ763235 17,146 SRX19741006 379,117 3317 21.9
HIM5 OQ763236 17,149 SRX19741007 694,703 6076 21.9
HIM08 OQ763237 17,146 SRX19741008 348,555 3049 21.9
SCF13 OQ763242 17,148 SRX19741009 231,530 2025 21.8
SCF14 OQ763243 17,149 SRX19741010 166,687 1458 21.8
SCF06 OQ763239 17,148 SRX19741011 193,265 1691 21.8
SCF07 OQ763240 17,144 SRX19741012 224,505 1964 21.8
SCF09 OQ763241 17,149 SRX19741013 188,219 1646 21.8
SCM10 OQ763249 17,150 SRX19741014 462,114 4042 21.8
SCM11 OQ763250 17,148 SRX19741016 293,155 2564 21.8
SCM14 OQ763251 17,148 SRX19741017 387,807 3392 21.8
SCM15 OQ763252 17,144 SRX19741018 342,605 2998 21.8
SCM02 OQ763244 17,148 SRX19741019 484,612 4239 21.8
SCM03 OQ763245 17,146 SRX19741020 605,989 5301 21.8
SCM05 OQ763246 17,144 SRX19741021 317,974 2782 21.8
SCM06 OQ763247 17,146 SRX19741022 130,730 1144 21.8
SCM09 OQ763248 17,148 SRX19741023 540,294 4726 21.8
TX1 OQ763253 17,148 SRX19741024 223,843 1958 21.8

Fig. 2.

Fig 2

Maximum Likelihood phylogenetic analysis of Cylas formicarius elegantulus mitochondrial protein-coding genes (nucleotides). Support values are indicated by colored circles placed at nodes. Nodes without support values within a specific clade represent identical sequences (e.g., HIM4, HIF01, HIM03). Cylas formicarius elegantulus photo credit: Mike Quinn, TexasEnto.net.

3. Experimental Design, Materials and Methods

3.1. Specimen Collection, DNA Extraction, and Sequencing

Adult C. formicarius elegantulus were collected from locations in the U.S. states of Georgia, Hawaii, South Carolina, and Texas by various methods detailed in Table 1. Genomic DNA was extracted from individual whole-body male and female sweetpotato weevils with the DNeasy Plant Mini Kit (Qiagen, Venlo, Netherlands) with modifications [2]. DNA quantity was measured on a NanoDrop 2000 spectrophotometer (ThermoFisher Scientific, Waltham, MA, USA). Whole-genome skim sequencing was done using Illumina NovaSeq 6000 at Novogene (www.novogene.com).

3.2. Mitochondrial Genome Assembly and Annotation

Mitochondrial genomes of 40 sweetpotato weevils were assembled in Geneious Prime version 2022.0.2 using the Map to Reference tool with the C. formicarius complete mitochondrial genome assembly from China as a reference (NCBI Reference Sequence: NC_046580.1; [3]). The Geneious mapper was set at medium sensitivity and five iterations assembled the fastq paired-end sequence datasets. Assemblies were circularized by trimming overlapping ends. Complete mitogenomes and assembled read datasets were deposited in GenBank (BioProject PRJNA945076). Mitogenome sequences were annotated using NC_046580.1 as a reference and the ‘Annotate from Database’ feature in Geneious Prime. Annotations are available in GenBank.

3.3. Phylogenetic Analysis

Phylogenetic analyses were conducted under an ML framework in IQ-TREE (v.2.1.3) [4]. The nucleotide sequences for each of the 13 mitochondrial protein coding genes in 43 taxa were aligned with MAFFT (v.7.249) [5], and the best nucleotide substitution model for each gene was selected with ModelFinder [6]. Branch support was estimated with 1000 ultrafast bootstrap replicates [7]. Ten independent tree searches were performed, and the tree with the greatest log-likelihood score was taken as the ML tree (Fig. 2).

Ethics Statements

The work involving insect invertebrates detailed herein complied with ARRIVE guidelines and the National Institutes of Health guide for the care and use of laboratory animals (NIH Publications No. 8023, revised 1978).

CRediT authorship contribution statement

Sharon A. Andreason: Conceptualization, Methodology, Formal analysis, Data curation, Visualization, Writing – original draft. Zachary Lahey: Methodology, Software, Formal analysis, Visualization, Writing – review & editing. Dongyan Zhao: Conceptualization, Methodology, Formal analysis. Katherine Mejia-Guerra: Conceptualization, Methodology, Formal analysis. Livy H. Williams: Resources, Writing – review & editing. Moira Sheehan: Conceptualization, Methodology, Writing – review & editing. Alvin M. Simmons: Resources, Writing – review & editing. Phillip A. Wadl: Conceptualization, Resources, Writing – review & editing.

Acknowledgments

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The authors thank John Coffey and Lance Lawrence for assistance in specimen collection and colony maintenance in South Carolina, Drs. Karen Harris-Shultz and Grant McQuate for collecting specimens from Georgia and Hawaii, respectively. The second author is a participant of the Oak Ridge Institute for Science and Education (ORISE) Agricultural Research Service (ARS) Research Participation Program, supported by the USDA-ARS, U.S. Vegetable Laboratory in Charleston, SC, USA. This work has been funded by the USDA. Mention of trade names or commercial products in this article is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the USDA. USDA is an equal opportunity provider and employer.

Data Availability

References

  • 1.Andreason S.A., Lahey Z., Zhao D., Mejia-Guerra K., Williams L.H., Sheehan M., Simmons A.M., Wadl P.A. Sweetpotato weevil genomics. NCBI GenBank. 2023 doi: 10.1016/j.dib.2023.109432. BioProject PRJNA945076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Wadl P.A., Williams L.H., III, Harris-Shultz K., McQuate G. Method for DNA isolation from sweetpotato weevil (Coleoptera: Curculionidae) collected in pheromone-baited traps. J Econ Entomol. 2019;112:1001–1003. doi: 10.1093/jee/toy390. [DOI] [PubMed] [Google Scholar]
  • 3.Yang H., Li Y. Complete mitochondrial genome of Cylas formicarius (Coleoptera: Brentidae) from China. Mitochondrial DNA Part B. 2019;4:1241–1242. doi: 10.1080/23802359.2019.1591247. [DOI] [Google Scholar]
  • 4.Minh B.Q., Schmidt H.A., Chernomor O., Schrempf D., Woodhams M.D., von Haeseler A., Lanfear R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Molecular Biology and Evolution. 2020;37:1530–1534. doi: 10.1093/molbev/msaa015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Katoh K., Standley D.M. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Molecular Biology and Evolution. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Kalyaanamoorthy S., Minh B.Q., Wong T.K.F., von Haeseler A., Jermiin L.S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017;14:587–589. doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hoang D.T., Chernomor O., von Haeseler A., Minh B.Q., Vinh L.S. UFBoot2: Improving the Ultrafast Bootstrap Approximation. Molecular Biology and Evolution. 2018;35:518–522. doi: 10.1093/molbev/msx281. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement


Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES