Skip to main content
Data in Brief logoLink to Data in Brief
. 2023 Dec 18;52:109989. doi: 10.1016/j.dib.2023.109989

Metagenome-assembled bacterial genomes recovered from the datasets of Spodoptera frugiperda (Smith) (Lepidoptera: Noctuidae)

Francisco Javier Flores Gallardo a, José Luis Hernández Flores b,, Selene Aguilera Aguirre c, Miguel Ángel Ramos López a, Jackeline Lizzeta Arvizu Gómez d, Carlos Saldaña Gutierrez e, María Carlota García Gutiérrez a, José Alberto Rodríguez Morales f, Juan Campos Guillén a,
PMCID: PMC10788226  PMID: 38226032

Abstract

Spodoptera frugiperda (Smith) (Lepidoptera: Noctuidae), also known as the fall armyworm, is an economically important and widespread polyphagous pest. Microorganisms associated to this insect during life cycle play important ecological roles. We report 3 metagenome-assembled bacterial genomes reconstructed from a metagenome dataset obtained from S. frugiperda larvae F3 3rd-instar reared using artificial diet under laboratory conditions. Genome data for Enterococcus casseliflavus indicated a genome length of 3,659,8333 bp and GC content of 42.54%. Genome data for E. mundtii indicated a genome length of 2,921,701 bp and GC content of 38.37%. Finally, genome data for Lactiplantibacillus plantarum indicated a genome length of 3,298,601 bp, GC content of 44.31%. Genome analysis allowed us to identify genus-specific protein families (PLFams), transporters and antibiotic resistance-related genes among others. DNA sequences were deposited in National Center for Biotechnology Information (https://www.ncbi.nlm.nih.gov/) as Bioproject accession PRJNA899064.

Keywords: Armyworm, Enterococcus casseliflavus, Enterococcus mundtii, Lactiplantibacillus plantarum, Metagenome-assembled genomes


Specifications Table

Subject Biology
Specific subject area Microbiology, genomics
Type of data Tables, Figures
How the data were acquired The Shotgun DNA sequencing was performed on Illumina NovaSeq Sequencer at Zymo Research, Irvine, CA. Generating 1.1 Gb of paired-end reads of 150 bp in length
Data format Raw, Analyzed, filtered, and assembled genome sequence
Description of data collection DNA was extracted from S. frugiperda larvae for metagenome sequencing. Genome sequences of 3 bacterial species were reconstructed from the metagenome datasets
Data source location Institution: Universidad Autónoma de Querétaro
City/Town/Region: Querétaro
Country: México
Latitude and longitude: 20°35′28″N 100°24′36″E
Data accessibility Repository name: Data was submitted to NCBI GenBank in the public repository.
Data identification number:
PRJNA899064
Direct URL to data: https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA899064
BioSamples: SAMN36465420; SAMN36465421; SAMN36465422

1. Value of the Data

  • The Genomes of E. casseliflavus, E. mundtii and L. plantarum can provide insights for the understanding of bacterial interaction with S. frugiperda.

  • These bacterial genomes data are applicable for comparative genomic and taxonomic purposes.

  • These data are valuable resources for researchers working in the field of S. frugiperda microbiome to understand ecological interactions and use of biological control agents.

  • Data will help to expand the knowledge of bacteria associated to healthy larvae under laboratory-rearing conditions or their interactions with the artificial diet.

2. Objective

Healthy colonies of insects are a mandatory requirement for biocontrol experiments. In this regard, endogenous microbiota of S. frugiperda might influence growth development and overall state of the insect. However, little is known about S. frugiperda microbiota during rearing using artificial diet under laboratory conditions. Therefore, the aim of the present work was to identify relevant genomic features and functional genes from 3rd-instar larvae of S. frugiperda-related bacteria with a potential ecological role, through a metagenome-assembled bacterial genome approach.

3. Data Description

This data contains metagenome-assembled bacterial genome using shotgun metagenomic sequencing of two 3rd-instar larvae of S. frugiperda reared using artificial diet under laboratory conditions [1,2]. The sequencing result was of 1.1 Gb paired-end reads of 150 bp in length. Table 1 provides the MAGs available in the dataset. Bacterial binning analyzed in CheckM [3] with high-quality produced with > 99.8 % completeness and < 2.1% contamination (Table 1). The BV-BRC metagenomic binning service [3] show that the genome (Fig. 1) for E. casseliflavus [4] contains 41 contigs with genome length of 3,659,833 bp, a mean coverage of 272.59 and GC content of 42.54%. The annotated genome identifies 3338 proteins belong to genus-specific protein families (PLFams) and 3664 protein coding sequence (CDS), 1 virulence factor according to VFDB source, 33 transporters and 41 antibiotic resistance-related genes. The genome for E. mundtii [5] contains 47 contigs with genome length of 2,921,701 bp, a mean coverage of 378.03 and GC content of 38.37%. The annotated genome identifies 2739 proteins belong to genus-specific protein families (PLFams) and 2923 protein coding sequence (CDS), 2 virulence factor according to VFDB source, 15 transporters and 38 antibiotic resistance-related genes. The genome for L. plantarum [6] contains 122 contigs with genome length of 3,298,601 bp, a mean coverage of 14.81 and GC content of 44.31%. The annotated genome identifies 2893 proteins belong to genus-specific protein families (PLFams) and 3258 protein coding sequence (CDS), not detected virulence factor, 16 transporters and 28 antibiotic resistance-related genes. Table 2 lists the antibiotic resistance genes present in each bacteria specie.

Table 1.

General features of metagenome-assembled bacterial genomes generated from 3rd-instar larvae of S. frugiperda reared using artificial diet under laboratory conditions.

Features Enterococcus casseliflavus Enterococcus mundtii Lactiplantibacillus plantarum
MAG identification bin.2.37734 bin.4.53346 bin.3.1590
Biosample ID SAMN36465420 SAMN36465422 SAMN36465421
Accession no. JAUQTC000000000 JAUQTE000000000 JAUQTD000000000
Completeness (%) 100 99.8 100
Contamination (%) 0.8 1.7 2.1
Contig count 41 47 122
DNA size (bp) 3,659,833 2,921,701 3,298,601
Mean Coverage 272.59 378.03 14.81
GC Content (%) 42.54 38.37 44.31
CDS 3664 2923 3258
Proteins with PATRIC genus-specific family (PLfam) assignments 3388 2739 2893
Virulence Factor (VFDB) 1 2 Not detected
Transporter (TCDB) 33 15 16
Antibiotic Resistance genes (PATRIC) 41 38 28

Fig. 1.

Fig 1

Fig 1

Fig 1

Circular genome map of Lactiplantibacillus plantarum (A), Enterococcus casseliflavus (B) and Enterococcus mundtti (C). The upper figure shows from outer to inner rings, the contigs, CDS on the forward strand, CDS on the reverse strand, antibiotic resistance genes, transporters, GC content and GC skew. The color of the CDS indicates the subsystem that these genes belong. The lower figure is a PATRIC annotation using RAST tool kit (RASTtk), which shows the CDS for subsystem functional assignments to which these genes belong. The numbers provided in parentheses on the right side of the subsystem name indicate the count of subsystems and the count of genes associated with the subsystem name.

Table 2.

Antibiotic resistance genes present in each bacterial species annotated from PATRIC database using the pipelines at Bacterial and Viral Bioinformatics Resource Center (BV-BRC).

Antibiotic resistance genes Enterococcus casseliflavus Enterococcus mundtii Lactobacillus plantarum
aac(6′)-Ie-aph(2”) (and related aacs) - + -
alr + + +
ddl + + +
dfrE + + -
fusA + + +
efrB + + -
tuf + + +
fabk - - +
folA + + +
folp - - +
fosX
gdpD
gidB
gyrA
gyrB
inhA, fabL
ileS
kasA
liaF
liaR
liaS
mprF
murA
pgsA
rho
rlmA(II)
rpoB
rpoC
rpsJ
rpsL
OmpR
VanR
VanC
VanR
BaeS
vanT
vanXY
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
-
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
-
-
-
-
-
+
-
+
+
+
+
+
+
+
-
-
-
+
+
+
+
+
+
+
+
+
+
-
-
-
-
-
-

4. Experimental Design, Materials and Methods

4.1. Rearing of S. frugiperda

In this study, S. frugiperda (instar L4-L5) were freshly collected from maize field crops during 2022, transported in containers, and reared using artificial diet under laboratory conditions [1,2].

4.2. DNA Extraction

For DNA extraction and metagenomics analysis, two complete F3 larvae (instar L3) were selected to obtain only one DNA sample. DNA was isolated using the ZymoBIOMICS DNA Miniprep Kit (Zymo Research, Irvine, CA) following the manufacturer's instructions. The genomic DNA was processed and analyzed with the Shotgun Metagenomic Sequencing Service (Zymo Research, Irvine, CA).

4.3. Sequencing and Assembly

Sequencing libraries were prepared with Illumina® DNA Library Prep Kit (Illumina, San Diego, CA) and the final library was sequenced on the platform NovaSeq® (Illumina, San Diego, CA). Generating 1.1 Gb of paired-end reads of 150 bp in length. Bioinformatics analyses were made using the pipelines at Bacterial and Viral Bioinformatics Resource Center (BV-BRC) and was submitted to the Metagenomic Binning Service [3,[7], [8], [9], [10]. Each set of binned contigs was annotated using RAST tool kit (RASTtk) [10]. All software were run with default parameters.

Data Accessibility

The raw sequence data were deposited at the National Centre for Biotechnology Information (NCBI) database under the project number PRJNA899064. The sequences of MAGs are available at GenBank under the genome accessions summarized in Table 1.

Ethics Statements

This work did not involve any human subjects, animals or species that require ethical approval.

CRediT authorship contribution statement

Francisco Javier Flores Gallardo: Methodology, Software. José Luis Hernández Flores: Conceptualization, Methodology, Software, Writing – original draft. Selene Aguilera Aguirre: Methodology, Software, Formal analysis. Miguel Ángel Ramos López: Writing – original draft. Jackeline Lizzeta Arvizu Gómez: Investigation, Formal analysis. Carlos Saldaña Gutierrez: Investigation. María Carlota García Gutiérrez: Methodology. José Alberto Rodríguez Morales: Methodology. Juan Campos Guillén: Conceptualization, Methodology, Software, Writing – original draft.

Acknowledgments

Funding: This study was partially financed by the Universidad Autónoma de Querétaro (FONDEC-UAQ-2022; FOPER-2022).

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Contributor Information

José Luis Hernández Flores, Email: jose.hernandezf@cinvestav.mx.

Juan Campos Guillén, Email: juan.campos@uaq.mx.

Data Availability

References

  • 1.Zavala-Sánchez MÁ, Rodríguez-Chávez JL, Figueroa-Brito R, et al. Bioactivity of 1-octacosanol from Senna crotalarioides (Fabaceae: Caesalpinioideae) to control Spodoptera frugiperda (Lepidoptera: Noctuidae) Fla. Entomol. 2020;102(4):731–737. [Google Scholar]
  • 2.Pinto JRL, Torres AF, Truzi CC, Vieira NF, Vacari AM, De Bortoli SA. Artificial corn-based diet for rearing Spodoptera frugiperda (Lepidoptera: Noctuidae) J. Insect Sci. 2019;19(4):2. doi: 10.1093/jisesa/iez052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Parks D.H., Imelfort M., Skennerton C.T., Hugenholtz P., Tyson G.W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Collins M.D., Jones D., Farrow J.A.E., Kilpper-Bälz R., Schleifer K.H. Enterococcus avium nom. rev., comb. nov.; E. casseliflavus nom. rev., comb. nov.; E. durans nom. rev., comb. nov.; E. gallinarum comb. nov.; and E. malodoratus sp. nov. Int. J. Syst. Evol. Microbiol. 1984;34(2):220–223. [Google Scholar]
  • 5.Collins M.D., Farrow J.A., Jones D. Enterococcus mundtii sp. nov. Int. J. Syst. Evol. Microbiol. 1986;36(1):8–12. [Google Scholar]
  • 6.Zheng J, Wittouck S, Salvetti E, Franz CMAP, Harris HMB, Mattarelli P, O'Toole PW, Pot B, Vandamme P, Walter J, Watanabe K, Wuyts S, Felis GE, Gänzle MG, Lebeer S. A taxonomic note on the genus Lactobacillus: Description of 23 novel genera, emended description of the genus Lactobacillus Beijerinck 1901, and union of Lactobacillaceae and Leuconostocaceae. Int. J. Syst. Evol. Microbiol. 2020;70(4):2782–2858. doi: 10.1099/ijsem.0.004107. Apr. [DOI] [PubMed] [Google Scholar]
  • 7.Parrello B., Butler R., Chlenski P., Pusch G.D., Overbeek R. Supervised extraction of near-complete genomes from metagenomic samples: a new service in PATRIC. PLOS One. 2021;16 doi: 10.1371/journal.pone.0250092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Nurk S., Meleshko D., Korobeynikov A., Pevzner P.A. metaSPAdes: a new versatile metagenomic assembler. Genome Res. 2017;27:824–834. doi: 10.1101/gr.213959.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Li D., et al. MEGAHIT v1. 0: a fast and scalable metagenome assembler driven by advanced methodologies and community practices. Methods. 2016;102:3–11. doi: 10.1016/j.ymeth.2016.02.020. [DOI] [PubMed] [Google Scholar]
  • 10.Brettin T., et al. RASTtk: a modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes. Sci. Rep. 2015;5:8365. doi: 10.1038/srep08365. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The raw sequence data were deposited at the National Centre for Biotechnology Information (NCBI) database under the project number PRJNA899064. The sequences of MAGs are available at GenBank under the genome accessions summarized in Table 1.


Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES