A chromosome-level genome assembly of the heteronomous hyperparasitoid wasp Encarsia sophia

Xiaoming Man; Cong Huang; Shengyong Wu; Jianyang Guo; Fanghao Wan; Frédéric Francis; Nianwan Yang; Wanxue Liu

doi:10.1038/s41597-024-04040-2

. 2024 Nov 19;11:1250. doi: 10.1038/s41597-024-04040-2

A chromosome-level genome assembly of the heteronomous hyperparasitoid wasp Encarsia sophia

Xiaoming Man ^1,², Cong Huang ¹, Shengyong Wu ¹, Jianyang Guo ¹, Fanghao Wan ¹, Frédéric Francis ², Nianwan Yang ^1,^3,^✉, Wanxue Liu ^1,^✉

PMCID: PMC11577007 PMID: 39562567

Abstract

Encarsia sophia, a heteronomous hyperparasitoid wasp, is a well-known biological control agent, but its genomic information is limited, hindering molecular investigations and understanding of multitrophic interactions. In this study, we present a chromosome-level genome assembly for E. sophia using Illumina, PacBio HiFi, and Hi-C technologies. The assembled genome size is 398.3 Mb, with a contig N50 of 1.0 Mb and a scaffold N50 of 74.0 Mb. The BUSCO completeness score is 97.1%, and genome coverage reaches 99.1%. Utilizing Hi-C assisted assembly, the genome was organized into five chromosomes, with a mounting rate of 95.1%. Repetitive sequences make up 54.6% of the genome, and 14,914 protein-coding genes were predicted, with 95.5% functionally annotated. The high-quality genome assembly of E. sophia is a significant achievement, marking the first complete genome for a heteronomous hyperparasitoid wasp. This milestone offers valuable insights into the evolution and host interactions of heteronomous hyperparasitoids, laying the foundation for extensive research in biological control.

Subject terms: Entomology, Genome assembly algorithms

Background & Summary

The Hymenoptera, one of the four largest orders in the class Insecta, is one of the most species-rich groups of insects. With the advancement of sequencing technologies, this order has become a hotspot in insect genomics research^1,2. Currently, the number of sequenced Hymenoptera genomes has reached 557 (on April 2024, based on statistics from NCBI), with 388 species sequenced in the past three years, and annotation information submitted for 125 species. Among these sequenced Hymenoptera species, 258 belong to parasitoids, primarily including 36 species of Cynipoidea, 75 species of Chalcidoidea, 98 species of Ichneumonoidea, 42 species of Proctotrupoidea, 6 species of Chrysidoidea, and 1 species of Orussidea.

Encarsia sophia (Hymenoptera: Aphelinidae) is a dominant parasitoid of the “super pest” Bemisia tabaci (Hemiptera: Aleyrodidae), serving as a crucial biological control agent against global populations of whiteflies due to its remarkable parasitic and destructive capabilities on the host^3–5. The reproductive strategy of this parasitoid is rather unique, being a typical heteronomous hyperparasitoid. Males and females develop heteronomously, obtaining their nutritional resources from different host insects. Females, the primary parasitoids, arise from fertilized eggs and parasitize directly within the target host insect, feeding on the larvae or nymphs of the host to complete their development. Conversely, males, arising from unfertilized eggs, act as hyperparasitoids and can only parasitize secondary hosts, i.e., those already parasitized by the primary parasitoids, feeding on the larvae of the primary parasitoids to complete their development^6–9. Here, mated female E. sophia parasitize directly within the nymphs of the B. tabaci, laying fertilized eggs that develop into female offspring, serving as primary parasitoids. Unmated females, on the other hand, can only parasitize secondary hosts, laying unfertilized eggs within the nymphs of conspecific or heterospecific parasitoids already parasitized within the whitefly nymphs, producing male offspring, acting as hyperparasitoids^10,11. So far, no genome of a heteronomous parasitoid has been reported. In order to gain deeper insights into the characteristics of such parasitoids, we conducted whole-genome sequencing and chromosomal-level assembly of E. sophia using Illumina, PacBio, and Hi-C technologies.

Methods

Parasitoid Wasp Collection and Sequencing

Encarsia sophia population, introduced in 2008 from the Vegetable Pest Integrated Management Laboratory at Texas A&M University, USA. They were reared in the insectarium of the Laboratory of Biological Invasion Research at the Langfang Research and Development Base of the Chinese Academy of Agricultural Sciences, using cotton plant B. tabaci nymphs as hosts (26 ± 1 °C, RH65 ± 5%, light cycle 14 L:10D). The B. tabaci laboratory population originates from the MEAM1 population maintained by the Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences (CAAS), in a greenhouse at the Institute of Plant Protection, CAAS, with no history of pesticide use. The cotton variety used is CCRI 49. E sophia is a typical heteronomous hyperparasitoid with a unique reproductive strategy: females act as primary parasitoids, parasitizing first- to fourth-instar B. tabaci nymphs (primary hosts). In contrast, solitary females produce male offspring, acting as secondary parasitoids parasitizing conspecific or heterospecific parasitoid larvae inside B. tabaci nymphs (secondary hosts). Given that males are secondary parasitoids, we collected newly emerged females for sequencing. To obtain newly emerged parasitoids, we used insect pins to transfer females from black pupae to centrifuge tubes (1.5 mL). We conducted daily checks for newly emerged adults and collected a total of 4,000 females for DNA extraction using the QIAamp DNA Mini Kit (QIAGEN). Following extraction, the purity, concentration, and integrity of the DNA were evaluated with the NanoDrop 2000&8000, Qubit Fluorometer, and Agilent 4200 Bioanalyzer, respectively.

Genome size estimation and assembly

The high-quality DNA samples from E. sophia were randomly sheared using a Covaris ultrasonic disruptor. Subsequent steps, such as end repair, A-tailing, adapter ligation, purification, and PCR amplification, were performed to complete the library construction process. The constructed library was subjected to paired-end sequencing using Illumina HiSeq. By removing reads with adapter sequences and those containing more than 10% uncertain bases (N), as well as discarding single-end reads where the proportion of low-quality bases (quality score below 5) exceeds 20%, we obtained the filtered clean reads. Then, a k-mer frequency histogram was generated using Jellyfish 2.2.7 with the following parameters: “-G 2 -m 17 -C -o kmercount.”¹², yielding the following estimations: a genome size of 412.21 Mbp, corrected to 404.2 Mbp, heterozygosity rate of 0.52%, and a repeat sequence proportion of 52.84% (Fig. 1). To obtain the preliminary genome assembly of E. sophia, we utilized 49,702,845,900 bp of second-generation sequencing data and assembled it using the Soapdenovo software. The assembly was then scaffolded using kmer41. The initial assembly results showed that the genome of E. sophia had a contig N50 of 1,272 bp with a total length of 318,591,742 bp, and a scaffold N50 of 2,192 bp with a total length of 328,391,604 bp (Table 1).

Fig. 1 — *Encarsia sophia* genome feature statistics obtained by Kmer analysis.

Table 1.

Encarsia sophia genome assembly to scaffold results.

	Total_length	Total_number	Max_length	N50_length	N90_length
Contig	318,591,742	699,645	91,064	1,272	133
Scaffold	328,391,604	601,156	178,874	2,192	146

Open in a new tab

Sequencing was conducted using the PacBio platform, resulting in a total sequencing volume of 148 G with a coverage depth of 366.16X (calculated based on the survey-estimated genome size of 404.20 M). Additionally, a short-insert library was prepared and sequenced using the Illumina platform (Table 2). Using the sequencing data, de novo assembly of the E. sophia genome was performed with HiFiasm¹³. The genome assembled by Hifiasm has a length of 398.19 Mbp, with a contig N50 of 1.33 Mbp (sequences above 100 bp were selected for the assembly results) (Table 3).

Table 2.

Summary of DNA/RNA sequencing data utilized for the genome assembly of Encarsia sophia.

Library	Insert size(bp)	Total data (G)	Read length (bp)	Sequence coverage (X)
Illumina	350	49.70	150	122.96
PacBio	—	148	—	366.16
Hi-C	350	2.37	150	98.54

Open in a new tab

Table 3.

Encarsia sophia genome denovo assembly results statistics.

	Total_length	Total_number	Max_length	N50_length	N90_length
Contig	338,576,684	1,144	295,958	1,327,545	136,066
Scaffold	328,391,604	1,144	295,958	1,327,545	136,066

Open in a new tab

To obtain the chromosome-level genome of E. sophia, a Hi-C sequencing library was constructed using Hi-C technology¹⁴, incorporating DNA from 20,000 female adults. Hi-C data were obtained from the sequencing, and the contigs/scaffolds assembled were anchored to approximate chromosome-level using the All-hic software¹⁵. Subsequently, the juicebox software (https://github.com/aidenlab/Juicebox) was utilized for manual correction based on chromosomal interaction intensity, resulting in the final chromosome-level genome of E. sophia (Table 4). Following Hi-C-assisted assembly, the E. sophia genome assembled at the chromosome-level comprises a total of 5 sequences, with an additional 189 sequences remaining unassembled at the chromosome-level. The total genome length is 398,274,414 bp, of which 378,887,893 bp is assembled onto chromosomes (Fig. 2). The genome mapping rate achieved is 95.1% (Tables 5, 6). (Results were based on contigs above 100 bp for assembly statistics).

Table 4.

Statistical results of the Encarsia sophia genome assembly, both from the initial de novo assembly and after Hi-C scaffolding.

Sample	Contig length	Scaffold length	Contig number	Scaffold number
Total	398,185,814	398,274,414	1,080	194
Max	4,052,312	163,268,332	—	—
Number>=2000	—	—	1080	194
N50	715,578	73,963,014	161	2
N60	558,990	72,460,500	224	3
N70	435,605	72,460,500	304	3
N80	326,172	38,401,749	410	4
N90	185,480	30,794,298	570	5

Open in a new tab

Fig. 2 — A genome-wide Hi-C interaction map of *Encarsia sophia* (5 chromosomes, 100 kb resolution) is shown, with a color gradient on the right indicating the interaction strength. Intrachromosomal interactions (red squares along the diagonal) are markedly more intense than interchromosomal interactions (light yellow squares).

Table 5.

Encarsia sophia single chromosome cluster number and length statistics of Hi-C assemble.

Sequeues ID	Cluster number	Sequeues length
Chr1	349	163,268,332
Chr2	164	73,963,014
Chr3	157	72,460,500
Chr4	142	38,401,749
Chr5	79	30,794,298

Open in a new tab

Table 6.

Encarsia sophia genome mapping rate of de novo and afer Hi-C scaffolding.

Class	Scaffold number	Total length
Place	5	378,887,893
Unplace	189	19,386,521
Total	194	398,274,414
Mapping rate	95.13%

Open in a new tab

Genome quality assessment

We employed different methods to assess the sequence integrity, consistency, and accuracy of the genome assembly. Firstly, the integrity of the E. sophia genome assembly was assessed using BUSCO with the insecta-odb10 database¹⁶, employing software such as MetaEuk and HMMER. The assembly resulted in 97.1% complete BUSCO genes, with 92.1% being single-copy genes and 5.0% being completely duplicated genes. Additionally, a core gene library comprising 248 conservative genes present in six eukaryotic model organisms was used for CEGMA assessment¹⁷ using tblastn, genewise, and geneid software. The assembly successfully identified 233 out of 248 core eukaryotic genes, indicating a completeness rate of 93.9%. Secondly, the sequence consistency of the E. sophia genome was assessed by aligning short-insert library reads using BWA software (http://bio-bwa.sourceforge.net/)¹⁸. The analysis revealed a HiFi reads alignment rate of approximately 97.6% and a genome coverage rate of around 99.1%, demonstrating strong consistency between the reads and the assembled genome. SNP calling was performed using samtools (http://samtools.sourceforge.net/) on the BWA alignment results, and after filtering and statistical analysis¹⁹, the genome exhibited a heterozygous SNP rate of 0.317095% and a homozygous SNP rate of 0.000943%, demonstrating excellent single-base accuracy in the assembly. Thirdly, the sequence accuracy of the E. sophia genome was assessed using Merqury software (https://github.com/marbl/merqury) with Illumina sequencing data. The quality value (Qv) of the genome, calculated based on K-mer using the Merqury-mash module^20,21, was determined to be 33.6653, indicating a base accuracy rate exceeding 99.9%. In conclusion, the E. sophia genome assembly exhibits good consistency, completeness, and accuracy (Table 7).

Table 7.

Encarsia sophia genome assembly quality assessment results.

Evaluation indicators	results
BUSCO	C:97.1%[S:92.1%,D:5.0%],F:0.6%,M:2.3%,n:1367
CEGMA	93.95%Completeness
Reads	97.58% Mapping rate;99.10% Coverage
SNP	0.317095% Heterozygosis;0.000943% Homology
Qv	33.6653

Open in a new tab

Genome annotation

Our approach to repetitive annotation utilizes a thorough strategy that combines homology alignment with de novo search to detect repetitive sequences across the entire genome. We utilized TRF (http://tandem.bu.edu/trf/trf.html)²² for ab initio prediction, extracting tandem repeat sequences. For homology-based prediction, we utilized the standard Repbase database (http://www.girinst.org/repbase)²³, employing RepeatMasker (http://www.repeatmasker.org/)²⁴ and its internal script, RepeatProteinMask, to identify repetitive regions with default settings. In the de novo prediction process, we applied LTR_FINDER (http://tlife.fudan.edu.cn/ltr_finder/)²⁵, RepeatScout (http://www.repeatmasker.org/), and RepeatModeler (http://www.repeatmasker.org/RepeatModeler.html)²⁶ to create a de novo repetitive element database. Subsequently, all repetitive sequences longer than 100 bp with an ‘N’ content below 5% were included in the initial transposable element (TE) library. This custom library, created by merging Repbase with our de novo TE library and refined using uclust to remove redundancy, was then utilized by RepeatMasker for the identification of repetitive sequences at the DNA level. The E. sophia genome contains 214.7 Mb of repetitive sequences, constituting 53.92% of the genome. Among them, long terminal repeats (LTRs) are the most abundant, accounting for 34.59% of the total, followed by Unknown (12.17%), 7.18% DNA elements, 3.96% long interspersed nuclear elements (LINEs), and short interspersed nuclear elements (SINEs) at a mere 0.02% (Table 8).

Table 8.

Encarsia sophia genome repeat sequence classification result statistics.

Repeat type	Denovo + Repbase		TE Proteins		Combined TEs
Repeat type	Length(bp)	% in Genome	Length(bp)	% in Genome	Length(bp)	% in Genome
DNA	26,727,384	6.71	5,680,941	1.43	28,604,198	7.18
LINE	12,742,058	3.20	5,263,490	1.32	15,788,154	3.96
SINE	73,932	0.02	0	0.00	73,932	0.02
LTR	135,720,939	34.08	19,125,339	4.80	137,771,019	34.59
Unknown	48,475,861	12.17	1,305	0.00	48,477,166	12.17
Total	213,763,901	53.67	30,069,622	7.55	214,739,217	53.92

Open in a new tab

Note: LINE (Long Interspersed Nuclear Elements): Repetitive sequences dispersed throughout the genome, each with repeat units exceeding 1000 bp. SINE (Short Interspersed Nuclear Elements): Short repetitive sequences dispersed throughout the genome, each with repeat units less than 50 bp.

LTR (Long Terminal Repeats): Sequences characterized by the presence of long terminal repeats on both ends. Unknown: Repeat sequences that could not be classified by RepeatMasker. Total: Represents non-redundant data after removing overlaps between different classifications. Denovo + Repbase: Combined results from RepeatScout, RepeatModeler, LTR_FINDER, and Piler, integrated with the RepBase nucleic acid library and processed with Uclust software according to the 80-80-80 rule. Annotation was performed with RepeatMasker to identify transposon elements in the genome. TE Proteins: Transposon elements identified by separately annotating the genome using TE proteins from the RepBase protein library with RepeatProteinMask software. Combined TEs: An integrated dataset of the above two methods, after removing redundancy. This result does not include data from TRF identification.

The protein-coding gene annotation in the E. sophia genome integrates de novo prediction, homology-based approaches, and RNA-Seq-supported modeling for gene prediction²⁷. For de novo gene prediction, our automated gene prediction pipeline utilized Augustus (v3.2.3) (http://bioinf.uni-greifswald.de/augustus/)²⁸, Geneid (v1.4), Genescan (v1.0), GlimmerHMM (v3.04) (http://ccb.jhu.edu/software/glimmerhmm/)²⁹, and SNAP (http://homepage.mac.com/iankorf/)³⁰. Homologous protein sequences were downloaded from NCBI Nasonia vitripennis (Nvit), Ceratosolen solmsi (Csol), Copidosoma floridanum (Cflo), Trichogramma brassicae (Tbra), Trichomalopsis sarcophagae (Tsar), Trichogramma pretiosum (Tpre). Using TblastN (v2.2.26; E-value ≤ 1e−5), protein sequences were aligned to the E. sophia genome³¹, and GeneWise (v2.4.1)³² software was employed to align matching proteins with homologous genomic sequences for accurate splice alignment and prediction of gene structures within each protein region. We constructed seven RNA-seq libraries, including different developmental stages of female E. sophia (600 eggs, Bemisia tabaci nymphs parasitized for <24 hours, dissected for host sampling; 200 first-instar larvae, B. tabaci nymphs parasitized for 48–60 hours, dissected for host sampling; 200 second-instar larvae, B. tabaci nymphs parasitized for 72–84 hours, dissected for host sampling; 80 third-instar larvae, B. tabaci nymphs parasitized for 120–132 hours, dissected for host sampling; 40 prepupae, B. tabaci nymphs parasitized for 168–178 hours, sampled after removing the host shell; 30 pupae, B. tabaci nymphs parasitized for 216–228 hours, sampled after removing the host shell; 50 adults, eclosed within < 24 hours.). Total RNA extracted from the aforementioned samples were used for library preparation, and sequencing was performed on the Illumina NovaSeq6000 platform³³. The sequencing output generated a total of 60.51 G raw data, and after filtering, 59.88 G clean data was used for genome annotation. For genome annotation, the transcriptome was assembled using Trinity (v2.1.1)³⁴. To refine the annotation, RNA-Seq data were processed with Hisat (v2.0.4)³⁵ under default settings to identify exonic regions and splice sites. The alignment results were subsequently used as input for Stringtie (v1.3.3)³⁶ with its default parameters, facilitating genome-guided transcriptome assembly. A comprehensive, non-redundant reference gene set was then created by merging the predictions from all three methods using EvidenceModeler (EVM, v1.1.1)³⁷, which incorporated masked transposable elements for accurate gene prediction. A total of 14,914 protein-coding genes were predicted in E. sophia genome. The average length of predicted genes was 11,273.01 base pairs, with an average protein-coding region length of 1,451.53 bp. The average lengths of exons and introns were 275.58 and 2,301.66 bp, respectively. On average, each gene contained 5.27 exons (Table 9, Fig. 3).

Table 9.

Encarsia sophia statistical results of genome gene structure prediction.

	Gene set	Number	Average transcript length(bp)	Average CDS length(bp)	Average exons per gene	Average exon length(bp)	Average intron length(bp)
De novo	Augustus	15,956	8,411.35	1,489.68	4.99	298.43	1,733.98
	Glimmer HMM	33,861	10,499.07	761.97	3.49	218.60	3,917.21
	SNAP	23,924	23,075.70	887.93	7.08	125.46	3,650.71
	Geneid	31,571	5,185.88	928.60	3.29	282.56	1,861.97
	Genscan	21,484	11,826.00	1,330.44	5.30	250.93	2,439.65
Homolog	Cflo	10,815	6,875.42	1,389.21	4.89	284.35	1,411.96
	Tsar	11,077	5,268.28	1,326.65	4.62	287.03	1,088.25
	Tbra	7,750	5,957.41	1,246.99	4.23	295.03	1,459.88
	Nvit	11,634	7,063.07	1,451.86	4.99	290.80	1,405.37
	Csol	9,492	8,239.81	1,511.51	5.42	278.99	1,522.99
	Tpre	10,678	7,095.35	1,427.50	4.97	287.27	1,427.96
RNAseq	PASA	23,430	9,543.93	1,123.95	4.09	274.77	2,724.45
RNAseq	Transcripts	51,252	14,530.86	2,300.38	4.39	524.37	3,611.08
EVM		17,419	8,973.75	1,359.08	4.86	279.83	1,974.37
Pasa-update*		17,270	10,398.83	1,374.41	4.88	281.68	2,326.25
Final set*		14,914	11,273.01	1,451.53	5.27	275.58	2,301.66

Open in a new tab

Note: *Includes UTR regions, while others do not.

Fig. 3 — Circular plot illustrating the chromosome-level genome assembly results for *Encarsia sophia*. A: chromosome information, B: gene density, C: GC content, D: ncRNA density, E: repeat density.

Gene functions were determined by aligning the E. sophia protein sequences with the Swiss-Prot database using Blastp, applying a threshold E-value of ≤1e−5 to identify the best matches. InterProScan70 (v5.31)³⁸ was used to annotate protein motifs and domains through searches across various public databases such as ProDom, PRINTS, Pfam, PANTHER, PROSITE and SMART. Gene Ontology (GO) IDs were subsequently assigned based on the relevant InterPro entries. We mapped the genes to the NR20 database using the closest BLAST hits from the Swissprot20 database³⁹ (E-value < 10–5) and DIAMOND (v0.8.22)/BLAST hits (E-value < 10-5). Furthermore, the genome was aligned with KEGG pathways⁴⁰ to determine the best match for each gene. Ultimately, 14,245 genes (95.5% of the total) in E. sophia genome were successfully annotated in at least one database⁴¹ (Table 10).

Table 10.

Functional annotation of Encarsia sophia proteins.

Type	Number	Percent(%)
Swissprot	10,110	67.80
Nr	13,514	90.60
KEGG	10,710	71.80
InterPro	13,363	89.60
GO	8,160	54.70
Pfam	10,103	67.70
Total annotated	14,245	95.50

Open in a new tab

To annotate non-coding RNAs (ncRNAs) in the E. sophia genome, tRNA genes were predicted using the tRNAscan-SE tool (http://lowelab.ucsc.edu/tRNAscan-SE/)⁴². Given the high conservation of rRNA sequences, we used sequences from closely related species as references and applied BLAST to identify rRNAs. Other ncRNAs, such as snRNAs and miRNAs, were detected by querying the Rfam database⁴³ with the infernal software (http://infernal.janelia.org/)⁴⁴, employing default parameters. In the end, a total of 1,457 non-coding RNAs were predicted, comprising 513 micro-RNAs (miRNAs), 514 transfer RNAs (tRNAs), 328 ribosomal RNAs (rRNAs), and 102 small nuclear RNAs (snRNAs) (Table 11).

Table 11.

Encarsia sophia genome non-coding RNA statistical results.

	Type	Copy number	Average length(bp)	Total length(bp)	% of genome
miRNA		513	146.54	75,174	0.018875
tRNA		514	74.33	38,206	0.009593
rRNA	rRNA	328	209.06	68,572	0.017217
	18S	95	289.37	27,490	0.006902
	28S	215	182.87	39,318	0.009872
	5.8S	18	98	1,764	0.000443
	5S	0	0	0	0
snRNA	snRNA	102	156.51	15,964	0.004008
	CD-box	15	146.40	2,196	0.000551
	HACA-box	11	188.09	2,069	0.000519
	splicing	75	154.25	11,569	0.002905
	scaRNA	1	130	130	0.000033
	Unknown	0	0	0	0

Open in a new tab

Data Records

The sequencing data for the E. sophia genome, including Illumina, PacBio, and Hi-C datasets, have been deposited in the Sequence Read Archive (SRA) at the National Center for Biotechnology Information (NCBI) under accession numbers SRR29702816, SRR29702817, SRR29702818^45–47, and in the Genome Sequence Archive (GSA) of the National Genomics Data Center (NGDC) under accession numbers BioProject PRJNA1131600 (NCBI) and CRA017569⁴⁸ (NGDC). The transcriptome data used for annotation, covering various developmental stages of female E. sophia, have been stored in the SRA of NCBI and the GSA of NGDC: Egg (SRR29702811⁴⁹, CRR1218365), 1st instar larva (SRR29702815⁵⁰, CRR1218361), 2nd instar larva (SRR29702814⁵¹, CRR1218362), 3rd instar larva (SRR29702813⁵², CRR1218363), prepupa (SRR29702810⁵³, CRR1218366), pupa (SRR29702809⁵⁴, CRR1218367), and adult (SRR29702812⁵⁵, CRR1218364). This Whole Genome Shotgun project has been deposited at GenBank under the accession JBFBOU000000000⁵⁶. The genome annotation files of Encarsia sophia are available in figshare under a 10.6084/m9.figshare.2642675241.

Technical Validation

The quality, concentration, and integrity of the DNA samples were evaluated using a NanoDrop 2000&8000, a Qubit 3.0 Fluorometer (Thermo Fisher Scientific, USA), and an Agilent 4200 Bioanalyzer (Agilent Technologies, CA, USA), respectively. RNA integrity was assessed using the RNA Nano 6000 kit on the Bioanalyzer 2100 system (Agilent Technologies, CA, USA). High-quality DNA and RNA were selected for library preparation and sequencing. Genome assembly integrity was verified using BUSCO (Benchmarking Universal Single-Copy Orthologs: http://busco.ezlab.org/) and CEGMA (Core Eukaryotic Genes Mapping Approach: http://korflab.ucdavis.edu/datasets/cegma/). The short-read sequences from the fragment library were mapped to the assembled genome using BWA software (http://bio-bwa.sourceforge.net/), and alignment rates, genome coverage, and depth distribution were analyzed to evaluate the completeness and uniformity of the assembly. Additionally, the genome’s quality value (Qv) was determined using the Merqury-mash module (https://github.com/marbl/merqury) to assess the sequence accuracy of the assembled genome.

Acknowledgements

This work was supported by the National Key Research and Development Program of China (2023YFC2605200), the Tianshan Talent Program for Outstanding Young Scientific and Technological Talents (2022TSYCCX0084) and National Natural Science Foundation of China (32072493).

Author contributions

N.W.Y. and W.X.L. conceived and designed the study; X.M.M. and S.H.W. collected the samples for sequencing; X.M.M. and C.H. performed the data analysis; X.M.M. wrote the draft manuscript; X.M.M., C.H., J.Y.G., F.H.W., W.X.L., N.W.Y. and F.F. discussed the results and improved and revised the manuscript.

Code availability

Data processing followed the standard protocols and guidelines of the relevant bioinformatics software, with default parameters applied unless specified otherwise. Details on the software versions and specific parameters are provided in the Methods section.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Nianwan Yang, Email: yangnianwan@caas.cn.

Wanxue Liu, Email: liuwanxue@caas.cn.

References

1.Ye, X. H. et al. A chromosome-level genome assembly of the parasitoid wasp Pteromalus puparum. Mol Ecol Resour.20, 1384–1402 (2020). [DOI] [PubMed] [Google Scholar]
2.Zhong, Y. W. et al. A chromosome-level genome assembly of the parasitoid wasp Eretmocerus hayati. Sci Data.10, 585 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Charles, O. A review of management of major arthropod pests affecting cassava production in Sub-Saharan Africa. Crop Prot.175, 1–15 (2024). [Google Scholar]
4.Katono, K. et al. Effect of Bemisia tabaci SSA1 host density and cassava genotype on host feeding capacity and parasitism by two Hymenoptera parasitoid species. Biocontrol Sci Technol.33, 19–34 (2022). [Google Scholar]
5.Caspary, R. et al. Cutting Dipping Application of Flupyradifurone against Cassava Whiteflies Bemisia tabaci and Impact on Its Parasitism in Cassava. Insects.14, 796 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Walter, G. H. Divergent male ontogenies in Aphelinidae (Hymenoptera, Chalcidoidea): A simplified classification and a suggested evolutionary sequence. Biol. J. Linn. Soc. Lond.19, 63–82 (1983). [Google Scholar]
7.Mills, N. J. et al. Prospective modelling in biological control: An analysis of the dynamics of heteronomous hyperparasitism in a cotton-whitefly-parasitoid system. J Appl Ecol.33, 1379–1394 (1996). [Google Scholar]
8.Williams, T. Invasion and displacement of experimental populations of a conventional parasitoid by a heteronomous hyperparasitoid. Biocontrol Sci Technol.6, 603–618 (1996). [Google Scholar]
9.Hunter, M. S. et al. Evolution and behavioral ecology of heteronomous aphelinid parasitoids. Annu. Rev. Entomol.46, 251–290 (2001). [DOI] [PubMed] [Google Scholar]
10.Yang, N. W. et al. Shifting preference between oviposition vs. host-feeding under changing host densities in two aphelinid parasitoids. PLoS One.7, e41189 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Xu, H. Y. et al. Competitive interactions between parasitoids provide new insight into host suppression. PLoS One.8, e82003 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics.27, 764 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Cheng, H. et al. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods.18, 1–6 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Belaghzal, H. et al. Hi-C 2.0: An optimized Hi-C procedure for high-resolution genome-wide mapping of chromosome conformation. Methods123, 56–65 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Zhang, X. et al. Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data. Nat. Plants.5, 833–845 (2019). [DOI] [PubMed] [Google Scholar]
16.Manni, M. et al. BUSCO Update: Novel and streamlined workfows along with broader and deeper phylogenetic coverage for scoring of Eukaryotic, Prokaryotic, and Viral genomes. Mol. Biol. Evol.38, 4647–4654 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Genis, P. et al. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics23, 1061–1067 (2007). [DOI] [PubMed] [Google Scholar]
18.Langmead, B. et al. Fast gapped-read alignment with Bowtie 2. Nat. Methods.9, 357–359 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Li, H. A. statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics.27, 2987–2993 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res.27, 722–736 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Rhie, A. et al. Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol.21, 245 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res.27, 573–580 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Bao, W. et al. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob. DNA.6, 11 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Tarailo-Graovac, M. et al. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinformatics4, 10 (2009). [DOI] [PubMed] [Google Scholar]
25.Xu, Z. et al. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res.35, 265–268 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci USA117, 9451–9457 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Mei, Y. et al. InsectBase 2.0: a comprehensive gene resource for insects. Nucleic Acids Res.50, D1040–D1045 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res.34, W435–439 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Majoros, W. H. et al. TigrScan and GlimmerHMM: two open-source ab initio eukaryotic gene-finders. Bioinformatics20, 2878–2879 (2004). [DOI] [PubMed] [Google Scholar]
30.Korf, I. Gene fnding in novel genomes. BMC Bioinformatics5, 59 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics10, 421 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Birney, E. et al. GeneWise and Genomewise. Genome Res.14, 988–995 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Cock, P. J. A. et al. The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants. Nucleic Acids Res.38, 1767–1771 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Bolger, A. M. et al. Trimmomatic: a fexible trimmer for Illumina sequence data. Bioinformatics30, 2114–2120 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Kim, D. et al. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods.12, 357–360 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol.33, 290–295 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol.9, R7 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Jones, P. et al. InterProScan 5: genome-scale protein function classifcation. Bioinformatics30, 1236–1240 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Bairoch, A. et al. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res.28, 45–48 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Kanehisa, M. et al. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res.28, 27–30 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Man, X. Genome assembly and annotation files of Encarsia sophia. figshare10.6084/m9.figshare.26426752 (2024).
42.Chan, P. P. et al. tRNAscan-SE 2.0: improved detection and functional classifcation of transfer RNA genes. Nucleic Acids. Res.49, 9077–9096 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Kalvari, I. et al. Rfam 14: expanded coverage of metagenomic, viral and microRNA families. Nucleic Acids. Res.49, D192–D200 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Nawrocki, E. P. et al. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics29, 2933–2935 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
45.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702816 (2024).
46.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702817 (2024).
47.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702818 (2024).
48.CNCB Genome Sequence Archivehttps://bigd.big.ac.cn/gsa/browse/CRA017569 (2024).
49.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702811 (2024).
50.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702815 (2024).
51.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702814 (2024).
52.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702813 (2024).
53.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702810 (2024).
54.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702809 (2024).
55.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702812 (2024).
56.Man, X. Encarsia sophia isolate IPP_NW_YANG_2024, whole genome shotgun sequencing project. GenBankhttps://identifiers.org/ncbi/insdc:JBFBOU000000000.1 (2024).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

Man, X. Genome assembly and annotation files of Encarsia sophia. figshare10.6084/m9.figshare.26426752 (2024).
NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702816 (2024).
NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702817 (2024).
NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702818 (2024).
CNCB Genome Sequence Archivehttps://bigd.big.ac.cn/gsa/browse/CRA017569 (2024).
NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702811 (2024).
NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702815 (2024).
NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702814 (2024).
NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702813 (2024).
NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702810 (2024).
NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702809 (2024).
NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702812 (2024).
Man, X. Encarsia sophia isolate IPP_NW_YANG_2024, whole genome shotgun sequencing project. GenBankhttps://identifiers.org/ncbi/insdc:JBFBOU000000000.1 (2024).

Data Availability Statement

[CR1] 1.Ye, X. H. et al. A chromosome-level genome assembly of the parasitoid wasp Pteromalus puparum. Mol Ecol Resour.20, 1384–1402 (2020). [DOI] [PubMed] [Google Scholar]

[CR2] 2.Zhong, Y. W. et al. A chromosome-level genome assembly of the parasitoid wasp Eretmocerus hayati. Sci Data.10, 585 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Charles, O. A review of management of major arthropod pests affecting cassava production in Sub-Saharan Africa. Crop Prot.175, 1–15 (2024). [Google Scholar]

[CR4] 4.Katono, K. et al. Effect of Bemisia tabaci SSA1 host density and cassava genotype on host feeding capacity and parasitism by two Hymenoptera parasitoid species. Biocontrol Sci Technol.33, 19–34 (2022). [Google Scholar]

[CR5] 5.Caspary, R. et al. Cutting Dipping Application of Flupyradifurone against Cassava Whiteflies Bemisia tabaci and Impact on Its Parasitism in Cassava. Insects.14, 796 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Walter, G. H. Divergent male ontogenies in Aphelinidae (Hymenoptera, Chalcidoidea): A simplified classification and a suggested evolutionary sequence. Biol. J. Linn. Soc. Lond.19, 63–82 (1983). [Google Scholar]

[CR7] 7.Mills, N. J. et al. Prospective modelling in biological control: An analysis of the dynamics of heteronomous hyperparasitism in a cotton-whitefly-parasitoid system. J Appl Ecol.33, 1379–1394 (1996). [Google Scholar]

[CR8] 8.Williams, T. Invasion and displacement of experimental populations of a conventional parasitoid by a heteronomous hyperparasitoid. Biocontrol Sci Technol.6, 603–618 (1996). [Google Scholar]

[CR9] 9.Hunter, M. S. et al. Evolution and behavioral ecology of heteronomous aphelinid parasitoids. Annu. Rev. Entomol.46, 251–290 (2001). [DOI] [PubMed] [Google Scholar]

[CR10] 10.Yang, N. W. et al. Shifting preference between oviposition vs. host-feeding under changing host densities in two aphelinid parasitoids. PLoS One.7, e41189 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Xu, H. Y. et al. Competitive interactions between parasitoids provide new insight into host suppression. PLoS One.8, e82003 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics.27, 764 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Cheng, H. et al. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods.18, 1–6 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Belaghzal, H. et al. Hi-C 2.0: An optimized Hi-C procedure for high-resolution genome-wide mapping of chromosome conformation. Methods123, 56–65 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Zhang, X. et al. Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data. Nat. Plants.5, 833–845 (2019). [DOI] [PubMed] [Google Scholar]

[CR16] 16.Manni, M. et al. BUSCO Update: Novel and streamlined workfows along with broader and deeper phylogenetic coverage for scoring of Eukaryotic, Prokaryotic, and Viral genomes. Mol. Biol. Evol.38, 4647–4654 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Genis, P. et al. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics23, 1061–1067 (2007). [DOI] [PubMed] [Google Scholar]

[CR18] 18.Langmead, B. et al. Fast gapped-read alignment with Bowtie 2. Nat. Methods.9, 357–359 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Li, H. A. statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics.27, 2987–2993 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res.27, 722–736 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Rhie, A. et al. Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol.21, 245 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res.27, 573–580 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Bao, W. et al. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob. DNA.6, 11 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Tarailo-Graovac, M. et al. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinformatics4, 10 (2009). [DOI] [PubMed] [Google Scholar]

[CR25] 25.Xu, Z. et al. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res.35, 265–268 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci USA117, 9451–9457 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Mei, Y. et al. InsectBase 2.0: a comprehensive gene resource for insects. Nucleic Acids Res.50, D1040–D1045 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res.34, W435–439 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.Majoros, W. H. et al. TigrScan and GlimmerHMM: two open-source ab initio eukaryotic gene-finders. Bioinformatics20, 2878–2879 (2004). [DOI] [PubMed] [Google Scholar]

[CR30] 30.Korf, I. Gene fnding in novel genomes. BMC Bioinformatics5, 59 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] 31.Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics10, 421 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR32] 32.Birney, E. et al. GeneWise and Genomewise. Genome Res.14, 988–995 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR33] 33.Cock, P. J. A. et al. The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants. Nucleic Acids Res.38, 1767–1771 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR34] 34.Bolger, A. M. et al. Trimmomatic: a fexible trimmer for Illumina sequence data. Bioinformatics30, 2114–2120 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR35] 35.Kim, D. et al. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods.12, 357–360 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR36] 36.Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol.33, 290–295 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR37] 37.Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol.9, R7 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR38] 38.Jones, P. et al. InterProScan 5: genome-scale protein function classifcation. Bioinformatics30, 1236–1240 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR39] 39.Bairoch, A. et al. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res.28, 45–48 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR40] 40.Kanehisa, M. et al. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res.28, 27–30 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] 41.Man, X. Genome assembly and annotation files of Encarsia sophia. figshare10.6084/m9.figshare.26426752 (2024).

[CR42] 42.Chan, P. P. et al. tRNAscan-SE 2.0: improved detection and functional classifcation of transfer RNA genes. Nucleic Acids. Res.49, 9077–9096 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR43] 43.Kalvari, I. et al. Rfam 14: expanded coverage of metagenomic, viral and microRNA families. Nucleic Acids. Res.49, D192–D200 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR44] 44.Nawrocki, E. P. et al. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics29, 2933–2935 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR45] 45.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702816 (2024).

[CR46] 46.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702817 (2024).

[CR47] 47.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702818 (2024).

[CR48] 48.CNCB Genome Sequence Archivehttps://bigd.big.ac.cn/gsa/browse/CRA017569 (2024).

[CR49] 49.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702811 (2024).

[CR50] 50.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702815 (2024).

[CR51] 51.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702814 (2024).

[CR52] 52.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702813 (2024).

[CR53] 53.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702810 (2024).

[CR54] 54.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702809 (2024).

[CR55] 55.NCBI Sequence Read Archivehttps://identifiers.org/ncbi/insdc.sra:SRR29702812 (2024).

[CR56] 56.Man, X. Encarsia sophia isolate IPP_NW_YANG_2024, whole genome shotgun sequencing project. GenBankhttps://identifiers.org/ncbi/insdc:JBFBOU000000000.1 (2024).

PERMALINK

A chromosome-level genome assembly of the heteronomous hyperparasitoid wasp Encarsia sophia

Xiaoming Man

Cong Huang

Shengyong Wu

Jianyang Guo

Fanghao Wan

Frédéric Francis

Nianwan Yang

Wanxue Liu

Abstract

Background & Summary

Methods

Parasitoid Wasp Collection and Sequencing

Genome size estimation and assembly

Fig. 1.

Table 1.

Table 2.

Table 3.

Table 4.

Fig. 2.

Table 5.

Table 6.

Genome quality assessment

Table 7.

Genome annotation

Table 8.

Table 9.

Fig. 3.

Table 10.

Table 11.

Data Records

Technical Validation

Acknowledgements

Author contributions

Code availability

Competing interests

Footnotes

Contributor Information

References

Associated Data

Data Citations

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases