Skip to main content
Genome Biology and Evolution logoLink to Genome Biology and Evolution
. 2024 Sep 27;16(10):evae205. doi: 10.1093/gbe/evae205

A Chromosome-Scale and Annotated Reference Genome Assembly of Plecia longiforceps Duda, 1934 (Diptera: Bibionidae)

Jonghwan Choi 1,2, Taemin Kang 3, Sun-Jae Park 4, Seunggwan Shin 5,6,
Editor: John Wang
PMCID: PMC11474240  PMID: 39331700

Abstract

Urbanization is a leading factor effecting global biodiversity, driving rapid evolutionary processes in the local biota. Species that adapt and proliferate in city environments can become pests, with human activities facilitating their dispersal and excessive outbreaks. Here we present the first genome data of Plecia longiforceps, a lovebug pest in Eastern Asia with intensive aggregations recently occurring in the Seoul Metropolitan Area of Korea. PacBio HiFi and ONT Pore-C sequencing data were used to construct a highly continuous assembly with a total size of 707 Mb and 8 major pseudochromosomes, its integrity supported by the N50 length of 98.1 Mb and 96.8% BUSCO completeness. Structural and functional annotation using transcriptome data and ab initio predictions revealed a high proportion (69.3%) of repeat sequences, and synteny analysis with Bibio marci showed high levels of genomic collinearity. The genome will serve as an essential resource for both population genomics and molecular research on lovebug dispersal and outbreaks, and also implement studies on the eco-evolutionary processes of insects in urbanizing habitats.

Keywords: lovebug, March fly, Pore-C, urban pest

Graphical Abstract

Graphical abstract.

Graphical abstract


Significance.

The lovebug species Plecia longiforceps is a recently highlighted urban pest exhibiting active patterns of outbreaks and range expansion in eastern Asia. We present a high-quality, chromosome-scale, and annotated reference genome for P. longiforceps, providing fundamental data for population and evolutionary studies to investigate the process of the species’ dispersal and adaptation to novel environments. Our results also effectively demonstrate the integration of HiFi and Pore-C sequencing technologies for de novo genome assembly in arthropods.

Introduction

Urbanization, with its rapidly increasing speed and scale, is one of the most prominent factors of environmental change and its subsequent effects on biodiversity (Simkin et al. 2022). Cities bring degradation and fragmentation of natural habitats, but also introduce novel landscapes and microclimates, all of which have substantial impacts on the species composition and interactions within the biotic community (Gibb and Hochuli 2002; Faeth et al. 2005). Furthermore, the complex transportation network of humans and goods in urban areas is often accompanied by the importation of foreign organisms, and together with the aforementioned alterations of environmental resistance, facilitates the establishment and proliferation of nonnative species (Byers 2002; McKinney 2006). These evolutionary challenges posed on the local biota drive rapid processes of adaptation (Alberti et al. 2017), as selection toward advantageous traits in human environments, such as pesticide resistance, tolerance to heat island effects, and better dispersal abilities, is frequently observed in urban communities (e.g. Piano et al. 2016; Zhu et al. 2016).

Some species that successfully adapt to human environments can become pests, defined on several criteria including their ability to threaten public health, damage artifacts, and occur in bothering numbers (Robinson 1996). Many species of true flies (Diptera) fall into the last category, as they seek benefit in survival and mating by living in large groups as eggs, larvae, or adults (Krivosheina et al. 2019). Urban environments and human activities can contribute to more intensive aggregations, as drainage systems, home gardens, and city parks provide suitable habitats for larvae, while artificial heat and volatiles attract adults (Busvine 1980; Krivosheina et al. 2019). Outbreaks also occur in the case of invasive species, with biological and environmental components determining the time and intensity of emerging populations (Crooks and Soulé 1999). Therefore, it is important to assess adaptive and evolutionary factors for effective management of outbreak pests in urban surroundings.

Due to their large size and weak, slow flying habits, mating aggregations of March flies (Bibionidae) can be especially conspicuous and thus disturbing to the public. Seasonal emergences of the St. Mark's Fly (Bibio marci) and Fever Fly (Dilophus febrilis) are regarded as nuisance pests in Europe (Busvine 1980). The Common Lovebug (Plecia nearctica) has a well-documented history of dispersal and explosive swarming along the US Gulf Coast, associated with anthropogenic factors including import through human transportation, habitation in highway subgrades, and attraction to common pollutants (Hetrick 1970; Buschman 1976; Callahan et al. 1985). However, compared with research on the ecology and pest status of March flies, little work has been done on the genomic or evolutionary level.

Plecia longiforceps is the latest member of Bibionidae to draw attention as urban pests, showing outbreak patterns closely resembling that of P. nearctica. First described from southeast China (Duda 1934), the insect spread along the Okinawa islands (Fitzgerald and Nakamura 2015; Tone and Osada 2020), and is recently causing major concern in the Seoul Metropolitan Area of Korea as mating swarms reach residential and commercial areas, with changes in seasonality suggesting adaptation to higher latitudes (Kim et al. 2022). As public demands grow for explanations to this emerging source of disturbance, research is required on the evolutionary background of successful expansions in East Asia and proliferation in urban habitats. In addition to field work and physiological experiments, genomic data will be critical in tracking the process of their dispersal, comparing urban and rural populations, and finding adaptive signals in city-associated traits. To address the need of molecular information, we constructed a chromosome-level and annotated genome assembly of P. longiforceps, based on high-fidelity (HiFi) long-read sequencing, Pore-C multiway contact sequencing, and RNA-sequencing data. Providing the first high-quality reference genome for the genus Plecia, we aim to facilitate genomic and evolutionary studies to understand the dispersal and adaptation processes of lovebugs in novel habitats.

Results and Discussion

Genome Assembly

The estimated genome size based on the k-mer distribution of HiFi reads was 702 Mb with 1.1% heterozygosity and 44.1% repeat content (supplementary fig. S1, Supplementary Material online). From the HiFi assembly, 1.0% (32 contigs) was identified as bacterial and nematode sequences and removed (supplementary fig. S2, Supplementary Material online). The final assembly using Pore-C contact information (Fig. 1a and b, Table 1) consisted of 491 scaffolds with a total length of 707 Mb, in accordance to the k-mer based estimate; also, more than 92.4% of the genome (653 Mb) was anchored to the eight largest chromosomal pseudomolecules. Genome statistics showed the N50 scaffold length of 98.1 Mb, 33.2% GC content, and 96.8% BUSCO completeness (94.1% single-copy, 2.6% duplicated, 1.2% fragmented, and 3.2% missing) (Table 1), demonstrating the high continuity and completeness of the de novo genome assembly.

Fig. 1.

Fig. 1.

Chromosome-scale genome assembly of P. longiforceps. a) Pore-C contact map of chromosomal interactions within the P. longiforceps genome assembly, showing eight major pseudomolecules. Outer boxes (blue) indicate final chromosome-level scaffolds, and inner boxes (green) are contigs after error correction in YaHS. b) Snail plot representing assembly statistics, including genome size, largest scaffold length, N50 and N90 lengths, GC composition, and BUSCO evaluation metrics. c) Genomic synteny between the eight major pseudochromosomes of P. longiforceps (pl1 to pl8) and B. marci chromosomes (bm1 to bm5, bmX). Repeat and gene densities of each pseudochromosome are depicted above, calculated in overlapping 100 kb windows.

Table 1.

Summary of genome assembly and annotation statistics for P. longiforceps

Item Value
Assembly statistics Genome size (bp) 706,927,914
Number of scaffolds 491
Number of contigs 801
Longest scaffold length (bp) 192,788,022
Scaffold N50 (bp) 98,122,942
Scaffold N90 (bp) 19,750,146
GC content (%) 33.21
BUSCO scores (insecta_odb10) C: 96.7% [S: 94.1%, D: 2.6%], F: 1.2%, M: 2.1%, n: 1367
Annotation statistics Repeat elements (length) 1,668,214 (489,828,923 bp)
Total gene models 12,648
Protein-coding genes 12,198
Annotated proteins 10,699
BUSCO scores (insecta_odb10) C: 83.1% [S: 81.1%, D: 2.0%], F: 3.1%, M: 13.8%, n: 1367

Structural and Functional Annotation

The combined reference transcriptome consisted of 210,898 sequences with 94.9% BUSCO completeness (1,297 complete BUSCOs), and individual assemblies also had 88.6% to 92.8% completeness (supplementary fig. S3, Supplementary Material online). Of the P. longiforceps genome, 69.3% (490 Mb) was masked as repetitive, including retroelements (26.9%), DNA transposons (8.2%), and simple repeats (4.0%). However, the largest portion was unclassified (208 Mb, 29.4%), implying the lack of genomic information on related taxa. In MAKER, 12,198 genes with an average length of 7,584.42 bp were predicted, encoding proteins with an average of 467.46 amino acids, comprising 13.1% of the genome; 450 tRNAs were also predicted. The final annotated gene set showed 83.1% BUSCO completeness (Fig. 1c, Table 1, supplementary table S1, Supplementary Material online).

Functions of 10,699 gene models (87.7%) were assigned based on at least one hit from the Swiss-Prot, NR, Pfam, CDD, and eggNOG databases; 8,250 and 4,283 were assigned GO terms and KEGG pathways, respectively (Table 1, supplementary table S1, Supplementary Material online). Notably, Pfam searches assigned 128 genes to the cytochrome P450 (CYP) family (PF00067), which is relatively a high number, among insects (Dermauw et al. 2020). CYPs are well known for their role in insecticide resistance, mediating the detoxification of commonly used pesticides including pyrethroids and neonicotinoids (Manjon et al. 2018), and large CYPomes have been reported in common pests such as mosquitoes (Yang and Liu 2011) and house flies (Scott et al. 2014). The increased number of CYPs in the P. longiforceps genome may imply tolerance to insecticides, especially considering that intense swarms were observed in areas under heavy spraying due to a previous outbreak of stick insects (Jung et al. 2020). For other detoxification genes, glutathione S-transferases (PF00043 and PF02798; 35 genes) and carboxylesterases (PF00135; 57 genes) showed higher numbers of annotations compared with model species (Koirala et al. 2022; Cruse et al. 2023), while ABC transporters (PF00005, 50 genes) were moderate (Wu et al. 2019), and UDP-glucosyltransferases (PF00201, 4 genes) were exceptionally scarce (Ahn et al. 2012). Another gene group of interest, heat shock proteins (HSPs) are key factors in tolerance to stresses such as heat, cold, and desiccation, and thus indicators of adaptive competence in novel environments (King and MacRae 2015). HSPs found in this study include 28 Hsp40 (PF00226), 10 Hsp60 (PF00118), 9 Hsp70 (PF00012), and 3 Hsp90 (PF00183) annotations. Although these putative numbers of detoxification- and heat resistance-related genes will require further curation to identify full-length, partial or fragmented genes, and pseudogenes, the annotated regions provide a basis for investigations on the adaptation of P. longiforceps to urban conditions.

Genome Synteny

Synteny analysis between the P. longiforceps (pl1 to pl8) and B. marci (bm1 to bm5, bmX) genomes (Fig. 1c) identified 433 collinearity blocks containing 7,878 genes, showing high levels of collinearity between chromosomal units, specifically pl1 with bm3 and bm5, pl2 with bm1, pl3 with bm4, and pl4 and pl6 with bm2. Also, a large portion of pl8 was syntenic to the X chromosome of B. marci, suggesting its relation with sex determination. Based on this preliminary data, research on the karyotype and sex chromosome system of P. longiforceps will be required to elucidate the processes of chromosomal rearrangement in the evolution of March flies.

Conclusion

Prior to this study, no genomic data had been published for the genus Plecia. In the family Bibionidae, two chromosome-scale assemblies (B. marci [GCA_910594885], D. febrilis [GCA_958336335]) were available, with only the former annotated. As the first genomic data for Plecia, our data will facilitate comprehensive studies on the evolutionary background of their range expansion and outbreak phenomena. The assembly can serve as a high-quality reference genome to identify genomic markers such as microsatellites and single-nucleotide polymorphisms, which have been widely used in population genomics of invasive pests (e.g. Schmidt et al. 2021; Kang et al. 2023). Furthermore, through comparative studies with P. nearctica, and also with relatively rural species such as P. adiastola, we can identify advantageous traits in inhabiting city areas, and factors effecting explosive increases in population. Overall, we hope this data provides a fundamental basis for understanding the adaptation and outbreaks of lovebugs in urban ecosystems.

Materials and Methods

Sample Collection and Sequencing

Final-instar larvae of P. longiforceps were collected from forest soil at Neulsolgil Park, Incheon, Korea (37°23′20.2″N 126°43′14.3″E) in April 2023, and were reared to obtain pupa and adults. Identification was confirmed based on morphological characteristics of male terminalia (Fitzgerald and Nakamura 2015; Kim et al. 2022). Samples were flash-frozen in liquid nitrogen and stored in −80 °C.

HiFi and Pore-C experiments were conducted at the National Instrumentation Center for Environmental Management, Korea. High molecular weight genomic DNA was extracted from a single male adult specimen using the Wizard Genomic DNA Purification Kit (Promega, USA). Extract quality was assessed by 1% agarose gel electrophoresis and quantified with the Quantus Fluorometer (Promega, USA). The HiFi library was prepared using the SMRTbell prep kit 3.0 (Pacific Biosciences (PacBio), USA) and sequenced with two runs on a PacBio Sequel IIe platform. Reads with a quality score of Q20 or higher were obtained using the Circular Consensus Sequencing protocol in SMRT Link ver. 12.0.0.177059.

Twenty male adult specimens were used for Pore-C DNA extractions following the standard procedure (Deshpande et al. 2022) with slight adaptations (supplementary methods, Supplementary Material online). Pore-C libraries were constructed using the Ligation Sequencing Kit V10 (Oxford Nanopore Technologies [ONT], UK) and sequenced on an ONT PromethION platform. Reads with a phred quality score of 7 or higher were selected using the Guppy 6.5.7 software provided by ONT.

RNA-sequencing procedures were conducted at Macrogen, Korea. Total RNA was extracted from four male adults, and one each of female adult, pupa, and final-instar larva, using the RNeasy Mini Kit (Qiagen, Germany). Extractions from male adults were combined into a single sample. RNA integrity was checked using TapeStation (Agilent, USA). RNA-seq libraries were prepared using the TruSeq Stranded mRNA Library Prep Kit (Illumina, USA) and sequenced on an Illumina NovaSeq6000 platform.

Genome Assembly

Genome profiles were estimated from raw HiFi data using GenomeScope 2.0 (Ranallo-Benavidez et al. 2020) based on k-mer (k = 21) frequency. HiFi reads were assembled using Hifiasm-0.19.5 (Cheng et al. 2021). Contaminants were identified and removed from the draft assembly using BLAST + 2.14.1 (Camacho et al. 2009). Pore-C data was processed using the Workflow Pore-C pipeline (https://github.com/epi2me-labs/wf-pore-c) developed by ONT. Results were input to YaHS (Zhou et al. 2023) for chromosome-level scaffolding, and chromosomal contact information was visualized in Juicebox 2.15 (Durand et al. 2016). Genome completeness was assessed with BUSCO 5.4.7 (Manni et al. 2021) using the insecta_odb10 dataset.

Transcriptome Assembly

The four RNA-seq datasets were preprocessed to remove Illumina adapters using Trimmomatic 0.39 (Bolger et al. 2014), foreign contaminants using Kraken2-2.1.3 (Wood et al. 2019), and rRNA sequences using SortMeRNA 4.3.6 (Kopylova et al. 2012). The datasets were concatenated into a single pair for de novo transcriptome assembly using Trinity v2.15.1 (Grabherr et al. 2011). MMseqs2 v.15.6f452 (Steinegger and Söding 2017) was used to extract representative sequences to produce a reference transcriptome for P. longiforceps. Each life stage/sex transcriptome was also assembled separately using Trinity. Completeness of each assembly was assessed with BUSCO using the insecta_odb10 dataset.

Gene Annotation and Synteny Analysis

The MAKER v3.01.04 pipeline (Campbell et al. 2014) was used for gene annotation. A taxon-specific repeat model was generated using RepeatModeler 2.0.5 (Flynn et al. 2020) and combined with FamDB models for repeat identification in RepeatMasker 4.1.5 (Smit et al. 2013). The repeat model, reference transcriptome, and Diptera protein data from Swiss-Prot were used for the initial round of gene prediction. SNAP v.2006-07-28 (Korf 2004) and AUGUSTUS 3.5.0 (Stanke et al. 2008) were trained for ab initio gene prediction in the next two rounds of MAKER to produce the final transcript and protein sequence datasets. Completeness of the annotated protein set was assessed with BUSCO using the insecta_odb10 dataset.

MMseqs2 was used to search the protein models against Swiss-Prot, NCBI NR proteins, Pfam, and CDD databases, and eggNOG-mapper v2.1.12 (Cantalapiedra et al. 2021) was used to search against the EggNOG database and assign GO terms and KEGG pathways. For genome synteny analysis with the published B. marci genome (Sivell et al. 2021), MMseqs2 was used to search for matching protein sequences between P. longiforceps and B. marci, and collinearity blocks were generated using MCScanX (Wang et al. 2012). Detailed procedures on genome assembly and annotation are available in supplementary methods, Supplementary Material online.

Supplementary Material

evae205_Supplementary_Data

Acknowledgments

We thank Sangil Kim, Seunghun Jung, Sang Gyun Jin, and Jeong Jun Lee (Seoul National University) for their critical advice and support throughout this project. We also thank Hwayong An and Soyeon Kang (National Instrumentation Center for Environmental Management) for their contribution in HiFi and Pore-C sequencing experiments, Youngsung Joo and Hanyoung Choi (Seoul National University) for their provision of rearing chambers for P. longiforceps larvae, and Chenzi Zhou (University of Cambridge) for his help on data analysis using YaHS.

Contributor Information

Jonghwan Choi, School of Biological Sciences and Institute of Biodiversity, Seoul National University, Seoul, Korea; Comparative Medicine Disease Research Center, Seoul National University, Seoul, Korea.

Taemin Kang, National Institute of Biological Resources, Incheon, Korea.

Sun-Jae Park, National Institute of Biological Resources, Incheon, Korea.

Seunggwan Shin, School of Biological Sciences and Institute of Biodiversity, Seoul National University, Seoul, Korea; Comparative Medicine Disease Research Center, Seoul National University, Seoul, Korea.

Supplementary Material

Supplementary material is available at Genome Biology and Evolution online.

Funding

This work was supported by the National Institute of Biological Resources funded by the Ministry of Environment of the Republic of Korea [NIBR202405104; NIBR202410201]; the National Research Foundation funded by the Ministry of Science and ICT of the Republic of Korea [2021R1C1C1003452; 2021R1A5A1033157], and the Creative-Pioneering Researcher Program through Seoul National University.

Data Availability

Raw sequencing data, genome assembly, and transcriptome assemblies are deposited at NCBI under the BioProject PRJNA1126849. Predicted protein and transcript models are available at Figshare (DOI: 10.6084/m9.figshare.26112745).

Literature Cited

  1. Ahn SJ, Vogel H, Heckel DG. Comparative analysis of the UDP-glycosyltransferase multigene family in insects. Insect Biochem Mol Biol. 2012:42(2):133–147. 10.1016/j.ibmb.2011.11.006. [DOI] [PubMed] [Google Scholar]
  2. Alberti M, Correa C, Marzluff JM, Hendry AP, Palkovacs EP, Gotanda KM, Hunt VM, Apgar TM, Zhou Y. Global urban signatures of phenotypic change in animal and plant populations. Proc Natl Acad Sci U S A. 2017:114(34):8951–8956. 10.1073/pnas.1606034114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics. 2014:30(15):2114–2120. 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Buschman LL. Invasion of Florida by the “lovebug” Plecia nearctica (Diptera: Bibionidae). Fla Entomol. 1976:59(2):191–194. 10.2307/3493971. [DOI] [Google Scholar]
  5. Busvine JR. Insects and hygiene: the biology and control of insect pests of medical and domestic importance. New York: Springer New York; 1980. [Google Scholar]
  6. Byers JE. Impact of non-indigenous species on natives enhanced by anthropogenic alteration of selection regimes. Oikos. 2002:97(3):449–458. 10.1034/j.1600-0706.2002.970316.x. [DOI] [Google Scholar]
  7. Callahan PS, Carlysle TC, Denmark HA. Mechanism of attraction of the lovebug, Plecia nearctica, to southern highways: further evidence for the IR-dielectric waveguide theory of insect olfaction. Appl Opt. 1985:24(8):1088–1093. 10.1364/AO.24.001088. [DOI] [PubMed] [Google Scholar]
  8. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: architecture and applications. BMC Bioinformatics. 2009:10(1):421. 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Campbell MS, Holt C, Moore B, Yandell M. Genome annotation and curation using MAKER and MAKER-P. Curr Protoc Bioinformatics. 2014:48(1):4.11.1–4.11.39. 10.1002/0471250953.bi0411s48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: functional annotation, orthology assignments, and domain prediction at the metagenomic scale. Mol Biol Evol. 2021:38(12):5825–5829. 10.1093/molbev/msab293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cheng H, Concepcion GT, Feng X, Zhang H, Li H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021:18(2):170–175. 10.1038/s41592-020-01056-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Crooks J, Soulé ME. Lag times in population explosions of invasive species: causes and implications. In: Sandlund OT, editor. Invasive species and biodiversity management. Amsterdam: Kluwer Academic Publishers; 1999. p. 103–125. [Google Scholar]
  13. Cruse C, Moural TW, Zhu F. Dynamic roles of insect carboxyl/cholinesterases in chemical adaptation. Insects. 2023:14(2):194. 10.3390/insects14020194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Dermauw W, Van Leeuwen T, Feyereisen R. Diversity and evolution of the P450 family in arthropods. Insect Biochem Mol Biol. 2020:127:103490. 10.1016/j.ibmb.2020.103490. [DOI] [PubMed] [Google Scholar]
  15. Deshpande AS, Ulahannan N, Pendleton M, Dai X, Ly L, Behr JM, Schwenk S, Liao W, Augello MA, Tyer C, et al. Identifying synergistic high-order 3D chromatin conformations from genome-scale nanopore concatemer sequencing. Nat Biotechnol. 2022:40(10):1488–1499. 10.1038/s41587-022-01289-z. [DOI] [PubMed] [Google Scholar]
  16. Duda O. Diptera. 2. Lonchopteridae, Bibionidae und Muscidae acalyptratae. Schwedisch-chinesische wissenschaftlische expedition nach den nordwestlichen Provinzen Chinas, unter Leitung von Dr. Sven Hedin und Prof. Su Ping-chang. Ark Zool. 1934:26B:1–6. (In German). [Google Scholar]
  17. Durand NC, Robinson JT, Shamim MS, Machol I, Mesirov JP, Lander ES, Aiden EL. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 2016:3(1):99–101. 10.1016/j.cels.2015.07.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Faeth SH, Warren PS, Shochat E, Marussich WA. Trophic dynamics in urban communities. BioScience. 2005:55(5):399–407. 10.1641/0006-3568(2005)055[0399:TDIUC]2.0.CO;2. [DOI] [Google Scholar]
  19. Fitzgerald SJ, Nakamura T. New record of Plecia longiforceps Duda, 1934 from Yaeyama Islands, Japan (Diptera: Bibionidae). Jpn J Syst Entomol. 2015:21(2):351–354. [Google Scholar]
  20. Flynn JM, Hubley R, Goubert C, Rosen J, Clark AG, Feschotte C, Smit AF. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci U S A. 2020:117(17):9451–9457. 10.1073/pnas.1921046117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gibb H, Hochuli DF. Habitat fragmentation in an urban environment: large and small fragments support different arthropod assemblages. Biol Conserv. 2022:106(1):91–100. 10.1016/S0006-3207(01)00232-4. [DOI] [Google Scholar]
  22. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, et al. Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat Biotechnol. 2011:29(7):644–652. 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Hetrick LA. Biology of the “Love-Bug”, Plecia nearctica (Diptera: Bibionidae). Fla Entomol. 1970:53(1):23–26. 10.2307/3493110. [DOI] [Google Scholar]
  24. Jung J-K, Nam Y, Kim D, Lee S-H, Lim J-H, Choi WI, Kim E-S. Tree-crown defoliation caused by outbreak of forest insect pests in Korea during 2020. Korean J Appl Entomol. 2020:59(4):409–410. 10.5656/KSAE.2020.10.0.054. [DOI] [Google Scholar]
  25. Kang JH, Ham D, Park SH, Hwang JM, Park S-J, Baek MJ, Bae YJ. Population genetic structure of a recent insect invasion: a gall midge, Asynapta groverae (Diptera: Cecidomyiidae) in South Korea since the first outbreak in 2008. Sci Rep. 2023:13(1):2812. 10.1038/s41598-023-29782-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kim S, Jung S, Choi J, Tsai C-L, Farrell BD, Shin S. History does not repeat itself; it rhymes: range expansion and outbreak of Plecia longiforceps (Diptera: Bibionidae) in East Asia. J Integr Pest Manag. 2022:13(1):31. 10.1093/jipm/pmac026. [DOI] [Google Scholar]
  27. King AM, MacRae TH. Insect heat shock proteins during stress and diapause. Annu Rev Entomol. 2015:60(1):59–75. 10.1146/annurev-ento-011613-162107. [DOI] [PubMed] [Google Scholar]
  28. Koirala BKS, Moural T, Zhu F. Functional and structural diversity of insect glutathione S-transferases in xenobiotic adaptation. Int J Biol Sci. 2022:18(15):5713–5723. 10.7150/ijbs.77141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kopylova E, Noé L, Touzet H. SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics. 2012:28(24):3211–3217. 10.1093/bioinformatics/bts611. [DOI] [PubMed] [Google Scholar]
  30. Korf I. Gene finding in novel genomes. BMC Bioinformatics. 2004:5(1):59. 10.1186/1471-2105-5-59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Krivosheina MG, Krivosheina NP, Krivosheina GG, Ozerov AL. Outbreaks of soil-dwelling Diptera (Insecta): has the phenomenon a natural or an anthropogenic origin. Entomol Rev. 2019:99(6):733–743. 10.1134/S0013873819060034. [DOI] [Google Scholar]
  32. Manjon C, Troczka BJ, Zaworra M, Beadle K, Randall E, Hertlein G, Singh KS, Zimmer CT, Homem RA, Lueke B, et al. Unravelling the molecular determinants of bee sensitivity to neonicotinoid insecticides. Curr Biol. 2018:28(7):1137–1143.e5. 10.1016/j.cub.2018.02.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Manni M, Berkeley MR, Seppey M, Simão FA, Zdobnov EM. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol Biol Evol. 2021:38(10):4647–4654. 10.1093/molbev/msab199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. McKinney ML. Urbanization as a major cause of biotic homogenization. Biol Conserv. 2006:127(3):247–260. 10.1016/j.biocon.2005.09.005. [DOI] [Google Scholar]
  35. Piano E, De Wolf K, Bona F, Bonte D, Bowler DE, Isaia M, Lens L, Merckx T, Mertens D, van Kerckvoorde M, et al. Urbanization drives community shifts towards thermophilic and dispersive species at local and landscape scales. Glob Change Biol. 2016:23(7):2554–2564. 10.1111/gcb.13606. [DOI] [PubMed] [Google Scholar]
  36. Ranallo-Benavidez TR, Jaron KS, Schatz MC. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat Commun. 2020:11(1):1432. 10.1038/s41467-020-14998-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Robinson WH. Urban entomolgy: insect and mite pests in the human environment. New York: Taylor & Francis; 1996. [Google Scholar]
  38. Schmidt TL, Swan T, Chung J, Karl S, Demok S, Yang Q, Field MA, Muzari MO, Ehlers G, Brugh M, et al. Spatial population genomics of a recent mosquito invasion. Mol Ecol. 2021:30(5):1174–1189. 10.1111/mec.15792. [DOI] [PubMed] [Google Scholar]
  39. Scott JG, Warren WC, Beukeboom LW, Bopp D, Clark AG, Giers SD, Hediger M, Jones AK, Kasai S, Leichter CA, et al. Genome of the house fly, Musca domestica L., a global vector of diseases with adaptations to a septic environment. Genome Biol. 2014:15(10):466. 10.1186/s13059-014-0466-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Simkin RD, Seto KC, McDonald RI, Jetz W. Biodiversity impacts and conservation implications of urban land expansion projected to 2050. Proc Natl Acad Sci U S A. 2022:119(12):e2117297119. 10.1073/pnas.2117297119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Sivell O; Natural History Museum Genome Acquisition Lab; Darwin Tree of Life Barcoding collective; Wellcome Sanger Institute Tree of Life programme; Wellcome Sanger Institute Scientific Operations: DNA Pipelines collective; Tree of Life Core Informatics collective; Darwin Tree of Life Consortium . The genome sequence of the St Mark's fly, Bibio marci (Linnaeus, 1758). Wellcome Open Res. 2021:6:285. 10.12688/wellcomeopenres.17265.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Smit AFA, Hubley R, Green P.. 2013. RepeatMasker Open-4.0. http://www.repeatmasker.org.
  43. Stanke M, Diekhans M, Baertsch R, Haussler D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics. 2008:24(5):637–644. 10.1093/bioinformatics/btn013. [DOI] [PubMed] [Google Scholar]
  44. Steinegger M, Söding J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat Biotechnol. 2017:35(11):1026–1028. 10.1038/nbt.3988. [DOI] [PubMed] [Google Scholar]
  45. Tone K, Osada M. Report of the Plecia longiforceps outbreak in Okinawa Island. Insects Loochoos. 2020:44:81–82. (In Japanese). [Google Scholar]
  46. Wang Y, Tang H, Debarry JD, Tan X, Li J, Wang X, Lee T-H, Jin H, Marler B, Guo H, et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012:40(7):e49. 10.1093/nar/gkr1293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Wood DE, Lu J, Langmead B. Improved metagenomic analysis with Kraken 2. Genome Biol. 2019:20(1):257. 10.1186/s13059-019-1891-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Wu C, Chakrabarty S, Jin M, Liu K, Xiao Y. Insect ATP-binding cassette (ABC) transporters: roles in xenobiotic detoxification and Bt insecticidal activity. Int J Mol Sci. 2019:20(11):2829. 10.3390/ijms20112829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Yang T, Liu N. Genome analysis of cytochrome P450s and their expression profiles in insecticide resistant mosquitoes, Culex quinquefasciatus. PLoS One. 2011:6(12):e29418. 10.1371/journal.pone.0029418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Zhou C, McCarthy SA, Durbin R. YaHS: yet another Hi-C scaffolding tool. Bioinformatics. 2023:39(1):btac808. 10.1093/bioinformatics/btac808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Zhu F, Lavine L, O'Neal S, Lavine M, Foss C, Walsh D. Insecticide resistance and management strategies in urban ecosystems. Insects. 2016:7(1):2. 10.3390/insects7010002. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

evae205_Supplementary_Data

Data Availability Statement

Raw sequencing data, genome assembly, and transcriptome assemblies are deposited at NCBI under the BioProject PRJNA1126849. Predicted protein and transcript models are available at Figshare (DOI: 10.6084/m9.figshare.26112745).


Articles from Genome Biology and Evolution are provided here courtesy of Oxford University Press

RESOURCES