Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2025 Dec 11;16:39. doi: 10.1038/s41598-025-28795-9

Morphological traits and mitogenome of Thaumapsylla breviceps orientalis endemic to China provide insight into the evolution of the order Siphonaptera

Ju Pu 1, Xiaoxia Lin 1, Xiaobin Huang 1, Xiaoyan Zheng 1, Xianzheng Zhang 1, Wenge Dong 1,
PMCID: PMC12764565  PMID: 41381548

Abstract

The Thaumapsylla breviceps orientalis is endemic to China and exhibits extreme host specificity (monoxenous). This study reports the first mitogenome sequenced for the family Ischnopsyllidae and provides comprehensive analysis of both the morphological characteristics and mitogenome of T. b. orientalis. The assembled mitogenome is 15,631 bp in length with a high AT content (78.5%), and its codon usage bias is predominantly shaped by natural selection. Evolutionary analysis based on evolutionary rates and nucleotide diversity across different families within Siphonaptera revealed that Pulicidae has the fastest evolutionary rate and the highest nucleotide diversity, a pattern likely driven by differences in their hosts and habitats. We observed that early-diverging flea lineages are predominantly polyxenous, whereas later-diverging lineages are primarily pleioxenous or monoxenous. Phylogenetic results indicate that the taxonomic status of the families Ctenophthalmidae, Vermipsyllidae, and Hystrichopsyllidae requires further study and revision. This research addresses a key knowledge gap in Ischnopsyllidae mitogenomics and clarifies the phylogenetic relationships within the order Siphonaptera.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-025-28795-9.

Keywords: Thaumapsylla breviceps orientalis, Ischnopsyllidae, Mitogenome, Phylogeny

Subject terms: Taxonomy, Genome, Bayesian inference

Introduction

The known family Ischnopsyllidae comprises 2 subfamilies, 5 tribes, 20 genera and 162 species, of which 6 genera and 30 species have been recorded in China1. The Ischnopsyllidae is a distinct family and morphologically closely related to the Leptopsyllidae and Ceratophyllidae. Fleas do not choose their hosts randomly, although opportunistic behavior may occur, their choice is influenced by specific requirements for conditions such as light, airflow, temperature, blood, hair, and host behavior2. Based on host specificity, fleas are classified into three primary types: (1) Polyxenous, characterized by low host specificity and the ability to parasitize a wide range of hosts; (2) Pleioxenous, restricted to hosts within a single family; (3) Monoxenous, exhibiting high host specificity, typically limited to a single host species or closely related species within the same genus. Thaumapsylla breviceps orientalis is a typical monoxenous flea species, primarily parasitizing fruit bats (mainly Rousettus leschenaultii)3. The high host specificity of T. b. orientalis may be attributed to several proposed factors: (1) the optimal body temperature of bats provides a suitable microenvironment for these fleas4; (2) the unique chemical composition of bat blood and their distinct olfactory cues may facilitate host recognition and preference5; and (3) the specific microclimatic conditions within bat caves may further influence flea survival and reproduction6. T. b. orientalis had no record for medical significance, its primary host (Rousettus leschenaultii) can carry various viruses, such as Coronavirus, Nipah virus, and Bartonella79, thereby serving as a critical component in a potential “virus reservoir - vector - new host” transmission chain.

Mitogenome also known as the second genetic system, was the primary genetic material outside nuclear in animals10. Owing to its strictly maternal inheritance, high mutation rate, and low recombination rate, the mitogenome serves as a powerful molecular marker and was widely used in studies of insect phylogeny, population genetics, and the identification of cryptic species11. Throughout evolutionary history, the base composition of most insect mitogenomes has remained relatively conserved. However, their structural organization can exhibit significant variation, including gene rearrangements, indels, as well as mutations in tRNA anticodons and the presence of atypical start codons12,13. Research on Siphonapteran mitogenomes has progressed more slowly compared to other major insect orders such as Diptera, Lepidoptera, and Coleoptera1416. Notably, a complete mitogenome from the family Ischnopsyllidae has not been available until now. To address this gap and to understand the evolution and phylogeny within Siphonaptera, we combined a detailed morphological description of T. b. orientalis with the first sequencing and analysis of its mitogenome. This study provides a valuable genomic resource for understanding the evolution and phylogeny of the Ischnopsyllidae.

Materials and methods

Specimen collection, morphological identification, DNA extraction and mitogenome sequencing

In June 2023, 17 individuals (male: 7, female: 10) of T. b. orientalis were collected from Rousettus leschenaultii (Chiroptera: Pteropodidae) captured in Baoshan City, Yunnan Province. The fleas were placed in Eppendorf tubes containing 95% ethanol, and data on the habitat, host and weather were recorded. Subsequently, they were transported to the laboratory for storage in an ultra-low temperature refrigerator at − 80 °C. All specimens were collected and preserved with the approval of the Animal Ethics Committee of Dali University (approval number: MECDU-201912-20).

The collected T. b. orientalis individuals were identified primarily based on morphological characteristics described in Fauna Sinica Insecta Siphonaptera1. Specimens selected for morphological documentation were sealed and dried in a 47 °C incubator for 3–5 days. Subsequently, both male and female specimens were photographed using an X-type optical microscope. The remaining specimens were sent to Shenzhen Huitong Biotechnology Co., Ltd. (China) for DNA analysis. Following tests for DNA purity and integrity, a sequencing library with an insertion fragment of 350 bp was constructed using the NEB Next Ultra DNA Library Prep Kit for Illumina (NEB, USA). Double-end sequencing was then performed on an Illumina Novaseq 6000 platform. Raw data were processed with the fastp software (https://github.com/OpenGene/fastp) to obtain clean reads. The filtering criteria included: removal of reads containing over 5% N-bases, removal of reads with more than 50% of bases having a quality score ≤ 5, and removal of adapter-contaminated reads.

Mitochondrial sequence assembly, annotation and analysis

The mitogenome was assemble using MitoZ 2.317. The reliability of the assembled was evaluated with BWA v 0.7.1718 and Samtools v 0.1.2019. To further validate the assembly, we used Geneious Prime 11.020 with the following parameters: a minimum overlap of 50 bp and a minimum overlap homology of 100%. PCGs and rRNA genes were identified using BLAST21. The tRNA genes were predicted with tRNAscan SE22 and ARWEN23, and all gene annotations were manually verified and corrected. The annotated mitogenome sequence of T. b. orientalis was deposited in GenBank (accession number: PP973737). Subsequent analyses were performed using a suite of tools, including CodonW v.1.4.2 (https://sourceforge.net/projects/codonw/), R package24, MEGA25 and DnaSP v 526. Nucleotide composition biases were assessed by calculating the AT-skew and GC-skew using the formulas: AT-skew = (A - T) / (A + T) and GC-skew = (G - C) / (G + C).

Phylogenetic analysis

The Boreus elegans (Carpenter, 1935) was selected as the outgroup for phylogenetic tree construction. A concatenated nucleotide dataset based on 13 PCGs and two rRNA genes was analyzed using both Bayesian inference (BI)27 and Maximum likelihood (ML)28 methods. Species information is provided in Table S2. First, the sequences of the 13 PCGs and two rRNA genes were individually aligned using MAFF29 and optimized with MACSE v2.6030. The resulting PCGs alignments were then trimmed with Gblocks31, the rRNA gene alignments were trimmed with trimAl32. The processed sequences were concatenated into a supermatrix. Nucleotide substitution saturation was assessed for this dataset using DAMBE33. The results (Iss < Iss.c, p < 0.05) indicated no significant saturation, confirming its suitability for phylogenetic reconstruction. Second, the optimal nucleotide substitution model was determined using ModelFinder34 (Table S3). An ML tree was constructed with IQ-TREE35, performing 1000 bootstrap replicates. A BI tree was constructed using MrBayes 3.2.6, running four independent Markov Chain Monte Carlo (MCMC) chains for 1 million generations each and sampling every 1000 generations. Finally, FigTree v.1.4.4 (http://tree.bio.ed.ac.uk/software/figtree/) was used to visualize and finalize the phylogenetic trees for presentation.

Results

Morphological characteristics

The head of T. b. orientalis was shorter than that of most fleas (Ceratophyllidae, Hystrichopsyllidae, Pulicidae and Ctenophthalmidae). It was fracticipital, characterized by its short and wide shape, which distinguished it from most other flea species. There were 2 genal combs. Column 1 of frons bristle had 6 to 7 bristles. There was one ocular bristle, and the club was long and narrow. Males had 36 to 38 pronotal combs, while females had 34 to 37. The mesothorax was longer than both the prothorax and the metathorax. The coxal bristle on the fore leg were numerous and densely arranged. There were 3 notches on the posterior edge of the tibia. An antepygidial bristle was present in both males and females, with one bristle per individual. There was no boundary between the immovable process and the clasper, and the movable process was a wide and short oval. The hilla was round. The presence of a convex margin along the apex of the basimere was an important character distinguishing T. b. orientalis from other male subspecies (Fig. 1).

Fig. 1.

Fig. 1

Morphological characteristics of a (♂) & b (♀) of T. b. orientalis from China. Image was acquired using a Leica DM3000 microscope.

Mitogenome structure and organization

The insect mitogenome was a double-stranded circular DNA molecule. The mitogenome of T. b. orientalis (GenBank accession number: PP973737) was 15,631 bp in size (the non-coding region was not complete and was not discussed in this paper). It contained 37 genes typical of metazoan animals, with 23 genes were located on the major chain (J-strand), including nine protein-encoding genes (nad2, cox1, cox2, atp8, atp6, cox3, nad3, nad6 and cob) and fourteen transfer RNA genes (trnI, trnM, trnW, trnL2 (taa), trnK, trnD, trnG, trnA, trnR, trnN, trnS1 (tct), trnE, trnT and trnS2 (tga)). The remaining fourteen genes were located on the minor chain (N-strand), including four protein-encoding genes (nad1, nad4, nad4L and nad5), eight transfer RNA genes (trnY, trnC, trnQ, trnV, trnL1 (tag), trnP, trnH and trnF) and two rRNA genes (rrnL and rrnS) (Fig. 2). The mitogenome base composition of T. b. orientalis was A: 37.8%, T: 40.7%, C: 13.1%, G: 8.4%, with a high AT content (78.5%). The AT content at the first codon position was 71.8%, 68.7% at the second codon position, and as high as 89.9% at the third codon position, indicating a stronger AT preference at the third codon position (Table 1). There were fourteen gene overlap regions (totaling 35 bp), and the largest overlap region was 7 bp (between atp8 and atp6 / nad4 and nad4L). There were ten intergenic regions (totaling 33 bp), and the largest intergenic region was 17 bp (between trnS2 and nad1) (Table S1).

Fig. 2.

Fig. 2

Organization of the T. b. orientalis mitogenome. tRNA genes were shown with the single-letter abbreviations of their corresponding amino acids. The two leucines are denoted by L1 (tag) and L2 (taa), and the two serines are denoted by S1 (tct) and S2 (tag).

Table 1.

Nucleotide composition and skewness of the T. b. orientalis mitogenome.

Gene A% T% C% G% AT% AT-skew GC-skew
PCGs 30.5 41.8 14.3 13.5 72.3 -0.156 -0.03
1st codon position 34.5 37.3 11.1 17.1 71.8 -0.039 0.212
2nd codon position 21.3 47.4 17.3 14.0 68.7 -0.380 -0.107
3rd codon position 43.5 46.4 5.9 4.3 89.9 -0.032 -0.164
rRNAs 39.2 40.3 6.5 13.9 79.5 -0.013 0.363
tRNAs 40.1 38.1 10.1 11.8 78.2 0.026 0.075
Whole genome 37.8 40.7 13.1 8.4 78.5 -0.037 -0.221

Protein coding genes, codon usage bias and tRNA secondary structure analysis

All PCGs of T. b. orientalis used ATN (ATT/ATC/ATG/ATA) as start codon, and all PCGs used TAA or TAG as stop codons except for nad5 and nad4 with incomplete T– as stop codons (Table S1). Codon usage bias directly affects translational efficiency. As this process occurs at the mRNA level (where T was replaced by U), the codons consist of A, U, G, and C. Analysis of the 13 PCGs identified 27 preferred codons (RSCU > 1), including UUA, UCU, CCU, CGA, ACU, GUA and GGA. The least frequently used codons were UCG, CUG, CGC, ACG, AGC, GCG and GGC (Fig. 3a). In the ENC-plot, except for cob, nad4L, nad4, atp6 and cox2 with Nc > 35, the other eight genes had Nc < 35. Most genes were located considerably below the standard curve, indicating that codon usage bias was primarily influenced by natural selection (Fig. 3b). The neutral curve showed that the points for each gene did not fall on the diagonal, and the correlation between GC12 and GC3s was not significant (R = 0.105). This suggests that the base composition of GC12 and GC3s differed and that the genomic GC content was highly conserved, with codon usage bias being more strongly influenced by natural selection (Fig. 3c). The PR2 analysis revealed that all genes were distributed away from the central point (0.5, 0.5), with A ≠ T and G ≠ C, indicating that both selection and mutation pressures collectively shaped codon usage bias (Fig. 3d).

Fig. 3.

Fig. 3

Preference analysis of 13 PCGs of T. b. orientalis. (a Relative Synonymous Codon Usage; b: ENC-plot; c: Neutral curve; d: PR2)

The 22 tRNA genes of T. b. orientalis ranged in length from 63 to 70 bp (Table S1). Among them, only trnS1 lacked the D-arm, while the remaining 21 tRNAs had typical cloverleaf-like structures. Most tRNAs formed typical Watson-Crick pairing, with a few mismatches (T-G: 15, T-T: 2, C-A: 1). Most of these mismatches occurred at the junction of the stem and the loop (Fig. S1).

Analysis of evolutionary differences among Siphonaptera based on evolutionary rate, nucleotide diversity and codon usage bias

Due to the availability of only a single sequenced mitogenome from the families Hystrichopsyllidae, Pygiopsyllidae, Ischnopsyllidae, and Vermipsyllidae, the average Ka/Ks values were not calculated for these groups. The average Ka/Ks values for the remaining families were ranked as follows: Pulicidae (0.7457) > Ceratophyllidae (0.5767) > Leptopsyllidae (0.5702) > Ctenophthalmidae (0.5504). Furthermore, the species with the highest evolutionary rates were Ctenocephalides felis (Bouche, 1835): 0.8423, C. f. felis (Bouche, 1835): 0.7999, C. orientis (Jordan, 1925): 0.7868, C. canis (Curtis, 1826): 0.7219, and Xenopsylla cheopis (Rothschild, 1903): 0.6873 (Table S2). Families that diverged around the Late Cretaceous (Pulicidae, Pygiopsyllidae, and Ctenophthalmidae) were predominantly characterized by polyxenous. In contrast, families that diverged during the Paleocene to Eocene (Hystrichopsyllidae, Vermipsyllidae, Leptopsyllidae, and Ceratophyllidae) were predominantly characterized by pleioxenous or monoxenous (Table 2). Additionally, the flea species parasitizing Rattus tanezumi and R. norvegicus were highly diverse. Among the 25 sequenced Siphonaptera mitogenomes in this study, 12 and 9 species were associated with R. tanezumi and R. norvegicus, respectively (Table 2).

Table 2.

Divergence times and host information for fleas.

Divergence times Species Host

Pulicidae

(Late Cretaceous)

Ctenocephalides felis Felis catus, Canidae, Rattus tanezumi, Lepus sinensis, Tupaia belangeri, Homo sapiens(Carnivora, Rodentia, Scandentia, Primates)
Ctenocephalides felis felis Felis catus, Canidae, Rattus tanezumi, Lepus sinensis, Tupaia belangeri, Homo sapiens(Carnivora, Rodentia, Scandentia, Primates)
Ctenocephalides orientis It is mainly parasite on the family Canidae, a few mammals (Carnivora).
Ctenocephalides canis It is mainly parasitic in the family Canidae, a few mammals (Carnivora).
Pulex irritans It is parasite on 15 orders, 77 genera and 130 species.
Xenopsylla cheopis Rattus norvegicus, Rattus tanezumi, Rattus rattus, Spermophilus dauricus, Apodemus agrarius, Suncus murinus, Suncus etruscus (Rodentia, Eulipotyphla)
Hystrichopsyllidae (Paleocene to Eocene.) Hystrichopsylla weida qinlingensis Anourosorex squamipes, Sorex cylindricauda (Family Soricidae)

Vermipsyllidae

(Paleocene to Eocene.)

Dorcadia ioffi Ovis aries, Capra hircus (Family Bovidae)

Pygiopsyllidae

(Late Cretaceous)

Aviostivalius aklossi bispiniformis Leopoladmys edwardsi, Rattus tanezumi, Rattus nitidus, Niviventer fulvescen, Rattus rattus, Niviventer confucianus, Tupaia belangeri, Anourosorex squamipes, Berylmys bowersi, Rattus brunneusculus, Bandicota indica, Niviventer coninga, Ratufa bicolor (Rodentia, Scandentia, Eulipotyphla)

Ctenophthalmidae

(Late Cretaceous)

Neopsylla specialis Apodemus agrarius, Apodemus chevrier, Rattus tanezumi, Niviventer confucianus, Niviventer fulvescen, Eothenomys miletus, Niviventer coxingi, Niviventer confucianus, Rattus losea, Rattus norvegicus (Mainly parasite on the family Muridae.)
Stenischia humilis Niviventer fulvescen, Apodemus agrarius, Rattus norvegicus, Rattus tanezumi, Apodemus chevrier, Eothenomys miletus, Rattus nitidus, Crocidura attenuata, Tupaia glis, Phodopus sungorus, Tscherskia triton, Spermophilus dauricus (Mainly parasite on Rodentia; a very few are parasite on Eulipotyphla and Scandentia.)
Stenischia montanis yunlongensis Apodemus draco, Apodemus chevrieri, Niviventer confucianus, Eothenomys custos, Rattus norvegicus, Anourosorex squamipes, Scaptonyx fusicaudus, Neotetracus sinensis (Rodentia, Eulipotyphla)
Ctenophthalmus yunnanus Neodon, Eothenomys fidelis, Eothenomys custos, Rattus norvegicus, Ochotona thibetana, Apodemus peninsulae (Mainly parasite on Rodentia; a very few are parasite on Lagomorpha)
Ctenophthalmus quadratus Eothenomys miletus, Eothenomys custos, Eothenomys olitor, Apodemus draco, Apodemus latronum, Rattus tanezumi, Rattus nitidus, Berylmys bowersi, Rattus rattus, Mus musculus, Dremomys pernyi, Crocidura attenuate (Mainly parasite on Rodentia; a very few are parasite on Eulipotyphla.)

Leptopsyllidae

(Paleocene to Eocene.)

Paradoxopsyllus custodis Rattus norvegicus, Rattus tanezumi, Eothenomys custos, Rupestes forresti, Niviventer fulvescen (Family Muridae)
Frontopsylla spadix Apodemus sylvaticus, Apodemus agrarius, Niviventer fulvescen, Niviventer confucianus, Rattus losea, Rattus tanezumi, Cricetulus longicaudatus, Allactaga sibirica, Apodemus chevrier etc. (Rodentia)
Frontopsylla diqingensis Apodemus peninsulae, Ochotona thibetana, Apodemus agrarius, Rattus nitidus, Rattus tanezumi, Niviventer eha (Rodentia)
Leptopsylla segnis Mus musculus, Rattus norvegicus, Rattus rattus, Rattus tanezumi, Rattus losea, Niviventer fulvescen, Rattus nitidus, Apodemus agrarius, Apodemus sylvaticus, Tscherskia triton (Family Muridae)

Ceratophyllidae

(Paleocene to Eocene.)

Citellophilus tesquorum Spermophilus undulatus, Spermophilus erythrogenys, Marmota baibacina, Apodemus sylvaticus, Cricetus migratorius, Apodemus sylvaticus, Felis silvestris, Rhombomys opimus, Spermophilus dauricus, Meriones unguiculatus, Meriones meridianus, Myospalax aspalax (Rodentia)
Ceratophyllus wui Aerodramus brevirostris innominate (Family Apodidae)
Nosopsyllus laeviceps Meriones meridianus, Rhombomys opimus, Meriones libycus, Marmota baibacina, Cricetus migratorius, Dipus sagitta, Mus musculus, Cricetulus barabensis, Rattus norvegicus, Meriones tamariscinus, Vulpes vulpes (Mainly parasite on Rodentia; a very few are parasite on Carnivora.)
Ceratophyllus anisus Rattus norvegicus, Rattus tanezumi, Rattus losea, Rattus nitidus, Berylmys bowersi, Apodemus agrarius, Mus musculus, Callosciurus erythraeus, Tupaia belangeri, Crocidura attenuate etc. (Mainly parasite on Rodentia; a very few are parasite on Primates and Carnivora.)
Macrostylophora euteles Dremomys lokriah, Dremomys rufigenis, Dremomys pernyi, Sciurotamias davidianus, Rupestes forresti, Tamiops swinhoei, Callosciurus erythraeus, Hylopetes alboniger, Apodemus agrarius, Tupaia belangeri (Mainly parasite on the family Sciuridae)

Ischnopsyllidae

(Paleocene to Eocene.)

Thaumapsylla breviceps orientalis Rousettus leschenaultii (Pteropodidae)

The divergence time of fleas is referred to Wu et al.1 and Zhu et al.36. The aforementioned host information is based on currently reported data1,37, with occasional and temporary hosts not included in this list.

Since there was only one species from the families Ischnopsyllidae, Hystrichopsyllidae, Pygiopsyllidae and Vermipsyllidae represented in the mitogenome database, an analysis of nucleotide diversity could not be performed. For the analyzable families, the nucleotide diversity (π) and the number of variable sites were as follows: Pulicidae (π = 0.3644; 6723 sites), Ctenophthalmidae (π = 0.3023; 5751 sites), Leptopsyllidae (π = 0.2390; 3733 sites), and Ceratophyllidae (π = 0.2594; 5609 sites) (Fig. 4). In addition, the nad5 and nad6 genes were characterized by greater sequence polymorphism and higher substitution rate (Fig. 4). A significant positive correlation was identified between the mitogenome AT content and evolutionary rate (Fig. 5). Statistical analysis of the Relative Synonymous Codon Usage (RSCU) across Siphonaptera species revealed that UUA was the most frequently used codon (461 times). Furthermore, most of the preferred codons for highly expressed genes ended with A or U (Fig. 6).

Fig. 4.

Fig. 4

Nucleotide diversity analysis of 4 families of Siphonaptera.

Fig. 5.

Fig. 5

Correlation analysis between evolutionary rate and AT content of Siphonaptera.

Fig. 6.

Fig. 6

RSCU cluster analysis of Siphonaptera based on PCGs.

Exploring the evolutionary relationship of the siphonaptera

Using Boreus elegans as the outgroup, a phylogenetic tree was constructed based on 13 PCGs and 2 rRNA gene sequences by Bayesian method (BI) and Maximum likelihood (ML) methods (Fig. 7). The phylogenetic relationships of Vermipsyllidae, Hystrichopsyllidae and Ctenophthalmidae were inconsistent between the BI and ML analyses. In addition to the outgroup, there are two main clades. The topological structure of BI tree was (((Ceratophyllidae + Ischnopsyllidae + Leptopsyllidae) + ((Vermipsyllidae + Hystrichopsyllidae) + Ctenophthalmidae)) + (Pulicidae + Pygiopsyllidae)). The topological structure of ML tree was (((Ceratophyllidae + Ischnopsyllidae + Leptopsyllidae) + (Ctenophthalmidae + Hystrichopsyllidae + Vermipsyllidae)) + (Pulicidae + Pygiopsyllidae)). The clade of Vermipsyllidae, Ctenophthalmidae and Hystrichopsyllidae was low supported(Bpp<0.75, Bsp<61). The results showed that T. b. orientalis and Leptopsylla segnis clustered together as a branch with high support (BPP = 1, BSP = 78).

Fig. 7.

Fig. 7

Phylogenetic relationship of Siphonaptera was reconstructed by Bayesian inference and Maximum likelihood method based on 13 PCGs. (Note:① suggests that the BI and ML topologies are inconsistent.)

Discussion

Among the currently sequenced mitogenomes of Siphonaptera, T. b. orientalis was the only parasite that infests the body surface of bats. The flight capability and specific body structure of bats have likely driven distinct morphological specializations in T. b. orientalis: (1) Adaptation to host environment: studies have shown that prolonged habitation in lightless environments leads to eye degeneration in species38. Correspondingly, the T. b. orientalis lacks pigment in its compound eyes, an adaptation consistent with the cave-dwelling and nocturnal habits of its host; (2) Attachment and Stability: the small size, flattened body, and resilient cuticle of fleas facilitate movement through the host’s pelage. We hypothesize that the well-developed pronotal comb (bearing 34–38 spines) in T. b. orientalis serves as a key adaptive trait, enabling the flea to maintain a firm grip on the host, particularly during flight. This is analogous to the rapid evolution of body size and color in feather lice in response to host plumage39; (3) Sensory Perception: the club (the third antenna segment) serves as the primary olfactory organ in fleas, playing a critical role in detecting host chemical signals and mechanical stimuli40. The pygidium, located at the abdominal terminus, is involved in tactile and chemosensory perception, aiding in host location and skin attachment41. The well-developed club and pygidium in T. b. orientalis are therefore essential for host localization and survival. In summary, these morphological characteristics collectively enhance the stability of its attachment to the bat’s body and improve its survival capacity within the unique ecological niche provided by its volant host.

To explore the molecular level of adaptive evolution in T. b. orientalis, we analyzed its mitogenome structure, codon usage bias, and evolutionary rate. The gene arrangement was consistent with that of the putative arthropod ancestor (Drosophila yakuba)42. We identified 27 high-frequency codons (predominantly A- or U-ending) in its mitogenome. As codon usage bias is a key regulator of gene expression that influences transcription, translation, and protein folding43,44. Our results indicate that natural selection plays a crucial role in shaping the molecular evolution of T. b. orientalis. Furthermore, our phylogenetic analysis revealed a general evolutionary trend: early-diverging flea lineages are predominantly polyxenous, whereas later-diverging lineages are primarily pleioxenous or monoxenous This pattern suggests a history of concerted evolution and increasing host specificity within Siphonaptera. The reasons for the high diversity of flea species parasitizing certain hosts, such as R. tanezumi and R. norvegicus, warrant further study. We also investigated factors influencing mitogenome evolutionary rates across Siphonaptera. A significant positive correlation was found between evolutionary rate and AT content, identifying base composition as a putative driver of rate variation. Notably, the subspecies Ctenocephalides felis felis exhibited a higher evolutionary rate (0.8423) than the nominate species Ctenocephalides felis (0.7999), raising the question of whether subspecies-level taxa generally display lower evolutionary rates—a hypothesis requiring future validation. Finally, previous studies have proposed nad5 and nad6 as superior markers to the traditionally used cox1 for species differentiation and population genetics in eukaryotes45. In line with this, we observed high sequence polymorphism and substitution rates in nad5 and nad6, suggesting their strong potential as molecular markers for resolving phylogenetic controversies within Siphonaptera.

Compared to most holometabolous insects, fleas exhibit a relatively complex taxonomic hierarchy that includes superfamily, family, subfamily, tribe, genus, species, and subspecies46. The topologies of the phylogenetic trees constructed using the two methods were largely congruent, differing only in the unstable positions of the families Ctenophthalmidae, Vermipsyllidae, and Hystrichopsyllidae—a finding consistent with previous studies47,48. The taxonomic status of Ctenophthalmidae remains particularly problematic, as it contains approximately one-quarter of all known flea species and has historically served as a “catch-all” category for taxa that are difficult to assign to other families46. One notable result was the clustering of Macrostylophora euteles (Ceratophyllidae) and Paradoxopsyllus custodis (Leptopsyllidae) into a highly supported clade (BPP = 1, BSP = 100). Further sequence alignment revealed an exceptionally high mitochondrial DNA sequence similarity of 98.8% between them (98.6% for cox1, 98.8% for cob, 99.5% for rrnL, and 99.8% for rrnS). This high genetic similarity suggests that the sequences deposited under these two names in GenBank may belong to the same species, indicating a potential misidentification. This hypothesis is surprising given that the two taxa are considered morphologically distinct and unlikely to be conspecific, highlighting a critical discrepancy between molecular and morphological data that warrants further investigation. As sequencing technologies continue to advance and more flea mitogenomes become available, expanding the taxonomic sampling will be crucial for significantly improving the resolution and robustness of the flea phylogenetic tree.

Conclusion

We present the first mitochondrial genome of T. b. orientalis and provide a comprehensive analysis of its morphology and genomic features. Our results indicate that natural selection is the dominant force shaping codon usage bias in this species. We observed an evolutionary trend from polyxenous early-diverging lineages toward pleioxenous or monoxenous later-diverging lineages. We propose that host specificity, ecological niche, and dispersal capacity are key factors underlying the variation in evolutionary rates among flea species. Furthermore, our phylogenetic analyses confirm the unstable taxonomic positions of the families Ctenophthalmidae, Vermipsyllidae, and Hystrichopsyllidae. This study lays a foundation for understanding the evolution and phylogeny of the family Ischnopsyllidae.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1 (458.9KB, pdf)

Acknowledgements

We thank the Technology Division of Shenzhen Huitong Biotechnology Co., Ltd. for its support in genomics and data analysis.

Author contributions

Ju Pu, Xiao-xia Lin and Wen-ge Dong designed and conducted the research. Xiao-bin Huang, Xian-zheng Zhang, and Xiao-yan Zheng provided the biological samples. Ju Pu and Xiao-xia Lin assembled and annotated the mitogenome. All authors read and approved the final manuscript.

Funding

We acknowledge funding support from the National Natural Science Foundation of China (NO. 32260152 to Wen-Ge Dong).

Data availability

The nucleotide sequences of the T. b. orientalis mitogenome were deposited in GenBank (https://www.ncbi.nlm.nih.gov/) under accession number PP973737.

Declarations

Competing interests

The authors declare no competing interests.

Statement

All methods were performed in accordance with the relevant guidelines and regulations.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Zhiying, L. Fauna Sinica Insecta Siphonaptera 1st edn, 328–370 (Science, 1986).
  • 2.Yan, H. et al. Host selection and influencing factors of parasitic fleas on the body surface of desert rodents, inner Mongolia, China. Int. J. Parasitol. Parasites Wildl.25, 100993. 10.1016/j.ijppaw.2024.100993 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Pawęska, J. T., Storm, N., Jansen van Vuren, P., Markotter, W. & Kemp, A. Attempted transmission of Marburg virus by Bat-Associated fleas thaumapsylla Breviceps Breviceps (Ischnopsyllidae: Thaumapsyllinae) to the Egyptian rousette Bat (Rousettus aegyptiacus). Viruses1610.3390/v16081197 (2024). [DOI] [PMC free article] [PubMed]
  • 4.Andrianiaina, A. F., Andry, S., Kettenburg, G., Ranaivoson, H. C. & Lacoste, V. Diversity and seasonality of ectoparasite burden on two species of Madagascar fruit bat, Eidolon Dupreanum and rousettus Madagascariensis. Parasit. Vectors. 18, 302. 10.1186/s13071-025-06805-z (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Leon-Noreña, M., Tauxe, G. M. & Riffell, J. A. Olfactory preferences and chemical differences of fruit scents for Aedes aegypti mosquitoes. BioRxiv10.1101/2025.02.05.636686 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Padilla, C. J., Martin, J. T., Cain, J. W. & Abiotic and demographic drivers of flea parasitism on deer mice in a recovering mixed-conifer forest a decade postfire. J. Parasitol.110, 375–385. 10.1645/23-45 (2024). [DOI] [PubMed]
  • 7.Yadav, P. D. et al. Detection of Nipah virus RNA in fruit Bat (Pteropus giganteus) from India. Am. J. Trop. Med. Hyg.87, 576–578. 10.4269/ajtmh.2012.11-0416 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Yadav, P. D. et al. Nipah virus sequences from humans and bats during Nipah Outbreak, Kerala, India, 2018. Emerg. Infect. Dis.25, 1003–1006. 10.3201/eid2505.181076 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Špitalská, E., Ševčík, M., Peresh, Y. Y. & Benda, P. Bartonella in Bat flies from the Egyptian fruit Bat in the middle East. Parasitol. Res.123, 144. 10.1007/s00436-024-08165-6 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Mounolou, J. C. & Mitochondrial DNA: an advance in eukaryotic cell biology in the 1960s. Biol. Cell.97, 743–748. 10.1042/bc20040128 (2005). [DOI] [PubMed] [Google Scholar]
  • 11.Ballard, J. W. The mitochondrial genome: mutation, selection and recombination. Curr. Opin. Genet. Dev.11, 667–672. 10.1016/s0959-437x(00)00251-3 (2001). [DOI] [PubMed] [Google Scholar]
  • 12.Yuan, B. & He, G. The first complete mitochondrial genome of the genus laelaps with novel gene arrangement reveals extensive rearrangement and phylogenetics in the superfamily dermanyssoidea. Exp. Appl. Acarol.93, 515–535. 10.1007/s10493-024-00943-2 (2024). [DOI] [PubMed] [Google Scholar]
  • 13.Steinberg, S. Structural compensation in atypical mitochondrial tRNAs. Nat. Struct. Biol.1, 507–510. 10.1038/nsb0894-507 (1994). [DOI] [PubMed] [Google Scholar]
  • 14.Stevens, J. R. & West, H. Mitochondrial genomes of the sheep blowfly, Lucilia sericata, and the secondary blowfly, Chrysomya megacephala. Med. Vet. Entomol.22, 89–91. 10.1111/j.1365-2915.2008.00710.x (2008). [DOI] [PubMed] [Google Scholar]
  • 15.Pan, M. et al. Characterization of mitochondrial genome of Chinese wild mulberry silkworm, Bomyx Mandarina (Lepidoptera: Bombycidae). Sci. China C Life Sci.51, 693–701. 10.1007/s11427-008-0097-6 (2008). [DOI] [PubMed] [Google Scholar]
  • 16.Li, X., Ogoh, K., Ohba, N. & Liang, X. Mitochondrial genomes of two luminous beetles, rhagophthalmus lufengensis and R. ohbai (Arthropoda, Insecta, Coleoptera). Gene392, 196–205. 10.1016/j.gene.2006.12.017 (2007). [DOI] [PubMed] [Google Scholar]
  • 17.Meng, G., Li, Y. & Yang, C. MitoZ: a toolkit for animal mitochondrial genome assembly, annotation and visualization. Nucleic Acids Res.47, 1–7. 10.1093/nar/gkz173 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Genomics (2013).
  • 19.Li, H. & Durbin, R et al. The sequence alignment/map format and samtools. Bioinformatics (Oxford England)25, 2078–2079. 10.1093/bioinformatics/btp352 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Kearse, M. et al. Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics (Oxford England). 28, 1647–1649. 10.1093/bioinformatics/bts199 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res.25, 3389–3402. 10.1093/nar/25.17.3389 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Chan, P. P., Lin, B. Y. & Mak, A. J. tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res.49, 9077–9096. 10.1093/nar/gkab688 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Laslett, D. ARWEN: a program to detect tRNA genes in metazoan mitochondrial nucleotide sequences. Bioinformatics (Oxford England)24, 172–175. 10.1093/bioinformatics/btm573 (2008). [DOI] [PubMed] [Google Scholar]
  • 24.Giorgi, F. M. & Ceraolo, C. The R language: an engine for bioinformatics and data science. Life (Basel)12. 10.3390/life12050648 (2022). [DOI] [PMC free article] [PubMed]
  • 25.Kumar, S. & Tamura, K. Molecular evolutionary genetics analysis software for microcomputers. Comput. Appl. Biosci.10, 189–191. 10.1093/bioinformatics/10.2.189 (1994). [DOI] [PubMed] [Google Scholar]
  • 26.Librado, P. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics (Oxford England)25, 1451–1452. 10.1093/bioinformatics/btp187 (2009). [DOI] [PubMed] [Google Scholar]
  • 27.Ronquist, F. MrBayes 3: bayesian phylogenetic inference under mixed models. Bioinformatics (Oxford England)19, 1572–1574. 10.1093/bioinformatics/btg180 (2003). [DOI] [PubMed] [Google Scholar]
  • 28.Stamatakis, A. & RAxML-VI -HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics (Oxford England). 22, 2688–2690. 10.1093/bioinformatics/btl446 (2006). [DOI] [PubMed] [Google Scholar]
  • 29.Katoh, K., Misawa, K. & Kuma, K. MAFFT: a novel method for rapid multiple sequence alignment based on fast fourier transform. Nucleic Acids Res.30, 3059–3066. 10.1093/nar/gkf436 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ranwez, V., Harispe, S. & Delsuc, F. MACSE: multiple alignment of coding sequences accounting for frameshifts and stop codons. PLoS One6, e22594. 10.1371/journal.pone.0022594 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Talavera, G. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst. Biol.56, 564–577. 10.1080/10635150701472164 (2007). [DOI] [PubMed] [Google Scholar]
  • 32.Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. TrimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics (Oxford England)25, 1972–1973. 10.1093/bioinformatics/btp348 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Xia, X. & DAMBE6. New tools for microbial genomics, phylogenetics, and molecular evolution. J. Hered.108, 431–437. 10.1093/jhered/esx033 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods. 14, 587–589. 10.1038/nmeth.4285 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Nguyen, L. T., Schmidt, H. A. & Haeseler, A. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol.32, 268–274. 10.1093/molbev/msu300 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Zhu, Q., Hastriter, M. W., Whiting, M. F. & Dittmar, K. Fleas (Siphonaptera) are Cretaceous, and evolved with theria. Mol. Phylogenet. Evol.90, 129–139. 10.1016/j.ympev.2015.04.027 (2015). [DOI] [PubMed] [Google Scholar]
  • 37.Lewis, R. E. Résumé of the siphonaptera (Insecta) of the world. J. Med. Entomol.35, 377–389. 10.1093/jmedent/35.4.377 (1998). [DOI] [PubMed] [Google Scholar]
  • 38.Leys, R., Cooper, S. J. & Strecker, U. Regressive evolution of an eye pigment gene in independently evolved eyeless subterranean diving beetles. Biol. Lett.1, 496–499. 10.1098/rsbl.2005.0358 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Bush, S. E., Villa, S. M., Altuna, J. C., Johnson, K. P. & Shapiro, M. D. Host defense triggers rapid adaptive radiation in experimentally evolving parasites. Evol. Lett.3, 120–128. 10.1002/evl3.104 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Cardoso Vde, A. Scanning electron microscopy studies of sensilla and other structures of the head of polygenis (Polygenis) tripus (Siphonapera: Rhopalopsyllidae). Micron37, 557–565. 10.1016/j.micron.2005.12.001 (2006). [DOI] [PubMed] [Google Scholar]
  • 41.Grenacher, S., Kröber, T., Guerin, P. M. & Vlimant, M. Behavioural and chemoreceptor cell responses of the tick, Ixodes ricinus, to its own faeces and faecal constituents. Exp. Appl. Acarol.25, 641–660. 10.1023/a:1016145805759 (2001). [DOI] [PubMed] [Google Scholar]
  • 42.Clary, D. O. & Wolstenholme, D. R. The mitochondrial DNA molecular of Drosophila yakuba: nucleotide sequence, gene organization, and genetic code. J. Mol. Evol.22, 252–271. 10.1007/bf02099755 (1985). [DOI] [PubMed] [Google Scholar]
  • 43.Baeza, M., Sepulveda, D. & Cifuentes, V. Codon usage bias in yeasts and its correlation with gene expression, growth temperature, and protein structure. Front. Microbiol.15, 1414422. 10.3389/fmicb.2024.1414422 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Wright, G., Rodriguez, A., Li, J., Milenkovic, T. & Emrich, S. J. CHARMING: harmonizing synonymous codon usage to replicate a desired codon usage pattern. Protein Sci.31, 221–231. 10.1002/pro.4223 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Suleman, Muhammad, N. et al. Mitochondrial genomes of two eucotylids as the first representatives from the superfamily microphalloidea (Trematoda) and phylogenetic implications. Parasit. Vectors14, 48. 10.1186/s13071-020-04547-8 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Whiting, M., Whiting, A. & Hastriter, M. A molecular phylogeny of fleas (Insecta: Siphonaptera): origins and host associations. Cladistics24, 677–707. 10.1111/j.1096-0031.2008.00211.x (2008). [Google Scholar]
  • 47.Zhang, Y., Fu, Y. T., Yao, C., Deng, Y. P. & Nie, Y. Mitochondrial phylogenomics provides insights into the taxonomy and phylogeny of fleas. Parasit. Vectors. 15, 223. 10.1186/s13071-022-05334-3 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Chen, B. et al. Complete mitochondrial genome of ctenophthalmus quadratus and stenischia humilis in China provides insights into fleas phylogeny. Front. Vet. Sci.10, 1255017. 10.3389/fvets.2023.1255017 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 1 (458.9KB, pdf)

Data Availability Statement

The nucleotide sequences of the T. b. orientalis mitogenome were deposited in GenBank (https://www.ncbi.nlm.nih.gov/) under accession number PP973737.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES