Skip to main content
BMC Genomics logoLink to BMC Genomics
. 2025 Mar 24;26:293. doi: 10.1186/s12864-025-11486-0

Comparative mitogenomic analysis of Chinese cavefish Triplophysa (Cypriniformes: Nemacheilidae): novel gene tandem duplication and evolutionary implications

Shuang Song 1,2,#, Jianhan Cao 1,2,#, Hongmei Xiang 2, Zhixiao Liu 1, Wansheng Jiang 1,2,
PMCID: PMC11934697  PMID: 40128668

Abstract

Background

Cavefish exhibit significant morphological changes that result in trade-offs in metabolic requirements and energy utilization in perpetual darkness. As cellular “powerhouses”, mitochondria play crucial roles in energy metabolism, suggesting that mitochondrial genes have likely experienced selective pressures during cavefish evolution.

Results

This study presents the first assembly of the complete mitogenome of Triplophysa yangi, a typical cavefish species in China. The mitogenome is 17,068 bp long, marking the longest recorded for the genus Triplophysa, and includes 13 protein-coding genes (PCGs), 2 rRNAs, 25 tRNAs, and a noncoding control region. An ~ 500 bp insertion between ND2 and WANCY regions was observed, comprising a large intact tandem repeat unit (A’-N’-OL’-C’) flanked by two unannotated sequences (U1/U2). The evolutionary origin of this repeat unit may involve either in situ duplication events with subsequent functional divergence—where neofunctionalization, subfunctionalization, or pseudogenization drove differential mutation rates between paralogs—or alternatively, horizontal acquisition from exogenous genetic material that became functionally integrated into the ancestral T. yangi mitogenome through co-option mechanisms. Phylogenetic analyses revealed two major clades within Triplophysa—epigean and hypogean lineages—consistent with previous classifications, while cave-restricted species exhibited signs of parallel evolution within the hypogean lineage. Selective pressure analysis indicated that the hypogean lineage (cave-dwelling groups, II & III) have a significantly increased ratio of nonsynonymous to synonymous substitution rates (ω) compared to the epigean lineage (surface-dwelling group, I), suggesting a combination of adaptive selection and relaxed functional constraints in cave-dwelling species.

Conclusions

The duplication of tRNAs in T. yangi and the potential positive selection sites identified in Triplophysa cavefish further indicated adaptive evolution in mitochondrial PCGs in response to extreme subterranean conditions.

Supplementary information

The online version contains supplementary material available at 10.1186/s12864-025-11486-0.

Keywords: Cave fish, Mitogenome, Phylogeny, Positive selection, Karst, China

Background

Cavefish, a specialized group that spends much or all of their lives in subterranean river habitats [1], has emerged as excellent models in developmental and evolutionary biology [2]. Over 230 cavefish species have been discovered worldwide, providing unique insights into how organisms adapt to extreme environments [3]. Notable morphological changes in cavefish include regressive traits, such as eye- and pigmentation loss, which arise convergently across different evolutionary lineages. Additionally, constructive sensory adaptations, such as increased neuromasts and taste buds, are frequently observed [4, 5]. Adaptive evolution in cavefish also involves metabolic changes, including enhanced fat-synthesis pathways [6] and the regulation of sugar metabolism and antioxidant mechanisms [7]. Overall, these changes represent trade-offs in metabolic requirements and energy utilization during cavefish adapt to perpetual darkness [8].

The karst landscape in Southwest China, covering approximately 2.4 million square kilometers, is home to over 150 cavefish species, representing the most diverse cavefish fauna on the earth [9]. Among the cavefish genera in China, the hypogean lineage of Triplophysa loaches ranks as the second largest group, surpassed only by Sinocyclocheilus cyprinids (approximately 40 vs. 70 species [10, 11]). Like Sinocyclocheilus cavefish [12], the hypogean lineage of Triplophysa exhibits remarkable morphological diversity, ranging from semi-cave-dwelling morphs with reduced eyes and pigmentation to fully cave-restricted morphs that are usually blind and white. Notably, some unusual morphological traits have also evolved in the hypogean lineage of Triplophysa. Recently, we described a new species, Triplophysa yangi, collected from a subterranean river in Shizong County, Yunnan, China [13]. This species features extraordinarily enlarged swim bladder chambers that protrude beyond the bilateral body wall, resembling a fish with a kind of “flotation device” (Fig. 1A). We proposed that this unique trait represents a novel troglomorphic adaptation in Triplophysa cavefish, in addition to the typical characteristics of eye reduction and lack of pigmentation.

Fig. 1.

Fig. 1

Live photos of T. yangi (A, showing the extraordinarily enlarged swim bladder chambers that protrude beyond the bilateral body wall resembling a fish with " flotation device”) and gene map of its newly sequenced mitogenome (B)

Mitochondria, frequently termed cellular “powerhouses” or “energy factories”, drive critical metabolic processes—most notably ATP synthesis via oxidative phosphorylation (OXPHOS) to meet cellular energy demands. Although mitochondrial sequences have been widely used in phylogenetic studies owing to their simple structure, conserved coding regions, rapid evolutionary rates, and maternal inheritance patterns [14, 15], molecular adaptation mechanisms in mitochondrial OXPHOS genes remain understudied [8]. Notably, while nuclear DNA encodes ~ 80% of OXPHOS enzymes (compared to mitochondrial DNA’s 13 subunits), these dual-origin components exhibit coordinated evolution and functional integration to maintain respiratory efficiency [16]. Intriguingly, energy-intensive species like bats demonstrate elevated selection pressures on mitochondrial-encoded OXPHOS genes relative to their nuclear counterparts. Nevertheless, both mitochondrial and nuclear OXPHOS genes experience greater selective pressures than nuclear-coded nonrespiratory genes [17].

As a unique biological group thriving in harsh environments devoid of light and with limited food, cavefish exhibit a range of metabolic changes distinct from their surface counterparts [6, 7]. It is reasonable to speculate that cavefish mitochondrial OXPHOS genes have undergone specific changes under evolutionary pressures in dark environments to facilitate unique energy strategies through the respiratory chain. Our previous comparative analysis of the mitogenomes of Sinocyclocheilus indicated that cave-dwelling species accumulated more nonsynonymous mutations in their mitochondrial PCGs than surface-dwelling species [8]. Another study involving 44 Triplophysa cavefish species also showed greater selective pressures in cavefish compared to non-cavefish species [18]. Since the first mitogenome of the hypogean lineage of Triplophysa was sequenced in 2012 (Triplophysa rosa [19]), an increasing number of mitogenomes have been sequenced and reported in GenBank, providing an excellent opportunity to examine the differing selective pressures on mitochondrial PCGs within this unique group.

In this study, we first assembled and annotated the complete mitogenome of the typical cave-dwelling species, T. yangi, and conducted a detailed examination of its characteristics. We then integrated this mitogenome with all available Triplophysa mitogenomes and performed clustering analysis using Principal Component Analysis (PCA) and average nucleotide identity (ANI) based on data from 49 Triplophysa species. Additionally, we reconstructed a phylogenetic tree using the mitochondrial PCG dataset and identified potential signals of positive selection in the mitochondrial PCGs of cave-dwelling species compared to their surface-dwelling counterparts. This study aims to provide insights into the mitogenomic evolution of Triplophysa cavefish that thrive in subterranean environments.

Results

Structures and characteristics of the mitogenome of T. yangi

The complete mitogenome of T. yangi was 17,068 bp in length, composed of 13 typical vertebrate PCGs, 2 rRNAs, 25 tRNAs, and a noncoding control region (Fig. 1B). Most genes were encoded on the heavy (H) strand, with the exception of ND6 and eleven tRNA genes encoded on the light (L) strand. Among the PCGs, only ND1 and COI used GTG as their start codon, while all other start codons were ATG. The stop codons in the PCGs included: ND1, ND2, COI, ATP6, ATP8, ND4L, ND5, and ND6, which ended with TAA; ND3, which ended with TAG; and COII, COIII, and CYTB, which used an incomplete T (--), while ND4 employed another incomplete TA (-) (Table 1).

Table 1.

Mitochondrial genome organization of T. Yangi

Gene Position Length(bp) Codon Intergenic nucleotide Strand Anticodon
From to Start Stop
tRNAPhe 1 69 69 0 H AAG
12 S rRNA 70 1018 949 2 H
tRNAVal 1021 1092 72 20 H CAU
16 S rRNA 1113 2768 1656 0 H
tRNALeu 2769 2843 75 6 H AAU
ND1 2844 3818 975 GTG TAA 0 H
tRNAIle 3825 3896 72 –2 H UAG
tRNAGln 3895 3965 71 1 L GUU
tRNAMet 3967 4035 69 0 H UAC
ND2 4036 5082 1047 ATG TAA 66 H
tRNAAla 5149 5217 69 1 L CGU
tRNAAsn 5219 5289 71 2 L UUG
OL 5292 5322 31 –2 H
tRNACys 5321 5385 65 206 L ACG
tRNATrp 5592 5661 70 2 H ACU
tRNAAla 5664 5731 68 1 L CGU
tRNAAsn 5733 5805 73 2 L UUG
OL 5808 5838 31 –2 H
tRNACys 5837 5897 61 –1 L ACG
tRNATyr 5897 5964 68 1 L AUG
COI 5966 7516 1551 GTG TAA 0 H
tRNASer 7517 7587 71 1 L AGU
tRNAAsp 7589 7660 72 12 H CUG
COII 7673 8363 691 ATG T(--) –1 H
tRNALys 8363 8438 76 1 H UUU
ATPase8 8440 8607 168 ATG TAA –10 H
ATPase6 8598 9281 684 ATG TAA –1 H
COIII 9281 10,064 784 ATG T(--) 0 H
tRNAGly 10,065 10,138 74 0 H
ND3 10,139 10,489 351 ATG TAG –2 H
tRNAArg 10,488 10,557 70 0 H GCU
ND4L 10,558 10,854 297 ATG TAA –7 H
ND4 10,848 12,229 1382 ATG TA(-) 0 H
tRNAHis 12,230 12,298 69 0 H GUG
tRNASer 12,299 12,365 67 1 H UCG
tRNALeu 12,367 12,438 72 0 H GAU
ND5 12,439 14,277 1839 ATG TAA –4 H
ND6 14,274 14,795 522 ATG TAA 0 L
tRNAGlu 14,796 14,864 69 4 L CUU
CYTB 14,869 16,009 1141 ATG T(--) 0 H
tRNAThr 16,010 16,082 73 –2 H UGU
tRNAPro 16,081 16,150 70 5 L GGU
D-Loop 16,156 16,898 743 170 H

Notably, the mitogenome of T. yangi contained 25 tRNA genes, distinguishing it from all other known species within the genus Triplophysa, which typically have 22 tRNAs. This increase was due to three duplicate copies of tRNAAla (A’), tRNAAsn (N’), and tRNACys (C’). Additionally, an OL (Origin of Light Strand Replication) copy (OL’) was also duplicated between tRNAAsn (N’) and tRNACys (C’), resulting in a large intact tandem repeat unit (A’-N’-OL’-C’, Fig. 2A) that maintained the same order as the original sequence (A-N-OL-C). The original unit (A-N-OL-C) was distinguished from the duplicated one (A’-N’-OL’-C’) based on parsimony and similarity: the original unit adhere to the conserved mitochondrial gene arrangement in Triplophysa species, and their sequences exhibit higher identity to orthologous genes in the closely related species. Interestingly, two unannotated sequences were located at the both ends of this A’-N’-OL’-C’ repeat unit. The unannotated forward flanking sequence (U1) was 66 bp in length and located between the stop codon of the original ND2 gene and the start of the A’-N’-OL’-C’ repeat unit. The unannotated backward flanking sequence (U2) was 206 bp, positioned between the end of the A’-N’-OL’-C’ repeat unit and the start of the original tRNATrp. Both U1 and U2 did not match any full-coverage sequences when analyzed using BLAST against the NCBI “nucleotide collection (nr/nt)” database, even with the least rigorous algorithm. However, the last third of U2 (approximately 55 bp) matched the ND2 gene of other Triplophysa species with about 80% similarity.

Fig. 2.

Fig. 2

Gene rearrangement in the mitogenome of T. yangi. Notes: (A) The typical mitochondrial arrangement in Triplophysa; (B) The fragment insertion and gene duplication of T. yangi. (C) Sequence alignments and similarities between the original and duplicated copies of T. yangi. (D) Secondary structures of the original and duplicated copies of tRNAs. Abbreviations: A’: duplicated tRNAAla; N’: duplicated tRNAAsn; C’: duplicated tRNACys; OL’: duplicated OL; U1: the unannotated forward flanking sequence; U2: the unannotated backward flanking sequence

Sequence similarities of the identified tRNAs and OL repeats were calculated after alignment with the original copies in the mitogenome of T. yangi. The duplicated tRNAAla (A’), tRNAAsn (N’), OL’, and tRNACys (C’) showed similarities of 91.43%, 87.67%, 93.77%, and 82.35% with their corresponding original copies (A, N, OL, and C) in T. yangi, respectively (Fig. 2B). Further analysis of the secondary structure of tRNAs indicated that the duplicated tRNAs (A’, N’, and C’) exhibited structural differences relative to the original copies (A, N, and C), particularly significant for tRNAAsn (N’ vs. N, Fig. 2C).

Sequence characteristics analysis of Triplophysa species

The mitogenome base composition of T. yangi was as follows: T (27.5%), C (25.7%), A (30.4%), and G (16.4%), resulting in a high A + T content of 57.9%. The overall AT-skew was positive (0.05), while the GC-skew was negative (–0.22) (Table 2). Among the 13 PCGs, ND2, COII, ATP8, ND4, and ND5 exhibited positive AT-skews, whereas the others showed slight negative values, with ND6 displaying a marked decrease. All PCGs presented negative GC-skews, except for ND6, which showed a relatively high positive value (Fig. 3A). The RSCU values indicated that Leu was encoded by the greatest number of synonymous codons (n = 6), while Val, Ser1, Pro, Thr, Ala, Arg, and Gly were encoded by four codons each, and all remaining amino acids were encoded by only two codons (Fig. 3B).

Table 2.

Base compositions (in percentages) of the mitogenomes that used for phylogenetic analysis

Species Total length (bp) T (%) C (%) A (%) G (%) A + T content (%) AT-skew GC-skew Accession number
1 T. aliensis 16,565 28.5 25.3 27.8 18.5 56.3 –0.01 –0.16 KJ739868
2 T. alticeps 16,572 28.8 25.1 28.2 17.9 57.0 –0.01 –0.17 MZ325251
3 T. angeli 16,569 28.6 25.8 27.0 18.6 55.6 –0.03 –0.16 KT213584
4 T. anterodorsalis 16,567 28.6 25.7 27.4 18.4 56.0 –0.02 –0.17 KT213585
5 T. baotianensis 16,576 27.6 25.5 30.8 16.1 58.4 0.05 –0.23 MT992550
6 T. bleekeri 16,568 28.6 25.8 27.1 18.5 55.7 –0.03 –0.16 JX135578
7 T. brevicauda 16,572 28.2 25.6 27.9 18.3 56.1 –0.01 –0.17 KT213588
8 T. chondrostoma 16,571 28.3 25.3 28.3 18.1 56.6 0.00 –0.17 KT213589
9 T. cuneicephala 16,568 28.8 25.2 28.2 17.8 57.0 –0.01 –0.17 KY945352
10 T. dalaica 16,576 28.2 25.6 28.3 17.9 56.5 0.00 –0.18 KT213590
11 T. dorsalis 16,576 28.4 25.6 28.2 17.8 56.6 0.00 –0.18 KT213591
12 T. erythraea 16,572 27.0 26.0 31.2 15.8 58.2 0.07 –0.24 PQ040451
13 T. fengshanensis 16,607 27.0 25.9 31.3 15.8 58.3 0.07 –0.24 OQ998929
14 T. grahami 16,566 28.5 25.0 29.2 17.5 57.7 0.01 –0.18 PP114297
15 T. hsutschouensis 16,571 28.4 25.6 27.3 18.7 55.7 –0.02 –0.16 KT213592
16 T. huapingensis 16,570 27.5 25.3 31.5 15.7 59.0 0.07 –0.23 OQ998930
17 T. jianchuanensis 16,569 27.2 26.6 28.3 17.8 55.5 0.02 –0.20 OQ603602
18 T. labiata 16,573 28.4 25.6 28.1 17.8 56.5 –0.01 –0.18 OQ559481
19 T. lixianensis 16,570 28.5 25.4 27.8 18.4 56.3 –0.01 –0.16 KT966735
20 T. longipectoralis 16,609 26.8 26.2 31.1 15.9 57.9 0.07 –0.24 OQ998928
21 T. longliensis 16,570 27.3 25.5 31.4 15.8 58.7 0.07 –0.23 OQ998931
22 T. markehenensis 16,569 28.7 25.3 28.2 17.8 56.9 –0.01 –0.17 KT213594
23 T. microps 16,571 28.3 25.5 28.0 18.3 56.3 –0.01 –0.16 KT213595
24 T. moquensis 16,571 28.6 25.3 28.4 17.7 57.0 0.00 –0.18 KT213597
25 T. nandanensis 16,604 27.0 25.9 31.2 15.9 58.2 0.07 –0.24 OQ998932
26 T. nanpanjiangensis 16,558 28.2 24.7 31.3 15.8 59.5 0.05 –0.22 OQ274895
27 T. nasobarbatula 16,605 26.8 26.1 31.1 15.9 57.9 0.07 –0.24 MT361978
28 T. nujiangensa 16,570 28.2 25.5 28.1 18.1 56.3 0.00 –0.17 KT213598
29 T. orientalis 16,570 28.3 25.8 27.7 18.1 56.0 –0.01 –0.18 KT213599
30 T. pappenheimi 16,572 28.7 25.0 28.7 17.5 57.4 0.00 –0.18 KT213600
31 T. pseudostenrua 16,638 28.6 25.0 28.8 17.6 57.4 0.00 –0.17 KT213601
32 T. robusta 16,570 28.4 25.3 28.2 18.0 56.6 0.00 –0.17 KM406486
33 T. rosa 16,585 27.3 25.3 31.8 15.6 59.1 0.08 –0.24 JF268621
34 T. scleroptera 16,570 28.5 25.4 28.2 17.8 56.7 –0.01 –0.18 KT213602
35 T. sellaefer 16,574 28.8 25.2 28.1 18.0 56.9 –0.01 –0.17 KT213603
36 T. siluroides 16,571 28.7 25.0 28.8 17.5 57.5 0.00 –0.18 JQ663847
37 T. stenura 16,571 28.4 25.4 27.9 18.3 56.3 –0.01 –0.16 KY851112
38 T. stewarti 16,567 28.3 25.5 27.8 18.5 56.1 –0.01 –0.16 KT213605
39 T. stoliczkai 16,568 28.8 25.2 28.1 17.9 56.9 –0.01 –0.17 KT213604
40 T. strauchii 16,590 28.5 25.4 28.3 17.8 56.8 0.00 –0.18 KP297875
41 T. tenuis 16,571 28.2 25.7 27.5 18.6 55.7 –0.01 –0.16 KT224363
42 T. tianeensis 16,573 27.1 25.9 30.5 16.4 57.6 0.06 –0.22 OQ998933
43 T. tibetana 16,573 28.3 25.7 26.9 19.1 55.2 –0.03 –0.15 KT224364
44 T. ulacholica 16,568 28.5 25.4 28.1 18.0 56.6 –0.01 –0.17 KT259194
45 T. venusta 16,574 26.9 26.9 27.8 18.4 54.7 0.02 –0.19 KT008666
46 T. wuweiensis 16,681 28.2 25.7 28.0 18.1 56.2 0.00 –0.17 KT224365
47 T. xiangxiensis 16,598 26.8 26.3 30.8 16.0 57.6 0.07 –0.24 KT751089
48 T. yangi* 17,068 27.5 25.7 30.4 16.4 57.9 0.05 –0.22 PQ356185*
49 T. zhenfengensis 16,567 27.6 25.5 30.5 16.4 58.1 0.05 –0.22 OQ998934
50 Homatula potanini# 16,571 26.3 26.9 30.2 16.6 56.5 0.07 –0.24 KP749475#

*, the sequence obtained in this study; #, the sequence used as the outgroup

Fig. 3.

Fig. 3

The characteristics of the mitogenome of T. yangi within the context of other congeners. Notes: Pictures show the GC and AT skews (A) and relative synonymous codon usage (RSCU) of T. yangi (B), the Ka/Ks ratio of 13 PCGs among 29 species of Triplophysa (C) and the nucleotide diversities according different groups (D)

The dataset for further analysis comprised 49 mitogenome sequences, including 48 downloaded from NCBI and one obtained from this study. Total sequence lengths ranged from 16,558 to 17,068 bp, with A + T content exceeding 50% across all sequences. Among these, ten species exhibited positive AT-skew patterns, while the remainder showed negative values. All 49 species displayed negative GC-skew patterns (Table 2). After removing stop codons from each PCG, the final aligned length of the 13 concatenated PCGs was 11,400 bp. The Ka/Ks ratios for all 13 PCGs were less than 1, with the highest ratio (0.108) in ATP8 and the lowest (0.024) in COII. ATP8, ND2, and ND4 were found to have relatively fast evolutionary rates, while COI, COII, and COIII displayed slower rates (Fig. 3C). A 25-bp sliding window analysis of these PCGs revealed nucleotide diversity (Pi) variabilities across gene regions and species groups, with the highest Pi values in the surface-dwelling fish group (I), followed by the semi-cave-dwelling group (II), and then the cave-restricted group (III) (Fig. 3D).

Sequence similarity analysis of Triplophysa species

The Kimura-2-parameter (K2P) distances of the COI sequences among 49 Triplophysa species ranged from 7.32 to 18.63%, while those of the concatenated PCGs ranged from 8.22 to 24.72%. T. yangi exhibited the smallest genetic distance with T. baotianensis based on both COI and concatenated PCGs analyses (Table S1). The PCA plot provided a reduced-dimensional view of the sequence data, capturing major variations among the 49 mitogenomes of Triplophysa. It demonstrated that hypogean lineage species cluster together, separating from the epigean lineage; the epigean lineage appeared to be further divided into two subgroups according to the sequence variations (Fig. 4 A). ANI analysis also significantly distinguished hypogean from epigean lineages, while the two groups we defined (II and III) in the hypogean lineage mixed together (Fig. 4B). Within the hypogean lineage, T. yangi exhibited the highest ANI value with T. zhenfengensis (93.88%) and the lowest value with T. longliensis (87.27%) (Fig. 4C). Additionally, correlation analyses indicated a generally negative relationship between ANI and genetic distance (R² = 0.81, P < 0.001) (Fig. 4D).

Fig. 4.

Fig. 4

The sequence clusters and similarities analysis based on PCGs of Triplophysa. Notes: (A) The PCA plot determined by K-means clustering of 49 mitogenomes in Triplophysa (green, cave-dwelling groups; pink and red, surface-dwelling groups); (B) ANI plot of 49 mitogenomes in Triplophysa; (C) ANI plot and values within the 14 species in cave-dwelling groups (II & III). (D) Correlation plot between the phylogenetic distance and ANI values

Phylogenetic analysis of Triplophysa

The phylogenetic trees derived from both BI and ML analyses exhibited identical topological structures, differing only in supporting values. Two major clades were identified with high support, confirming the established phylogenetic classification of epigean and hypogean lineages in Triplophysa [10], corresponding to Clade A (epigean) and Clade B (hypogean) in this study (Fig. 5). All species classified as group I based on morphological traits clustered within Clade A, while species in groups II and III formed Clade B. Notably, species in groups II and III intermixed without exhibiting mutually monophyletic structures. T. yangi displayed a sister group relationship with T. zhenfengensis and T. baotianensis, subsequently clustering with T. nanpanjiangensis to form an independent subclade. Interestingly, cave-restricted species from group III branched into several different clades within Clade B, indicating pervasive parallel evolution within the hypogean lineage.

Fig. 5.

Fig. 5

The phylogenetic relationship within Triplophysa based on 49 species. Notes: The numbers around the nodes are the bootstrap values and posterior probabilities from BI and ML methods. The species information used is listed in Table 2. Branches marked in black, green, and red indicate the species assigned into the surface-dwelling fish group (I), semi-cave-dwelling fish group (II), and cave-restricted fish group (III), respectively. The characteristics of eyes and body color, as well as the branch-wise Ka/Ks ratios are color-marked on the right, and the pictures of one representative species in each defined group are also provided

Selective pressures analysis on the mitogenomes of Triplophysa

The PAML branch test revealed an ω ratio of 0.0584 for the PCGs across all examined Triplophysa species, indicating overall constrained selection pressure. However, likelihood ratio tests (LRTs) indicated that both the two-ratio (M2) and three-ratio models (M3) provided a better fit than the one-ratio model (M0) (P = 0.000), suggesting variable ω ratios among certain groups. The three-ratio model was superior to one of the two-ratio models (M3 vs. M2 − 1, P = 0.000), but not significantly different from the other (M3 vs. M2 − 2, P = 0.062) (Table 3). These findings indicated that the group assignments in both M3 and M2 − 2 were similarly effective compared to M2 − 1. The ω ratios estimated under the free-ratio model (M1) further supported diverse selection pressures across branches, revealing significantly higher values for cave-dwelling species (II & III) compared to surface-dwelling species (I) (mean ω: 0.105 (II & III) vs. 0.064 (I); P = 0.001). Interestingly, T. yangi we sequenced here, as one typical species in the cave-restricted group, has the highest ω value of 0.4 among all the Triplophysa species examined in this study (Fig. 5). These results collectively suggested distinct ω ratios between surface-dwelling and cave-dwelling groups, with higher ω values in the latter. However, RELAX analysis indicated that cave-dwelling groups (II & III) experienced significant relaxation of selection compared to the surface-dwelling group (I), with K = 0.48 (P = 0.000).

Table 3.

Selection pressures estimation on PCGs of Triplophysa under branch model by codeml in PAML

Models Code lnL Parameter estimates Models compared 2∆L p-value
One-ratio M0 –122918.7780 ω = 0.04198
Free-ratio M1 –122711.2116 ——
Two-ratio (null) M2 − 1 –122892.9577 ω0 = 0.04071, ω1 = 0.08528 M2–1 vs. M0 51.641 0
Two-ratio (null) M2 − 2 –122858.5190 ω0 = 0.03825, ω1 = 0.07264 M2–2 vs. M0 120.518 0
Three-ratio (null) M3 –122856.7836 ω0 = 0.03825, ω1 = 0.06838, ω2 = 0.08447 M3 vs. M0 123.989 0
Three-ratio (null) M3 –122856.7836 ω0 = 0.03825, ω1 = 0.06838, ω2 = 0.08447 M3 vs. M2–1 72.348 0
Three-ratio (null) M3 –122856.7836 ω0 = 0.03825, ω1 = 0.06838, ω2 = 0.08447 M3 vs. M2–2 3.471 0.062
Two-ratio (positive) M2 − 2 (p) –124664.3930 ω0 = 0.03681, ω1 = 1 M2–2 (p) vs. M2–2 –3,611.748 0
Three-ratio (positive) M3 (p) –123289.4480 ω0 = 0.03791, ω1 = 0.06526, ω2 = 1 M3 (p) vs. M3 –865.329 0

According to the PAML site models, M8 performed better than M7 based on LRTs (M8 vs. M7, P = 0.000). Among 3,800 codon sites analyzed, 19 were identified under positive selection in the site model, while 67 sites were detected in the branch-site model. Seven sites from the site model and 13 from the branch-site model exhibited high Bayesian Empirical Bayes (BEB) values (> 0.95). Additionally, 18 and 27 sites were identified as under positive selection by FEL and MEME analysis, respectively. Five codon sites were identified by at least two methods, yielding a total of 31 codon sites considered potential positive selection sites with high credibility (Table 4). These sites were distributed across eight genes: ND1 (5), ND2 (6), ATP6 (2), ND3 (1), ND4 (4), ND5 (10), ND6 (1), and CYTB (2).

Table 4.

Positive selection sites of high credibility on the PCGs of Triplophysa identified by codeml in PAML and HyPhy, and the amino acid properties changes identified by TreeSAAP

No. AA positions Gene names Codeml in PAML HyPhy TreeSAAP
Site model, PP value (P < 0.05) Branch-site model, PP value (P < 0.05) FEL, P (P < 0.05) MEME, P (P < 0.05) Radical changes* Total number
1 10 ND1 0.999** 0.0127 pK’ 1
2 111 ND1 1.000** pK’ 1
3 159 ND1 0.974* pK’ 1
4 161 ND1 0.966* pK’ 1
5 173 ND1 0.567 pK’; Ra; 2
6 85 ND2 0.998** pK’; α m 2
7 152 ND2 0.994** pK’; αc; Rα 3
8 206 ND2 0.94 Pα; pK’; 2
9 228 ND2 0.903 Pα; pK’; 2
10 241 ND2 0.982* Pα; Pβ; Br;pK’; Ra; Ht 6
11 318 ND2 1.000** Pα; pK’; pHi; αm; Rα 5
12 29 ATP6 0.949 pK’ 1
13 195 ATP6 0.999** pK’ 1
14 87 ND3 0.933 Pα;pK’; αc; 3
15 91 ND4 0.672 Pα; c; pK’; αm 4
16 189 ND4 0.644 pK’; αc; 2
17 194 ND4 0.574 pK’; αc; 2
18 388 ND4 0.672 pK’ 1
19 2 ND5 0.863 0.01 c; pK’; h; R α 4
20 33 ND5 0.980* Pα; pHi; αc; αm 4
21 35 ND5 0.936 Pα; pK’; pHi; αc; αm 5
22 39 ND5 0.991** Pα; pK’; pHi; αc; αm 5
23 120 ND5 0.844 Pα; pK’; αc; 3
24 275 ND5 0.718 pK’; α c 2
25 277 ND5 0.861 pK’; α c 2
26 506 ND5 0.519 0.04 Pα; Br; pK’; αc; Rα 5
27 521 ND5 0.923 Pα; pK’; Mv; αc; Rα; Hp 6
28 523 ND5 0.794 Pα; pK’; Rα; Hp 4
29 119 ND6 0.777 pK’ 1
30 306 CYTB 0.986* 0.02 Pβ; pK’; F; Rα; Ht 5
31 360 CYTB 0.999** 0.009 0.00 pK’; H p 2

AA, amino acid; PP, posterior probabilities from Bayes empirical Bayes; *, Physicochemical amino acid properties available in TreeSAAP: αc: Power to be at the C-terminal; αm: Power to be at the middle of alpha-helix; Br: Buriedness; c: Composition; F: Mean r.m.s. fluctuation displacement; h: Hydropathy; Hp: Surrounding hydrophobicity; Ht: Thermodynamic transfer hydrophobicity; Mv: Molecular volume; Mw: Molecular weight; Pα: α- helical tendencies; Pβ: β-structure tendencies; pHi: Isoelectric point; pK’: Equilibrium Constant of ionization for COOH; Ra: Solvent accessible reduction ratio.

Amino acid changes and structural analysis of positive selection sites

All 31 potential positive selection sites of high credibility underwent radical substitutions in physicochemical properties, as determined by TreeSAAP (Table 4). Eleven sites exhibited four or more types of radical changes in amino acid properties. The positively selected sites identified in the branch-site models likely reflected the differences between cave-dwelling and surface-dwelling groups, as they were designated as foreground and background lineages. These sites were mapped onto the secondary and 3D structures of corresponding homologous proteins (Fig. 6). Most sites were located within functional domains of α-helices, particularly near or on the junction sites of α-helices and loop areas, which were probably crucial for the conformational stability of the relevant proteins.

Fig. 6.

Fig. 6

Positive selection sites highlighted in the crystal structure of ND1 (A), ND2 (B), ATP6 (C), ND3 (D), ND4 (E), ND5 (F), ND6 (G) and CYTB (H) based on the homologous protein. Notes: The sites with yellow color indicate the positions of potential positive selection sites of cave-dwelling species groups of Triplophysa under the branch-site model in PAML

Discussion

Mitogenome characteristics and gene duplications of T. yangi

Mitochondrial DNA is a powerful tool in molecular and evolutionary studies due to its advantages over complex nuclear DNA, playing crucial roles in reconstructing phylogenetic relationships, analyzing population genetics, and examining selective pressures [8, 20, 21]. In this study, we successfully sequenced and assembled the mitogenome of T. yangi, a newly identified cavefish exhibiting novel troglomorphic characteristics, with extraordinarily enlarged swim bladder chambers resembling a kind of “flotation device” (Fig. 1). The mitogenome length of 17,068 bp in T. yangi is the longest recorded among all Triplophysa species (Table 2). This length is attributed to an additional sequence inserted between ND2 and the WANCY region, comprising a large intact tandem repeat unit (A’-N’-OL’-C’) along with two unannotated flanking sequences (U1 & U2), totaling approximately 500 bp (Table 1; Fig. 2).

Typically, mitogenomes are compact with conserved gene order, and gene duplications in fish are rare [22]. Most duplications in fish species have been found around the control region, such as in Muraenesox cinereus [23], Garra cyprinids [24], Antarctic notothenioid fishes [25], and Epinephelus groupers [22]. In this study, we identified a new duplication pattern in the mitochondrial WANCY region, characterized by a core duplication unit with a large intact tandem repeat (A’-N’-OL’-C’) (Fig. 2). The fragments in this tandem repeat showed relatively low sequence similarities (86.15–93.55%) with the original copies (A, N, OL, C) of T. yangi, generally less than the genetic similarities between T. yangi and its sister species T. baotianensis at the COI gene (92.68%) and concatenated PCGs (91.78%). It suggested that the tandem repeat (A’-N’-OL’-C’) likely originated through one of two evolutionary pathways: (1) In situ duplication followed by long-term functional divergence (neofunctionalization or subfunctionalization) or pseudogenization, resulting in differential mutation rates between paralogous regions; (2) Horizontal acquisition from an exogenous source, possibly another Triplophysa species, with subsequent integration and functional co-option in the ancestral T. yangi mitogenome. However, current evidence cannot conclusively determine the predominant evolutionary mechanism driving this structural variation. Prioritizing integrated genomic and functional analyses will be essential to elucidate the underlying mitogenome plasticity in the future.

Gene rearrangements and duplications in mitogenomes may confer evolutionary advantages. Miya & Nishida [26] proposed a link between tRNA rearrangements and deep-sea adaptation, based on findings in Gonostoma gracile. Minhas et al. [25] discussed extensive duplications and rearrangements in Antarctic notothenioids, suggesting a role of mitochondrial gene duplication in cold adaptation, while He et al. [22] reported similar tRNAAsp gene duplications in groupers, inferring lineage-specific adaptations. Unlike nuclear gene duplications, which are known to generate novel functions for adaptive evolution [27], the functional implications of mitochondrial gene duplications remain less understood [22]. Given mitochondria’s crucial role in the pathway of OXPHOS, any stable gene rearrangements may confer benefits for species adapting to complex and changing environments. As a typical cave-restricted species living in perpetual darkness, T. yangi likely faced distinct energy and metabolic demands. Thus, the unique intact tandem repeat pattern (A’-N’-OL’-C’) in its mitogenome may contribute somehow to its cave adaptation.

Phylogenetic implications of Triplophysa

Phylogenetic relationships within the genus Triplophysa have garnered significant interest due to their species diversity and wide distribution across the Qinghai-Tibetan Plateau. However, reconstructing the complete phylogenetic picture within this largest genus of Cypriniforms has been challenging due to limited species coverage and insufficient molecular data. Feng et al. [28] conducted a phylogenetic analysis of Triplophysa stoliczkae using multi-locus genes, revealing significant discrepancies between analyses based solely on mitochondrial or nuclear genes. This suggests that potential gene flow events in the evolutionary history of T. stoliczkae may be better resolved by integrating multiple genetic markers. Wang et al. [29] performed a phylogenetic analysis within Triplophysa and reported the new mitogenome of T. labiata, identifying four subclades within the species dataset they used. They also uncovered a sister group relationship between T. labiata and T. dorsalis. Similarly, Wang et al. [30] sequenced the mitogenome of T. bombifrons, revealing the four main subclade relationships and the sister group relationship between T. bombifrons and T. tenuis. These studies underscored the importance of mitogenomic sequences in reconstructing phylogenetic relationships within the Triplophysa genus, providing a robust framework for future research.

The phylogenetic relationships reconstructed in our study (Fig. 5) align closely with the latest analysis based on mitogenomes by Zhang et al. [18]. Our study incorporated two additional species, T. yangi and T. erythraea, the latter of which was also a cave-restricted species whose mitogenomic characteristics have been reported in another study [31]. The two major clades revealed in our analysis confirmed the previous classification of epigean and hypogean lineages in Triplophysa [10]. Notably, in the hypogean lineage, cave-restricted species appeared in multiple distinct branches, indicating pervasive parallel evolution, similar to patterns observed in cavefishes of Sinocyclocheilus [8].

Despite our phylogenetic tree including a substantial number of Triplophysa species with available mitogenomic data, the overall phylogeny of the genus remains unresolved. The species coverage in this study is still limited: approximately 48 out of about 180 species (26.7%) in the genus and 14 out of about 40 species (35%) in the hypogean lineage. Given this limited coverage, we did not estimate divergence times, although such estimates could provide insights into the biogeographic history of this fascinating group. Additionally, divergence time estimates can vary significantly depending on the calibration strategy used, as seen in previous estimates of the divergence time of the most recent common ancestor (MRCA) of Triplophysa species [28, 32], where various calibration points were employed due to the lack of fossil records.

Estimation of the selective pressures within Triplophysa

Compared to the surface-dwelling fish group (I), both the semi-cave-dwelling (II) and cave-restricted fish groups (III) exhibited significantly higher mean ω (Ka/Ks) ratios for the 13 mitochondrial PCGs (Table 3). It suggested that the cave-dwelling groups (II and III) have experienced reduced purifying selection efficacy, resulting in the accumulation of more nonsynonymous mutations. A high ω value can arise from either positive selection, which fixes beneficial nonsynonymous mutations, or relaxed functional constraints, which reduce the effectiveness of purifying selection and allow deleterious mutations to become fixed [33]. While it is challenging to distinguish between these scenarios based solely on ω values, the RELAX analysis indicated that relaxed functional constraints were likely the primary contributors to the elevated ω ratios in the cave-dwelling groups (II and III). However, positive selection may also contribute, particularly at the specific potential positive selection sites identified in this study (Table 4). The harsh physiological conditions in cave environments might impose strong selective pressures on energy metabolism genes, including mitochondrial OXPHOS genes. Furthermore, cavefish were often susceptible to population bottlenecks due to restricted migration, resulting in a small effective population size (Ne). This demographic constraint can significantly increase the ω ratio as well. Thus, the higher ω values in cave-dwelling groups (II and III) may reflect a complex interplay between ecological and physiological selective pressures and demographic restrictions that lead to genetic drift, which in turn may relax selection.

Positive selection was often observed over short evolutionary periods and typically affected only a few sites, making it challenging to be detected amidst the ongoing purifying selection at most sites within a gene [34, 35]. To address this, we utilized PAML’s site and branch-site models, alongside HyPhy tools FEL and MEME, to identify codon sites potentially under positive selection. By emphasizing convergent results from these approaches, we identified high-credibility potential positive selection sites (Table 4). Our findings revealed evidence of signatures of positive selection at specific amino acid positions in eight mitochondrial OXPHOS genes: ND1 (5), ND2 (6), ATP6 (2), ND3 (1), ND4 (4), ND5 (10), ND6 (1), and CYTB (2). All identified sites underwent radical changes in amino acid properties, suggesting that the extensive parallel evolution of cave-dwelling fishes in Triplophysa involved significant genetic adaptations to extreme subterranean environments across the extensive Karst areas of South China.

Conclusions

This study has presented the first assembly of the complete mitogenome of Triplophysa yangi, which is 17,068 bp in length, making it the longest recorded for the genus Triplophysa. Compared to the mitogenomes of other Triplophysa, about 500 bp additional sequence was identified from T. yangi, comprising a large intact tandem repeat unit (A’-N’-OL’-C’) and two unannotated flanking sequences (U1 and U2). Its evolutionary origin may involve either in situ duplication events with subsequent functional divergence, or horizontal acquisition from exogenous genetic materials. The cave-restricted species in the hypogean lineages of Triplophysa exhibited signs of parallel evolution within the hypogean lineage. Selective pressure analysis indicated that the hypogean lineage (cave-dwelling groups, II & III) have significantly higher nonsynonymous/synonymous substitution ratios (ω) compared to the epigean lineage (surface-dwelling group, I). The duplication of tRNAs of T. yangi and the potential positive selection sites identified in Triplophysa cavefish further indicate adaptive evolution in mitochondrial PCGs in response to extreme subterranean conditions.

Materials and methods

Sampling and sequencing of T. yangi

The specimen of T. yangi used in this study (voucher number JWS20221148) was collected in December 2022 from a subterranean tributary of the Nanpanjiang River drainage in Wulong Township, Shizong County, Yunnan Province, China. The fish was euthanized before handling, using a solution of MS-222 (Macklin, Shanghai, China) at a concentration of 40 to 50 mg/L. Muscle tissue was then collected and DNA was extracted using the DNeasy Blood and Tissue Kit (Qiagen, Hilden, Germany). A DNA library was prepared using the Agencourt AMPure XP-Medium Kit (Beckman Coulter, USA) and the AxyPrep Mag PCR Cleanup Kit (Axygen, Corning, USA). High-throughput paired-end sequencing was performed on the MGISEQ2000 platform (Complete Genomics and MGI Tech, Shenzhen, China), generating approximately 44 Gb of raw reads with a read length of 150 bp.

Mitogenome assembly and structural analysis of T. yangi

The complete mitogenome of T. yangi was firstly assembled using MitoZ [36], and further confirmed by NOVOPlasty [37] and MitoFinder [38]. Initial annotation of the assembled mitogenome was conducted using the online server MITOS2 (available at http://mitos2.bioinf.uni-leipzig.de/index.py [39]). Further annotation and gene map plotting were performed with the web application MitoFish (available at http://mitofish.aori.u-tokyo.ac.jp [40]). The secondary structure of tRNAs was predicted using tRNAscanSE 1.21 [41]. Nucleotide composition and codon usage of PCGs were computed using MEGA 11.0 [42]. Strand asymmetry was assessed using the formulas AT-skew = [A– T] / [A + T] and GC-skew = [G– C] / [G + C] [43].

Sequence characteristics and similarity analysis of Triplophysa species

All available mitogenomes of Triplophysa were downloaded from GenBank, and one representative sequence was selected for each species for subsequent analysis. The final dataset comprised 49 Triplophysa species, including the one sequenced in this study. Based on the classification of cavefish in Sinocyclocheilus [8], Triplophysa species were categorized into three major groups according to morphological characteristics: (I) surface-dwelling fishes with normal eyes and typical coloration; (II) semi-cave-dwelling fishes with reduced eyes and partial loss of pigmentation; (III) cave-restricted fishes, lacking eyes or possessing only tiny eye dots, exhibiting white coloration (albinism). Relative synonymous codon usage (RSCU), the ratio of nonsynonymous substitution rates (Ka) to synonymous substitution rates (Ks), and nucleotide diversity (Pi) were calculated using DnaSP 6 [44]. Genetic distances were estimated with MEGA using the Kimura-2 parameter (K2P) [45]. Subsequently, PCA was conducted on the 49 mitogenomes using a specialized Python script for mitochondrial DNA sequences [46]. Average nucleotide identity (ANI) values were determined based on phylogenetic tree classification through pairwise comparisons using fastANI with the parameter “--minFraction 0.8” [47]. Correlation analyses of phylogenetic distance and ANI values from the complete mitogenomes were conducted using the MRM package in R (v4.3.1) [48].

Phylogenetic analysis

Using Homatula potanini as an outgroup, phylogenetic analysis was conducted on the mitogenomes of 49 Triplophysa species. The 13 PCGs were concatenated and aligned using the CLUSTALW algorithm in MEGA [42]. Phylogenetic analyses were performed using maximum likelihood (ML) and Bayesian inference (BI), with partitioning schemes and nucleotide substitution models selected based on the Akaike Information Criterion (AIC) using PartitionFinder 2 [49]. ML analysis was executed with RaxML 8.0.2 [50], conducting 10 runs with random additional sequences and generating bootstrap values based on 1,000 rapid bootstrap replicates. BI analysis was performed with MrBayes 3.2.6 [51], running for 2,000,000 generations and sampling every 1,000 generations. Posterior probabilities (PP) were calculated to produce a consensus tree after discarding the first 25% of samples as burn-in.

Selective pressures analysis

In molecular evolution, the ratio of nonsynonymous substitution (Ka) to synonymous substitution (Ks) per site, denoted as ω (Ka/Ks), reflects the selective pressures acting on a gene during its evolution. When ω > 1, the gene shows signs of positive selection; ω = 1 indicates neutral selection; and ω < 1 suggests purifying selection. The codon-based maximum likelihood (codeml) method, implemented in the PAML 4 package [52], was used to estimate ω values. First, branch models assessed ω values among groups based on the three categories defined above, allowing different branches to have distinct ω. The one-ratio model (M0), proposing a single ω for all branches, served as the null hypothesis of no adaptive evolution. Next, the two-ratio model (M2), permitting different ω values for background and foreground branches, identified groups of interest (M2 − 1: group III vs. groups I & II; M2 − 2: groups II & III vs. group I). The three-ratio model (M3), allowing independent ω values for groups I, II, and III, explored selective pressures across species. Additionally, the free-ratio model (M1), estimating specific ω values for each branch, examined variation in selective pressures across all groups. Differences in ω values among the three groups were statistically tested using the Wilcoxon rank sum test in SPSS 24.0 (SPSS Inc., Chicago, IL). To quantify potential positive selection probability across sequence sites, we implemented paired site models (M7 and M8), allowing ω to vary among sites (M7: beta [0 < ω < 1]; M8: beta & ω [ω > 1]) to identify positive selection at each site. Finally, branch-site models were employed to examine selective pressures, allowing each lineage and site to have its own ω value. This approach facilitates detection of selective pressure magnitudes on foreground branches and identification of positively selected sites. The paired comparison models used were Model A vs. Model A null. Likelihood ratio tests (LRTs) evaluated significance between comparative models by calculating twice the log-likelihood difference (2∆L) based on the chi-square distribution, with degrees of freedom (df) representing the difference in free parameters between models.

In addition to positively selected sites identified by the site and branch-site models from PAML, we conducted further identification using the Fixed Effects Likelihood (FEL) and Mixed Effects Model of Evolution (MEME) methods available on the HyPhy online platform Datamonkey [53]. FEL uses the maximum likelihood approach to infer Ka and Ks at each site, assuming constant selection pressure throughout the phylogeny. Positive selection was inferred when the likelihood ratio test yielded p < 0.1. In contrast, MEME employs a mixed-effects ML method to examine whether individual sites were under positive selection without needing to specify branches a priori. Candidate sites for positive selection were identified when β+ > α, with a significant likelihood ratio test at p < 0.1.

Additionally, to assess changes in substitution ratios potentially resulting from relaxed functional constraints within specific branches, we utilized RELAX software [54] to evaluate natural selection strength among different groups. RELAX begins by fitting a codon model with three ω classes to the entire phylogeny (null model) and assesses for relaxed or intensified selection by introducing ‘k’ as a parameter of selection intensity, where k ≥ 0 and k = Log ωtest/Log ωreference. A significant result with k > 1 or k < 1 indicates intensified or relaxed selection strength along the test branches, respectively.

Amino acid changes and structural analyses

The TreeSAAP program [55] was used to examine whether the positively selected sites identified in PAML and HyPhy exhibited changes in amino acid physicochemical properties at the protein level. TreeSAAP assessed the impact of natural selection based on 31 structures and physicochemical properties by measuring goodness-of-fit values. A change in amino acid properties within the range of 6 to 8 indicated radical alterations, suggesting potential positive selection [56]. To further investigate whether the positively selected sites were located in critical functional domains of the protein, we downloaded homologous protein structures of Triplophysa mitogenomes from the UniProt website (http://www.uniprot.org/). We highlighted the positively selected sites detected by PAML, HyPhy, and TreeSAAP on their three-dimensional structures using PyMOL software (Schrödinger, New York, USA).

Electronic supplementary material

Below is the link to the electronic supplementary material.

Acknowledgements

The authors would like to acknowledge Mr. Hongfu Yang for his help in sample collection.

Author contributions

WJ conceived the project and designed scientific objectives. SS, JC and WJ prepared the fish samples. SS, JC, and HX conducted the genome assembly, annotation and bioinformatics analysis. SS and CJ prepared the first manuscript. WJ improved the first manuscript. HX and ZL participated in discussions and revised the manuscript. All authors read and approved the final manuscript.

Funding

This work was supported by National Natural Science Foundation of China (32060128, 32160241) and the Ph.D start-up fund in Jishou University.

Data availability

The final complete mitogenome, along with annotated information, has been deposited in GenBank under accession number PQ356185 (https://www.ncbi.nlm.nih.gov/nuccore/PQ356185). All the analyses and findings of this study were based on this sequence and other sequences available in GenBank.

Declarations

Ethics approval and consent to participate

The collection and sampling of the specimens were approved by the Ethics Committee of Jishou University, according to the “3R principle” (Reduction, Replacement, and Refinement) that required by National Ministry of Science and Technology (No. 398 [2006]). All the procedures of animal treatment were also complied with the guidance of the Code of Practice for the Housing and Care of Animals and Wildlife Protection Act of China.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Shuang Song and Jianhan Cao contributed equally to this work.

References

  • 1.Ma L, Zhao YH, Yang JX. Cavefish of China. In: Encyclopedia of Caves. Edited by T WWCDP, 3rd ed. edn. London: Academic Press. 2019: 237–254.
  • 2.Jeffery WR, Ma L, Zhao YH. Cavefish as biological models in the laboratory and in the wild. Zool Res. 2023;44(4):834–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Maldonado E, Rangel-Huerta E, Rodriguez-Salazar E, Pereida-Jaramillo E, Martínez-Torres A. Subterranean life: behavior, metabolic, and some other adaptations of Astyanax cavefish. J Exp Zool B Mol Dev Evol. 2020;334(7–8):463–73. [DOI] [PubMed] [Google Scholar]
  • 4.ML SDN. Sensory adaptations of fishes to subterranean environments. Bioscience. 2013;63(4):274–83. [Google Scholar]
  • 5.Jeffery WR. Astyanax surface and cave fish morphs. Evodevo. 2020;11:14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Xiong S, Wang W, Kenzior A, Olsen L, Krishnan J, Persons J, Medley K, Peuß R, Wang Y, Chen S, et al. Enhanced lipogenesis through Pparγ helps cavefish adapt to food scarcity. Curr Biol. 2022;32(10):2272–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Medley JK, Persons J, Biswas T, Olsen L, Peuß R, Krishnan J, Xiong S, Rohner N. The metabolome of Mexican cavefish shows a convergent signature highlighting sugar, antioxidant, and Ageing-Related metabolites. Elife. 2022;11:e74539. [DOI] [PMC free article] [PubMed]
  • 8.Jiang WS, Li J, Xiang HM, Sun C, Chang JB, Yang JX. Comparative analysis and phylogenetic and evolutionary implications of mitogenomes of Chinese Sinocyclocheilus cavefish (Cypriniformes: Cyprinidae). Zool Res. 2023;44(4):779–81. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Ma L, Yang JX, Lei FK, Xu MZ, Zhao YH, Jeffery WR. Protection and exploration of the scientific potential of Chinese cavefish. Zool Res. 2023;44(4):675–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Liu F, Zeng Z-X, Gong Z. Two new hypogean species of Triplophysa (Cypriniformes: Nemacheilidae) from the river Yangtze drainage in Guizhou, China. J Vertebrate Biology. 2022;71:22062. [Google Scholar]
  • 11.Jiang WS, Li J, Lei XZ, Wen ZR, Han YZ, Yang JX, Chang JB. Sinocyclocheilus sanxiaensis, a new blind fish from the three Gorges of Yangtze river provides insights into speciation of Chinese cavefish. Zoological Res. 2019;40(6):552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Borowsky R, Cavefishes. Curr Biol. 2018;28(2):R60–4. [DOI] [PubMed] [Google Scholar]
  • 13.Cao J, Song S, Yi W, Xiang H, Yang H, Li J, Jiang W. Two New Cavefish Species of Triplophysa (Cypriniformes: Nemacheilidae) from the Karst Subterranean Rivers of Yunnan, China. Zoological Journal of the Linnean Society 2025. in press.
  • 14.Wang JX, Lan XY, Luo QH, Gu ZR, Zhou Q, Zhang MY, Zhang YX, Jiang WS. Characterization, Comparison of Two New Mitogenomes of Crocodile Newts Tylototriton (Caudata: Salamandridae), and Phylogenetic Implications. Genes (Basel) 2022, 13(10):1878. [DOI] [PMC free article] [PubMed]
  • 15.Xiang H, Zhou Q, Li W, Shu J, Gu Z, Jiang W. Insights into phylogenetic positions and distribution patterns: complete mitogenomes of two sympatric Asian horned toads in Boulenophrys (Anura: Megophryidae). Ecol Evol. 2024;14(7):e11687. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Das J. The role of mitochondrial respiration in physiological and evolutionary adaptation. BioEssays. 2006;28(9):890–901. [DOI] [PubMed] [Google Scholar]
  • 17.Shen YY, Liang L, Zhu ZH, Zhou WP, Irwin DM, Zhang YP. Adaptive evolution of energy metabolism genes and the origin of flight in bats. Proc Natl Acad Sci U S A. 2010;107(19):8666–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Zhang J, Shu L, Peng Z. Adaptive evolution of mitochondrial genomes in Triplophysa cavefishes. Gene. 2024;893:147947. [DOI] [PubMed] [Google Scholar]
  • 19.Wang J, Tang Q, Wang Z, Zhang Y, Wu Q, Peng Z. The complete mitogenome sequence of a cave loach Triplophysarosa (Teleostei, balitoridae, Nemacheilinae). Mitochondrial DNA. 2012;23(5):366–8. [DOI] [PubMed] [Google Scholar]
  • 20.Lan X, Wang J, Zhang M, Zhou Q, Xiang H, Jiang W. Molecular identification of Acrossocheilus jishouensis (Teleostei: Cyprinidae) and its complete mitochondrial genome. Biochem Genet. 2024;62(2):1396–412. [DOI] [PubMed] [Google Scholar]
  • 21.Zhou Q, Xiang HM, Zhang MY, Liu Y, Gu ZR, Lan XY, Wang JX, Jiang WS. Two complete mitochondrial genomes of Leptobrachium (Anura: Megophryidae: Leptobrachiinae): Characteristics, Population Divergences, and Phylogenetic Implications. Genes (Basel). 2023;14(3):768. [DOI] [PMC free article] [PubMed]
  • 22.He H, Gao Z, Hu Z, Cai L, Huang Y, Meng Z, Liang R. Comparative characterization and phylogenetic analysis of complete mitogenome of three taxonomic confused groupers and the insight to the novel gene tandem duplication in Epinephelus. Front Mar Sci. 2024;11:1450003. [Google Scholar]
  • 23.Zhang K, Zhu K, Liu Y, Zhang H, Gong L, Jiang L, Liu L, Lü Z, Liu B. Novel gene rearrangement in the mitochondrial genome of Muraenesox cinereus and the phylogenetic relationship of anguilliformes. Sci Rep. 2021;11(1):2411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Zhang C, Zhang K, Peng Y, Zhou J, Liu Y, Liu B. Novel gene rearrangement in the mitochondrial genome of three Garra and insights into the phylogenetic relationships of labeoninae. Front Genet. 2022;13:922634. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Minhas BF, Beck EA, Cheng CC, Catchen J. Novel mitochondrial genome rearrangements including duplications and extensive heteroplasmy could underlie temperature adaptations in Antarctic notothenioid fishes. Sci Rep. 2023;13(1):6939. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Miya M, Nishida M. Organization of the mitochondrial genome of a Deep-Sea fish, Gonostoma gracile (Teleostei: Stomiiformes): First Example of Transfer RNA Gene Rearrangements in Bony Fishes. Mar Biotechnol (NY). 1999;1(5):416–26. [DOI] [PubMed] [Google Scholar]
  • 27.Kondrashov FA. Gene duplication as a mechanism of genomic adaptation to a changing environment. Proc Biol Sci. 2012;279(1749):5048–57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Feng C, Tang Y, Liu S, Tian F, Zhang C, Zhao K. Multiple convergent events created a nominal widespread species: Triplophysa stoliczkae (Steindachner, 1866) (Cobitoidea: Nemacheilidae). BMC Evol Biol. 2019;19(1):177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Wang C, Luo S, Yao N, Wang X, Song Y, Chen S. A comprehensive analysis of Triplophysa labiata (Kessler, 1874) mitogenome and its phylogenetic implications within the triplophysa genus. Genes (Basel). 2023;14(7):1356. [DOI] [PMC free article] [PubMed]
  • 30.Wang X, Song Y, Xie H, Zi F, Chen S, Luo S. Complete mitogenome of the Triplophysa bombifrons: comparative analysis and phylogenetic relationships among the members of triplophysa. Genes (Basel). 2023;14(1):128. [DOI] [PMC free article] [PubMed]
  • 31.Song J, Song S. Mitochondrial genome and phylogenetic analysis of Triplophysa erythraea (in Chinese). Chin J Zool. 2024;59(4):588–96. [Google Scholar]
  • 32.Zhong H, Sun Y, Wu H, Li S, Shen Z, Yang C, Wen M, Chen P, Gu Q. Pleistocene climate and geomorphology drive the evolution and phylogeographic pattern of Triplophysa robusta (Kessler, 1876). Front Genet. 2022;13:955382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Li Y, Zhang R, Liu S, Donath A, Peters RS, Ware J, Misof B, Niehuis O, Pfrender ME, Zhou X. The molecular evolutionary dynamics of oxidative phosphorylation (OXPHOS) genes in hymenoptera. BMC Evol Biol. 2017;17(1):269. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Yang M, Gong L, Sui J, Li X. The complete mitochondrial genome of Calyptogena marissinica (Heterodonta: veneroida: Vesicomyidae): Insight into the deep-sea adaptive evolution of vesicomyids. PLoS ONE. 2019;14(9):e0217952. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Zhang J, Nielsen R, Yang Z. Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. Mol Biol Evol. 2005;22(12):2472–9. [DOI] [PubMed] [Google Scholar]
  • 36.Meng G, Li Y, Yang C, Liu S. MitoZ: a toolkit for animal mitochondrial genome assembly, annotation and visualization. Nucleic Acids Res. 2019;47(11):e63. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Dierckxsens N, Mardulyn P, Smits G. NOVOPlasty:de Novoassembly of organelle genomes from whole genome data. Nucleic Acids Res. 2016,45(4):e18. [DOI] [PMC free article] [PubMed]
  • 38.Allio R, Schomaker-Bastos A, Romiguier J, Prosdocimi F, Nabholz B, Delsuc F. MitoFinder: efficient automated large-scale extraction of mitogenomic data in target enrichment phylogenomics. Mol Ecol Resour. 2020;20(4):892–905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Donath A, Jühling F, Al-Arab M, Bernhart SH, Reinhardt F, Stadler PF, Middendorf M, Bernt M. Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes. Nucleic Acids Res. 2019;47(20):10543–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Zhu T, Sato Y, Sado T, Miya M, Iwasaki W. MitoFish, mitoannotator, and mifish pipeline: updates in 10 years. Mol Biol Evol. 2023;40(3):msad035. [DOI] [PMC free article] [PubMed]
  • 41.Chan PP, Lin BY, Mak AJ, Lowe TM. tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res. 2021;49(16):9077–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Tamura K, Stecher G, Kumar S. MEGA11: molecular evolutionary genetics analysis version 11. Mol Biol Evol. 2021;38(7):3022–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Perna NT, Kocher TD. Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes. J Mol Evol. 1995;41(3):353–8. [DOI] [PubMed] [Google Scholar]
  • 44.Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, Sánchez-Gracia A. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol Biol Evol. 2017;34(12):3299–302. [DOI] [PubMed] [Google Scholar]
  • 45.Kimura M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol. 1980;16(2):111–20. [DOI] [PubMed] [Google Scholar]
  • 46.Homchan S, Kutanan W, Gupta YM. A reproducible workflow for assembling the mitochondrial genome of Acheta domesticus (Orthoptera: Gryllidae). Ecol Evol. 2024;14(7):e11696. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Richter M, Rosselló-Móra R, Oliver Glöckner F, Peplies J. JSpeciesWS: a web server for prokaryotic species circumscription based on pairwise genome comparison. Bioinformatics. 2016;32(6):929–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Wei ZF, Ta KW, Zhang NN, Liu SS, Meng L, Liu KQ, Cai CY, Peng XT, Shao CW. Molecular phylogenetic relationships based on mitochondrial genomes of novel deep-sea corals (Octocorallia: Alcyonacea): Insights into slow evolution and adaptation to extreme deep-sea environments. Zool Res. 2024;45(1):215–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Lanfear R, Frandsen PB, Wright AM, Senfeld T, Calcott B. PartitionFinder 2: new methods for selecting partitioned models of evolution for molecular and morphological phylogenetic analyses. Mol Biol Evol. 2017;34(3):772–3. [DOI] [PubMed] [Google Scholar]
  • 50.Stamatakis A. Using RAxML to infer phylogenies. Curr Protoc Bioinf. 2015;51:6.14.11–16.14.14. [DOI] [PubMed]
  • 51.Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Höhna S, Larget B, Liu L, Suchard MA, Huelsenbeck JP. MrBayes 3.2: efficient bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61(3):539–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24(8):1586–91. [DOI] [PubMed] [Google Scholar]
  • 53.Weaver S, Shank SD, Spielman SJ, Li M, Muse SV, Kosakovsky Pond SL. Datamonkey 2.0: A modern web application for characterizing selective and other evolutionary processes. Mol Biol Evol. 2018;35(3):773–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Wertheim JO, Murrell B, Smith MD, Kosakovsky Pond SL, Scheffler K. RELAX: detecting relaxed selection in a phylogenetic framework. Mol Biol Evol. 2015;32(3):820–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Woolley S, Johnson J, Smith MJ, Crandall KA, McClellan DA. TreeSAAP: selection on amino acid properties using phylogenetic trees. Bioinformatics. 2003;19(5):671–2. [DOI] [PubMed] [Google Scholar]
  • 56.Tian R, Losilla M, Lu Y, Yang G, Zakon H. Molecular evolution of globin genes in gymnotiform electric fishes: relation to hypoxia tolerance. BMC Evol Biol. 2017;17(1):51. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The final complete mitogenome, along with annotated information, has been deposited in GenBank under accession number PQ356185 (https://www.ncbi.nlm.nih.gov/nuccore/PQ356185). All the analyses and findings of this study were based on this sequence and other sequences available in GenBank.


Articles from BMC Genomics are provided here courtesy of BMC

RESOURCES