Abstract
The long-tailed ground squirrel (Urocitellus undulatus) is primarily distributed in various regions, including Xinjiang and Heilongjiang in China, as well as in Siberia, Mongolia, the Russian Federation, and Kazakhstan. In this study, we report the complete mitochondrial genomes of U. undulatus. The genome consists of 16,456 base pairs, including 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and a control region. The phylogenetic relationship indicates a closer evolutionary relationship between U. undulatus and U. richardsonii. These findings provide a foundation for the taxonomic identification, phylogenetic evolution, and mitochondrial genome research of U. undulatus.
Keywords: Mitochondrial genome, Urocitellus undulatus, Sciuridae, phylogeny
Graphical Abstract
Introduction
The long-tailed ground squirrel (Urocitellus undulatus Pallas, 1778) belongs to the rodent family Sciuridae, within the Urocitellus genus. It is distributed in various regions, including China (Xinjiang, Heilongjiang), Siberia, Mongolia, Russia, and Kazakhstan (McLean et al. 2018). These squirrels predominantly inhabit the sparse tree grasslands and vast prairie areas at the edge of the Gobi Desert. They particularly prefer areas with well-drained, loamy soils, which make it easier for them to dig their burrows (Ricankova et al. 2006; Kryštufek and Vohralík 2013). As an omnivorous species, U. undulatus primarily feeds on vegetation found in the grasslands, including grass, seeds, and roots. However, to obtain additional protein, they also eat insects (Smith et al. 2010). Like other species of the family Sciuridae, the burrows of U. undulatus are divided into resident and temporary burrows. The resident burrows are more complex in structure and differ between summer and winter, whereas the temporary burrows have simpler tunnels and are primarily used for escaping from predators.
Due to the wide distribution and large population of U. undulatus, it was classified as a “Least Concern” species by the International Union for Conservation of Nature (IUCN) in 2016. However, as a reservoir for many pathogens (Demina et al. 2017; Li et al. 2019a), accurate identification of U. undulatus is crucial. Mitochondrial genomes are commonly employed in species identification and evolutionary studies due to their conserved gene content, maternal inheritance, and compact molecular structure (Tatarenkov and Avise 2007; Gao et al. 2017). In this study, the mitochondrial genome of U. undulatus was determined and annotated, providing a valuable reference for future research on this species, as well as establishing a basis for the classification and evolutionary analysis of the Sciuridae family.
Materials and methods
Sample collection and DNA extraction
In June 2024, a male adult squirrel was collected as a specimen after natural death in Qinghe County (89°47′E, 45°00′N), Xinjiang Uygur Autonomous Region, China. The specimen’s dorsal fur was gray-brown with white spots, and its claws were gray-brown. Diagnostic characteristics confirmed the specimen as U. undulatus (Rodentia: Sciuridae) (Figure 1). After collection, the specimen was first disinfected with 75% ethanol. A small sterile muscle sample was then taken from its leg and preserved at −20 °C for further analysis. Voucher specimens have been preserved in the Dali University Biological Herbarium (voucher number DLU 240589) (URL: http://www.dali.edu.cn/jcyxy/xkpt/jcyxsyjxzx/6431.htm). Contact person: Xing Yang, yang08220013@163.com. Genomic DNA was extracted following the protocol provided by the TIANamp Genomic DNA Kit (Tiangen, Beijing, China) as recommended by the manufacturer.
Figure 1.
Species reference image of Urocitellus undulatus collected from Qinghe County, photographed by Qingsong Sun.
Sequence, assembly, and annotation analysis
DNA samples were processed by Harbin Botai Biological Co., Ltd. using the Illumina NovaSeq 6000 platform. Libraries were prepared with a whole-genome shotgun approach, featuring 350 bp insert fragments. Paired-end (PE) sequencing was conducted with 150 bp reads at both ends of the DNA fragments. Raw sequencing reads were first assessed for quality metrics using FastQC (Brown et al. 2017). To ensure robust downstream analyses, raw data were filtered to remove low-quality sequences, yielding high-quality clean reads. The high-quality sequencing data were assembled using SPAdes v3.14.1 (Bankevich et al. 2012), with multiple k-mer values applied to further optimize the assembly. Mitochondrial protein-coding genes (PCGs), transfer RNA genes (tRNAs), and rRNA genes (rRNAs) were annotated using MITOS (Bernt et al. 2013). Genome architecture was visualized as a circular map via Organellar Genome DRAW v1.2 (Greiner et al. 2019). The AT and GC content of each gene were analyzed using DNAStar software, and the AT-Skew and GC-Skew are computed using these equations: AT skew = (A − T)/(A + T) and GC skew = (G − C)/(G + C) (Perna and Kocher 1995).
Phylogenetic analysis
Phylogenetic relationships between U. undulatus and other closely related species were inferred using maximum likelihood (ML) and Bayesian inference (BI) methods. Downloaded the 13 PCGs from 21 species from NCBI, followed by sequence trimming and alignment using MEGA v 11.0. The ML method was executed in MEGA v 11.0 with 1,000 bootstrap replicates (Tamura et al. 2021). The most appropriate model was inferred to be GTR+G + I based on ModelFinder. The BI analysis was performed using MrBayes version 3.2.7, involving 1,000,000 iterations, with a sample taken every 1000 generations, and the initial 25% of the trees were excluded as burn-in (Ronquist and Huelsenbeck 2003). The phylogenetic tree was viewed and modified using FigTree software.
Results
The mitochondrial genome of U. undulatus spans 16,456 bp in total length (GenBank accession number: PQ720778) and is illustrated in Figure 2, with a mean coverage of trimmed sequencing data at 1735.90× (Figure S1). The mitogenome comprises the typical 37 genes found in animals, consisting of 13 PCGs, 2 rRNAs, 22 tRNAs, plus a control region (Figure 2). Nucleotide composition was adenine (32.33%), thymine (30.92%), guanine (12.64%), and cytosine (24.11%). The mitogenome consists of 63.25% A + T content and 36.75% G + C content. In the analysis of AT-skew and GC-skew for the entire mitochondrial genome, the obtained values were 0.022 and −0.312, respectively (Table S1). Two rRNA genes (rrnL and rrnS), 14 tRNAs (trnV, trnF, trnL1, trnM, trnD, trnW, trnI, trnR, trnG, trnK, trnH, trnS, trnL, and trnT) genes and 11 PCGs (nad1-5, cox1-3, atp6, atp8, cob and nad4L) were situated on the majority stand. The minority stand contained a PCG (nad6), and eight tRNA genes (trnC, trnY, trnN, trnQ, trnA, trnS2, trnP and trnE) (Table S2). The 13 PCGs had a sequence length of 11,398 bp, comprising 69.26% of the entire mitochondrial genome. Nucleotide composition was adenine (30.88%), thymine (32.47%), guanine (12.44%), and cytosine (24.71%). The PCGs consist of 62.85% A + T content and 37.15% G + C content. Start codons for all PCGs were ATN, but different protein coding genes correspond to different termination codons; of which 6 stop codons were TAA (cox1, cox2, atp8, atp6, nad4L, and nad5), 2 stop codons were AGA (nad6 and cob), 2 incomplete stop codons were TA(A) (nad1 and nad3) and 3 incomplete stop codons were T(AA) (nad2, cox3 and nad4). tRNA lengths vary from 74 bp to 59 bp, with trnL-TTA being the extended sequence at 74 base pairs while trnS-AGC is the shortest at 59 base pairs. The control region, spanning 1,006 bp, is positioned between trnF and trnP. The rrnL and rrnS are located on either side of trnV, with lengths of 1,572 bp and 967 bp, respectively.
Figure 2.
The mitochondrial genome map of U. undulatus.
Phylogenetic trees were constructed by concatenating 13 protein gene sequences from 21 species and using ML and BI methods (Figure 3). Myoxus glis (Gliridae), Castor canadensis, and C. fiber (Castoridae), and as outgroups to confirm the phylogenetic relationship of Sciuridae. In the BI and ML trees, U. undulatus was sister to that of U. richardsonii with high bootstrap support (1.00 in BI and 100% in ML). This phylogenetic analysis provides useful genetic resources for revealing the evolution of the family Sciuridae.
Figure 3.
Phylogenetic relationships of the Sciuridae family inferred using BI and ML analyses based on 13 PCGs of mitogenomes. The following sequences were used: Cynomys gunnisoni gunnisoni (Streich et al. 2019), Cynomys leucurus and Cynomys ludovicianus (Li et al. 2016), Ictidomys tridecemlineatus (Zhang et al. 2016b), Urocitellus undulatus (this study), Urocitellus richardsonii and Callospermophilus lateralis (Zhang et al. 2016a), Spermophilus alashanicus (Zhao et al. 2024), Spermophilus xanthoprymnus (Unpublished), Spermophilus citellus (Krystufek et al. 2009), Spermophilus taurensis (Matrosova et al. 2023), Marmota flaviventris (Unpublished), Marmota vancouverensis (Hao and Cao 2019), Marmota baibacina (Unpublished), Marmota himalayana (Li et al. 2019b), Marmota sibiri (Unpublished), Tamias sibiricus (Yoon et al. 2015), Tamiops swinhoei (Xu et al. 2016), Myoxus glis (Xu et al. 2016), Castor canadensis and Castor fiber (Horn et al. 2011).
Discussion and conclusion
In this study, the entire mitochondrial genome of U. undulatus was sequenced. Consistent with most metazoan mitochondrial genomes, it contains 37 functional genes (Boore 1999). However, some previously published animal mitochondrial genomes show differences from other metazoan genomes, as their mitochondrial genomes lack the atp8 gene (Yamasaki et al. 2012; Hao et al. 2024). The length and gene arrangement of the U. undulatus mitochondrial genome are consistent with those of other Sciuridae family species. All protein-coding genes utilize the standard mitochondrial start codons (ATG, ATA, ATT), and it has been observed that the genes nad1, nad2, nad3, nad4, and cox3 possess incomplete stop codons. This phenomenon has also been found in other Sciuridae family species, such as S. taurensis and M. himalayana (Li et al. 2019b; Matrosova et al. 2023). However, these incomplete stop codons will eventually form complete stop codons through post-transcriptional polyadenylation modifications (Beckenbach and Stewart 2009; Chen et al. 2012; Donath et al. 2019). The content of AT is higher than that of GC, which is common in mitochondrial studies of some animals (Hassanin et al. 2005).
Two methods were utilized to construct the phylogenetic tree of the Sciuridae family, and the species within the family clustered together with high confidence. The results show that U. undulatus and U. richardsonii are closely related, with species within the same genus (Urocitellus) clustering together. This study fills the gap in the mitochondrial genome of U. undulatus, lays the foundation for the systematics and evolution of the genus Urocitellus, providing valuable resources for the species evolution, taxonomy, species identification, and population genetics research of the Sciuridae family.
Supplementary Material
Acknowledgments
The study was created by Zimeng Liu and Lei Chen, who also wrote the manuscript. The analysis and interpretation of the data were carried out by Yuran Pang, Fengyan Zhang, Tangxin Liu, and Yayun He. Contributions to the collection and identification of U. undulatus were made by Xing Yang, Xinhui Zhang, Jiaxin Wang, and Xu He. The interpretation of experimental data, the critical revision of important knowledge content, and the final approval of the version to be published are the responsibility of Qingsong Sun, and all authors agree to be accountable for all aspects of the work.
Funding Statement
Jilin city science and technology innovation development plan project(Grant No. 20230103009).
Ethical approval
The study was approved by the institutional review boards of Jilin Agricultural Science and Technology University and Dali University, China. The collection of rodent muscle tissue was performed following the guidelines provided by Dali University under reference number DLU 240589. The field research complies with the relevant regulations of Qinghe County, Xinjiang Uygur Autonomous Region.
Disclosure statement
No potential conflict of interest was reported by the authors.
Data availability statement
The data supporting the findings of this investigation may be found at https://www.ncbi.nlm.nih.gov/ under the reference number PQ720778. The associated BioProject, SRA, and Bio-Sample numbers are PRJNA1216044, SRR32134887, and SAMN46424648, respectively.
References
- Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, et al. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 19(5):455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beckenbach AT, Stewart JB.. 2009. Insect mitochondrial genomics 3: the complete mitochondrial genome sequences of representatives from two neuropteroid orders: a dobsonfly (order Megaloptera) and a giant lacewing and an owlfly (order Neuroptera). Genome. 52(1):31–38. doi: 10.1139/g08-098. [DOI] [PubMed] [Google Scholar]
- Bernt M, Donath A, Jühling F, Externbrink F, Florentz C, Fritzsch G, Pütz J, Middendorf M, Stadler PF.. 2013. MITOS: improved de novo metazoan mitochondrial genome annotation. Mol Phylogenet Evol. 69(2):313–319. doi: 10.1016/j.ympev.2012.08.023. [DOI] [PubMed] [Google Scholar]
- Boore JL. JNar. 1999. Animal mitochondrial genomes. Nucleic Acids Res. 27(8):1767–1780. doi: 10.1093/nar/27.8.1767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown J, Pirrung M, McCue LA.. 2017. FQC Dashboard: integrates FastQC results into a web-based, interactive, and extensible FASTQ quality control tool. Bioinformatics. 33(19):3137–3139. doi: 10.1093/bioinformatics/btx373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen M, Tian L-L, Shi Q-H, Cao T-W, Hao J-S. JZr. 2012. Complete mitogenome of the Lesser Purple Emperor Apatura ilia (Lepidoptera: nymphalidae: apaturinae) and comparison with other nymphalid butterflies. Dongwuxue Yanjiu. 33(2):191–201. doi: 10.3724/SP.J.1141.2012.02191. [DOI] [PubMed] [Google Scholar]
- Demina TV, Tkachev SE, Kozlova IV, Doroshchenko EK, Lisak OV, Suntsova OV, Verkhozina MM, Dzhioev YP, Paramonov AI, Tikunov AY, et al. 2017. Comparative analysis of complete genome sequences of European subtype tick-borne encephalitis virus strains isolated from Ixodes persulcatus ticks, long-tailed ground squirrel (Spermophilus undulatus), and human blood in the Asian part of Russia. Ticks Tick Borne Dis. 8(4):547–553. doi: 10.1016/j.ttbdis.2017.03.002. [DOI] [PubMed] [Google Scholar]
- Donath A, Jühling F, Al-Arab M, Bernhart SH, Reinhardt F, Stadler PF, Middendorf M, Bernt M.. 2019. Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes. Nucleic Acids Res. 47(20):10543–10552. doi: 10.1093/nar/gkz833. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gao Y, Zhang Y, Yang X, Qiu J-H, Duan H, Xu W-W, Chang Q-C, Wang C-R.. 2017. Mitochondrial DNA evidence supports the hypothesis that Triodontophorus species belong to cyathostominae. Front Microbiol. 8:1444. doi: 10.3389/fmicb.2017.01444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greiner S, Lehwark P, Bock R.. 2019. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 47(W1):W59–w64. doi: 10.1093/nar/gkz238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hao Z, Cao Y.. 2019. The Complete mitochondrial genome of Marmota vancouverensis (Vancouver Island Marmot). Mitochondrial DNA B Resour. 4(2):3151–3152. doi: 10.1080/23802359.2019.1668308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hao Y, Li B, Ma L, Xu M, Niu P, Bu Y.. 2024. The complete mitochondrial genome of Cylicocyclus ultrajectinus (Ihle, 1920). Mitochondrial DNA B Resour. 9(11):1518–1521. doi: 10.1080/23802359.2024.2427110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hassanin A, Léger N, Deutsch J.. 2005. Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of metazoa, and consequences for phylogenetic inferences. Syst Biol. 54(2):277–298. doi: 10.1080/10635150590947843. [DOI] [PubMed] [Google Scholar]
- Horn S, Durka W, Wolf R, Ermala A, Stubbe A, Stubbe M, Hofreiter M.. 2011. Mitochondrial genomes reveal slow rates of molecular evolution and the timing of speciation in beavers (Castor), one of the largest rodent species. PLoS One. 6(1):e14622. doi: 10.1371/journal.pone.0014622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krystufek B, Bryja J, Buzan EV.. 2009. Mitochondrial phylogeography of the European ground squirrel, Spermophilus citellus, yields evidence on refugia for steppic taxa in the southern Balkans. Heredity (Edinb). 103(2):129–135. doi: 10.1038/hdy.2009.41. [DOI] [PubMed] [Google Scholar]
- Kryštufek B, Vohralík VJL. series nova. 2013. Taxonomic revision of the Palaearctic rodents (Rodentia). Part 2. Sciuridae: urocitellus. Marmota and Sciurotamias. 44:27–138. [Google Scholar]
- Li Y, Gu C, Yang J, Wei Y, Wang M, Nan X.. 2019b. The mitochondrial genome of Himalayan marmot, Marmota himalayana (Rodentia: sciuridae) using next-generation sequencing. Mitochondrial DNA Part B, Resources. 4(1):1181–1182. doi: 10.1080/23802359.2019.1591185. [DOI] [Google Scholar]
- Li L-L, Liu M-M, Shen S, Zhang Y-J, Xu Y-L, Deng H-Y, Deng F, Duan Z-J.. 2019a. Detection and characterization of a novel hepacivirus in long-tailed ground squirrels (Spermophilus undulatus) in China. Arch Virol. 164(9):2401–2410. doi: 10.1007/s00705-019-04303-z. [DOI] [PubMed] [Google Scholar]
- Li B, Yu D, Cheng H, Storey KB, Zhang J.. 2016. The complete mitochondrial genomes of Cynomys leucurus and C. ludovicianus (Rodentia: sciuridae). Mitochondrial DNA A DNA Mapp Seq Anal. 27(5):3295–3296. doi: 10.3109/19401736.2015.1015010. [DOI] [PubMed] [Google Scholar]
- Matrosova VA, Gündüz İ, Ermakov OA, Demirtaş S, Simonov E.. 2023. The complete mitochondrial genome of endemic Taurus ground squirrel Spermophilus taurensis (Rodentia: sciuridae). RusJTheriol. 22(2):97–101. doi: 10.15298/rusjtheriol.22.2.02. [DOI] [Google Scholar]
- McLean BS, Nyamsuren B, Tchabovsky A, Cook JA.. 2018. Impacts of late quaternary environmental change on the long-tailed ground squirrel (Urocitellus undulatus) in Mongolia. Zool Res. 39(5):364–372. doi: 10.24272/j.issn.2095-8137.2018.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perna NT, Kocher TD.. 1995. Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes. J Mol Evol. 41(3):353–358. doi: 10.1007/bf00186547. [DOI] [PubMed] [Google Scholar]
- Ricankova V, Fric Z, Chlachula J, Stastna P, Faltynkova A, Zemek FJ. Joz. 2006. Habitat requirements of the long‐tailed ground squirrel (Spermophilus undulatus) in the southern Altai. J Zool. 270(1):1–8. doi: 10.1111/j.1469-7998.2006.00136.x. [DOI] [Google Scholar]
- Ronquist F, Huelsenbeck JP.. 2003. MrBayes 3: bayesian phylogenetic inference under mixed models. Bioinformatics. 19(12):1572–1574. doi: 10.1093/bioinformatics/btg180. [DOI] [PubMed] [Google Scholar]
- Smith AT, Lunde D, Hoffmann RS, Xie Y.. 2010. A guide to the mammals of China. Princeton, NJ: Princeton University Press. [Google Scholar]
- Streich SP, Keepers KG, Griffin KA, Kane NC, Martin AP. JMDPB. 2019. The complete mitochondrial genome of Gunnison’s prairie dog subspecies (Cynomys gunnisoni gunnisoni) and phylogenetic relationship within the genus Cynomys. Mitochondrial DNA Part B, Resources. 4(1):397–398. doi: 10.1080/23802359.2018.1547157. [DOI] [Google Scholar]
- Tamura K, Stecher G, Kumar S.. 2021. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol Biol Evol. 38(7):3022–3027. doi: 10.1093/molbev/msab120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tatarenkov A, Avise JC.. 2007. Rapid concerted evolution in animal mitochondrial DNA. Proc Biol Sci. 274(1619):1795–1798. doi: 10.1098/rspb.2007.0169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu P, Li Y, Guo Y, Cheng S, Lei P.. 2016. Complete mitochondrial genome of the Tamiops swinhoei (Rodentia: sciuridae). Mitochondrial DNA A DNA Mapp Seq Anal. 27(3):2257–2258. doi: 10.3109/19401736.2014.984169. [DOI] [PubMed] [Google Scholar]
- Yamasaki H, Ohmae H, Kuramochi T.. 2012. Complete mitochondrial genomes of Diplogonoporus balaenopterae and Diplogonoporus grandis (Cestoda: diphyllobothriidae) and clarification of their taxonomic relationships. Parasitol Int. 61(2):260–266. doi: 10.1016/j.parint.2011.10.007. [DOI] [PubMed] [Google Scholar]
- Yoon KB, Cho JY, Park YC.. 2015. Complete mitochondrial genome of a chipmunk species, Tamias sibiricus (Rodentia: sciuridae) in Korea. Mitochondrial DNA. 26(5):749–750. doi: 10.3109/19401736.2013.848352. [DOI] [PubMed] [Google Scholar]
- Zhang L, Huang Y, Storey KB, Yu D, Zhang JY.. 2016a. Complete mitochondrial genomes of Callospermophilus lateralis and Urocitellus richardsonii (Rodentia: sciuridae). Mitochondrial DNA B Resour. 1(1):359–360. doi: 10.1080/23802359.2016.1168716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang L, Storey KB, Yu DN, Hu Y, Zhang JY.. 2016b. The complete mitochondrial genome of Ictidomys tridecemlineatus (Rodentia: sciuridae). Mitochondrial DNA A DNA Mapp Seq Anal. 27(4):2608–2609. doi: 10.3109/19401736.2015.1041117. [DOI] [PubMed] [Google Scholar]
- Zhao Y, Liang J, Li J, Zhang Z, Sun Y, Liu F, Zhang X, Liang Y, Teng L, Liu Z, et al. 2024. First complete mitochondrial genome of the Alashan ground squirrel (Spermophilus alashanicus) (Rodentia: sciuridae) from Ningxia, China. Mitochondrial DNA B Resour. 9(1):148–152. doi: 10.1080/23802359.2024.2305406. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data supporting the findings of this investigation may be found at https://www.ncbi.nlm.nih.gov/ under the reference number PQ720778. The associated BioProject, SRA, and Bio-Sample numbers are PRJNA1216044, SRR32134887, and SAMN46424648, respectively.