Skip to main content
Mitochondrial DNA. Part B, Resources logoLink to Mitochondrial DNA. Part B, Resources
. 2026 Jan 24;11(2):290–294. doi: 10.1080/23802359.2026.2619330

Characterization of the complete chloroplast genome of Xantolis weimingii Huan C. Wang et Feng Yang et al. 2024 (Sapotaceae, Chrysophylloideae) and its phylogenetic implications

Dongrou Li a, Shaoyun Liu b, Jiangquan Wang b, Feng Yang b,
PMCID: PMC12833893  PMID: 41602093

Abstract

Xantolis weimingii is a species recently described in 2024. In this study, we reported the complete chloroplast (cp) genome of X. weimingii, representing the first assembly of the genome of a Xantolis species. The chloroplast genome of X. weimingii was 158,458 bp in size, comprising a large single-copy (LSC) region of 87,778 bp, a small single-copy (SSC) region of 18,516 bp, and two inverted repeat (IR) regions of 26,082 bp each. A total of 114 distinct genes were annotated, including 75 of which are protein-coding genes, 30 tRNA genes, 4 rRNA genes, and five open reading frames. The Maximum Likelihood (ML) phylogenetic analysis showed that X. weimingii was sister to other members of the subfamily Chrysophylloideae with strong support.

Keywords: Xantolis weimingii, Chrysophylloideae, chloroplast genomics, phylogenetic analysis

Introduction

Xantolis Raf. (Sapotaceae, Chrysophylloideae) comprises approximately 14 species, five of which occur in China, distributed from the eastern Himalayas to the Philippines in tropical Asia (van Royen 1957; Li 1987; Li and Pennington 2019). Morphologically, the genus is characterized by its obvious spines, lanceolate lobes of calyx and corolla, and aristate staminodes (Swenson and Anderberg 2005). Recent studies based on molecular data have demonstrated that the phylogenetic placement of Xantolis within the subfamily Chrysophylloideae remains unresolved, with conflicting topologies reported in previous analyses (Anderberg and Swenson 2003; Swenson and Anderberg 2005; Swenson et al. 2008, 2023; Bartish et al. 2011).

To date, only a handful of chloroplast genome sequences have been reported for Sapotaceae species (Liu et al. 2019; Niu et al. 2020; Zheng et al. 2020), and no complete chloroplast genome of any Xantolis species has been assembled. For poorly studied genera like Xantolis, chloroplast genome data can provide additional informative loci to improve the accuracy of phylogenetic inference. Xantolis weimingii Huan C. Wang et Feng Yang et al. (2024, Figure 1), a recently described species (Yang et al. 2024), is threatened by the ongoing clear-cutting and habitat destruction with only four populations discovered to date. In this study, we first reported the chloroplast genome of X. weimingii, which will offer a useful resource for future conservation genetics research.

Figure 1.

Figure 1.

Picture of the collected sample of Xantolis weimingii. Photographed by Huanchong Wang on 17 September 2017. The young branches, leaves, petioles, pedicels, sepals and fruit of X. weimingii are covered with ferruginous arachnoid-lanate indumentum. The fruit has persistent sepals, a beak at the apex, and a persistent style.

Materials and methods

Leaves of X. weimingii were collected from the living plants (Figure 1), from the mountain behind Fawu village, Dalongtan, Eshan County, Yuxi, Yunnan Province, China (sample collection site coordinates: 24°30’14.17ʺN, 102°03′46.60ʺE). The voucher specimen (contact: Feng Yang; yfde828@qq.com) was deposited at the Herbarium of Yunnan University (YUKU) under the voucher number ES2450.

Total genomic DNA was extracted from silica-dried leaves using a TIANGEN plant genomic DNA extraction kit (TIANGEN Biotech., Beijing, China) following the manufacturer’s protocols. Sequencing was performed on an Illumina HiSeq 6000 sequencing platform (Illumina, CA, USA) at Beijing Novogene Technology Co., Ltd. (Tianjin, China), generating 3 Gb of raw data for X. weimingii. The raw reads were processed in Trimmomatic v.0.32 (Bolger et al. 2014) to remove adapters, low-quality reads and reads containing unknown bases. The chloroplast genome of X. weimingii was assembled using GetOrganelle v1.7.7.1 (Jin et al. 2020). The assembled plastome sequence was annotated using the online annotation tools GeSeq (Tillich et al. 2017) and CPGAVAS2 (Shi et al. 2019), validated with CPGView (Liu et al. 2023), and adjusted with Geneious Prime 2020.0.3., ensuring annotation accuracy. Finally, the sequence was submitted to GenBank of NCBI. The circular chloroplast genome map of X. weimingii was also drawn using CPGview (Liu et al. 2023).

To clarify the phylogenetic position of X. weimingii, a total of 32 cpDNA related to X. weimingii from Sapotaceae, were used as ingroups. Diospyros morrisiana Hance was designated as the outgroup. All sequences were aligned using MAFFT v.7.490 (Katoh and Standley 2013) with the default parameters, and the aligned dataset was cleaned using TrimAl v.1.4.1 (Capella‐Gutierrez et al. 2009) to remove poorly aligned regions. Then we performed the maximum-likelihood (ML) phylogenetic analysis using IQ-TREE2 v. 2.1.3 (Nguyen et al. 2015) with branch support estimated using 1,000 replicates of the ultrafast bootstrapping algorithm (UFboot) (Minh et al. 2013), as well as the Shimodaira-Hasegawa-like approximate likelihood-ratio test (SHaLRT, Guindon et al. 2010). The phylogenetic trees were visualized in FigTree v.1.4.4 (Rambaut 2013).

Results

The chloroplast genome of X. weimingii was 158,458 bp in size with a 427.16× average depth of coverage (Supplemental Figure S1). It featured a circular and typical quadripartite structure, and contained a large single-copy (LSC, 87,778 bp) region, a small single-copy (SSC, 18,516 bp) region, and two inverted repeat (IR, 26,082 bp each) regions (Figure 2). The percentage of GC in the whole genome was 36.9%, and in LSC, SSC, and IR regions were  34.7%, 30.2%, and 42.9%, respectively. The chloroplast genome of X. weimingii had a total of 114 distinct genes, including 75 protein-coding genes (PCGs) and 30 tRNA genes, 4 rRNA genes and 5 open reading frames (ORFs) genes (Table 1). There were 15 genes, including 9 PCGs and 6 tRNA genes, with two exons and 3 genes (ycf3, clpP, and rps12) with three exons. Of the PCGs, 11 were cis-spliced genes, 9 of which contained one intron, and 2 contained two introns (Supplemental Figure S2). In addition, the location of the three exon regions of the trans-spliced gene rps12 were identified (Supplemental Figure S3).

Figure 2.

Figure 2.

The chloroplast genome map of Xantolis weimingii. Genes inside and outside the circle are transcribed clockwise and counter-clockwise. The different colored boxes in the outermost circle show the genes. The inner circle has a grey area indicating the GC content, while the quadripartite structure (LSC, SSC, IRA, and IRB) is shown on the inner circle accordingly.

Table 1.

List of genes was annotated in the chloroplast genome of Xantolis weimingii.

Category Gene group Gene name Unique genes number
Photosynthesis Subunits of photosystem I psa A, psa B, psaC, psaI, psaJ 5
  Subunits of photosystem II psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ 15
  Subunits of NADH dehydrogenase ndhA*, ndhB* (2), ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK 11
  Subunits of cytochrome b/f complex petA, petB*, petD*, petG, petL, petN 6
  Subunits of ATP synthase atpA, atpB, atpE, atpF*, atpH, atpI 6
  Large subunit of rubisco rbcL 1
Self-replication Proteins of large ribosomal subunit rpl14, rpl16*, rpl2* (2), rpl20, rpl22, rpl23 (2), rpl32, rpl33, rpl36 9
  Proteins of small ribosomal subunit rps11, rps12** (2), rps14, rps15, rps16*, rps18, rps19, rps2, rps3, rps4, rps7 (2), rps8 12
  Subunits of RNA polymerase rpoA, rpoB, rpoC1*, rpoC2 4
  Ribosomal RNAs rrn16 (2), rrn23 (2), rrn4.5 (2), rrn5 (2) 4
  Transfer RNAs trnA-UGC* (2), trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnG-GCC, trnG-UCC*, trnH-GUG, trnI-CAU (2), trnI-GAU* (2), trnK-UUU*, trnL-CAA (2), trnL-UAA*, trnL-UAG, trnM-CAU, trnfM-CAU, trnN-GUU (2), trnP-UGG, trnQ-UUG, trnR-ACG (2), trnR-UCU, trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU, trnT-UGU, trnV-GAC (2), trnV-UAC*, trnW-CCA, trnY-GUA 30
Other genes Maturase matK 1
  Protease clpP** 1
  Envelope membrane protein cemA 1
  Acetyl-CoA carboxylase accD 1
  c-type cytochrome synthesis gene ccsA 1
  Translation initiation factor infA 1
Genes of unknown function Conserved hypothetical chloroplast ORF ycf1 (2)#, ycf2 (2), ycf3**, ycf4, ycf15 (2) 5

Notes. Gene *: Gene with one intron; Gene **: Gene with two introns; Gene #: One pseudogene identified; Gene (2): Number of copies of multi-copy genes.

The phylogenetic relationships of 33 samples were well resolved in this study using the chloroplast genome sequences (Figure 3). All 33 samples of Sapotaceae form a monophyletic clade with strong support (UFboot = 100%, SH-aLRT = 100%). The phylogenomic tree recovered three well-supported subfamily clades: the Sarcospermatoideae clade, the Sapotoideae clade, and the Chrysophylloideae clade. Xantolis weimingii occupied a basal position within Chrysophylloideae with strong support (UFboot = 100%, SH-aLRT = 100%).

Figure 3.

Figure 3.

Molecular phylogenetic tree of 33 samples of Sapotaceae based on plastome sequences with maximum-likelihood analysis. Ultrafast bootstrap and Shimodaira-Hasegawa-like test values are marked near the nodes (UFboot/SH-aLRT). Nodes with 100 bootstrap percentages and 100 Shimodaira-Hasegawa-like test values are not marked on the tree. Xantolis weimingii is highlighted by using red colored text. Diospyros morrisiana (GenBank: OP485441) was used as outgroup. The following sequences were used: Aningeria altissima (GenBank: MZ274143, Mascarello et al. 2021), Argania spinosa (GenBank: MK533159), Autranella congolensis (GenBank: MZ274145, Mascarello et al. 2021), Baillonella toxisperma (GenBank: MZ274146, Mascarello et al. 2021), Chrysophyllum cainito (GenBank: MT435527, Zheng et al. 2020), Chrysophyllum roxburghii (GenBank: MW801247), Gambeya africana (GenBank: MZ274138, Mascarello et al. 2021), Gambeya gigantea (GenBank: MZ274139, Mascarello et al. 2021), Gambeya lacourtiana (GenBank: MZ274140, Mascarello et al. 2021), Gambeya subnuda (GenBank: MZ274141, Mascarello et al. 2021), Madhuca hainanensis (GenBank: MT909828), Manilkara zapota (GenBank: MK790101, MN295595, Li et al. 2019; Liu et al. 2019), Manilkara kauki (GenBank: PV010369), Mimusops coriacea (GenBank: MW846242), Mimusops elengi (GenBank: OK458682), Planchonella grandifolia (GenBank: ON881598), Pouteria adolfi-friedericii (GenBank: MZ274142, Mascarello et al. 2021), Pouteria aningeri (GenBank: MZ274144, Mascarello et al. 2021), Pouteria caimito (GenBank: MT712131, MN065160, Yang et al. 2019), Pouteria campechiana (GenBank: KX426215, Jo et al. 2016, MH018545, Niu et al. 2018), Pouteria viridis (GenBank: OP650217), Sarcosperma laurinum (GenBank: MW801278), Sarcosperma pedunculatum (GenBank: ON881457), Sideroxylon wightianum (GenBank: ON881480, MG719834), Synsepalum dulcificum (GenBank: MT723946, Niu et al. 2020), Tieghemella africana (GenBank: MZ274147, Mascarello et al. 2021), Tieghemella heckelii (GenBank: MZ274148, Mascarello et al. 2021), Vitellaria paradoxa (GenBank: MK953548, Wang et al. 2019), Xantolis weimingii (GenBank: PV548271, in this study).

Discussion and conclusion

In this study, the chloroplast genome of X. weimingii was assembled and annotated, representing the first genomic resource not only for this rare species but also for the genus Xantolis. The chloroplast genome exhibits the typical quadripartite structure, with its length, GC content, gene composition and count showing highly similar to those of previously reported Sapotaceae species (Niu et al. 2020; Zheng et al. 2020), reflecting strong evolutionary conservation of plastid genomes in this family.

The phylogenetic results indicated that X. weimingii is the basal clade of Chrysophylloideae, and Xantolis represents an early-diverging lineage within the subfamily.  This finding can provide critical genomic evidence for resolving the ambiguous phylogenetic position of the genus. It was consistent with previous phylogenetic studies based on molecular markers (Anderberg and Swenson 2003; Swenson and Anderberg 2005; Swenson et al. 2008), but with greatly increased resolution. Overall, these results will provide effective molecular information for further studies on germplasm evaluation and molecular phylogeny of Xantolis.

Supplementary Material

Figure S1. Clean reads coverage depth map of Xantolis weimingii.tif

Acknowledgements

We thank Dr. Huanchong Wang for providing the picture of Xantolis weimingii for this study.

Funding Statement

The author(s) reported there is no funding associated with the work featured in this article.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Ethical approval

Given that Xantolis weimingii has been assessed as Critically Endangered (CR) in accordance with the IUCN Red List Guidelines, we comply with the International Union for Conservation of Nature (IUCN) policies research involving species at risk of extinction, the Convention on Biological Diversity, the Convention on the Trade in Endangered Species of Wild Fauna and Flora, and the Regulations of the People’s Republic of China on Wild Plants Protection in the study.

Data availability statement

The genome sequence data supporting this study’s findings are openly available in GenBank of NCBI at https://www.ncbi.nlm.nih.gov/, reference number PV548271. The associated BioProject, SRA, and Bio-Sample numbers are PRJNA1254621, SRR33283946 and SAMN48116441, respectively.

References

  1. Anderberg AA, Swenson U.. 2003. Evolutionary lineages in Sapotaceae (Ericales): a cladistic analysis based on ndhF sequence data. Int J Plant Sci. 164(5):763–773. 10.1086/376818 [DOI] [Google Scholar]
  2. Bartish IV, Antonelli A, Richardson JE, Swenson U.. 2011. Vicariance or long-distance dispersal: historical biogeography of the pantropical subfamily Chrysophylloideae (Sapotaceae). Journal of Biogeography. 38(1):177–190. 10.1111/j.1365-2699.2010.02389.x [DOI] [Google Scholar]
  3. Bolger AM, Lohse M, Usadel B.. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 30(15):2114–2120. 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Capella-Gutierrez S, Silla-Martinez JM, Gabaldon T.. 2009. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 25(15):1972–1973. 10.1093/bioinformatics/btp348 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Guindon S, et al. 2010. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 59(3):307–321. 10.1093/sysbio/syq010 [DOI] [PubMed] [Google Scholar]
  6. Jin JJ, et al. 2020. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 21(1):241. 10.1186/s13059-020-02154-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Jo SJ, Kim HW, Kim YK, Cheon SH, Kim KJ.. 2016. The first complete plastome sequence from the family Sapotaceae, Pouteria campechiana (Kunth) Baehni. Mitochondrial DNA B Resour. 1(1):734–736. 10.1080/23802359.2016.1233469 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Katoh K, Standley DM.. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30(4):772–780. 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Li SG. 1987. Xantolis Raf. In: Li SG, editor. Flora Reipublicae Popularis Sinicae. Vol. 60(1). Science Press. p. 64–67. [Google Scholar]
  10. Li SG, Pennington TD.. 1996. Sapotaceae A. L. Jussieu. In: Wu ZY, Raven PH, Hong DY, editors. Flora of China. Vol. 15. Science Press. p. 205–214. [Google Scholar]
  11. Li SX, Zhang XF, Wang HX, Zhu ZX, Wang HF.. 2019. Complete plastome sequence of Manilkara zapota (L.) P. Royen (Sapotaceae). Mitochondrial DNA B Resour. 4(2):3114–3115. 10.1080/23802359.2019.1667915 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Liu J, et al. 2019. The complete chloroplast genome sequence of Manilkara zapota (Linn.) van Royen. Mitochondrial DNA B Resour. 4(2):2127–2128. 10.1080/23802359.2019.1623122 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Liu S, et al. 2023. CPGView: a package for visualizing detailed chloroplast genome structures. Mol Ecol Resour. 23(3):694–704. 10.1111/1755-0998.13729 [DOI] [PubMed] [Google Scholar]
  14. Mascarello M, et al. 2021. Genome skimming reveals novel plastid markers for the molecular identification of illegally logged African timber species. PLoS One. 16(6):e0251655. 10.1371/journal.pone.0251655 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Minh BQ, Nguyen MA, von Haeseler A.. 2013. Ultrafast approximation for phylogenetic bootstrap. Mol Biol Evol. 30(5):1188–1195. 10.1093/molbev/mst024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ.. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 32(1):268–274. 10.1093/molbev/msu300 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Niu YF, et al. 2018. The complete chloroplast genome of tropical and sub-tropical fruit tree Lucuma nervosa (Sapotaceae). Mitochondrial DNA B Resour. 3(1):440–441. 10.1080/23802359.2018.1457995 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Niu YF, Ni SB, Liu J.. 2020. Complete chloroplast genome of Synsepalum dulcificum D.: a magical plant that modifies sour flavors to sweet. Mitochondrial DNA B Resour. 5(3):3070–3071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Rambaut A. 2013. FigTree v1.4. Molecular evolution, phylogenetics and epidemiology. University of Edinburgh, Institute of Evolutionary Biology. [Google Scholar]
  20. Shi L, et al. 2019. CPGAVAS2, an Integrated Plastome Sequence Annotator and Analyzer. Nucleic Acids Res. 47(W1):W65–W73. 10.1093/nar/gkz345 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Swenson U, Anderberg AA.. 2005. Phylogeny, character evolution, and classification of Sapotaceae (Ericales). Cladistics. 21(2):101–130. 10.1111/j.1096-0031.2005.00056.x [DOI] [PubMed] [Google Scholar]
  22. Swenson U, et al. 2023. Reassessment of generic boundaries in Neotropical Chrysophylloideae (Sapotaceae): eleven reinstated genera and narrowed circumscriptions of Chrysophyllum and Pouteria. Taxon. 72(2):307–359. 10.1002/tax.12894 [DOI] [Google Scholar]
  23. Swenson U, Richardson JE, Bartish IV.. 2008. Multi-gene phylogeny of the pantropical subfamily Chrysophylloideae (Sapotaceae): evidence of generic polyphyly and extensive morphological homoplasy. Cladistics. 24(6):1006–1031. 10.1111/j.1096-0031.2008.00235.x [DOI] [PubMed] [Google Scholar]
  24. Tillich M, et al. 2017. GeSeq-versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 45(W1):W6–W11. 10.1093/nar/gkx391 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. van Royen P. 1957. Revision of the Sapotaceae of the Malaysian area in a wider sense. VI. Xantolis Rafifinesque. Blumea. 8:207–234. [Google Scholar]
  26. Wang Y, Yuan X, Chen Z, Luo T.. 2019. The complete chloroplast genome sequence of Vitellaria paradoxa. Mitochondrial DNA B Resour. 4(2):2403–2404. 10.1080/23802359.2019.1636727 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Yang D, Qiu Q, Xu L, Xu Y, Wang Y.. 2019. The complete chloroplast genome of Pouteria caimito. Mitochondrial DNA B Resour. 4(2):2824–2825. 10.1080/23802359.2019.1660249 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Yang F, et al. 2024. Xantolis weimingii (Sapotaceae), a new species from the Yuanjiang River basin, Yunnan, southwest China. PhytoKeys. 246:251–263. 10.3897/phytokeys.246.119516 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Zheng C, Liu ZY, Liu J.. 2020. The complete chloroplast genome sequence of Chrysophyllum cainito, a semidomesticated species. Mitochondrial DNA B Resour. 5(3):2199–2200. 10.1080/23802359.2020.1768964 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1. Clean reads coverage depth map of Xantolis weimingii.tif

Data Availability Statement

The genome sequence data supporting this study’s findings are openly available in GenBank of NCBI at https://www.ncbi.nlm.nih.gov/, reference number PV548271. The associated BioProject, SRA, and Bio-Sample numbers are PRJNA1254621, SRR33283946 and SAMN48116441, respectively.


Articles from Mitochondrial DNA. Part B, Resources are provided here courtesy of Taylor & Francis

RESOURCES