Skip to main content
Mitochondrial DNA. Part B, Resources logoLink to Mitochondrial DNA. Part B, Resources
. 2019 Jul 12;4(2):2403–2404. doi: 10.1080/23802359.2019.1636727

The complete chloroplast genome sequence of Vitellaria paradoxa

Yi Wang 1,, Xiaolong Yuan 1, Zhonghua Chen 1, Ting Luo 1
PMCID: PMC7687629  PMID: 33365563

Abstract

Vitellaria paradoxa is an important economical tree in Africa. In this study, the complete chloroplast genome (cpDNA) sequence of V. paradoxa was determined from Illumina HiSeq pair-end sequencing data. The cpDNA is 159,725 bp in length, contains a large single copy region (LSC) of 88,720 bp and a small single copy region (SSC) of 19,338 bp, which were separated by a pair of inverted repeat (IR) regions of 25,981 bp. The genome contains 130 genes, including 85 protein-coding genes, 8 ribosomal RNA genes, and 38 transfer RNA genes. The overall GC content of the whole genome is 36.7% and the corresponding values of the LSC, SSC, and IR regions are 34.6, 30.0, and 42.9%, respectively. Further phylogenomic analysis showed that V. paradoxa clustered together with Sideroxylon wightianum.

Keywords: Vitellaria paradoxa, chloroplast, Illumina sequencing, phylogenetic analysis


Vitellaria paradoxa is an important economical tree in Africa (Maranz et al. 2004). Vitellaria paradoxa yields a high-stearate fat, called Shea butter, that is extracted from the large seeds. Shea butter can be used as a valuable product in the food and cosmetic industries (Wei et al. 2019). Vitellaria paradoxa was introduced into China from Ghana and planted in Yuanjiang, Yunnan province, in 1964 (Yang and Wang 2018). The Shea nuts and butter of V. paradoxa are important export commodities and play a major role in the local economy and diet, the fruit pulp is also widely consumed (Maranz et al. 2003). Recently, the researcher reported that the leaf and bark of V. paradoxa showed antimicrobial activities (Olasunkanmi et al. 2017). However, there have been no genomic studies on V. paradoxa.

Herein, we reported and characterized the complete V. paradoxa plastid genome (MK953548). One V. paradoxa individual (specimen number: 2018040131) was collected from Yuanjiang, Yunnan Province of China (23°19′23″ N, 101°30′19″ E). The specimen is stored at Yunnan Academy of Forestry Herbarium. DNA was extracted from its fresh leaves using DNA Plantzol Reagent (Invitrogen, Carlsbad, CA, USA).

Paired-end reads were sequenced by using Illumina HiSeq system (Illumina, San Diego, CA, USA). In total, about 29.9 million high-quality clean reads were generated with adaptors trimmed. Aligning, assembly, and annotation were conducted by CLC de novo assembler (CLC Bio, Aarhus, Denmark), BLAST, GeSeq (Tillich et al. 2017), and GENEIOUS v 11.0.5 (Biomatters Ltd, Auckland, New Zealand). To confirm the phylogenetic position of V. paradoxa, other nine species of order Ericales from NCBI were aligned using MAFFT v.7 (Katoh and Standley 2013) and maximum likelihood (ML) bootstrap analysis was conducted using RAxML (Stamatakis 2006); bootstrap probability values were calculated from 1000 replicates. Cornus capitata (MG524990) and Alangium alpinum (MG525003) were served as the out-group.

The complete V. paradoxa plastid genome is a circular DNA molecule with the length of 159,725 bp, contains a large single copy region (LSC) of 88,720 bp and a small single copy region (SSC) of 19,338 bp, which were separated by a pair of inverted repeat (IR) regions of 25,981 bp. The overall GC content of the whole genome is 36.7% and the corresponding values of the LSC, SSC, and IR regions are 34.6, 30.0, and 42.9%, respectively. The genome contains 130 genes, including 85 protein-coding genes, 8 ribosomal RNA genes, and 38 transfer RNA genes. Phylogenetic analysis showed that V. paradoxa clustered together with Sideroxylon wightianum (Figure 1). The determination of the complete plastid genome sequences provided new molecular data to illuminate the Ericales evolution.

Figure 1.

Figure 1.

The maximum-likelihood tree based on the 9 chloroplast genomes of order Ericales. The bootstrap value based on 1000 replicates is shown on each node.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

  1. Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30:772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Maranz S, Wiesman Z, Garti N. 2003. Phenolic constituents of shea (Vitellaria paradoxa) kernels. J Agric Food Chem. 21:6268–6273. [DOI] [PubMed] [Google Scholar]
  3. Maranz S, Wiesman Z, Bisgaard J, Bianchi G. 2004. Germplasm resources of Vitellaria paradoxa based on variations in fat composition across the species distribution range. AgroFor Syst. 1:71–76. [Google Scholar]
  4. Olasunkanmi OO, Akinpelu DA, Adeniyi PO, Ajayi OF, Aso JO, Olorunmola FO. 2017. Investigations into antibacterial, phytochemical and antioxidant properties of Vitellaria paradoxa (Gaertn.) Stem Bark Extracts. J Pharm Res Int. 2:1–17. [Google Scholar]
  5. Stamatakis A. 2006. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 22:2688–2690. [DOI] [PubMed] [Google Scholar]
  6. Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, Greiner S. 2017. GeSeq- versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 45:W6–W11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Wei YJ, Ji BY, Siewers V, Xu DY, Halkier BA, Nielsen J. 2019. Identification of genes involved in Shea butter biosynthesis from Vitellaria paradoxa fruits through transcriptomics and functional heterologous expression. Appl Microbiol Biotechnol. 9:1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Yang CR, Wang YJ. 2018. Investigation report of Shea tree cultivating in Yunnan. Chinese Wild Plant Res. 37:68–72. [Google Scholar]

Articles from Mitochondrial DNA. Part B, Resources are provided here courtesy of Taylor & Francis

RESOURCES