Abstract
The first complete chloroplast genome (cpDNA) sequence of Canarium album was determined from Illumina HiSeq pair-end sequencing data in this study. The cpDNA is 163,347 bp in length, contains a large single-copy (LSC) region of 87,838 bp and a small single-copy (SSC) region of 13,935 bp, which were separated by a pair of inverted repeats (IRs) regions of 30,787 bp. The genome contains 129 genes, including 84 protein-coding genes, 8 ribosomal RNA genes, and 37 transfer RNA genes. The overall guanine-cytosine (GC) content of the whole genome is 37.5%, and the corresponding values of the LSC, SSC, and IR regions are 35.8%, 32.6%, and 41.1%, respectively. Further, phylogenomic analysis showed that C. album clustered together with Boswellia sacra.
Keywords: Canarium album, chloroplast, Illumina sequencing, phylogenetic analysis
Canarium album (Chinese olive) is a fruit tree belonging to genus Canarium with the Burseraceae family, and whose natural distribution in the southeast area of China (Kuo et al. 2015). C. album has more than two thousand years of cultivation history in China (Wei et al. 2019). Canarium album has also been introduced to other Asian tropical and semi-tropical regions (He and Xia 2011). The fruit of C. album has relatively low oil contents, and are generally processed in the food industry to beverages, candy, and confections (Ssonko and Xia 2005). Their dried fruits are also a Chinese traditional medicine used for the treatment of faucitis, stomatitis, hepatitis, and toxicosis (Ding 1999). However, there has been no genomic studies on C. album.
Herein, we reported and characterized the complete C. album plastid genome (MN106250). One C. album individual (specimen number: 201805041) was collected from Puwen, Yunnan Province of China (23°76′18″N, 101°28′37″E). The specimen is stored at Yunnan Academy of Forestry Herbarium, Kunming, China, and the accession number is YAFH0012735. DNA was extracted from its fresh leaves using DNA Plantzol Reagent (Invitrogen, Carlsbad, CA).
Paired-end reads were sequenced by using Illumina HiSeq system (Illumina, San Diego, CA). In total, about 30.1 million high-quality clean reads were generated with adaptors trimmed. Aligning, assembly, and annotation were conducted by CLC de novo assembler (CLC Bio, Aarhus, Denmark), BLAST, GeSeq (Tillich et al. 2017), and GENEIOUS version 11.0.5 (Biomatters Ltd, Auckland, New Zealand). To confirm the phylogenetic position of C. album, other four species of family Burseraceae from NCBI were aligned using MAFFT version 7 (Katoh and Standley 2013) and maximum likelihood (ML) bootstrap analysis was conducted using RAxML (Stamatakis 2006); bootstrap probability values were calculated from 1000 replicates. Pistacia weinmaniifolia (MF630953) was served as the out-group.
The complete C. album plastid genome is a circular DNA molecule with the length of 163,347 bp, contains a large single-copy (LSC) region of 87,838 bp and a small single-copy (SSC) region of 13,935 bp, which were separated by a pair of inverted repeats (IRs) regions of 30,787 bp. The overall guanine-cytosine (GC) content of the whole genome is 37.5%, and the corresponding values of the LSC, SSC, and IR regions are 35.8%, 32.6%, and 41.1%, respectively. The plastid genome contained 129 genes, including 84 protein-coding genes, 8 ribosomal RNA genes, and 37 transfer RNA genes. Phylogenetic analysis showed that C. album clustered together with Boswellia sacra (Figure 1). The determination of the complete plastid genome sequences provided new molecular data to illuminate the Burseraceae evolution.
Disclosure statement
No potential conflict of interest was reported by the authors.
References
- Ding BP. 1999. Pharmacology of qingguo pills on relieving cough. Chin Tradit Pat Med. 21:27–28. [Google Scholar]
- He Z, Xia W. 2011. Microwave-assisted extraction of phenolics from Canarium album L. and identification of the main phenolic compound. Nat Prod Res. 25(2):85–92. [DOI] [PubMed] [Google Scholar]
- Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30(4):772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kuo CT, Liu TH, Hsu TH, Lin FY, Chen HY. 2015. Antioxidant and antiglycation properties of different solvent extracts from Chinese olive (Canarium album L.) fruit. Asian Pac. J Trop Med. 8:1013–1021. [DOI] [PubMed] [Google Scholar]
- Ssonko UL, Xia WS. 2005. Processing technology for cloudy Chinese olive juice beverage. Chin J Food Industry. 2:5–7. [Google Scholar]
- Stamatakis A. 2006. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 22(21):2688–2690. [DOI] [PubMed] [Google Scholar]
- Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, Greiner S. 2017. GeSeq-versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 45(W1):W6–W11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wei XX, Lai RL, Chen J, Pan SL, Wu RJ. 2019. Studies on genetic diversity on inflorescence phenotypic characteristics of Canarium album germplasm resource. J Trop Subtrop Bot. 27:1–10. [Google Scholar]