Abstract
Camellia leyeensis Chang & Y. C. Zhong is a plant belonging to the genus Camellia. To determine its correct taxonomic status and better understand its molecular phylogenetic and genetic diversity, we studied the chloroplast genome of this species. Here, we report and characterize the complete chloroplast (cp) genome of C. leyeensis by using Illumina paired-end sequencing data. The chloroplast genome was determined to be 157,063 bp in length with a GC content of 37.30%. The genome contained 136 genes, including 91 protein-coding genes, 37 tRNA genes, and 8 rRNA genes. The sequence contained a large single-copy region (LSC, 86,661 bp), a small single-copy region (SSC, 18,284 bp), and two inverted repeat sequences (IRs, 26,059 bp, each). The GC content of the IR regions (42.96%) was higher than that of the SSC region (30.6%) and LSC region (35.31%). Phylogenetic analysis suggested that Camellia leyeensis is closely related to C. anlungensis with 96% bootstrap support. This chloroplast genome would be helpful for the phylogeny and conservation of Camellia.
Keywords: Camellia leyeensis, chloroplast genome, phylogenetic analysis
The genus Camellia of the family Theaceae comprises approximately 120 species, most of which have high economic and ornamental value (e.g. Camellia. sinensis (L.) O. Ktze. and Camellia oleifera Abel.). These species are divided into 20 sections based on morphological characteristics and genetic relationships. Approximately 97 species of Camellia are distributed in southern and southwestern China (76 endemic) (Ming and Bartholomew 2007). Camellia leyeensis Chang et Y. C. Zhong was first reported as a new species in 1991 belonging to Sect. Tuberculata, Camellia genus. It is different from species in other sections of Camellia in terms of its morphological features, specifically, having tuberculate capsules. This species, which commonly grows atop mountains with evergreen broad-leaved forests, is mainly distributed in the Ya-an Forest Farm, Leye County, Guangxi, with Pinus massoniana Lamb. and Celtis sinensis Pers and other shrubs as companion species. This taxon is similar to Camellia rhytidocarpa Chang et Liang. However, it can be distinguished from C. rhytidocarpa by its elliptical leaves and small capsules. Current studies on C. leyeensis have focused only on its leaf anatomy and micromorphological characteristics and biogeography (Lu et al. 2008). However, there are few reports of other studies on C. leyeensis. In this study, for the first time, we reported the chloroplast genome sequence of C. leyeensis provided useful genomic resources not only for the exchange of information between different species, but also for its phylogenetic and evolutionary studies.
Total DNA was isolated from fresh leaves of a C. leyeensis individual collected from its type locality, which is Ya-an Forest Farm, Leye County, Guangxi Province (24.85.833 N, 106.29.208 E, alt. 684 m). The specimen and extracted DNA were deposited at the Biodiversity Research Laboratory, Forestry College of Guizhou University (http://fc.gzu.edu.cn) under voucher number LZ20210701 (collected by Zhi Li and Xu Xiao, lizhighz@163.com and 1758257279@qq.com). The guidelines established by the Administration of Affairs Concerning Plant Experimentation state that approval from the Science and Technology Bureau of China and the Department of Wildlife Administration is not necessary when the plant in question are neither rare nor near extinction (first- or second-class state protection level). Therefore, approval was not required for the experiments described in this paper. Total genomic DNA was extracted from fresh leaves by using the modified CTAB procedure of Doyle and Doyle (1987) and sequenced on the Illumina NovaSeq 6000 platform (Illumina, San Diego, CA). Genome sequences were screened and assembled with SPAdes v.3.5.0 (Lapidus A. et al. 2014).
The complete cp genome of C. leyeensis (GenBank accession OK046127) is 157,063 bp in size, with a GC content of 37.30%, similar to the chloroplast genome of most Camellia genus plants (Li et al. 2018; Yayan et al. 2020). The annotated complete cp genome contains 136 genes, including 91 protein-coding genes (PCGs), 37 transfer RNAs (tRNAs), and 8 ribosomal RNAs (rRNAs), accounting for 66.91%, 27.21%, and 5.88% of all annotated genes, respectively. The sequence is composed of a large single-copy region (LSC, 86,661 bp), a small single-copy region (SSC, 18,284 bp), and two inverted repeat sequences (IRs, 26,059 bp, each). The GC content of the IR regions (42.96%) is higher than that of the SSC region (30.6%) and LSC region (35.31%).
To perform a phylogenomic analysis, 29 Theaceae family species plastomes were obtained from GenBank for maximum-likelihood (ML) phylogenetic analysis, and Polyspora hainanensis (Hung T. Chang) C. X. Ye ex B. M. Bartholomew & T. L. Ming, was used as an outgroup. The alignment was performed using MAFFT (Katoh and Standley 2013). The phylogenetic tree was built using RAxML8.1.5 (Stamatakis 2014) with bootstrap values set to 1000. The results showed that C. leyeensis is sister to C. anlungensis with 96% bootstrap support (Figure 1).
Author contributions
Field work, Conceived and designed the experiments: Zhi Li and Xu Xiao; Performed the experiments: Jiangtao Lu and Guoyuan Yang; Analyzed the data and wrote the manuscript: Xu Xiao and Zhi Li.
Funding Statement
This project was supported by grants from the Science and Technology Plan Project of Guizhou Province (Qian Ke He Ji Chu -ZK[2022] general 072), Talent Introduction Research Project of Guizhou University [Grant No. 56 (2019)] and Characteristic Forestry Industry Scientific Research Project of Guizhou (2020-06).
Disclosure statement
The authors declare that there are no conflicts of interest regarding the publication of this article. The authors alone are responsible for the content and writing of the paper.
Data availability statement
The data used to support the findings of this study are available from the corresponding author upon request and in GenBank at [https://www.ncbi.nlm.nih.gov/genbank/] (accession number OK046127). The associated BioProject, SRA, and Bio-Sample numbers are PRJNA765056, SRR16037759, and SAMN21543270, respectively.
References
- Doyle J. 1987. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull. 19:11–15. [Google Scholar]
- Lu HF, Jiang B, Shen ZG, Shen JB, Peng QF, Cheng CG.. 2008. Comparative leaf anatomy, FTIR discrimination and biogeographical analysis of Camellia section Tuberculata (Theaceae) with a discussion of its taxonomic treatments. Plant Syst Evol. 274(3-4):223–224. [Google Scholar]
- Li W, Shi X, Guo W, Banerjee AK, Zhang Q, Huang Y.. 2018. Characterization of the complete chloroplast genome of Camellia granthamiana (Theaceae), a vulnerable species endemic to China. Mitochondrial DNA B Resour. 3(2):1139–1140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ming TL, Bartholomew B.. 2007. Theaceae. In Wu ZY and PH Raven, editors. Flora of China. Vol. 12. St. Louis: Science Press, Beijing and Missouri Botanical Garden Press. [Google Scholar]
- Stamatakis, A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 30(9):1312–1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh, K., & Standley, D. M.. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular biology and evolution, 30(4), 772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yayan Z, Jie X, Gang W, Congjun Y, Yang L, Xiaoyong D.. 2020. Characterization of the complete chloroplast genome of Camellia anlungensis. Mitochondrial DNA Part B. 5(1):873–874. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data used to support the findings of this study are available from the corresponding author upon request and in GenBank at [https://www.ncbi.nlm.nih.gov/genbank/] (accession number OK046127). The associated BioProject, SRA, and Bio-Sample numbers are PRJNA765056, SRR16037759, and SAMN21543270, respectively.