Abstract
Nekemias cantoniensis (Hook. et Arn.) Planch 1873 is a woody vine species native to South and Southwest China that is rich in flavonoids and also displays excellent pharmacological activities. The purpose of this study was to characterize the complete chloroplast (cp) genome of N. cantoniensis using Illumina pair-end sequencing data. In summary, the complete cp genome of N. cantoniensis exhibits a quadripartite structure with a length of 162,655 base pairs, including a large single-copy (LSC) region of 89,341 base pairs, a small single-copy (SSC) region of 19,076 base pairs, and two inverted repeats (IRs) regions of 27,119 base pairs. The overall GC content of the genome is 37.41%, while the corresponding values for the LSC, SSC, and IR regions are 34.75%, 32.89%, and 43.02%, respectively. The genome contains 137 genes, of which 87 are protein coding, 36 are tRNA coding, and eight are rRNA coding. Maximum-likelihood phylogenetic analyses revealed that N. cantoniensis was clustered with N. grossedentata.
Keywords: Chloroplast genome, small single-copy, phylogenetic relationship, maximum likelihood
Species of Nekemias cantoniensis (Hook. et Arn.) Planch 1873 are woody vines native to South and Southwest China, such as Guangdong, Guangxi, Yunnan, Hainan, and Hong Kong (Editorial Committee of Flora of China 2007). The stem tips and tender leaves of N. cantoniensis can be used to make Tengcha tea, which has a health benefit due to its flavonoid content (Gui et al. 2015; Li et al. 2021). For hundreds of years, it has been used to treat colds, fevers, sore throats, gangrenous hepatitis, and boils (Wu et al. 2014). In recent years, pharmacological research has shown that N. cantoniensis and its extract have a wide range of pharmacological effects, including antioxidant, antibacterial, liver protection, blood lipid reduction, blood glucose reduction, anti-inflammatory, analgesic, and anti-tumor effects (Gao et al. 2017). The basis for these pharmacological effects is believed to be primarily the presence of secondary metabolites in N. cantoniensis, primarily flavonoids, such as dihydromyricetin, myricetin, myricetin, etc. (Chen et al. 2019). In this study, high-throughput sequencing technology was used to sequence, assemble and annotate the complete chloroplast (cp) genome of N. cantoniensis. The structural characteristics of N. cantoniensis cp genome were statistically analyzed, and the genetic relationship between N. cantoniensis and some genera of Vitaceae was accurately located, providing a theoretical basis for the conservation and sustainable utilization of N. cantoniensis resources.
The fresh leaves of N. cantoniensis were collected from Guangzhou, Guangdong, China (113°45′E, 23°40′N, altitude: 139 m) (Figure 1(A,B)). The scientific committee of the Guangdong Academy of Agricultural Sciences approved the procedure for plant collection. A specimen was deposited at the Guangdong of Agro-biological Gene Research Center (http://multi-omics.agrogene.ac.cn/, contact person: Yongjian Luo, and email:851022933@qq.com) under the voucher number 20210303001. Total genomic DNA was extracted from fresh young leaves of N. cantoniensis using a modified CTAB method and quantified according to Allen et al. (2006). We constructed paired-end libraries with insert sizes of 150 bp (our study protocol was approved by the ethics review board of Hubei Minzu University, exemption number 2022031055). Beijing Microread Inc. (Beijing, China) sequenced the complete genome of N. cantoniensis using the Illumina HiSeq 2500 platform, resulting in 3.43 Gb of raw reads. SRR16608263 is the project accession number for the raw sequence data deposited in NCBI SRA. According to Bolger et al. (2014), the raw data were then filtered using Trimmomatic Version 0.38 with default settings. GetOrganelle was used to assemble the complete genome of cp according to Jin et al. (2020) and annotated by CPGAVAS2 (http://www.herbalgenomics.org/cpgavas2) according to Shi et al. (2019). A genome annotation for each of the cp species was submitted to GenBank (OK662571) after being reviewed and adjusted manually.
The whole cp genome of N. cantoniensis was 162,655 bp in length and demonstrated a typical angiosperm circular cp structure, with a large single-copy (LSC) region of 89,341 bp, a small single-copy (SSC) region of 19,076 bp, and a pair of inverted repeat (IR) regions of 54,238 bp (Figure 2). The total GC content of the genome of N. cantoniensis cp is 37.41% (LSC, 34.75%; SSC, 32.89%; IR, 43.02%). GC content and gene order were comparable to those of Nekemias grossedentata (Gu et al. 2020). There were 131 genes in the cp genome, including 87 protein-coding genes, 36 tRNA genes, and eight rRNA genes. There were introns in 12 protein-coding genes (rps16, rps12, rpoC1, rpl2, rpl16, petD, petB, pafI, ndhB, ndhA, clpP1, and atpF) and six tRNA genes (trnK-UUU, trnR-UCC, trnL-UAA, trnV-UAC, trnI-GAU, and trnA-UGC). Sixty-eight SSRs were identified, 61 of which were mono-nucleotides (A/C/T, 88.52%), eight were di-nucleotides (AT/TA, 9.84%), and one was a tetra-nucleotide repeat (TTAA, 1.64%) (Figure 3).
To confirm the phylogenetic position of N. cantoniensis within the Vitaceae family, we generated a maximum-likelihood (ML) phylogenetic tree using 17 cp genomes from the GenBank collection. Using MAFFT V7.309, several sequence alignments were done (Katoh and Standley 2013). A ML phylogenetic tree was generated based on a data matrix of a concatenation of 76 protein-coding sequences, implemented with RAxML v8 (Nguyen et al. 2015).The results confirmed that N. cantoniensis was clustered with N. grossedentata. This study extends our comprehension of cp genome evolution in Nekemias.
Author contributions
In this research, Yongjian Luo is the experimental designer and executor. He has completed the data analysis and the first draft of the paper. Ru Wang and Qing Li have contributed to the experimental design and the analysis of experimental results. Jun Liu and Zhijun Deng have been responsible for supervising the experimental design, data analysis, and the writing and revision of the paper. The final version of the manuscript was read and approved by all authors.
Funding Statement
The research in this paper was supported by the National Natural Science Foundation of China [31860073, 81303169, 31871716, 31371715], the Open Fund of Hubei Key Laboratory of Biologic Resources Protection and Utilization [PT012212], and the Science and Technology Program of Guangdong Province, China [2020B121201008, 2020B020209003, 2018B020202004]. The research in this paper was also supported by the Agricultural Science and Technology Cooperation Project of Foshan; Science and Technology Program of Guangzhou.
Disclosure statement
All authors declare no potential conflicts of interest.
Data availability statement
In support of the findings of this study, the genome sequence data are openly available in GenBank of the NCBI at https://www.ncbi.nlm.nih.gov/ under accession no. OK662571. Specifically, the associated BioProject, SRA, and Bio-Sample numbers are PRJNA775800, SRR16608263, and SAMN22627476, respectively.
References
- Allen GC, Flores-Vergara MA, Krasynanski S, Kumar S, Thompson WF.. 2006. A modified protocol for rapid DNA isolation from plant tissues using cetyltrimethylammonium bromide. Nat Protoc. 1(5):2320–2325. [DOI] [PubMed] [Google Scholar]
- Bolger AM, Lohse M, Usadel B.. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 30(15):2114–2120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen ML, Chen J, Li G.. 2019. Experimental study on anti-hepatitis B virus activity of petroleum ether extraction of Ampelopsis cantoniensis (Hook.et.Arn.) Pl. in vitro. Hubei J TCM OCT. 14(10):3–5. [Google Scholar]
- Editorial Committee of Flora of China. 2007. Flora of China (Vol. 12). Beijing: Science Press. [Google Scholar]
- Gao Q, Ma R, Chen L, Shi S, Cai P, Zhang S, Xiang H.. 2017. Antioxidant profiling of vine tea (Ampelopsis grossedentata): off-line coupling heart-cutting HSCCC with HPLC-DAD-QTOF-MS/MS. Food Chem. 225:55–61. [DOI] [PubMed] [Google Scholar]
- Gu L, Zhang N, Feng C, Yi Y, Yu ZW.. 2020. The complete chloroplast genome of Ampelopsis grossedentata (Hand.-Mazz.) W. T. Wang (Family: Vitaceae) and its phylogenetic analysis. Mitochondrial DNA B Resour. 5(3):2423–2424. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gui C, Huang J, Cheng PP, Fang Y, Xia Y, Zhang XQ. 2015. Screen of anti-tumor active substances in vitro from Ampelopsis megalophylla. Chinese Tradit Patent Med. 37(7):1411–1416. (in Chinese) [Google Scholar]
- Jin JJ, Yu WB, Yang JB, Song Y, dePamphilis CW, Yi TS, Li DZ.. 2020. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 21(1):241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Standley DM.. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30(4):772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li YH, Pu TZ, Zhou PN, Fang JH, Zhang XQ, Gong L. 2021. Cloning and bioinformatics analysis of flavonoid-3’-hydroxylase (F3’H) gene in Ampelopsis megalophylla. Mol Plant Breed. 19(18):5984–5993. (in Chinese) [Google Scholar]
- Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ.. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 32(1):268–274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shi L, Chen H, Jiang M, Wang L, Wu X, Huang L, Liu C.. 2019. CPGAVAS2, an integrated plastome sequence annotator and analyzer. Nucleic Acids Res. 47(W1):W65–W73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu XX, Huang RM, Xu ZF, Qiu SX.. 2014. Chemical constituents from vine stems of Ampelopsis cantoniensis. Nat Prod Res Dev. 26(11):1771–1774. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
In support of the findings of this study, the genome sequence data are openly available in GenBank of the NCBI at https://www.ncbi.nlm.nih.gov/ under accession no. OK662571. Specifically, the associated BioProject, SRA, and Bio-Sample numbers are PRJNA775800, SRR16608263, and SAMN22627476, respectively.