Abstract
Paris polyphylla is a medicinal plant commonly used in southwest of China. In this study, we sequenced the complete chloroplast (cp) genome sequence of P. polyphylla to investigate its phylogenetic relationship in the genus Paris. The chloroplast genome of P. polyphylla was 163,533 bp in length with 37.1% overall GC content, including a large single copy (LSC) region of 84,272 bp, a small single copy (SSC) region of 12,899 bp and a pair of inverted repeats (IRs) of 33,181 bp. The cp genome contained 114 genes, including 79 protein coding genes, 30 tRNA genes, and 4 rRNA genes. The phylogenetic analysis indicated P. polyphylla was closely related to P. marmorata.
Keywords: Paris polyphylla, chloroplast, Illumina sequencing, phylogeny
Paris is a very complex genus of the Melanthiaceae family, which comprises approximately 24 species in the world. Most of them are widespread in Europe and East Asia (Li 1998; Ji et al. 2006). There are 19 species in China (Ji et al. 2006). The roots of plants within this genus have been widely used in traditional Chinese medicine for thousands of years (Jiangsu New Medical College 1977). Among these species, P. polyphylla is a unique species widely distributed in southwest China. The roots of this species have been frequently used as the “Paridis Rhizoma” (Namely “Chong-Lou”) for local medicine in these regions, owing to its analgesic, hemostatic, anti-tumor, and anti-inflammatory activities (Jiangsu New Medical College 1977). However, up to now for such medicinal plant, many studies have mainly explored its genetic diversity (ISSR and RAPD) (Zhang et al. 2004; He et al. 2007), the genome information of P. polyphylla is too little published in GenBank, so that insufficient comprehensive genomic resource is conducted for it. At present, we report the chloroplast genome sequence of P. polyphylla and find its internal relationships within the genus Paris, which can provide basic data for further research of genus Paris species useful genome resource in China.
Fresh and clean leave materials of P. polyphylla were collected from Yunlong county, Yunnan, China (N25°53′40.48″, E99°17′16.92″), and the plant materials and a voucher specimen (No. TAE03) were deposited at Tourism and Culture College of Yunnan University (Lijiang). Total genomic DNA was extracted using the improved CTAB method (Doyle 1987; Yang et al. 2014), and sequenced with Illumina Hiseq 2500 (Novogene, Tianjin, China) platform with pair-end (2 × 300 bp) library. The raw data was filtered using Trimmomatic v.0.32 with default settings (Bolger et al. 2014). Then paired-end reads of clean data were assembled into circular contigs using GetOrganelle.py (Jin et al. 2018) with Paris rugosa (No. NC_038170) as reference. Finally, the cpDNA was annotated by the Dual Organellar Genome Annotator (DOGMA; http://dogma.ccbb.utexas.edu/) (Wyman et al. 2004) and tRNAscan-SE (Lowe and Chan 2016) with manual adjustment using Geneious v. 7.1.3 (Kearse et al. 2012).
The circular genome map was generated with OGDRAW v.1.3.1 (Greiner et al. 2019). Then the annotated chloroplast genome was submitted to the GenBank under the accession number MN518849. The total length of the chloroplast genome was 163, 533 bp, with 37.1% overall GC content. With typical quadripartite structure, a pair of inverted repeats (IRs) of 33,181 bp was separated by a small single copy (SSC) region of 12,899 bp and a large single copy (LSC) region of 84,272 bp. The cp genome contained 114 genes, including 79 protein coding genes, 30 tRNA genes, and 4 rRNA genes. Of these, 22 genes were duplicated in the inverted repeat regions, 8 protein-coding genes, and 6 tRNA genes contain one intron, while three genes (ycf3, rps12 and clpP) have two introns.
To investigate its taxonomic status, a total of 14 cp genome sequences of the genus Paris species were downloaded from the NCBI database used for phylogenetic analysis. After using MAFFT V.7.149 for aligning (Katoh and Standley 2013), jModelTest v.2.1.7 (Darriba et al. 2012) was used to determine the best-fitting model (GTR + G) for the chloroplast genomes. Then Maximum likelihood (ML) analysis was performed by RaxML v.8.2.4 (Stamatakis 2014) with 1000 bootstrap replicates, and two Liliaceae species (Fritillaria cirrhosa: KF769143 and Lilium bakerianum: NC_035592) were used as outgroups. The results showed that P. polyphylla was closely related to P. marmorata (Figure 1). Meanwhile, the phylogenetic relationship in the genus Paris was consistent with previous studies and this will be useful data for developing markers for further studies.
Figure 1.
Maximum likelihood (ML) tree of 14 species within the genus Paris based on the plastomes using two Liliaceae species as outgroups.
Disclosure statement
The authors are highly grateful to the published genome data in the public database. The authors declare no conflicts of interest and are responsible for the content.
References
- Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 30(15):2114–2120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Darriba D, Taboada GL, Doallo R, Posada D. 2012. jModelTest 2: more models, new heuristics and parallel computing. Nat Methods. 9(8):772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doyle J. 1987. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull. 19:11–15. [Google Scholar]
- Greiner S, Lehwark P, Bock R. 2019. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 47(W1):W59–W64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- He J, Yang BY, Chen SF, Gao LM, Wang H. 2007. Assessment of genetic diversity of Paris polyphylla (Trilliaceae) by ISSR Markers. Acta Botanica Yunnanica. 29(4):388–392. [Google Scholar]
- Ji Y, Fritsch PW, Li H, Xiao T, Zhou Z. 2006. Phylogeny and classification of Paris (Melanthiaceae) inferred from DNA sequence data. Ann Bot. 98(1):245–256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiangsu New Medical College. 1977. Dictionary of Chinese materia medical (in Chinese). Shanghai: Science and Technology Press of Shanghai. [Google Scholar]
- Jin JJ, Yu WB, Yang JB, Song Y, Yi TS, Li DZ. 2018. GetOrganelle: a simple and fast pipeline for de novo assembly of a complete circular chloroplast genome using genome skimming data. bioRxiv. 1–11. [Google Scholar]
- Katoh K, Standley D. 2013. MAFFT multiple sequence alignment software version improvements in performance and usability. Mol Biol Evol. 30(4):772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, et al. 2012. Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 28(12):1647–1649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H. 1998. The phylogeny of the genus Paris L In Li H, editor. The genus Paris L (In Chinese). Beijing: Science Press; p. 8–65. [Google Scholar]
- Lowe TM, Chan PP. 2016. tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 44(W1):W54–W57. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 30(9):1312–1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wyman SK, Jansen RK, Boore JL. 2004. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 20(17):3252–3255. [DOI] [PubMed] [Google Scholar]
- Yang JB, Li DZ, Li HT. 2014. Highly effective sequencing whole chloroplast genomes of angiosperms by nine novel universal primer pairs. Mol Ecol Resour. 14:1024–1031. [DOI] [PubMed] [Google Scholar]
- Zhang JY, Yu H, Zhang SG, Ding CC. 2004. RAPD variation within and among four populations of Paris polyphylla. Biodivers Sci. 12(5):517–522. [Google Scholar]