Abstract
Torilis scabra (Thunb.) DC. is widely distributed in China and Japan and has been introduced to North America. In this study, the complete chloroplast genome sequence of the T. scabra was obtained by de novo assembly using the NGS data. The chloroplast genome of T. scabra was 157,855 bp in length and divided into four distinct regions, such as large single-copy region (85,362 bp), small single-copy region (17,993 bp), and a pair of inverted repeat regions (27,250 bp). The genome annotation predicted a total of 127 genes, including 82 protein-coding genes, 37 tRNA genes, and 8 rRNA genes. Phylogenetic analysis with reported chloroplast genomes showed that T. scabra has a close genetic relationship with Anthriscus cerefolium.
Keywords: Torilis scabra (Thunb.) DC., complete chloroplast genome, Apiaceae
Torilis scabra (Thunb.) DC. is an annual or perennial herb that belongs to the family Apiaceae. It usually grows in mixed forests on mountain slopes or in valleys, roadsides, especially in disturbed areas on hillsides at an altitude of 250–2400 meters. Torilis scabra is widely distributed in China and has reputed medicinal value (She et al. 2005). The fruit shape of Torilis is similar to other genus of the Apiaceae, such as the Sanicula (Chen et al. 2018). Some confusion on identification exists within the Torilis genus with similar species frequently misidentified in herbaria and the literature (Antonio et al. 2014). Thus, it shows that our understanding of this genus is still insufficient and more effective molecular methods are needed to determine the phylogenetic relationships. The chloroplast genome structure is relatively conservative and the base mutation rate is moderate, which has been widely used phylogenetic study for various plant groups (Li et al. 2017). Here, we first report the complete chloroplast genome of T. scabra, which provides more data for the phylogenetic and evolutionary relationships of this genus.
The fresh and healthy leaves of T. scabra were collected from Yongchun county (25°26′12′′N 117°53′27′′E), Fujian Province, China. Voucher specimens were deposited in Hiaoqiao University Herbarium (00004531). Total genomic DNA was extracted by Plant Genomic DNA Kit (Sangon Biotech, Shanghai, China). Paired-end reads were sequenced by using Illumina Hiseq Platform (Illumina, San Diego, CA, USA). Approximately, 10 Gb of paired-end (150 bp) sequence data were randomly extracted from the total sequencing output and used as input for NOVOPlasty (Dierckxsens et al. 2017) to assemble the plastid genome. The plastid genome of Pterygopleurum neurophyllum (GenBank accession number: NC_033345) was used as the seed sequence. The genes in chloroplast genome were predicted using Geneious version 11.0.4 (Kearse et al. 2012) and corrected manually. The phylogenetic tree was constructed by maximum likelihood method with 1000 bootstrap replicates was inferred using MEGA7.0 (Kumar et al. 2016) from alignments created by the MAFFT (Katoh et al. 2002) using plastid genomes of 15 species.
The complete chloroplast genome of T. scabra (GenBank accession number: MN105615) was 157,855 bp in total sequence length, which was separated into four distinct regions such as large single-copy (LSC) region was 85,362 bp, small single-copy (SSC) region was 17,993 bp, and a pair of inverted repeat regions are 27,250 bp in each length. Overall, GC contents of T. scabra chloroplast genomes were 37.40%. The chloroplast genome detected a total of 127 genes including 82 protein-coding genes, 37 tRNA genes, and 8 rRNA genes.
To determine the phylogenetic position of T. scabra, a phylogenetic analysis was carried out among 13 complete chloroplast genomes of Apiaceae from NCBI. The phylogenetic analysis showed that T. scabra is closely related to A. cerefolium and is distinct from S. chinensis (Figure 1). This complete chloroplast genome can provide new evidence for taxonomy and can be further used for population genomic studies, genetic engineering research in Torilis.
Acknowledgments
The authors thank Professor Yan Yu (College of Life Sciences, Sichuan University) and Huang min for help with sequence analysis.
Disclosure statement
The authors declare no conflicts of interest and are responsible for the content.
References
- Antonio D, Stephen JD, Caroline AM, Kristine MA. 2014. North-East, North-Central, Mid-Atlantic United States and Southern Canada: Japanese Hedgeparsley (Torilis japonica)—a new invasive species in the United States. Invas Plant Sci Mana. 7:553–560. [Google Scholar]
- Chen ZX, Downie SR, Wang QZ. 2018. Review on the taxonomy, origin, and evolution of Sanicula (Umbelliferae). Plant Sci J. 36:136–143. [Google Scholar]
- Dierckxsens N, Mardulyn P, Smits G. 2017. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45:e18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Misawa K, Kuma K, Miyata T. 2002. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30:3059–3066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, et al. . 2012. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 28:1647–1649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kumar S, Stecher G, Tamura K. 2016. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Mol Biol Evol. 33:1870. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Y, Lü GH, Zhang XN, He XM. 2017. Chloroplast genome structure and variation analysis of Brassicaceae species. Acta Bot Boreal Occident Sin. 37:1090–1101. [Google Scholar]
- She ML, Pu FD, Pan ZH, Watson MF, Cannon JFM, Holmes-Smith I, Kljuykov EV, Phillippe LR, Pimenov MG. 2005. Apiaceae (Umbelliferae) Vol. 14 In: Wu ZY, Raven PH, editors. Flora of China. Beijing: Science Press; St. Louis: Missouri Botanical Garden Press; p. 28. [Google Scholar]