Abstract
Gentiana apiata N. E. Brown (Gentianaceae) is a perennial herb plant and only grows in Qinba Mountains in China. Here, we first characterized the complete nucleotide sequence of chloroplast (cp) genome of G. apiata via Illumina next generation sequencing platform. The complete chloroplast genome of G. apiata was 144,274 bp in length, comprising of a large single copy (LSC) region of 77,353 bp, a small single copy (SSC) region of 17,009 bp, and two inverted repeat regions (IRs) of 24,956 bp. The cp genome contains 127 genes, including 82 protein-coding genes, 35 tRNA, eight rRNA genes, and two pseudogenes. Phylogenetic analysis based on 18 cp genome sequences showed that G. apiata closely related to congeneric species.
Keywords: Gentiana apiata, chloroplast genome, phylogenetic tree
Gentiana apiata N. E. Brown (Gentianaceae) is a perennial medicinal herb that only grows in Qinba Mountains in China. It is commonly known as Qinlinglongdan or Zhulingcao in Chinese medicine, and the whole plant has been used as Chinese folk medicine for the treatment of inappetence, hepatic injury disease, and gynaecopathia disease (Zhou et al. 2004). However, little research on this species has been found. In plant, chloroplast (cp) DNA provided valuable phylogenetic information, owning to its conserved genome structures and comparatively high substitution rates (Wu and Ge 2012). In this study, we characterized the complete nucleotide sequence of chloroplast genome of G. apiata based on the Illumina next-generation sequencing technology.
Fresh samples of G. apiata were collected from Qingling Mountains in China (N34.0054°, E107.8123°). The genomic DNA was isolated from samples using the modified CTAB method (Doyle and Doyle 1987). DNA sample and voucher specimen (No. GALZH2015722) of G. apiata were deposited in the Northwest University Museum (NUM). We conducted the next generation sequencing in Novogene Bioinformatics Technology Co., Ltd (Beijing, China). The high-quality reads were obtained after pair-read sequencing. Then, raw reads trimmed and assembled with the program MITObim v 1.8 (Hahn et al. 2013) using Anemone trullifolia (NC_039456) as reference. The genome annotation was performed using the online program Dual Organellar Genome Annotator (DOGMA, Wyman et al. 2004) (http://ogdraw.mpimp-golm.mpg.de/). Eventually, the complete and annotated cp genome sequence of G. apiata has been submitted to GenBank (Accession number: MN049985). Finally, we drew the circular plastid genome maps using the program OGDRAW (Lohse et al. 2013).
The cp genome of G. apiata is 144,274 bp size, which consist of a large single copy (LSC) region of 77,353 bp, a small single copy (SSC) region of 17,009 bp and two inverted repeat regions (IRs) of 24,956 bp. In total, this cp genome of G. apiata contains 127 genes, including 82 protein-coding genes, 35 tRNA genes, eight rRNA genes, and two pseudogenes (ycf1 and rps19). Among all the genes, 11 genes (atpF, ndhA, ndhB, petB, rpl2, rpl16, rpoC1, tRNA-Leu, tRNA-Val, tRNA-Ile, tRNA-Ala) contain a single intron and three genes (ycf3, clpP, rps12) contained two introns. The overall GC content of G. apiata plastome is 37.8%, while the corresponding values of LSC, SSC, and IR regions are 35.6%, 31.4%, and 43.3%, respectively.
A total of 18 cpDNA sequences were used to infer phylogenetic position of G. apiata. All cpDNA were aligned using the software MAFFT (Katoh and Standley 2013) with the default parameters. The phylogenetic analysis was conducted using the program MEGA7 (Kumar et al. 2016) with 1000 bootstrap replicates. As showed in the phylogenetic tree (Figure 1), the genus Gentiana formed a monophyletic clade with high bootstrap supported value (100%), suggesting G. apiata was closely related to other 14 species of Gentiana.
Figure 1.
Phylogenetic tree based on 18 complete chloroplast genome sequences.
Disclosure statement
No potential conflict of interest was reported by the authors.
References
- Doyle JJ, Doyle JL. 1987. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull. 19:11–15. [Google Scholar]
- Hahn C, Bachmann L, Chevreux B. 2013. Reconstructing mitochondrial genomes directly from genomic next-generation sequencing reads-a baiting and iterative mapping approach . Nucleic Acids Res. 41:e129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Boil Evol. 30:772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kumar S, Stecher G, Tamura K. 2016. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 33:1870–1874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lohse M, Drechsel O, Kahlau S, Bock R. 2013. Organellar Genome DRAW–a suite of tools for generating physical maps of plastid and mitochondrial genomes and visualizing expression data sets. Nucleic Acids Res. 41:W575–W581. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu ZQ, Ge S. 2012. The phylogeny of the BEP clade in grasses revisited: evidence from the whole-genome sequences of chloroplasts. Mol Phylogenet Evol. 62:573–578. [DOI] [PubMed] [Google Scholar]
- Wyman SK, Jansen RK, Boore JL. 2004. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 20:3252–3255. [DOI] [PubMed] [Google Scholar]
- Zhou L, Wang N, Yang WX, Zhao HS, Tian P. 2004. The chemical consitituents of Gentiana apiata N. E. Br. Chinese J Org Chem. 32:89–92. [Google Scholar]

