Abstract
Rubus eucalyptus Focke belong to genus Rubus in the family Rosaceae, the fresh fruits of which can be used for medicine and cosmetic. The plastome of R. eucalyptus is 155,672 bp in length, with one large single copy region of 85,277 bp, one small single copy region of 18,864 bp, and two inverted repeat (IR) regions of 25,748 bp. It contains 130 genes, including 86 protein-coding genes, 8 ribosomal RNA, and 37 transfer RNA. Phylogenetic tree shows that R. eucalyptus was at the basal of Rosaceae. The published plastome within Rubus provides significant insight for elucidating the phylogenetic relationship of taxa within tribe Rosaceae.
Keywords: Rubus eucalyptus, Rosaceae, chloroplast genome, phylogenetic tree
Rubus eucalyptus Focke belong to genus Rubus in the family Rosaceae, distributed in Guizhou, Sichuan, Shanxi, Hubei and Yunnan provinces in China (Robertson 1974; Thompson 1995). The fresh Fruits can be used for medicine and cosmetic. Because of the complexity of their inter- and intraspecific morphological variations, species delimitation in genus Rubus is quite difficult (Alice and Campbell 1999). A well-resolved phylogeny based on sufficient molecular markers is essential to understanding the relationship between species, and the efficient utilization and improvement of these wild Rubus species as crop. In this study, we reported the complete chloroplast genome of R. eucalyptus, a wild species widespread in subtropical and tropical China, as a resource for future studies on the taxonomy of Rubus.
Fresh leaves of R. eucalyptus were collected from Xiaozhaizigou National Reserve (Mianyang, Sichuan, China;coordinates: 103°45′E, 31°50′N). Dried and kept in silica gel for DNA extraction, and then stored in the Herbarium of Key laboratory in Ningxia Forestry Institute with the accession number of NX190512. Total genomic DNA was extracted with a modified CTAB method (Doyle and Doyle 1987). First, we obtained 10 million high quality pair-end reads for R. eucalyptus, and after removing the adapters, the remained reads were used to assemble the complete chloroplast genome by NOVOPlasty (Dierckxsens et al. 2017). The complete chloroplasts genome sequence of Rubus coreanus was used as a reference. Plann v1.1 (Huang and Cronk 2015) and Geneious v11.0.3 (Kearse et al. 2012) were used to annotate the chloroplast genome and correct the annotation. The completecp genome sequence has been submitted to GenBank (accession number MN013402).
The R. eucalyptus cp genome is 155,672 bp in length, exhibits a typical quadripartite structural organization, consisting of a large single copy (LSC) region of 85,277 bp, two inverted repeat (IR) regions of 25,796 bp and a small single copy (SSC) region of 18,864bp. The cp genome contains 130 complete genes, including 86 protein-coding genes (86 PCGs), eight ribosomal RNA genes (four rRNAs), and 37 tRNA genes (20 tRNAs). Most genes occur in a single copy, while 15 genes occur in double, including all rRNAs (4.5S, 5S, 16S, and 23S rRNA), 5 tRNAs (trnA-UGC, trnL-CAA, trnN-GUU, trnR-ACG, and trnV-GAC), and 6 PCGs (rps7, rps12, rpl2, rpl23, ndhB, ycf2). The overall AT content of cp DNA is 62.9%, while the corresponding values of the LSC, SSC, and IR regions are 65.1%, 69.1%, and 57.2% respcetively.
In order to further clarify the phylogenetic position of Rubus eucalyptus, plastome of six representative Rosaceae species were obtained from NCBI to construct the plastome phylogeny, with Malus transitoria as an outgroup. All the sequences were aligned using MAFFT v.7.313 (Katoh and Standley 2013) and maximum likelihood phylogenetic analyses were conducted using RAxML v.8.2.11 (Stamatakis 2014). The phylogenetic tree shows that all species of Rosaceae were clustered together one clade. while R. eucalyptus was at the basal of Rosaceae (Figure 1). This study provides a platform for future molecular phylogenetics of species in tribe Rosaceae.
Figure 1.
Phylogenetic relationships of Rosaceae species using whole chloroplast genome. GenBank accession numbers: Fragaria virginiana (KY085911.1), Geum rupestre (NC_037392.1), Malus transitoria (MK098838.1), Potentilla indic (NC_041178.1), Rosa banksiae (NC_042194.1), Sanguisorba tenuifolia (NC_042223.1).
Disclosure statement
No potential conflict of interest was reported by the authors.
References
- Alice L, Campbell C. 1999. Phylogeny of Rubus (rosaceae) based on nuclear ribosomal DNA internal transcribed spacer region sequences. Am J Bot. 86:81–97. [PubMed] [Google Scholar]
- Dierckxsens N, Mardulyn P, Smits G. 2017. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45:e18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doyle JJ, Doyle JL. 1987. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull. 19:11–15. [Google Scholar]
- Huang DI, Cronk Q. 2015. Plann: a command-line application for anno-tating plastome sequences. Appl Plant Sci. 3:1500026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30:772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, et al. 2012. Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 28:1647–1649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robertson K. 1974. The genera of Rosaceae in the southeastern United States. J Arnold Arboretum. 55:352–360. [Google Scholar]
- Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 30:1312–1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thompson M. 1995. Chromosome numbers of Rubus species at the National Clonal Germplasm Repository. HortScience. 30:1447–1452. [Google Scholar]

