Abstract
Quercus myrsinifolia Blume is an evergreen oak tree species native to East Asia, and is also one of the dominant trees of subtropical evergreen broad-leaved forests. In this study, we sequenced and analysed the complete chloroplast (cp) genome of the species. The circular genome is 160,803 bp in size, consisting of two copies of inverted repeat (IR) regions of 25,840 bp, one large single-copy (LSC) region of 90,223 bp, and one small single-copy (SSC) region of 18,900 bp. It encodes a total of 114 unique genes, including 80 protein-coding genes, 30 tRNA genes, and four rRNA genes. Phylogenetic analysis based on 30 cp genome sequences indicated that Q. myrsinifolia was among the members of section Cyclobalanopsis, and was most closely related to Q. sichourensis.
Keywords: chloroplast genome, Fagaceae, phylogeny, Quercus myrsinifolia
Quercus myrsinifolia Blume is an evergreen oak tree species native to East Asia. It is widely distributed across China, Japan, Korea, Laos, Thailand, and Vietnam, and grows at an altitude ranging from 200 m to 2500 m above sea level. As one of the dominant trees of subtropical evergreen broad-leaved forests, it is of ecological importance in maintaining mountain ecosystem, and providing habitats and foods for various species, including endangered animals like Tibetan macaque (Macaca thibetana) (You et al. 2013). It is also an important commercial tree species for producing excellent timber. Previous phylogenomic studies have shown that Q. myrsinifolia belongs to the compound trichome base (CTB) lineage within Quercus section Cyclobalanopsis (Deng et al. 2018). However, the plastid genome sequence of the species remains unknown. In this study, we report the complete chloroplast (cp) genome of Q. myrsinifolia based on Illumina paired-end sequencing data.
Fresh young leaves were sampled from an adult tree growing at the Huangshan Mountain, Anhui Province, China (30.10°N,118.17°E). The voucher specimen (accession number YL20190417008) was preserved at the Herbarium of Nanjing Forestry University (HNFU). Total DNA extraction and whole genome sequencing on the Illumina Hiseq X Ten platform were conducted by Nanjing Genepioneer Biotechnologies Inc. (Nanjing, China). A total of 28,699,387 clean reads were produced and then used for the de novo assembly with NOVOplasty 2.7.2 (Dierckxsens et al. 2016). Gene annotation was performed using the CpGAVAS pipeline (Liu et al. 2012).
The complete cp genome of Q. myrsinifolia (GenBank accession number MN199025) is a circular molecule of 160,803 bp in length, consisting of two copies of IR (25,840 bp) separated by the LSC (90,223 bp) and SSC (18,900 bp) regions. The overall GC content was 36.89%, while the corresponding values of the LSC, SSC, and IR regions were 34.74%, 31.09%, and 42.77%, respectively. The cp genome encoded a total of 133 genes, of which 114 were unique and 19 were duplicated in the IR regions. The 114 unique genes contained 80 protein-coding genes, 30 tRNA genes, and four rRNA genes. Fifteen genes contained introns, 12 of which (six protein-coding genes and six tRNA genes) contained one intron, and three of which (rps12, ycf3, and clpP) contained two introns.
To identify the phylogenetic position of Q. myrsinifolia, we reconstructed a maximum-likelihood (ML) tree using cp genome sequences of 24 Fagaceae species and six outgroups from Juglandaceae and Betulaceae (Figure 1). Three locally collinear blocks (LCBs) were identified and a matrix of 82,255 bp was generated by the HomBlocks pipeline (Bi et al. 2018). Using PartitionFinder 2 (Lanfear et al. 2016), the TVM + G model and the TVM + I + G model were chosen for the subset of LCBs 1-2 (10,255 bp) and the subset of LCB 3 (72,000 bp), respectively. The IQ-tree 1.6.8 program (Nguyen et al. 2014) was used to perform the ML analyses, with branch support values estimated through 20,000 ultrafast bootstrap replicates. Our results indicated that Q. myrsinifolia was among the members of section Cyclobalanopsis, and was most closely related to Q. sichourensis with 99% bootstrap support.
Figure 1.
The maximum-likelihood (ML) phylogenetic tree reconstructed by IQ-tree 1.6.8 (Nguyen et al. 2014) based on cp genome sequences of 24 Fagaceae species and six outgroups from Juglandaceae and Betulaceae. The bootstrap support value is labelled for each node. Black bars show two subgenera (Cerris and Quercus) and five sections (Cerris, Ilex, Cyclobalanopsis, Quercus, and Lobatae) of Quercus (Denk et al. 2017).
Disclosure statement
No potential conflict of interest was reported by the authors.
References
- Bi G, Mao Y, Xing Q, Cao M. 2018. HomBlocks: a multiple-alignment construction pipeline for organelle phylogenomics based on locally collinear block searching. Genomics. 110:18–22. [DOI] [PubMed] [Google Scholar]
- Deng M, Jiang XL, Hipp AL, Manos PS, Hahn M. 2018. Phylogeny and biogeography of East Asian evergreen oaks (Quercus section Cyclobalanopsis; Fagaceae): insights into the Cenozoic history of evergreen broad-leaved forests in subtropical Asia. Mol Phylogenet Evol. 119:170–181. [DOI] [PubMed] [Google Scholar]
- Denk T, Grimm GW, Manos PS, Deng M, Hipp AL. 2017. An updated infrageneric classification of the oaks: review of previous taxonomic schemes and synthesis of evolutionary patterns In: Gil-Pelegrín E, Peguero-Pina JJ, Sancho-Knapik D, editors. Oaks physiological ecology. Exploring the functional diversity of genus Quercus L. Cham: Springer; p. 13–38. [Google Scholar]
- Dierckxsens N, Mardulyn P, Smits G. 2016. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45:18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lanfear R, Frandsen PB, Wright AM, Senfeld T, Calcott B. 2016. PartitionFinder 2: new methods for selecting partitioned models of evolution for molecular and morphological phylogenetic analyses. Mol Bio Evol. 34:772–773. [DOI] [PubMed] [Google Scholar]
- Liu C, Shi L, Zhu Y, Chen H, Zhang J, Lin X, Guan X. 2012. CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences. BMC Genomics. 13:715. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. 2014. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 32:268–274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- You SY, Yin HB, Zhang SZ, Ji TY, Feng XM. 2013. Food habits of Macaca thibetana at Mt. Huangshan, China]. J Biol. 30:64–67. [Chinese]. [Google Scholar]

