Abstract
Gaillardia pulchella, an annual Asteraceae herb native to tropical America, is widely cultivated in China for its vibrant blooms and long flowering period. The species exhibited notable biological activities, including antioxidant and anti-inflammatory effects. We presented the first complete chloroplast genome of G. pulchella, spanning 151,778 bp with an overall GC content of 37.7%, and containing 128 genes (85 protein-coding genes, 35 tRNA genes, and eight rRNA genes). Phylogenetic analysis revealed its close relationship with Galinsoga parviflora, Tagetes erecta, and Flaveria bidentis. This study provided valuable genomic insights for evolutionary studies, and potential pharmaceutical applications of G. pulchella.
Keywords: Gaillardia pulchella, Helenieae, chloroplast genome, phylogenetic analysis
Introduction
Gaillardia pulchella is an annual herbaceous plant belonging to the Asteraceae family, which is native to tropical America and is widely cultivated in China as an ornamental species. In addition to their horticultural value, G. pulchella has been reported to contain bioactive compounds, including sesquiterpene derivatives (Kupchan et al. 1966; Yu et al. 1988) and phenolic compounds such as flavones (Moharram et al. 2017). These compounds exhibit various pharmacological properties, including antioxidant, anti-inflammatory, and hepatoprotective effects (Inayama et al. 1984; Harimaya and Inayama 1990).
Despite its medicinal potential, comprehensive genomic investigations of G. pulchella remain scarce. Although several nuclear and chloroplast genome markers (e.g. ndhF and trnL-trnF) are accessible in GenBank, a complete chloroplast genome has not been documented to date. Chloroplast genomes are instrumental in phylogenetic reconstruction, species identification, and evolutionary analysis owing to their conserved structure and maternal mode of inheritance. Accordingly, the present study aims to sequence and annotate the complete chloroplast genome of G. pulchella, thereby supporting future research on its genetic diversity, evolutionary affiliations, and prospective applications in medicine and agriculture.
Materials and methods
The fresh leaves used for sequencing were obtained from Peony District, Heze City, Shandong Province, China (35°18’55.37′’N, 115°30’38.77′’E) (Figure 1), and a sample was deposited in the Heze University Herbarium (contact person: Hongqin Li, 463056627@qq.com) under specimen number HZ20220809.
Figure 1.
Field photograph of Gaillardia pulchella. Location: 35°18’55.37"N, 115°30’38.77"E. Photographer: Liqiang Wang. Key morphological features: Entire plant covered with soft hairs. Leaves alternate, variable in shape (oblong, lanceolate, or spatulate); leaf margins entire or pinnately lobed at base; terminal capitulum with long peduncles; ligulate flowers yellow with brownish-purple bases; flower apices with 2–3 teeth; achenes hairy at base. Panel A: whole plant panorama, panel B: top-down view of the capitulum.
Whole genomic DNA was extracted using the Plant Genomic DNA Kit (Tiangen Biotech, Beijing, China). The extracted DNA was fragmented into about 300 bp short-insert fragments to construct sequencing libraries, which were subsequently sequenced on the Illumina NovaSeq 6000 platform at Wuhan Benagen Technology Company Limited, Wuhan, China. Trimmomatic (v0.35) (Bolger et al. 2014) with default settings was employed to filter raw reads by removing adapters and low-quality bases. Following quality control, about 24 GB of clean reads were assembled using GetOrganelle (v1.7.1) (Jin et al. 2020). The complete chloroplast genome was annotated using the online tool CPGAVAS2 (Shi et al. 2019) and manually refined using Apollo (Pontius 2018). The annotated chloroplast genome was subsequently submitted to GenBank under the accession number OR124734. The CPGview tool was applied to visualize the circular genome map of the newly assembled chloroplast genome (Liu et al. 2023).
To investigate the phylogenetic position of G. pulchella, the chloroplast genomes of 18 species of the Asteraceae were downloaded from the GenBank database. In addition, the outgroup was Nymphaea tetragona. The entire chloroplast genome sequence was aligned using MAFFT software with default parameters (https://mafft.cbrc.jp/alignment/software/) (Katoh and Standley 2016). A maximum-likelihood (ML) phylogenetic tree was subsequently constructed using IQ-TREE (v2.0) (Nguyen et al. 2015) with the best-fit model of TVM+G4, along with 1000 bootstrap replicates.
Results
The G. pulchella chloroplast genome sequence spans 151,778 bp and displays a typical quadripartite structure. Compared to the chloroplast genome of Arabidopsis thaliana, a structural rearrangement has occurred in this species’chloroplast genome (Figure S1). The genome comprises two IR regions of 25,002 bp each, separated by a large LSC region of 83,527 bp and a small SSC region of 18,247 bp (Figure 2). The chloroplast genome exhibits variable GC content, with an overall GC content of 37.7%. The assembled complete chloroplast genome shows an average sequencing depth of 915.55×, with a minimum of 182× and a maximum of 1375× (Figure S2). The IR regions have the highest GC content at 43.1%, followed by the LSC and SSC regions at 35.8% and 31.2%, respectively. The G. pulchella chloroplast genome is predicted to encode 128 genes, including 85 protein-coding genes (PCGs), eight rRNA genes, and 35 tRNA genes. Within the IR regions, six unique PCGs (rps12, rps7, rpl2, rpl23, ndhB, and ycf2), seven unique trnA genes (trnM, trnL, trnV, trnE, trnA, trnR, and trnN), and four unique rRNA genes (rrn16S, rrn23S, rrn4.5S, rrn5S) were identified. Across the chloroplast genome, ten PCGs (rps16, rpoC1, atpF, petB, petD, rpoA, rpl16, rpl2, ndhB, and ndhA) each contain one intron, while two PCGs (ycf3, clpP) contain two introns. In addition, five tRNA genes (trnH-GUG, trnK-UUU, trnQ-UUG, trnS-GCU, and trnC-GCA, trnD-GUC, and trnY-GUA) contain one intron. The rps12 gene was subjected to trans-splicing. The structures of the cis-splicing and trans-splicing PCG genes are shown in Figure S3 and Figure S4, respectively. The lengths of the protein-coding genes, tRNA genes, and rRNA genes in the chloroplast genome are 2,8456 bp, 3,831 bp, and 9,273 bp, accounting for 18.7%, 2.5%, and 6.1% of the total genome length, respectively.
Figure 2.
The complete chloroplast genome of Gaillardia pulchella. The map contains six tracks. From the center outward, the first track displays the dispersed repeats. The second track shows long tandem repeats as short blue bars. The third track shows short tandem repeats or microsatellite sequences as short bars with different colors. The small single-copy (SSC), inverted repeat (IRA and IRB), and large single-copy (LSC) regions are shown on the fourth track. The GC content along the genome is plotted on the fifth track. The genes are shown on the sixth track. Genes belonging to different functional groups are color-coded. Genes on the inside and outside of the map are transcribed in clockwise and counterclockwise directions, respectively.
Phylogenetic analysis revealed that G. pulchella clustered into a monophyletic group with Galinsoga parviflora, Tagetes erecta, and Flaveria bidentis (Figure 3). This clade received maximum bootstrap support (100%), confirming its close evolutionary relationship.
Figure 3.
Maximum-likelihood phylogeny of Gaillardia pulchella and its close relatives using whole-genome sequences. The bootstrap values based on 1000 replicates are shown on each node in the phylogenetic tree. The 19 species were downloaded from GenBank. Taraxacum mongolicum KU736961 (Kim et al. 2016), Chrysanthemum indicum MH165290 (Ma et al. 2020), Chrysanthemum lavandulifolium MH165287 (Ma et al. 2020), Chrysanthemum vestitum MH165288 (Ma et al. 2020), Flaveria bidentis MK836182 (Yang et al. 2019), Leucanthemum maximum MN518843 (Chen et al. 2019), Tagetes erecta MN462588 (Song et al. 2019), Galinsoga parviflora MK737938 (Zhang et al. 2019), Gaillardia pulchella OR124734 (this study), Calendula officinalis OP161555 (Zhang et al. 2024), Calendula officinalis OQ515449, Nymphaea tetragona MT107634 (outgroup) (Sun et al. 2021). Aster indicus PV564080, Lactuca indica ON782501, Erigeron acris OR124733, Aster altaicus KX352465 (Park et al. 2017), Chrysanthemum zawadzkii OP697855, Cichorium intybus MK569377 (Yang et al. 2019).
Conclusions and discussion
This study provides the first detailed characterization of the genome of G. pulchella, revealing a typical annular tetrad structure measuring 151,778 bp and comprising 128 predicted genes. A structural rearrangement event was identified within the chloroplast genome. Phylogenetic analysis demonstrated that G. pulchella, Galinsoga parviflora, Tagetes erecta, and Flaveria bidentis form a robustly supported monophyletic clade within the Asteraceae. This clustering is consistent with recent molecular phylogenetic investigations based on chloroplast DNA markers (e.g. ndhF and trnL-trnF) and nuclear ribosomal ITS sequences, which consistently place these genera within the Heliantheae alliance—a lineage defined by radiate capitula and conserved mechanisms regulating floral symmetry (Chen et al. 2018). The chloroplast genome analysis of G. pulchella further reinforces its taxonomic placement within the Asteraceae. Accordingly, the phylogenetic results offer a reliable depiction of the evolutionary position of G. pulchella.
Supplementary Material
Acknowledgments
All authors contributed to the manuscript. Liqiang Wang was primarily responsible for the study design. Hongqin Li identified the species and collected samples. Liqiang Wang assembled and annotated the chloroplast genome. Jicheng Yuan and Guiyuan Wu analyzed the chloroplast genome structure and Hongqin Li drafted the manuscript. All authors approved the final version for publication and agreed to be accountable for all aspects of the work.
Funding Statement
This work was supported by the Shandong Provincial Natural Science Foundation [ZR2021MC136].
Ethical statement
The Gaillardia pulchella specimen is not an endangered species. It requires no specific permissions or licenses. Research on this species, including the collection of plant material, was conducted following the guidelines provided by Heze University in this study.
Disclosure statement
No potential conflict of interest was reported by the authors. Guiyuan Wu is affiliated with the Production Department of Shandong Buchang Pharmaceuticals Co., Ltd. The authors declare no other competing financial or non-financial interests related to this work.
Data availability statement
The complete chloroplast genome sequence of G. pulchella in this study has been submitted to the NCBI database under the accession number OR124734 https://www.ncbi.nlm.nih.gov. The associated BioProject, BioSample, and SRA numbers are PRJNA928567, SAMN48024363, and SRR33185918.
References
- Bolger AM, Lohse M, Usadel B.. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 30(15):2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen H, Jiang M, Wang L, You J, Liu C.. 2019. Complete plastome of Leucanthemum maximum, the first in genus Leucanthemum. Mitochondrial DNA B Resour. 5(1):19–20. doi: 10.1080/23802359.2019.1693922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen J, Shen CZ, Guo YP, Rao GY.. 2018. Patterning the Asteraceae capitulum: duplications and differential expression of the flower symmetry CYC2-like genes. Front Plant Sci. 9:551. doi: 10.3389/fpls.2018.00551. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harimaya K, Inayama S.. 1990. Structure-activity relationships of pseudoguaianolides isolated from Gaillardia pulchella and their derivatives. Heterocycles. 30(2):993–1008. doi: 10.3987/COM-89-S101. [DOI] [Google Scholar]
- Inayama S, Harimaya K, Hori H, Ohkura T, Kawamata T, Hikichi M, Yokokura T.. 1984. Studies on non-sesquiterpenoid constituents of Gaillardia pulchella. II. Less lipophilic substances, methyl caffeate as an antitumor catecholic. Chem Pharm Bull (Tokyo). 32(3):1135–1141. doi: 10.1248/cpb.32.1135. [DOI] [PubMed] [Google Scholar]
- Jin JJ, Yu WB, Yang JB, Song Y, dePamphilis CW, Yi TS, Li DZ.. 2020. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 21(1):241. doi: 10.1186/s13059-020-02154-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Standley DM.. 2016. A simple method to control over-alignment in the MAFFT multiple sequence alignment program. Bioinformatics. 31(13):1933–1942. doi: 10.1093/bioinformatics/btv028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim JK, Park JY, Lee YS, Woo SM, Park HS, Lee SC, Kang JH, Lee TJ, Sung SH, Yang TJ, et al. 2016. The complete chloroplast genomes of two Taraxacum species, T. platycarpum Dahlst. and T. mongolicum Hand.-Mazz. (Asteraceae). Mitochondrial DNA Part B: resources. 1(1):412–413. doi: 10.1080/23802359.2016.1176878. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kupchan SM, Cassady JM, Kelsey JE, Schnoes HK, Smith DH, Burlingame AL.. 1966. Structural elucidation and high-resolution mass spectrometry of Gaillardin, a new cytotoxic sesquiterpene lactone. J Am Chem Soc. 88(22):5292–5302. doi: 10.1021/ja00974a037. [DOI] [Google Scholar]
- Liu S, Ni Y, Li J, Zhang X, Yang H, Chen H, Liu C.. 2023. CPGView: a package for visualizing detailed chloroplast genome structures. Mol Ecol Resour. 23(3):694–704. doi: 10.1111/1755-0998.13738. [DOI] [PubMed] [Google Scholar]
- Ma Y, Zhao L, Zhang W, Zhang Y, Xing X, Duan X, Hu J, Harris AJ, Liu P, Dai S, et al. 2020. Origins of cultivars of Chrysanthemum—Evidence from the chloroplast genome and nuclear LFY gene. J Syst Evol. 58(6):925–944. doi: 10.1111/jse.12682. [DOI] [Google Scholar]
- Moharram FA, El Dib RAEM, Marzouk MS, El-Shenawy SM, Ibrahim HA.. 2017. New apigenin glycoside, polyphenolic constituents, anti-inflammatory and hepatoprotective activities of Gaillardia grandiflora and Gaillardia pulchella aerial parts. Pharmacogn Mag. 13(Suppl 2):S244–S249. doi: 10.4103/pm.pm_440-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ.. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 32(1):268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Park J, Shim J, Won H, Lee J.. 2017. Plastid genome of Aster altaicus var. uchiyamae Kitam., an endangered species of Korean asterids. Journal of Species Research. 6(1):76–90. doi: 10.12651/JSR.2017.6.1.076. [DOI] [Google Scholar]
- Pontius E. 2018. Apollo. Ann Emerg Med. 72(5):616. doi: 10.1016/j.annemergmed.2018.07.001. [DOI] [PubMed] [Google Scholar]
- Shi L, Chen H, Jiang M, Wang L, Wu X, Huang L, Liu C.. 2019. CPGAVAS2, an integrated plastome sequence annotator and analyzer. Nucleic Acids Res. 47(W1):W65–W73. doi: 10.1093/nar/gkz345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Song Z, Li TQ, Liu ZH, Jiang K, Ou XB.. 2019. The complete chloroplast genome of Tagetes erecta (Asteroideae), a wildly cultivated ornamental plant. Mitochondrial DNA B Resour. 5(1):79–80. doi: 10.1080/23802359.2019.1692704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun C, Chen F, Teng N, Xu Y, Dai Z.. 2021. Comparative analysis of the complete chloroplast genome of seven Nymphaea species. Aquat Bot. 170:103353. doi: 10.1016/j.aquabot.2020.103353. [DOI] [Google Scholar]
- Yang D, Wei Z, Yue C, Xu X, Zhang T, Zhao Q Q., Zhu S.. 2019. The complete chloroplast genome sequence and phylogenetic analysis of Flaveria bidentis (L.) Kuntze, an invasive alien plant species in China. Mitochondrial DNA Part B: resources. 4(1):2021–2022. doi: 10.1080/23802359.2019.1618215. [DOI] [Google Scholar]
- Yang S, Sun X, Wang L, Jiang X, Zhong Q.. 2019. The complete chloroplast genome sequence of Chicory (Cichorium intybus L.). Mitochondrial DNA B Resour. 4(1):1533–1534. doi: 10.1080/23802359.2019.1601524. [DOI] [Google Scholar]
- Yu S, Fang N, Mabry TJ, Abboud KA, Simonsen SH.. 1988. Sesquiterpene lactones from Gaillardia pulchella. Phytochemistry. 27(9):2887–2891. doi: 10.1016/0031-9422(88)80187-5. [DOI] [Google Scholar]
- Zhang N, Huang K, Xie P, Deng A, Tang X, Jiang M, Mo P, Yin H, Huang R, Liang J, et al. 2024. Chloroplast genome analysis and evolutionary insights in the versatile medicinal plant Calendula officinalis L. Sci Rep. 14(1):9662. doi: 10.1038/s41598-024-60455-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Z, Chen Y, Jiang X, Zhu P, Zeng Y, Tang T, Li L.. 2019. Characterization of the complete chloroplast genome sequence of Galinsoga parviflora and its phylogenetic implications. Mitochondrial DNA B Resour. 4(2):2106–2108. doi: 10.1080/23802359.2019.1623109. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The complete chloroplast genome sequence of G. pulchella in this study has been submitted to the NCBI database under the accession number OR124734 https://www.ncbi.nlm.nih.gov. The associated BioProject, BioSample, and SRA numbers are PRJNA928567, SAMN48024363, and SRR33185918.



