Skip to main content
BMC Plant Biology logoLink to BMC Plant Biology
. 2026 Mar 6;26:670. doi: 10.1186/s12870-026-08486-x

Genome-wide identification of the DFR gene family in Dracaena cambodiana and its expression analysis under wound stress

Shuang Li 1, Hongyou Zhao 1, Chunyong Yang 1, Yanfang Wang 1, Yating Zhu 1,2, Qianxia Li 1,3, Ge Li 1, Lixia Zhang 1, Zhaoyou Deng 1, Ling Wang 1, Yanqian Wang 1,
PMCID: PMC13078043  PMID: 41787291

Abstract

Dracaena cambodiana Pierre ex Gagnep. is divided into two distinct groups: the Yunnan clade with soft leaves (D. cambodiana A) and the Hainan clade with hard leaves (D. cambodiana B). This species is the key plant source of dragon’s blood, a well-known traditional Chinese medicine derived from the defensive metabolites of Dracaena species, with flavonoids as its major bioactive components. Dihydroflavonol 4-reductase (DFR) plays a pivotal role in flavonoid biosynthetic pathway. In this study, we identified 19 DFR genes (designated as DcDFRs) from the reference genome of D. cambodiana, which are distributed across six chromosomes and phylogenetically classified into four subfamilies. Comprehensive analyses of their chromosomal localization, sequence homology, gene structure, and phylogenetic relationships revealed two distinct tandemly duplicated gene clusters (TDGCs), designated as TDGCs-1 and TDGCs-2. Notably, TDGCs-1, comprising DcDFR1DcDFR5 and localized on chromosome 5, exhibits conserved tissue-specific co-expression patterns and coordinated transcriptional responses to wound stress across two distinct D. cambodiana accessions (D. cambodiana A and D. cambodiana B). Its expression levels are positively correlated with both the duration of stress induction and flavonoid accumulation, suggesting a critical role in flavonoid metabolism and wound defense responses. Unlike TDGCs-1, TDGCs-2 does not show typical co-expression patterns under wound stress in D. cambodiana A, which implies that the two tandemly duplicated gene clusters have both overlapping and distinct biological functions. This study reveals the secondary metabolic pathways triggered by particular environmental stresses and provides a theoretical foundation for future investigations into the artificial induction technology of dragon’s blood.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12870-026-08486-x.

Keywords: Dihydroflavonol 4-reductase, Tandemly duplicated gene clusters, Flavonoid, Dragon's blood

Introduction

Dracaena cambodiana Pierre ex Gagnep., an ancient monocotyledonous evergreen tree within the genus Dracaena, has two distinct groups: the Yunnan clade with soft leaves (D. cambodiana A) and Hainan clade with hard leaves (D. cambodiana B) [1]. The tree species is mainly distributed in Yunnan, Guangxi, and Hainan Provinces in China, as well as in several Southeast Asian countries [2]. When the trunk of D. cambodiana is damaged by external factors, a red resin-like substance is secreted at the wound site. This substance is called dragon’s blood (also known as “XueJie”) [3] and is of great significance. Dragon’s blood is not only used as a traditional Chinese medicine but it also has extensive international applications. Consequently, D. cambodiana has emerged as an important medicinal plant for the production of dragon’s blood resin. Dragon’s blood has a variety of pharmacological properties, including enhancing blood circulation to dissipate blood stasis, alleviating inflammation and pain, and has the effect of astringing to arrest bleeding, which is used to treat wounds, leucorrhea, fractures, diarrhea and piles, as well as intestinal and stomach ulcers [4]. With the constant advancements in clinical medicine research, dragon’s blood has demonstrated promising efficacy in anti-cancer treatment [5, 6, 7] and the management of cardiovascular diseases [8, 9], and is being explored for its potential as an antidepressant [10, 11, 12]. In the modern industrial realm, owing to its unique color and chemical characteristics, Dragon’s blood is crucial in industries, such as cosmetic and coating industries [13, 14]. Nevertheless, D. cambodiana has a low growth rate and dragon’s blood yield in the natural environment is extremely low, which is not sufficient to meet the ever-increasing global demand. This has led to the overexploitation and destructive harvesting of wild Dracaena resources. In light of this, D. cambodiana and D. cochinchinensis, which are two key plant sources of dragon’s blood, have been enlisted in the “List of Wild Plants under State Key Protection (2021 Edition)” as second-class protected species [15]. Illegal digging, collection, and trading of these plants are explicitly prohibited.

Dragon’s blood is regarded as a defensive metabolite generated by Dracaena trees to counteract biotic and abiotic stresses. Dragon’s blood synthesis is a complex process that encompasses multiple enzymatic reactions and metabolic stages. Modern chemical and pharmacological investigations have demonstrated that the core active components of dragon’s blood comprise flavonoids, terpenoids, and other compounds [16]. Notably, the flavonoid content of dragon’s blood can reach 50–80% [17, 18]. The flavonoid components in dragon’s blood, including the active ingredients loureirin A and B, exhibit a C6-C3-C6 flavonoid skeleton structure that is analogous to that of anthocyanins and other flavonoids [19, 20]. The structure is derived from phenylalanine and malonyl coenzyme A. Chemically, they likely share certain precursor substances and intermediate metabolic steps during biosynthesis.

Dihydroflavonol 4-reductase (DFR), which is encoded by the DFR gene, is a crucial enzyme involved in the metabolism of flavonoids and polyphenols in plants [21, 22] and is essential for the synthesis of plant secondary metabolites. In a wide array of plants, DFR is intricately involved in the biosynthesis of anthocyanins and proanthocyanidins, where it catalyzes the reduction of dihydroflavonols to leucoanthocyanidins. Notably, DFR expression level is strongly correlated with the anthocyanin content. For example, anthocyanin accumulation in plants such as Lycium ruthenicum Murr. [23], Hosta ventricosa [24], Paeonia suffruticosa [25], Brassica oleracea var. acephala [26, 27], is enhanced when the DFR gene is highly expressed, which directly impacts the color of fruits, flowers, and leaves. However, DFRs derived from different plants exhibit different preferences for substrates (dihydrokaempferol [DHK]), dihydroquercetin [DHQ], dihydromyricetin [DHM]), which have been found in plants such as Camellia sinensis [28], Ginkgo biloba [29], Medicago truncatula [30], Rhododendron delavayi [31], and Dryopteris erythrosora [32]. Moreover, the engineered DFR gene in petunia can have a preference for specific substrates, thereby influencing​ anthocyanin synthesis and flower color [33, 34]. The DFR gene exhibits differential expression across various plant tissues and developmental stages. During fruit coloring of strawberry [35] and plum [36, 37], the expression levels of most DFR genes peak at the full fruit coloring stage. The DFR gene is also implicated in plant growth and development. Overexpression of the GbDFR6 gene in Ginkgo biloba affects the growth and development of tobacco plants, leading to alterations in root and leaf morphology, as well as delayed flowering [38]. The expression level of the DFR gene changes dynamically in plants in response to biotic or abiotic stresses, such as bacterial leaf blight stress [39], salt stress [40, 41], drought stress [42], high temperature [43, 44] and sunburn stress [45], the expression level of the DFR, suggesting that it influences plant stress response. Studies have revealed that the regulation of tea [46], jujube [47], and grape [48] flavors is influenced by the DFR gene. Although the DFR gene is involved in numerous secondary metabolic reactions in plants, it exhibits sequence conservation in catalytic mechanisms and basic functions. This conservation not only provides a foundation for studying the functions of DFR genes in different plants but also implies its important position within the regulatory network of plant secondary metabolism.

Wang et al. [49] induced the production of dragon’s blood in D. cochinchinensis using Fusarium proliferatum and the chemical components of the induced dragon’s blood are almost the same as those of the natural dragon’s blood. Zhu et al. [50] artificially induced dragon’s blood production in D. cambodiana by injecting a chemical inducer and analyzed the transcriptome data. The researchers discovered that among the 16 genes annotated as DFR genes, the expression levels of seven DFR genes were considerably upregulated following inducer injection. The upregulation of gene expression was correlated with flavonoid accumulation. Six DFR genes in D. cochinchinensis [51] exposed to wound stress have been shown to be involved in the flavonoid biosynthetic pathway. The expression levels of five of these genes increase substantially after wound stress, which is consistent with the flavonoid accumulation. According to Yang et al., high expression levels of two DFR genes have been detected after six months of wound induction in D. cochinchinensis [52]. Genomic data analysis of D. cochinchinensis has revealed that DFR genes are enriched in the flavonoid biosynthesis and metabolic pathways [53]. Generally, a tandemly duplicated gene cluster (TDGCs) in plants contains three or more homologous genes, while a biosynthetic gene cluster (BGCs) contains three or more nonhomologous enzyme genes associated with the same secondary metabolic pathway [54]. This evidence suggests that the DFR gene is involved in flavonoid biosynthesis in dragon’s blood by catalyzing specific reaction steps, thereby converting precursor substances into the effective components present in dragon’s blood resin.

The identification and functional characterization of DFR genes in this plant species are associated with substantial knowledge gaps due to the scarcity of genomic resources of D. cambodiana. These gaps impeded our capacity to regulate the synthesis of dragon’s blood and boost its production via biotechnological approaches. Comprehending the evolutionary diversity and expression dynamics of DFR genes in Dracaena plants is crucial for enhancing our understanding of the biosynthesis of medicinally valuable flavonoids and the adaptive responses of the plant to environmental challenges. Therefore, this study aimed to address the gaps by conducting a systematic analysis of the DFR gene family in D. cambodiana. Leveraging the published genomic data and bioinformatics techniques, we identified 19 DFR homologous genes, which are distributed on six chromosomes, including two tandem duplication gene clusters that characterized their structural features, and evaluated their expression patterns under normal and wound stress conditions. The findings of this study will enhance our understanding of flavonoid biosynthesis in plant species of the genus Dracaena, thereby forming a theoretical foundation for elucidating the mechanisms by which DFR genes regulate dragon’s blood synthesis, as well as offer novel perspectives for increasing dragon’s blood production and improving the stress tolerance capacity of these endangered plants through human intervention.

Results

Identification and subcellular localization of DcDFR proteins

To elucidate the evolution of the DFR family in D. cambodiana, a BLAST analysis was conducted using the DFR protein sequence from Oryza sativa and Arabidopsis thaliana (Fig. 1A). A total of 19 DcDFR genes were identified and named DcDFR1-DcDFR19 after removing redundant and incomplete sequences (Table S1). Further amino acid sequence alignment revealed that the NADP-binding domain [31] was present in all DcDFRs except DcDFR1, DcDFR3, DcDFR4, DcDFR9, and DcDFR11 (Figure S1). Analysis of the physicochemical properties of DcDFR proteins (Table 1) revealed that the lengths of amino acids (Len) ranged between 208 and 545, molecular weights (MW) ranged from 22.52 to 61.25 kD, and the theoretical isoelectric points (pI) ranged from 5.21 to 8.55. All DcDFR proteins were stable except DcDFR1, DcDFR2, DcDFR3, DcDFR4, and DcDFR5. The aliphatic index (AI) values and grand average of hydrophobicity (GR) values ranged from 75.75 to 103.37, -0.282 to 0.13, respectively. The subcellular localization (SL) results revealed that DcDFR proteins were distributed in the chloroplast, cytoplasm, and nucleus. Among the 19 DFR proteins, DcDFR18 was localized in the nucleus, while DcDFR13 and DcDFR15 were predicted to be secreted proteins.

Fig. 1.

Fig. 1

Evolutionary and structural characterization of DcDFR genes in D. cambodiana. A Venn diagram illustrating BLASTP results of OsDFR and AtDFR. B Three-dimensional (3D) structure models of DcDFR proteins. C Distribution of DcDFR genes on D. cambodiana chromosome. TDGCs, tandemly duplicated gene clusters. D Intraspecies collinearity: the red line represents collinear gene pairs of DcDFR genes, while the green line represents collinear gene pairs between DcDFR and undetected genes. E Interspecies collinearity: the blue line represents collinear relationships between D. cambodiana and Oryza sativa, and between D. cambodiana and Arabidopsis thaliana

Table 1.

Physicochemical properties of DcDFR proteins

Gene Len (aa) MW (kD) pI II AI GR SL
DcDFR1 243 27.62 8.55 60.89 83.09 -0.062 Chloroplast
DcDFR2 365 40.93 6.86 49.8 79.32 -0.159 Chloroplast
DcDFR3 239 26.91 7.03 52.82 81.59 -0.054 Chloroplast
DcDFR4 240 27.13 6.66 47.23 82.46 -0.028 Chloroplast
DcDFR5 335 37.64 7.99 43.53 80.3 -0.127 Chloroplast
DcDFR6 320 35.62 5.74 37.83 84.38 -0.224 Cytoplasm
DcDFR7 326 35.38 5.76 37.59 89.82 -0.12 Chloroplast
DcDFR8 337 36.98 6.02 29.47 88.78 -0.151 Chloroplast
DcDFR9 252 27.46 5.26 37.7 103.37 0.269 Cytoplasm
DcDFR10 313 34.81 6.61 29.94 91.53 -0.103 Chloroplast
DcDFR11 208 22.52 5.21 33.66 94.71 0.13 Chloroplast
DcDFR12 545 61.25 5.99 32.41 91.71 -0.211 Cytoplasm
DcDFR13 352 39.25 5.89 36.94 90.54 -0.115 Secreted
DcDFR14 326 35.77 6.4 23.67 86.41 -0.136 Chloroplast
DcDFR15 335 37.30 6.16 33.1 94.87 0.004 Secreted
DcDFR16 322 35.98 5.7 31.93 91.4 -0.18 Cytoplasm
DcDFR17 293 32.12 8.3 31.14 91.77 -0.039 Chloroplast
DcDFR18 402 44.19 6.19 38.55 75.75 -0.282 Nucleus
DcDFR19 339 37.54 7.64 36.89 93.78 0.029 Chloroplast

Len Length of amino acids, MW Molecular weight, pI theoretical isoelectric points, II Instability index, AI Aliphatic index, GR Grand average of hydrophobicity, SL Subcellular localization

Secondary structure and three-dimensional structure models of the DcDFR gene family

Analysis of the secondary structure proportion of the DFR gene family in D. cambodiana (Table 2) revealed that 19 family members were predominantly composed of alpha helices, accounting for 36.18–48.02% and random coils, which ranged from 33.04 to 46.44%. DcDFR9 had the highest percentage of alpha helices at 48.02% and DcDFR3 had the highest percentage of random coils at 46.44%. The proportion of extended strands ranged from 8.65% (DcDFR11) to 15.62% (DcDFR6) and that of beta turns ranged from 2.40% (DcDFR11) to 8.55% (DcDFR19). The SWISS-MODEL software was employed to predict the three-dimensional (3D) structure models of the 19 DcDFR proteins. According to the predicted three-dimensional structural model of the DFR proteins, the structural patterns of most family members exhibited considerable similarities (Fig. 1B). The similarity of the advanced structures of proteins suggests that they have the same functions.

Table 2.

Secondary structure of DcDFR proteins

Gene Alpha Helix Extended Strand Beta Turn Random Coil
DcDFR1 39.51% 12.35% 4.53% 43.62%
DcDFR2 38.63% 13.97% 6.30% 41.10%
DcDFR3 38.08% 10.88% 4.60% 46.44%
DcDFR4 41.32% 11.16% 4.55% 42.98%
DcDFR5 41.79% 13.73% 6.57% 37.91%
DcDFR6 40.94% 15.62% 6.56% 36.88%
DcDFR7 40.49% 15.03% 6.44% 38.04%
DcDFR8 40.36% 13.95% 5.64% 40.06%
DcDFR9 48.02% 12.30% 3.97% 35.71%
DcDFR10 41.85% 15.34% 6.07% 36.74%
DcDFR11 46.15% 8.65% 2.40% 42.79%
DcDFR12 43.67% 13.03% 6.06% 37.25%
DcDFR13 40.34% 13.35% 6.53% 39.77%
DcDFR14 40.49% 14.42% 5.83% 39.26%
DcDFR15 42.09% 14.33% 5.37% 38.21%
DcDFR16 40.99% 14.60% 5.90% 38.51%
DcDFR17 36.18% 13.99% 7.17% 42.66%
DcDFR18 43.78% 10.45% 4.48% 41.29%
DcDFR19 44.25% 14.16% 8.55% 33.04%

Chromosome localization and collinearity of DcDFR gene family

Based on the chromosome annotation data of D. cambodiana, a total of 19 DcDFR genes were unevenly dispersed across six chromosomes (Fig. 1C). Chromosome 2 had the highest number of DcDFRs genes (six). Both chromosomes 5 and 6 contained five DcDFR genes each. Conversely, chromosomes 3, 10, and 11 consisted of a solitary DcDFR gene. Notably, there were five DFR genes on both chromosomes 5 (DcDFR15) and 2 (DcDFR912, 16) that were located within the same region. The distance between these genes ranged from 2942 base pairs to 66,739 base pairs (Table S1). These results suggest that there are two tandemly duplicated gene clusters (TDGCs) of the DFR gene in the D. cambodiana genome; however, further research is required to verify the findings. We temporarily named the DFR gene cluster on chromosome 5 as TDGCs-1 (DcDFR15) and the DFR gene cluster on chromosome 2 as TDGCs-2 (DcDFR912, 16).

To comprehensively understand the evolutionary progression of the DcDFR gene family in D. cambodiana, an in-depth investigation into gene duplication events was conducted. The investigation included both within D. cambodiana itself, and between D. cambodiana and the model plants O. sativa and A. thaliana. A total of five pairs of collinear genes were identified in D. cambodiana through intraspecific collinearity analysis (Fig. 1D). These pairs were located on chromosomes 2, 6, 10, and 19. The identified gene pairs were DcDFR8/DcDFR14, DcDFR8/DcDFR17, and DcDFR14/DcDFR17, as well as two additional gene pairs involving DcDFR7, DcDFR10. The presence of five pairs of collinear genes indicates that the DFR gene family of D. cambodiana has undergone multiple gene duplication events. However, no collinear genes were found within TDGCs-1 or TDGCs-2.

The interspecific collinearity analysis detected 16 collinear gene pairs between D. cambodiana and O. sativa. In contrast, only three collinear gene pairs were identified between D. cambodiana and A. thaliana (Fig. 1E and Table S2). The high collinearity of the DFR gene family observed between D. cambodiana and O. sativa indicates that the gene family has a high degree of conservation among monocotyledonous plants, while the significant differences observed between D. cambodiana and A. thaliana indicate variations in the gene family differentiation pathways between monocotyledonous and dicotyledonous plants.

Gene structure and conserved domains of DcDFR gene family

An analysis was conducted on the genetic structure and conserved motifs of the DcDFR gene family using the D. cambodiana genome database (Fig. 2A, B and C). MEME software was used to predict 10 distinct motifs and TBtools was used to analyze conservation of the DcDFR proteins. The lengths of the conserved motifs ranged from 15 to 50 amino acids (Table 3) and were unevenly distributed among the DcDFR proteins (Fig. 2A). Notably, motif 3 was predominantly located at the N-terminus of most proteins. The most prevalent motifs among the 19 DcDFR proteins were motifs 1 (18 DcDFRs), 2 (18 DcDFRs), 7 (18 DcDFRs) followed by motifs 4 (16 DcDFRs), 3 (14 DcDFRs), 8 (14 DcDFRs), 5 (12 DcDFRs), and 6 (12 DcDFRs). Motif 9 was uncommon (11 DcDFRs). DcDFR12 had the highest number of motifs (16), whereas DcDFR19 had the lowest number of motifs (2). Generally, the presence of diverse conserved motifs could potentially contribute to the functional differentiation among DcDFR proteins (Fig. 2B). The PLN02650 superfamily domain comprised eight motifs, namely motifs 1, 2, 4, 5, 7, 8, 9, and 10. Alterations, such as replacement or loss of these motifs, could lead to changes in the protein function. The FR_SDR_e domain and AR_FR_like_1_SDR_e domain, both belonging to the Short-chain Dehydrogenase/Reductase (SDR) superfamily, contained motifs 2 and 3. The results suggest that these two motifs are highly conserved within the SDR superfamily and are crucial for maintaining the structural and functional integrity of proteins within this family.

Fig. 2.

Fig. 2

Comprehensive structural and functional characterization of the DFR gene family in D. cambodiana. A-B The conserved motif and domain of the DcDFR gene family. C The intron–exon structure of the DcDFR genes. D Predicted structures of cis-acting elements in the promoter region of the DcDFR gene family

Table 3.

Specific conserved motifs of the DcDFR gene family in D. cambodiana

graphic file with name 12870_2026_8486_Tab3_HTML.jpg

Analysis of the structural characteristics of these genes (Fig. 2C) revealed that over 94% (18 out of 19 DcDFRs) of the gene family members contained 3–5 introns and 4–6 exons. The finding indicates that the vast majority of DcDFR genes follow the specific intron–exon combination pattern during splicing. Notably, DcDFR12 deviated from the typical pattern by adopting an alternative splicing mechanism and possessing the highest number of introns (9) and exons (10) within the DcDFR gene family.The differences observed in protein motifs, domains, and gene structures indicate that the DFR gene family of D. cambodiana has certain characteristics of evolutionary complexity and functional diversity. Remarkably, five genes within the TDGCs-1 gene cluster were relatively similar in terms of motifs, domains, and gene structures; however, significant differences were observed among genes in the TDGCs-2 cluster.

Prediction of cis-acting elements of DcDFR gene promoter

Cis-acting elements, which are transcription factor binding sites, play a pivotal role in regulating the initiation of gene transcription. Various cis-acting elements were involved in responses to light, phytohormones, abiotic stress, and plant development (Fig. 2D). A total of 23 cis-acting elements that are functionally associated with light responsiveness were identified (Fig. 2D and Table S3). Specifically, each DcDFR gene contained 5–16 light-responsive elements, including the G-box, Box 4, TCCC-motif, I-box, and GT1-motif, which are known for their roles in light-related transcriptional regulation [55]. Six cis-acting elements were associated with the plant development process. These elements were involved in seed-specific regulation, meristem expression, endosperm expression, zein metabolism regulation, flavonoid biosynthesis gene regulation, and MYBHv1 binding site. The flavonoid biosynthetic gene regulation element was only detected in DcDFR17 (Table S3). Nine cis-acting elements were related to phytohormones, such as elements responsive to gibberellin, abscisic acid, methyl jasmonate, salicylic acid, and auxin. All members of the DcDFR gene family shared this category of phytohormone-related elements, except DcDFR3. Furthermore, all promoters of DcDFR genes contained cis-acting elements involved in abiotic stress response regulation. These cis-acting elements included anaerobic induction, low-temperature responsive, drought-inducibility, wound-responsive, circadian control, as well as defense- and stress-responsive cis-elements. However, the wound-responsive and defense and stress responsive gene regulation cis-elements were only detected in DcDFR10 and DcDFR19, respectively (Table S3). Dragon’s blood synthesis is closely associated with light, hormone, and stress responses, and secondary metabolic processes during Dracaena plant development. Overall, the results of this study showed that the promoter region of the DFR gene in D. cambodiana contains response elements for these processes, indicating that the DFR gene is one of the key factors influencing dragon’s blood synthesis.

Phylogenetic tree construction and prediction of expression characteristics of DFR gene family

To elucidate the classification and evolutionary relationships within the DFR gene family in D. cambodiana, a phylogenetic tree was constructed using 19 proteins (DcDFR1–19) from D. cambodiana and 6 proteins (AtDFR1–6) from A. thaliana (Table S1). According to the results, 25 DFR proteins from the two species were grouped into four subfamilies (Ⅰ–Ⅳ) with 4, 8, 2, and 11 members (Fig. 3). Three subfamilies (Ⅰ, Ⅲ, and Ⅳ) comprised DFR genes from both D. cambodiana and A. thaliana, while subfamily Ⅱ consisted of only two DFR genes. The DFR genes within the TDGCs-1 and TDGCs-2 gene clusters were grouped into the same branch, indicating that the genes within these clusters are evolutionarily homologous.

Fig. 3.

Fig. 3

Phylogenetic tree and prediction of the expression characteristics of the DFR gene family. Red triangles represent the DFR proteins from A. thaliana. A. thaliana leaves were collected from the entire rosette after transitioning to flowering and the stems were from the second internode

To further explore the functions of the DFR gene family, gene expression data of D. cambodiana were obtained from the NCBI database and those of A. thaliana were obtained from the ePlant website. We selected roots, stems, and leaves to generate a heatmap of gene expression data, which was then combined with the phylogenetic tree (Fig. 3). Gene expression analysis revealed that DcDFR15 and DcDFR17 were not expressed in the roots, stems, and leaves of D. cambodiana. However, the expression of the remaining 17 DcDFR genes was upregulated in either the roots or leaves. The expression levels of nine DFR genes (DcDFR1, DcDFR2, DcDFR4, DcDFR5, DcDFR6, DcDFR8, DcDFR11, DcDFR13, and DcDFR14) in D. cambodiana root tissues were higher than those in stem and leaf tissues. The expression levels of five DFR genes (DcDFR7, DcDFR9, DcDFR10, DcDFR12, and DcDFR16) in leaf tissues were higher than those in root and stem tissues. Moreover, a substantial variation was observed in the expression patterns of genes belonging to the same subfamily. The results indicate that the expression of most DFR genes in D. cambodiana has a certain degree of tissue specificity and some of the genes that are sensitive to environmental signals are located in the root and leaf tissues. Genes within the TDGCs-1 and TDGCs-2 clusters were co-expressed, although each had tissue-specific expression. Specifically, TDGCs-1 was co-expressed in roots and TDGCs-2 was co-expressed in leaves.

Expression of DcDFRs in different tissues of D. cambodiana A

D. cambodiana A represents the Yunnan clade, characterized by soft leaves [1] (Fig. 4A). A previous study conducted by our research team revealed that dragon’s blood accumulates both in the roots and damaged stems of D. cambodiana A, and its content in the bark is higher than that in the xylem [56]. Moreover, the study revealed that the expression level of DFR was high in roots and damaged stems. In this study, the expression patterns of the DFR gene family (Fig. 4B and 4C) were analyzed using the same samples used in the previous study [56]. Taking the expression level of the DcDFR gene in young leaves (YL) as a reference, a bar graph that quantitatively depicts the relative gene expression levels across diverse tissues was constructed. The results showed that the expression levels of eight DFR gene family members, namely DcDFR1, DcDFR2, DcDFR3, DcDFR4, DcDFR5, DcDFR13, DcDFR18, and DcDFR19 in the root xylem (XR) were high. The expression levels of four DRF gene family members, namely DcDFR7, DcDFR10, DcDFR12, and DcDFR16 in YL were high. However, the expression levels of DcDFR9, DcDFR11, DcDFR15, and DcDFR17 were not detected in all samples. The variation in gene expression patterns indicates functional divergence among members of the DcDFR gene family, suggesting that DcDFR1, DcDFR2, DcDFR3, DcDFR4, DcDFR5, DcDFR13, DcDFR18, and DcDFR19 could be involved in the biosynthesis of dragon’s blood in D. cambodiana A. The qPCR results showed that genes within the TDGCs-1 and TDGCs-2 clusters were co-expressed and tissue-specific, which is consistent with the predicted results. However, DcDFR9 and DcDFR11 in the TDGCs-2 cluster were not expressed.

Fig. 4.

Fig. 4

Morphology and gene expression in D. cambodiana A tissues. A Morphology of D. cambodiana A tree. B Gene expression levels were normalized according to the column scales and visualized in a heatmap. YL, young leaves; SL, senescent leaves; BUS, bark of the undamaged stems; XUS, xylem of the undamaged stems; BR, bark of the roots; XR, xylem of the roots. C The relative expression levels of DcDFR genes in different tissues. Error bars indicate the standard deviation of three biological replicates; different letters indicate significant differences at P < 0.05 by one-way ANOVA

DcDFR expression and flavonoid content in D. cambodiana A stem after wound induction

To further explore whether the DFR gene is involved in the biosynthetic reaction of dragon’s blood, undamaged and healthy D. cambodiana A stems were selected for wound induction (Fig. 5A). Wound color gradually intensified and turned brown as the induction time progressed. The stems that had been subjected to wound induction for nine days were longitudinally sectioned and distinct brown substances were observed to have formed around the wound. Similarly, methanol extract color changed from light yellow to dark yellow. High performance liquid chromatography (HPLC) analysis revealed that the contents of compounds in the stem increased substantially after nine days of wound induction and loureirin A was detected in the samples after nine days (Fig. 5B and 5C). The total flavonoid content in samples was determined after 0 and 9 d of wound induction (Fig. 5D). The results showed that the total flavonoid content after 9 d of wound induction was significantly (P < 0.05) higher than that after 0 d of wound induction.

Fig. 5.

Fig. 5

Multidimensional analysis of stem responses to wound induction in D. cambodiana A. A The changes in stems and methanol extracts from stems. B-C HPLC analysis of loureirin A/B content in stems after 0, 3, 6, and 9 d of wound induction. SC, standard compounds; “a” and “b” represent loureirin A and B, respectively and the black triangle represents loureirin A peak. The values are means ± standard deviation (SD) of three biological replicates. D The flavonoid content of stems after 0 and 9 d of wound induction. Error bars represent the SD of three biological replicates, P = 0.0198; * denotes P < 0.05 by Student’s test. E Variations in the expression levels of DcDFR genes in D. cambodiana A stems after 0, 3, 6, and 9 d of wound induction. Error bars represent the SD of three biological replicates; different letters indicate significant differences at P < 0.05 by one-way ANOVA. F Gene expression levels were normalized according to the scale of rows and visualized in a heatmap

The expression levels of all DcDFR genes in D. cambodiana A following wound induction were analyzed (Fig. 5E and 5F). The results demonstrated that the expression levels of DcDFR9, DcDFR11, and DcDFR15 were undetectable. The expression levels of the other DcDFR genes varied during the wound induction period (0–9 d). The expression levels of DcDFR1, DcDFR2, DcDFR3, DcDFR4, DcDFR5, DcDFR12, and DcDFR16 increased with an increase in the induction time and their expression patterns were largely consistent when compared to those of the control samples (0 d). The expression levels of DcDFR7, DcDFR8, DcDFR13, DcDFR14, DcDFR18, and DcDFR19 initially decreased and subsequently increased after wounding. These results indicate that DcDFR15 could be potential genes involved in resin formation in D. cambodiana A after wound induction. Genes within the TDGCs-1 and TDGCs-2 clusters were co-expressed in tree stems under wound stress and their expression levels increased with the prolongation of trauma period. However, the expression of DcDFR10 in the TDGCs-2 cluster was suppressed under wound stress. DcDFR9 and DcDFR11 in the TDGCs-2 cluster were not expressed. In addition, DcDFR17 was not expressed in various tissues of D. cambodiana A, but was expressed under wound stress, indicating that DcDFR17 could be associated with wound stress resistance.

Expression of DcDFR genes and dragon’s blood content in different tissues of D. cambodiana B

Research has revealed that D. cambodiana A and D. cambodiana B are two branches of D. cambodiana [1]. To gain a more in-depth understanding of the expression patterns of DcDFR genes in D. cambodiana, D. cambodiana B tissue samples were collected (Fig. 6A). The extract from BR exhibited a red color (Fig. 6A). Loureirin A and B were only detected in BR samples and loureirin A content was significantly (P < 0.01) higher than that of loureirin B (Fig. 6B). From the chromatogram (Fig. 6C), we observed that flavonoids in the methanol extract from BR were more abundant. A comprehensive analysis of the expression levels of DcDFR genes in the samples was performed (Fig. 6D and 6E). The results revealed that the expression patterns of the DcDFR gene family in different D. cambodiana B tissues varied substantially. The expression levels of DcDFR9, DcDFR11, DcDFR15, and DcDFR17 were not detected, which was consistent with the observation made in D. cambodiana A. However, the expression patterns of the other genes varied between D. cambodiana A and D. cambodiana B tissues. Eight genes, DcDFR1, DcDFR2, DcDFR3, DcDFR4, DcDFR5, DcDFR10, DcDFR12, and DcDFR16, had the highest relative expression levels in senescent leaves (SL). DcDFR1, DcDFR2, DcDFR3, DcDFR4, and DcDFR5 exhibited similar expression patterns, with the second-highest expression levels being observed in the bark of roots (BR) and the lowest in the xylem of undamaged stems (XUS). DcDFR13 and DcDFR19 exhibited relatively high expression levels in XUS. The expression levels of DcDFR7 and DcDFR8 were the highest in the barks of undamaged stems (BUS); DcDFR6 had the highest expression level in the xylem of roots (XR); DcDFR18 had the highest expression level in YL; and DcDFR14 had the highest expression level in the bark of roots (BR). The DFR genes within the TDGCs-1 and TDGCs-2 clusters were co-expressed and tissue-specific in D. cambodiana B. However, DcDFR9 and DcDFR11 within the TDGCs-2 cluster were not expressed in D. cambodiana B tissues. Based on the qRT-PCR and HPLC results, dragon’s blood resin contains a large quantity of flavonoids; therefore, we speculated that DcDFR1, DcDFR2, DcDFR3, DcDFR4, DcDFR5, DcDFR7, DcDFR8, and DcDFR14 genes could be involved in dragon’s blood resin synthesis in D. cambodiana B.

Fig. 6.

Fig. 6

Multidimensional analysis of D. cambodiana B tissues. A Morphology of D. cambodiana B tree and characteristics of different tissues, including the morphology of fresh samples, dried powder samples, and methanol extracts of different tissues. YL, young leaves; SL, senescent leaves; BUS, bark of the undamaged stems; XUS, xylem of the undamaged stems; BR, bark of the roots; XR, xylem of the roots. B The contents of loureirin A and B in different tissues. The values are means ± standard deviation (SD) of three biological replicates. C Chromatograms of methanol extracts from different D. cambodiana B tissues. “a” and “b” at the position of the black triangle represent loureirin A and B, respectively. D The relative expression levels of DcDFR genes in different D. cambodiana B tissues. Error bars represent the SD of three biological replicates; different letters indicate significant differences at P < 0.05 by one-way ANOVA. E Gene expression levels were normalized according to the scale of rows and visualized in a heatmap

Expression of DcDFR genes and flavonoid content in D. cambodiana B stems after wound induction

The undamaged and healthy stems of D. cambodiana B were selected for wound induction (Fig. 7A). The color of the wound and methanol extracts was initially similar to those of D. cambodiana A, then gradually intensified, and eventually turned brown as induction time increased. The compound contents increased significantly after 9 d of wound induction (Fig. 7B), but loureirin A and B were not detected in the samples. The results showed that the total flavonoid content at 9 d of wound induction was significantly higher than that at 0 d of wound induction (Fig. 7C).

Fig. 7.

Fig. 7

Variations in gene expression and flavonoid content in D. cambodiana B stems after wound induction. A Variations in the colors of stems and methanol extracts from stems. B HPLC analysis results of methanol extracts from stems 0, 3, 6, and 9 d after wound induction. SC, standard compounds; “a” and “b” represent loureirin A and B, respectively. C Flavonoid content of D. cambodiana stems after 0 and 9 d of wound induction. Error bars represent the standard deviation (SD) of three biological replicates, P = 0.0057; ** denote P < 0.01 by Student’s test. D Variations in DcDFR gene expression levels in D. cambodiana stems after 0, 3, 6, and 9 d of wound induction. Error bars represent the SD of three biological replicates; different letters indicate significant differences at P < 0.05 by one-way ANOVA. E Gene expression levels were normalized according to the scale of rows and visualized in a heatmap

The analysis results of DcDFR gene expression patterns in D. cambodiana B following wound induction are presented in Fig. 7D and 7E. The expression levels of three genes, namely DcDFR9, DcDFR11, and DcDFR15 were not detected after wound induction. The expression levels of DcDFR1, DcDFR2, DcDFR3, DcDFR4, and DcDFR5 generally increased with an increase in wound induction time and exhibited similar expression patterns, which is consistent with the observation made in D. cambodiana A. The expression levels of three genes, namely DcDFR7, DcDFR10, and DcDFR14, decreased significantly after wound induction. The expression levels of DcDFR6, DcDFR8, DcDFR12, DcDFR13, DcDFR16, DcDFR17, DcDFR18, and DcDFR19 initially decreased and then increased after wound induction. These results indicate that DcDFR15 genes are potential genes involved in resin formation in D. cambodiana B after wound induction. DFR genes within the TDGCs-1 and TDGCs-2 clusters were co-expressed in D. cambodiana B stems under wounding stress. However, unlike D. cambodiana A, the expression of genes within the TDGCs-2 cluster was downregulated.

Discussion

The DFR gene plays a key role in modulating anthocyanin synthesis, plant growth, and development [19]. The gene also enhances substantially improve plant resistance to various biotic and abiotic stresses, such as high temperature, drought, salinity, and cold [57, 58]. To date, the DFR gene family has been identified and characterized in numerous plant species, including strawberry [35], rapeseed [59], apple [60], and rice [61]. Nevertheless, a comprehensive and in-depth research on the DFR gene family in the Dracaena genus has not been conducted.

During the evolutionary course of diverse species, gene families commonly undergo tandem duplication or large-scale segmental duplication events, which directly impact the expansion of gene families [62]. According to a previous study, D. cochinchinensis, a close relative of D. cambodiana, has experienced two whole-genome duplication (WGD) events [53]. The karyotypes of D. cambodiana and D. cochinchinensis are 2n = 40 [63, 64]. Therefore, we hypothesized that the WGD event could be a primary driving force behind the expansion and evolution of the DFR gene family in D. cambodiana. The reference genome of D. cambodiana used in this study was a haploid genome (haplotypes A and B). Based on the assembly information, haplotype A database, which had a larger size, a higher assembly rate, and more annotated protein-coding genes, was selected as the reference genome. Five pairs of segmental duplication genes, three pairs of segmental duplication genes among the 19 DFR genes, and two gene pairs, DcDFR7 and DcDFR10 exhibited collinearity with two genes that have not been confirmed to be DFR genes, which were located on chromosomes 19 and 6, respectively (Fig. 1D). A prediction based on the published whole-genome of D. cochinchinensis revealed 31,619 protein-coding genes [53] and GeneID predicted that the D. cambodiana genome contained 53,700 genes [65]. However, the number of protein-coding genes in the selected haplotype A was fewer than that in the published genome. The finding suggests that using a single haploid genome for gene family research still has certain limitations when compared to conducting research within the whole genome context. Therefore, a further comprehensive screening of the DFR gene family within the whole genome context (haplotypes A and B) should be performed to identify new functional DFR genes.

The phenomenon of homologous genes with high sequence similarity existing in clusters is extremely common in the plant genome [66], but relatively few gene clusters involved in secondary metabolic pathways have been identified. In this study, a comprehensive investigation of the DFR gene family in D. cambodiana was conducted. A total of 19 DFR genes, which were distributed on six chromosomes, including 2 DFR gene clusters, were identified from D. cambodiana (Fig. 1C). DFR is a key enzyme gene involved in the flavonoid biosynthetic pathway. The common characteristics of secondary metabolic gene clusters in plants are co-expression, co-regulation, and coordinated transcription [67, 68]. In this study, DcDFR genes within the TDGCs-1 cluster were located in close proximity on chromosome 5 (Fig. 1C) and they exhibited coordinated transcription and co-expression patterns in different D. cambodiana tissues under wounding stress (Figs. 5E and 7D), which demonstrates that they form a gene cluster. The open reading frame sequences of five DcDFR genes in the TDGCs-1 cluster were highly homologous to 74.15% (Figure S2 and Table S1) and their intron–exon structures, as well as the conserved motifs and domains of the encoded proteins were extremely similar (Fig. 2A, 2B and 2C). Phylogenetic analysis also showed that they clustered into one clade (Fig. 3). Therefore, TDGCs-1 is highly likely to be a homologous tandemly duplicated gene cluster (TDGCs) formed by tandem duplication events of the DFR gene in D. cambodiana. In addition, genes in the TDGCs-2 (Locus on chromosome 2 of D. cambodiana) cluster exhibited gene cluster characteristics, such as close genomic locations, co-expression, and clustering together in evolutionary analysis. The physical proximity of genes on the genome can enhance co-expression characteristics of gene clusters [68]. Therefore, we hypothesize that it is also a tandemly duplicated gene cluster. Plant gene clusters are generated through gene duplication, neofunctionalization, and dynamic genomic recombination during evolution, and they exhibit the characteristic of independent transcription [67, 69]. Although the five DFR genes in the TDGCs-2 cluster showed low sequence similarity, conserved motifs and domains of encoded proteins, and intron–exon structures, two pseudogenes were not expressed, which could be due to gene recombination and neofunctionalization during the gene cluster formation. For example, a TDGCs cluster composed of five O-methyltransferase (OMT) genes was identified in the biosynthetic pathway of polymethoxyflavones (PMFs) in citrus, with two of the genes being nonfunctional pseudogenes. Comparative genomic and syntenic analyses have revealed that the OMT cluster can be duplicated from CreOMT6 and it influences the genetic basis of PMF biosynthesis in mandarins through neofunctionalization [54].

TDGCs-2 is a gene cluster composed of three DFR genes and two pseudogenes, with the two pseudogenes being derived from the same transcript (Fig. 1C and Table S1). In different tissues of D. cambodiana A/B, DFR genes within both TDGCs-1 and TDGCs-2 clusters were co-expressed. However, under wounding stress, the TDGCs-2 cluster failed to maintain this co-expression pattern specifically in D. cambodiana A (Figs. 5E and 7D). This phenomenon deviates from the typical characteristic of co-expressed gene clusters, indicating functional differentiation within the cluster. Cis-element analysis revealed that the promoter of DcDFR10 contains more abundant light-, hormone-, and abiotic stress-responsive elements compared to other members of TDGCs-2, which provides a mechanistic insight. Specifically, under normal conditions, a shared core regulatory module mediates the co-expression of cluster members; whereas under wounding stress, DcDFR10 integrates additional environmental signals through its expanded cis-elements, leading to the decoupling of its expression from that of other cluster members. We hypothesize that following the formation of the TDGCs-2 cluster in the ancestral species of Dracaena, functional differentiation of this gene cluster occurred due to divergent positive selection pressures among species within the genus Dracaena [70]. The altered expression pattern of DcDFR10 is not attributed to gene dysfunction but rather represents an evolutionary adaptation. Through the expansion of its cis-regulatory landscape, DcDFR10 has acquired an enhanced capacity to fine-tune its expression in response to specific environmental cues. Under special circumstances, individual genes within the TDGCs-2 cluster may deviate from the typical co-expression pattern [71, 72]. Ultimately, such regulatory plasticity may enhance the plant’s ability to adapt to complex and varying stress conditions. Generally, an incompletely co-regulated gene cluster is a multifunctional biosynthetic gene cluster that contains at least two partially overlapping gene clusters [73, 74, 75]. Therefore, further research is required to determine whether there are other flavonoid biosynthesis genes associated with the TDGCs-2 gene cluster. If they exist, it will be conducive to determine the unique flavonoid biosynthetic pathways involved in defense response in Dracaena.

Oxidative burst, a hallmark early plant defense response, rapidly induces substantial accumulation of reactive oxygen species (ROS) following wound stress. During dragon’s blood formation in D. cochinchinensis, ROS functions as a central signaling molecule that activates downstream transcription factors, upregulates the flavonoid biosynthetic pathway, and promotes flavonoid accumulation [53]. The DcDFR17 gene exhibits a strict wound-specific expression pattern, with no expression detected in healthy tissues. Its promoter region contains three antioxidant response elements (AREs) and one flavonoid biosynthetic regulatory element (MBSI) (Fig. 2D and Table S2). AREs are recognized by transcription factors activated under oxidative stress, indicating that DcDFR17 expression is likely regulated by wound-induced ROS signaling. Concurrently, the MBSI element may participate in modulating flavonoid metabolism. We hypothesize that this gene not only contributes to flavonoid-mediated defense responses and dragon’s blood synthesis but also aids in maintaining redox homeostasis. Injury-induced ROS signals activate DcDFR17 expression via these cis-regulatory elements, thereby promoting flavonoid accumulation and enhancing the plant’s pathogen defense and ROS scavenging capacities. Thus, DcDFR17 may serve as a key integrator of stress signals and metabolic regulation, specifically participating in the formation of dragon’s blood in D. cambodiana. Dragon’s blood predominantly comprises flavonoids, which are a highly conserved class of compounds derived from the 2-phenylchromone nucleus [20]. Loureirin A, B, C, and D belong to the chalcone family and represent the primary active constituents, as well as indicator compounds for the quality control of dragon’s blood [76, 77]. According to a previous study, during the induction of dragon’s blood, the synthesis timings of loureirin A, C, and D precede that of loureirin B and the distribution patterns of flavonoid compounds vary at different time points following wounding stress [52]. Chalcones and flavones are mainly enriched in the short term and early to middle phases (within10 d) post wounding, whereas flavonols and isoflavones are predominantly concentrated in the later part of the middle period [52]. This implies that the genes implicated in flavonoid biosynthesis may also exhibit temporal disparities. In this study, the expression levels of five genes DcDFR15 (TDGCs-1) in D. cambodiana A and B were initially substantially upregulated and then remained unchanged until 9 d after wound stress induction. Conversely, the expression levels of most of the other DcDFR genes decreased considerably 3 d after wound stress, with some showing signs of recovery at 9 d after wound stress induction. This phenomenon could be attributed to functional divergence among the 19 DFR genes. As the stress duration increased, the genes involved in the middle and later stages of flavonoid biosynthesis restored their catalytic activity. DcDFR15 (TDGCs-1) could be involved in the specific flavonoid biosynthetic pathway in Dracaena species under wound stress and the final metabolites are associated with the defensive responses of Dracaena to wound stress. In plants, gene clusters formed by biosynthetic genes are typically involved only in the metabolic pathway of one or a class of compounds, rather than broader metabolic networks [78]. Loureirin A, B, C, and D are specific plant metabolites in the genus Dracaena. However, further studies should be conducted to determine whether the TDGCs-1 and TDGCs-2 gene clusters in D. cambodiana are involved in the metabolism of loureirin A, B, C, and D.

In summary, 19DFRgenes were identified from the reference genome ofD. cambodianaand the genes were distributed on six chromosomes, including two tandemly duplicated gene clusters, TDGCs-1 and TDGCs-2. The TDGCs-1 cluster exhibited varying co-expression patterns in different D. cambodianaA and B tissues andstems after wound stress, suggesting its involvement in the special flavonoid metabolism and defense response in D. cambodianaexposed to wound stress. However, genes within the TDGCs-2 cluster did not follow the typical co-expression pattern whenD. cambodianaA was subjected to wound stress, which could be caused by functional divergence of the gene cluster under positive selection pressure.

Conclusion

In this study, 19 DFR genes were identified from the reference genome of D. cambodiana and the genes were distributed on six chromosomes, including two tandemly duplicated gene clusters, TDGCs-1 and TDGCs-2. The TDGCs-1 cluster exhibited varying co-expression patterns in different D. cambodiana A and B tissues and stems after wound stress, suggesting its involvement in the special flavonoid metabolism and defense response in D. cambodiana exposed to wound stress. However, genes within the TDGCs-2 cluster did not follow the typical co-expression pattern when D. cambodiana A was subjected to wound stress, which could be caused by functional divergence of the gene cluster under positive selection pressure.

Materials and methods

Plant materials

The plant materials used in this study were sourced from six healthy 30-year-old trees, including three D. cambodiana A trees (Fig. 4A) and three D. cambodiana B trees (Fig. 6A) that were both cultivated at the Yunnan Branch of the Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Jinghong (22.0058°, 100.7885°), Yunnan, China, and all plants were formally identified by Professor Haitao Li. Roots, stems, and leaves were collected from the two types of trees [56]. For the wound induction experiment, holes were punched on the stems of trees. Each hole measured approximately 5 mm × 5 mm and the spacing between two adjacent holes was approximately 5 cm. Samples were collected at 0, 3, 6, and 9 d after wound induction. All samples were collected with three biological replicates. Subsequently, the samples were immediately frozen in liquid nitrogen and stored at -80℃ to preserve their integrity for subsequent analysis.

Data sources

The protein sequences of A. thaliana [35] and O. sativa [61] were downloaded from TAIR (http://www.arabidopsis.org/) and NCBI database (https://www.ncbi.nlm.nih.gov/), respectively. The associated genome assembly and gene annotation files were downloaded from Ensembl Plants website (https://plants.ensembl.org/, accessed on 24 February, 2025). Genome data, annotation files, and RNA-Seq data of D. cambodiana were obtained from Genome Warehouse in the National Genomics Data Center (NGDC), Beijing Institute of Genomics, Chinese Academy of Sciences/China National Center for Bioinformation (https://ngdc.cncb.ac.cn/), under the BioProject accession number PRJCA021348 [64].

Identification and characterization of DFR gene family in D. cambodiana

We used the AtDFR and OsDFR protein sequences to screen the D. cambodiana database using BLASTP (blast-2.15.0+) [79]. The screening led to the identification of 34 DFR proteins (Fig. 1A). Thereafter, TBtools (v2.302) was used to extract the sequences of the 34 proteins [80]. A further screening of these proteins was based on the annotation files and the functional domain (FR_SDR_e) available on the Conserved Domain Database (CDD) of the NCBI website (https://www.ncbi.nlm.nih.gov/cdd/, accessed on 16 January, 2025) [81], as well as the conserved protein-family domain (Pfam_pf01370) from the Pfam website (https://pfam.xfam.org/, accessed on 10 January, 2025) [82]. Ultimately, a total of 19 DFR genes of D. cambodiana were successfully identified. The ExPASy ProtParam online tool (https://web.expasy.org/protparam/, accessed on 21 February, 2025) was used to analyze the amino acid length, molecular weight, theoretical isoelectric point, aliphatic index, and grand average of hydrophobicity of DcDFR proteins [83]. The secondary structural features of the DcDFR protein family were analyzed using the SOPMA algorithm (http://npsa-pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_sopma.html, accessed on 24 February, 2025) [84], and the 3D structure models of the DcDFR proteins were predicted using SWISS-MODEL (https://swissmodel.expasy.org/, accessed on 11 February, 2025) [85].

Subcellular localization, chromosome localization, and collinearity analysis

WoLF PSORT (https://wolfpsort.hgc.jp/, accessed on 28 February, 2025) was used to predict the subcellular localization of DcDFR proteins [86]. Gene Location Visualize from GTF/FFF function in TBtools was used to map the distribution of genes on chromosomes. DNA and annotation files of A. thaliana, O. sativa, and D. cambodiana were downloaded. Interspecies and intraspecies collinearity were generated using One Step MCScanX and Advanced Circos functions integrated in TBtools.

Analysis of gene structure, conserved motifs, and conserved domains of DcDFR gene family

MEME (http://meme-suite.org/, accessed 8 March, 2025) was used to analyze the conserved motifs of DcDFR proteins, with the number of motifs being set to 10 [87]. The CDD of the NCBI (https://www.ncbi.nlm.nih.gov/cdd/, accessed on 23 February, 2025) was used to analyze conserved domains. Gene structure, conserved motifs, and conserved domains of DcDFR genes were visualized using Gene Structure View (Advanced) function in TBtools.

Analysis of Cis-acting elements in DcDFR gene promoter region

The Gtf/Gff3 Sequence Extract function in TBtools was used to extract the 2000-bp upstream region of the coding sequences (CDs) as the gene promoter region. The promoter sequences were submitted to the PlantCARE database (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/, accessed 4 March, 2025) for the analysis of cis-acting elements within the promoter region. The resulting data were visualized using the Simple BioSequence Viewer function in TBtools.

Evolutionary analysis and tissue-specific expression of DcDFR gene family

MEGA-X was used to construct a phylogenetic tree for the DFR genes in A. thaliana and D. cambodiana [88]. The full-length sequences of DFR proteins were aligned using ClustalW and a phylogenetic tree was constructed using the neighbor-joining (NJ) method with a bootstrap value set at 1000. Gene expression data for A. thaliana and D. cambodiana were downloaded from the Ensembl Plants (http://plants.ensembl.org/index.html, accessed on 24 February, 2025) and NGDC (https://ngdc.cncb.ac.cn/, accessed on 23 February, 2025) databases, respectively. Gene expression data for three tissues, including root, stem, and leaf tissues were imported into iTOL (https://itol.embl.de/, accessed 12 March, 2025) and combined with a phylogenetic tree. Afterward, iTOL was used to beautify the phylogenetic tree [89].

RNA extraction and preparation of cDNA test samples

Total RNA was extracted from plant samples using an RNAprep Pure Plant Plus Kit (DP441, Tiangen Biotech, Beijing, China), according to the manufacturer’s instructions. The quality of the extracted RNA was assessed using 1.5% agarose gel electrophoresis and its concentration was determined using a micro-spectrophotometer (NanoDrop One, Thermo Fisher Scientific, USA). The cDNA library was constructed using high quality RNAs and PrimeScript FAST RT Reagent Kit with gDNA Eraser (RR092A, TaKaRa Bio Inc., Shiga, Japan).

Quantitative RT-PCR (qRT-PCR) analysis

Gene-specific primers (Table S4) for qRT-PCR were designed using Primer Premier 6.0 (PREMIER Biosoft International, Palo Alto, CA, USA) and the actin gene was used as the reference gene. qRT-PCR was performed on a CFX96 Touch Real-Time PCR Detection System (Bio-Rad Laboratories Inc., Hercules, CA, USA) using TB Green Premix Ex Taq II (RR820A, TaKaRa Bio Inc., Shiga, Japan). The PCR program was set as follows: initial denaturation at 95 °C for 30 s, followed by 40 cycles of 5 s at 95 °C for denaturation, and 30 s at 60 °C for annealing and extension. Gene expression levels were calculated using the 2−∆∆Ct method. GraphPad Prism 8.0 (GraphPad Software Inc., La Jolla, CA, USA) was used to generate graphs.

Extraction of flavonoids

Fresh samples were dried at 60 °C overnight, ground into powder, and sieved through a 60-mesh sieve. Thereafter, 0.1 g of each sample was weighed, 15 mL of methanol was added, and ultrasonic extraction was performed at 25 °C for 15 min. The extraction process was repeated three times. Subsequently, the solvent extracts were combined and methanol was removed using a rotary evaporator at 45°C [90]. The residue was eluted with methanol and the volume was adjusted to 2 mL. The solution was then passed through a 0.22 μm filter membrane in preparation for loureirin A and B, and total flavonoid analysis.

Loureirin A and B content analysis

HPLC was used to analyze Loureirin A and B contents. HPLC was performed at 25 °C using a Shimadzu LC-2030 C System (Shimadzu, Kyoto, Japan) coupled with a ZORBAX SB-C18 column (5 μm, 4.6 mm × 250 mm; Agilent Technologies, Santa Clara, CA, USA) and a DAD detector. The mobile phase consisted of a gradient of 30% acetonitrile–0.3% acetic acid in water (A) and acetonitrile (B). The initial composition was 100% (A), which changed to 80% (A) at 5 min and was held at this ratio until after 30 min. The flow rate was set at 1.0 mL/min, the injection volume was 10 µL, and the compounds were detected at a wavelength of 278 nm [90]. The standard compounds, loureirin A and B, were obtained from the National Institute for Food and Drug Control (Beijing, China). The concentration/absorbance linear regression equations for loureirin A and B were y=38276x-8635.7 (R2 = 0.9999, 0.05–100 µg/mL) and y=24312x + 131.67 (R2 = 0.9999, 0.08–10.0 µg/mL), respectively. The contents of loureirin A and B were calculated and expressed as µg per gram of dry sample (µg/g). Origin 2024 (OriginLab Corp., Northampton, MA, USA) was used to plot chromatograms.

Total flavonoid content analysis

The total flavonoid content was determined using the methods described in Pharmacopoeia of the People’s Republic of China [91]. Briefly, 1 mL of the methanol extract was transferred into a test tube and double-distilled water was added to make the volume up to 5 mL. Thereafter, 0.3 mL of 5% sodium nitrite solution was added to the test tube, the solution was mixed thoroughly, and left to stand for 6 min. Afterward, 0.3 mL of 10% aluminum nitrate solution was added, the solution was shaken thoroughly, and 4.4 mL of 4% sodium hydroxide solution was added after 6 min. The solution was left to stand for 15 min and absorbance was measured at a wavelength of 510 nm. As a standard compound, rutin was obtained from the National Institute for Food and Drug Control (Beijing, China) and the concentration/absorbance linear regression equation was y = 0.0114x + 0.0031 (R2 = 0.9995, 5–30 µg/mL). The total flavonoid content was calculated and expressed as mg per gram of dry sample (mg/g). GraphPad Prism 8.0 (GraphPad Software Inc., La Jolla, CA, USA) was used to generate graphs.

Identification of tandemly duplicated genes clusters

Tandemly duplicated genes clusters (TDGCs) or clusters of tandemly duplicated genes have traditionally been identified subjectively as genomic neighborhoods containing several gene duplicates in close proximity [92]. Generally, tandemly duplicated gene clusters contain homologous genes [54]. In this study, we defined TDGCs as homologous genes located on the same chromosome with an intervening distance of no more than 10-kb between adjacent genes.

Statistical analysis

All data are presented as means ± standard deviation (SD) and the results are derived from three independent replicates. Unpaired Student’s t-test was performed to compare two groups and analysis of variance was performed to compare several groups. All statistical analyses were performed using IBM SPSS Statistics 22.0 (IBM Corp., Armonk, NY, USA). Statistical significance was set at *P < 0.05, **P < 0.01.

Supplementary Information

Supplementary Material 1. (445.2KB, pdf)
12870_2026_8486_MOESM2_ESM.xlsx (23.5KB, xlsx)

Supplementary Material 2. Table S1: DFR gene family information of Arabidopsis thaliana, Oryza sativa and Dracaena cambodiana, related to Figure 1.

12870_2026_8486_MOESM3_ESM.xlsx (10.8KB, xlsx)

Supplementary Material 3. Table S2: Collinear pairs genes of D. cambodiana with O. sativa or A. thaliana, related to Figure 1E.

12870_2026_8486_MOESM4_ESM.xlsx (14.9KB, xlsx)

Supplementary Material 4. Table S3: Predicted cis-acting element structures in promoter region of DcDFR gene family, related to Figure 2D.

12870_2026_8486_MOESM5_ESM.xlsx (11.9KB, xlsx)

Supplementary Material 5. Table S4. Primer sequences of qRT-PCR, related to Figure 4–7.

Acknowledgements

This work was financially supported by Yunnan Science and Technology Talents and Platform Program (202105AD160054), the Selection Special Programme of Yunnan Province High-level Technological Talents and Innovative Teams (202405AS350020), the Yunnan Key Research and Development Program (202403AK140014), the Xishuangbanna Prefecture Science and Technology Plan Project (202401001).

Authors’ contributions

Shuang Li: Conceptualization, performed most of the experiments, writing – original draft. Hongyou Zhao, Chunyong Yang, Yanfang Wang: assisted in some experiments, methodology and data curation, formal analysis. Yating Zhu, Qianxia Li, Ge Li, Lixia Zhang, Zhaoyou Deng, Ling Wang: Visualization and resources. Yanqian Wang: Conceptualization, writing – review and editing, project administration, funding acquisition.

Data availability

The data presented in this study are available on request from the corresponding author.

Declarations

Ethics approval and consent to participate

All plant materials comply with local institutional guidelines and legislation.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Xin Y, Tan Y, Xin P, Li H, Zhang S, Tang J, et al. Taxonomical clarification of Dracaena cambodiana Pierre ex Gagnep., the source plant of Chinese Resina Draconis. Plant Sci J. 2024;42(5):572–81. [Google Scholar]
  • 2.Zheng D, Xie L, Wang Y, Zhang Z, Zhang W. Research advancesin drangon’s blood plants in China. Chin Wild Plant Resour. 2009;28(6):15–20. [Google Scholar]
  • 3.Cai X, Xu Z. Research on the plant resources of domestic dragon’s blood. Acta Bot Yunnanica. 1979;1(2):1–10. [Google Scholar]
  • 4.Gupta D, Bleakley B, Gupta RK. Dragon’s blood: botany, chemistry and therapeutic uses. J Ethnopharmacol. 2008;115:361–80. [DOI] [PubMed] [Google Scholar]
  • 5.Chen X, Zhao Y, Yang A, Tian Y, Pang D, Sun J, et al. Chinese dragon’s blood EtOAc extract inhibits liver cancer growth through downregulation of Smad3. Front Pharmacol. 2020;11:669. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zhang L, Ke W, Zhao X, Lu Z. Resina Draconis extract exerts anti-HCC effects through METTL3‐m6A‐Survivin axis. Phytother Res. 2022;36:2542–57. [DOI] [PubMed] [Google Scholar]
  • 7.Hong Y, Sun X, Lu L. Loureirin B inhibits cervical cancer development by blocking PI3K/AKT signaling pathway: Network pharmacology analysis and experimental validation. Appl Biochem Biotechnol. 2024;196:8587–604. [DOI] [PubMed] [Google Scholar]
  • 8.Yang P-X, Fan X-X, Liu M-X, Zhang X-Z, Cao L, Wang Z-Z, et al. Longxuetongluo Capsule alleviate ischemia/reperfusion induced cardiomyocyte apoptosis through modulating oxidative stress and mitochondrial dysfunction. Phytomedicine. 2024;134:155993. [DOI] [PubMed] [Google Scholar]
  • 9.Li C, Zhang Y, Wang Q, Meng H, Zhang Q, Wu Y, et al. Dragon’s blood exerts cardio-protection against myocardial injury through PI3K-AKT-mTOR signaling pathway in acute myocardial infarction mice model. J Ethnopharmacol. 2018;227:279–89. [DOI] [PubMed] [Google Scholar]
  • 10.Yang L, Ran Y, Quan Z, Wang R, Yang Q, Jia Q, et al. Pterostilbene, an active component of the dragon’s blood extract, acts as an antidepressant in adult rats. Psychopharmacol Biol Approach. 2019;236:1323–33. [DOI] [PubMed] [Google Scholar]
  • 11.Yang Y, Mouri A, Lu Q, Kunisawa K, Kubota H, Hasegawa M, et al. Loureirin C and xanthoceraside prevent abnormal behaviors associated with downregulation of brain derived neurotrophic factor and AKT/mTOR/CREB signaling in the prefrontal cortex Induced by chronic corticosterone exposure in mice. Neurochem Res. 2022;47:2865–79. [DOI] [PubMed] [Google Scholar]
  • 12.Kunisawa K, Shan J, Lu Q, Yang Y, Kosuge A, Kurahashi H, et al. Loureirin C and xanthoceraside attenuate depression-like behaviors and expression of interleukin-17 in the prefrontal cortex Induced by chronic unpredictable mild stress in mice. Neurochem Res. 2022;47:2880–9. [DOI] [PubMed] [Google Scholar]
  • 13.Nguyen JK, Masub N, Jagdeo J. Bioactive ingredients in Korean cosmeceuticals: Trends and research evidence. J Cosmet Dermatol. 2020;19:1555–69. [DOI] [PubMed] [Google Scholar]
  • 14.Haji A, Shahmoradi GF, Mohammadi L. Dyeing of polyamide 6 fabric with new bio-colorant and bio-mordants. Environ Sci Pollut Res. 2022;30:37981–96. [DOI] [PubMed] [Google Scholar]
  • 15.National Forestry and Grassland Administration, Ministry of Agriculture and Rural Affairs of the People’s Republic of China. List of Wild Plants under State Key Protection. 2021, https://www.gov.cn/zhengce/zhengceku/2021-09/09/content_5636409.htm.
  • 16.Fan J-Y, Yi T, Sze-To C-M, Zhu L, Peng W-L, Zhang Y-Z, et al. A systematic review of the botanical, phytochemical and pharmacological profile of Dracaena cochinchinensis, a plant source of the ethnomedicine Dragon’s. Blood Molecules. 2014;19:10650–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Helal IE, Elsbaey M, Zaghloul AM, Mansour E. -S.S. A unique C-linked chalcone-dihydrochalcone dimer from Dracaena cinnabari resin. Nat Prod Res. 2019;35:2558–63. [DOI] [PubMed] [Google Scholar]
  • 18.Sun J, Liu J-N, Fan B, Chen X-N, Pang D-R, Zheng J, et al. Phenolic constituents, pharmacological activities, quality control, and metabolism of Dracaena species: A review. J Ethnopharmacol. 2019;244:112138. [DOI] [PubMed] [Google Scholar]
  • 19.Liu W, Feng Y, Yu S, Fan Z, Li X, Li J, et al. The flavonoid biosynthesis network in plants. Int J Mol Sci. 2021;22:12824. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Nabavi SM, Šamec D, Tomczyk M, Milella L, Russo D, Habtemariam S, et al. Flavonoid biosynthetic pathways in plants: Versatile targets for metabolic engineering. Biotechnol Adv. 2020;38:107316. [DOI] [PubMed] [Google Scholar]
  • 21.Kang Y, Peng J, Cai W, Liu W, Luo J, Wu Q. Research progress on flavonoid metabolic synthesis pathway and related function genes in medicinal plants. Chin Tradit Herb Drugs. 2014;45(9):1336–41. [Google Scholar]
  • 22.Zhao Y, Yang X, Zhao X, Zhong Y. Research progress on regulation of plant flavonoids biosynthesis. Sci Technol Food Ind. 2021;42(21):454–63. [Google Scholar]
  • 23.Qi Y, Wei H, Gu W, Shi W, Jiang L, Deng L, et al. Transcriptome profiling provides insights into the fruit color development of wild Lycium ruthenicum Murr. from Qinghai–Tibet Plateau. Protoplasma. 2021;258:33–43. [DOI] [PubMed] [Google Scholar]
  • 24.Qin S, Liu Y, Cui B, Cheng J, Liu S, Liu H. Isolation and functional diversification of dihydroflavonol 4-Reductase gene HvDFR from Hosta ventricosa indicate its role in driving anthocyanin accumulation. Plant Signal Behav. 2022;17(1):e2010389. [DOI] [PMC free article] [PubMed]
  • 25.Luan Y, Chen Z, Tang Y, Sun J, Meng J, Tao J, et al. Tree peony PsMYB44 negatively regulates petal blotch distribution by inhibiting dihydroflavonol-4-reductase gene expression. Ann Bot. 2023;131:323–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Feng X, Zhang Y, Wang H, Tian Z, Xin S, Zhu P. The dihydroflavonol 4-reductase BoDFR1 drives anthocyanin accumulation in pink-leaved ornamental kale. Theor Appl Genet. 2020;134:159–69. [DOI] [PubMed] [Google Scholar]
  • 27.Zhang Y, Feng X, Liu Y, Zhou F, Zhu P. A single-base insertion in BoDFR1 results in loss of anthocyanins in green-leaved ornamental kale. Theor Appl Genet. 2022;135:1855–65. [DOI] [PubMed] [Google Scholar]
  • 28.Ruan H, Shi X, Gao L, Rashid A, Li Y, Lei T, et al. Functional analysis of the dihydroflavonol 4-reductase family of Camellia sinensis: exploiting key amino acids to reconstruct reduction activity. Hortic Res. 2022;9:uhac098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Hua C, Linling L, Shuiyuan C, Fuliang C, Feng X, Honghui Y et al. Molecular cloning and characterization of three genes encoding dihydroflavonol-4-reductase from Ginkgo biloba in anthocyanin biosynthetic pathway. PLoS ONE. 2013;8(8):e72017. [DOI] [PMC free article] [PubMed]
  • 30.Xie DY, Jackson LA, Cooper JD, Ferreira D, Paiva NL. Molecular and biochemical analysis of two cDNA clones encoding dihydroflavonol-4-reductase from Medicago truncatula. Plant Physiol. 2004;134:979–94. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Sun W, Zhou N, Wang Y, Sun S, Zhang Y, Ju Z, et al. Characterization and functional analysis of RdDFR1 regulation on flower color formation in Rhododendron delavayi. Plant Physiol Biochem. 2021;169:203–10. [DOI] [PubMed] [Google Scholar]
  • 32.Chen X, Liu W, Huang X, Fu H, Wang Q, Wang Y et al. Arg-type dihydroflavonol 4-reductase genes from the fern Dryopteris erythrosora play important roles in the biosynthesis of anthocyanins. PLoS ONE. 2020;15(5):e0232090. [DOI] [PMC free article] [PubMed]
  • 33.Haselmair-Gosch C, Miosic S, Nitarska D, Roth BL, Walliser B, Paltram R, et al. Great cause-small effect: undeclared genetically engineered orange petunias harbor an inefficient dihydroflavonol 4-reductase. Front. Plant Sci. 2018;9:149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Vainio J, Mattila S, Abdou SM, Sipari N, Teeri TH. Petunia dihydroflavonol 4-reductase is only a few amino acids away from producing orange pelargonidin-based anthocyanins. Front Plant Sci. 2023;14:1227219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Chen L-Z, Tian X-C, Feng Y-Q, Qiao H-L, Wu A-Y, Li X, et al. The genome-wide identification of the dihydroflavonol 4-reductase (DFR) gene family and its expression analysis in different fruit coloring stages of strawberry. Int J Mol Sci. 2024;25:9911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Chen L, Wang X, Cui L, Li Y, Liang Y, Wang S, et al. Transcriptome and metabolome analyses reveal anthocyanins pathways associated with fruit color changes in plum (Prunus salicina Lindl). PeerJ. 2022;10:e14413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Liao L, Li Y, Lan X, Yang Y, Wei W, Ai J, et al. Integrative analysis of fruit quality and anthocyanin accumulation of plum cv. ‘Cuihongli’ (Prunus salicina Lindl.) and its bud mutation. Plants. 2023;12:1357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Ni J, Zhang N, Zhan Y, Ding K, Qi P, Wang X, et al. Transgenic tobacco plant overexpressing ginkgo dihydroflavonol 4-reductase gene GbDFR6 exhibits multiple developmental defects. Front Plant Sci. 2022;13:1066736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Jan R, Aaqil Khan M, Asaf S, Lubna, Park J-R, Lee I-J, Kim K-M. Flavonone 3-hydroxylase relieves bacterial leaf blight stress in rice via overaccumulation of antioxidant flavonoids and induction of defense genes and hormones. Int J Mol Sci. 2021;22:6152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Zhang L, Liu Y, Zhang Z, Fang S. Physiological response and molecular regulatory mechanism reveal a positive role of nitric oxide and hydrogen sulfide applications in salt tolerance of Cyclocarya paliurus. Front Plant Sci. 2023;14:1211162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Wang Y, Jiang W, Li C, Wang Z, Lu C, Cheng J, et al. Integrated transcriptomic and metabolomic analyses elucidate the mechanism of flavonoid biosynthesis in the regulation of mulberry seed germination under salt stress. BMC Plant Biol. 2024;24:132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Wang Y, Zhang M, Li X, Zhou R, Xue X, Zhang J, et al. Overexpression of the wheat TaPsb28 gene enhances drought tolerance in transgenic Arabidopsis. Int J Mol Sci. 2023;24:5226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Wang H, Xu W, Zhang X, Wang L, Jia S, Zhao S, et al. Transcriptomics and metabolomics analyses of Rosa hybrida to identify heat stress response genes and metabolite pathways. BMC Plant Biol. 2024;24:874. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Wang F, Li Z, Wu Q, Guo Y, Wang J, Luo H, et al. Floral response to heat: a study of color and biochemical adaptations in purple chrysanthemums. Plants. 2024;13:1865. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Long Y, Shen C, Lai R, Zhang M, Tian Q, Wei X, et al. Transcriptomic and metabolomic analysis reveals the potential roles of polyphenols and flavonoids in response to sunburn stress in chinese olive (Canarium album). Plants. 2024;13:2369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Majumder A, Kanti Mondal S, Mukhoty S, Bag S, Mondal A, Begum Y, et al. Virtual screening and docking analysis of novel ligands for selective enhancement of tea (Camellia sinensis) flavonoids. Food Chem :X. 2022;13:100212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Shen B, Zhang Z, Shi Q, Du J, Xue Q, Li X. Active compound analysis of Ziziphus jujuba cv. Jinsixiaozao in different developmental stages using metabolomic and transcriptomic approaches. Plant Physiol Biochem. 2022;189:14–23. [DOI] [PubMed] [Google Scholar]
  • 48.Pei D, Ren Y, Yu W, Zhang P, Dong T, Jia H, et al. The roles of brassinosteroids and methyl jasmonate on postharvest grape by regulating the interaction between VvDWF4 and VvTIFY 5 A. Plant Sci. 2023;336:111830. [DOI] [PubMed] [Google Scholar]
  • 49.Wang X-H, Zhang C, Yang L-L, Gomes-Laranjo J. Production of dragon’s blood in Dracaena cochinchinensis plants by inoculation of Fusarium proliferatum. Plant Sci. 2011;180:292–9. [DOI] [PubMed] [Google Scholar]
  • 50.Zhu J-H, Cao T-J, Dai H-F, Li H-L, Guo D, Mei W-L, et al. De Novo transcriptome characterization of Dracaena cambodiana and analysis of genes involved in flavonoid accumulation during formation of dragon’s blood. Sci Rep. 2016;6:38315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Sun H, Song M, Zhang Y, Zhang Z. Transcriptome profiling reveals candidate flavonoid-related genes during formation of dragon’s blood from Dracaena cochinchinensis (Lour.) S.C.Chen under conditions of wounding stress. J Ethnopharmacol. 2021;273:113987. [DOI] [PubMed] [Google Scholar]
  • 52.Liu Y, Gao S, Zhang Y, Zhang Z, Wang Q, Xu Y, et al. Transcriptomics and metabolomics analyses reveal defensive responses and flavonoid biosynthesis of Dracaena cochinchinensis (Lour.) S. C. Chen under wound stress in natural conditions. Molecules. 2022;27:4515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Xu Y, Zhang K, Zhang Z, Liu Y, Lv F, Sun P, et al. A chromosome-level genome assembly for Dracaena cochinchinensis reveals the molecular basis of its longevity and formation of dragon’s blood. Plant Commun. 2022;3:100456. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Peng Z, Song L, Chen M, Liu Z, Yuan Z, Wen H et al. Neofunctionalization of an OMT cluster dominates polymethoxyflavone biosynthesis associated with the domestication of citrus. PNAS. 2024;121(14):e2321615121. [DOI] [PMC free article] [PubMed]
  • 55.Fu J, Xu Q. Transcriptional regulatory mechanisms of carotenoid and anthocyanin metabolism in response to light signaling in plants. J Huazhong Agric Univ. 2021;40(1):1–11. [Google Scholar]
  • 56.Wang Y, Li S, Yang C, Wang Y, Peng J, Li G, et al. Concentration and gene expression analyses of dragon’s blood flavonoids in different tissues of Dracaena cochinchinensis. CERNE. 2022;28:e–102945. [Google Scholar]
  • 57.Zhu S, Zheng M, Tian T, Yu J, Zhao Q, Huang S, et al. Research progress on plant secondary metabolism and regulation. Plant Physiol J. 2023;59(12):2188–216. [Google Scholar]
  • 58.Ge S, Zhang X, Han W, Li Q, Li X. Research progress on plant flavonoids biosynthesis and their anti-stress mechanism. Acta Hortic Sin. 2023;50(1):209–24. [Google Scholar]
  • 59.Qian X, Zheng W, Hu J, Ma J, Sun M, Li Y, et al. Identification and expression analysis of DFR Gene family in Brassica napus L. Plants. 2023;12:2583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Li W-F, Gao J, Ma Z-H, Hou Y-J, Li X, Mao J, et al. Molecular evolution and expression assessment of DFRs in apple. Chem Biol Technol Agric. 2023;10:98. [Google Scholar]
  • 61.Wang J, Zhang C, Li Y. Genome-wide identification and expression profiles of 13 key structural gene families involved in the biosynthesis of rice flavonoid scaffolds. Genes. 2022;13:410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Cannon SB, Mitra A, Baumgarten A, Young ND, May G. The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol. 2004;4(10):1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Zhao H, Li S, Yang C, Li G, Wang Y, Peng J, et al. FISH-based karyotype analyses of four Dracaena species. Cytogenet Genome Res. 2021;161:272–7. [DOI] [PubMed] [Google Scholar]
  • 64.Chen B-Z, Li D-W, Wang W-J, Xin Y-X, Wang W-B, Li X-Z, et al. Chromosome-level and haplotype-resolved genome assembly of Dracaena cambodiana (Asparagaceae). Sci Data. 2024;11:873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Bandapalli OR, Ding X, Mei W, Huang S, Wang H, Zhu J et al. Genome survey sequencing for the characterization of genetic background of Dracaena cambodiana and its defense response during dragon’s blood formation. PLoS ONE. 2018;13(12):e0209258. [DOI] [PMC free article] [PubMed]
  • 66.Richard G-F, Kerrest A, Dujon B. Comparative Genomics and Molecular Dynamics of DNA Repeats in Eukaryotes. Microbiol Mol Biol Rev. 2008;72(4):686–727. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Nützmann H-W, Scazzocchio C, Osbourn A. Metabolic gene clusters in eukaryotes. Annu Rev Genet. 2018;52:159–83. [DOI] [PubMed] [Google Scholar]
  • 68.Ghanbarian AT, Hurst LD. Neighboring genes show correlated evolution in gene expression. Mol Biol Evol. 2015;32:1748–66. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Liu Z, Cheema J, Vigouroux M, Hill L, Reed J, Paajanen P, et al. Formation and diversification of a paradigm biosynthetic gene cluster in plants. Nat Commun. 2020;11:5354. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Sue M, Nakamura C, Nomura T. Dispersed benzoxazinone gene cluster: molecular characterization and chromosomal localization of glucosyltransferase and glucosidase genes in wheat and rye. Plant Physiol. 2011;157:985–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Takos AM, Knudsen C, Lai D, Kannangara R, Mikkelsen L, Motawia MS, et al. Genomic clustering of cyanogenic glucoside biosynthetic genes aids their identification in Lotus japonicus and suggests the repeated evolution of this chemical defence pathway. Plant J. 2011;68:273–86. [DOI] [PubMed] [Google Scholar]
  • 72.Matsuba Y, Nguyen TTH, Wiegert K, Falara V, Gonzales-Vigil E, Leong B, et al. Evolution of a complex locus for terpene biosynthesis in solanum. Plant Cell. 2013;25:2022–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Matsuba Y, Zi J, Jones AD, Peters RJ, Pichersky E. Biosynthesis of the diterpenoid lycosantalonol via nerylneryl diphosphate in Solanum lycopersicum. PLoS ONE. 2015;10:e0119302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Wang Q, Hillwig ML, Okada K, Yamazaki K, Wu Y, Swaminathan S, et al. Characterization of CYP76M5–8 indicates metabolic plasticity within a plant biosynthetic gene cluster. J Biol Chem. 2012;287:6159–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Okada A, Shimizu T, Okada K, Kuzuyama T, Koga J, Shibuya N, et al. Elicitor induced activation of the methylerythritol phosphate pathway toward phytoalexins biosynthesis in rice. Plant Mol Biol. 2007;65:177–87. [DOI] [PubMed] [Google Scholar]
  • 76.Zhao H, Chen Z. An HPLC-ESI-MS method for analysis of loureirin A and B in dragon’s blood and application in pharmacokinetics and tissue distribution in rats. Fitoterapia. 2013;86:149–58. [DOI] [PubMed] [Google Scholar]
  • 77.Su X-Q, Song Y-L, Zhang J, Huo H-X, Huang Z, Zheng J, et al. Dihydrochalcones and homoisoflavanes from the red resin of Dracaena cochinchinensis (Chinese dragon’s blood). Fitoterapia. 2014;99:64–71. [DOI] [PubMed] [Google Scholar]
  • 78.Qiao X, Houghton A, Reed J, Steuernagel B, Zhang J, Owen C, et al. Comprehensive mutant chemotyping reveals embedding of a lineage­specific biosynthetic gene cluster in wider plant metabolism. PNAS. 2025;122(12):e2417588122. [DOI] [PMC free article] [PubMed]
  • 79.Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, et al. Gapped BLAST and PSI - BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25(17):3389–402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, et al. TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data. Mol Plant. 2020;13(2):1194–202. [DOI] [PubMed] [Google Scholar]
  • 81.Marchler–Bauer A, Bryant SH. CD–Search: protein domain annotations on the fly. Nucleic Acids Res. 2004;32:W327–31. (Web Server issue). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Mistry J, Chuguransky S, Williams L, Qureshi M, Salazar GA et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 2021;49(D1):D412-D419. [DOI] [PMC free article] [PubMed]
  • 83.Gasteiger E, Hoogland C, Gattiker A, Duvaud S, Wilkins MR, et al. Protein identification and analysis tools on the ExPASy server. In: Walker JM, editor. The proteomics protocols handbook. Totowa: Humana; 2005. pp. 571–607. [Google Scholar]
  • 84.Geourjon C, Deléage G. SOPMA: significant improvements in protein secondary structure prediction by consensus prediction from multiple alignments. Comput Appl Biosci. 1995;11:681–4. [DOI] [PubMed] [Google Scholar]
  • 85.Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018;46(W1):W296–303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Horton P, Park KJ, Obayashi T, Fujita N, Harada H, et al. WoLF PSORT: Protein localization predictor. Nucleic Acids Res. 2007;35(Web Server issue):W585–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Bailey TL, Johnson J, Grant CE, Noble WS. The MEME Suite. Nucleic Acids Res. 2015;43(W1):W39–49. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol Biol Evol. 2018;35(6):1547–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Letunic I, Bork P. Interactive Tree of Life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. 2024;52(W1):W279–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Cui JL, Wang CL, Guo SX, Xiao PG, Wang ML. Stimulation of dragon’s blood accumulation in Dracaena cambodiana via fungal inoculation. Fitoterapia. 2013;87:31–6. [DOI] [PubMed] [Google Scholar]
  • 91.Chinese Pharmacopoeia Commission. Pharmacopoeia of the People’s Republic of China, vol. 1. Beijing: China Medical Science Press; 2020. p. 191–2.
  • 92.Ortiz JF, Rokas A, CTDGFinder:. A Novel Homology-Based Algorithm for Identifying Closely Spaced Clusters of Tandemly Duplicated Genes. Mol Biol Evol. 2016;34(1):215–29. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 1. (445.2KB, pdf)
12870_2026_8486_MOESM2_ESM.xlsx (23.5KB, xlsx)

Supplementary Material 2. Table S1: DFR gene family information of Arabidopsis thaliana, Oryza sativa and Dracaena cambodiana, related to Figure 1.

12870_2026_8486_MOESM3_ESM.xlsx (10.8KB, xlsx)

Supplementary Material 3. Table S2: Collinear pairs genes of D. cambodiana with O. sativa or A. thaliana, related to Figure 1E.

12870_2026_8486_MOESM4_ESM.xlsx (14.9KB, xlsx)

Supplementary Material 4. Table S3: Predicted cis-acting element structures in promoter region of DcDFR gene family, related to Figure 2D.

12870_2026_8486_MOESM5_ESM.xlsx (11.9KB, xlsx)

Supplementary Material 5. Table S4. Primer sequences of qRT-PCR, related to Figure 4–7.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.


Articles from BMC Plant Biology are provided here courtesy of BMC

RESOURCES