Abstract
Ginseng, a valuable Chinese medicinal herb, is renowned worldwide for its effectiveness in alleviating certain conditions and promoting overall health. In this study, we performed weighted gene co-expression network analysis (WGCNA) on the accumulation of essential saponins under the influence of 13 essential environmental factors (including air temperature, air bottom temperature, surface mean temperature, soil temperature, surface shortwave radiation, soil moisture, soil water content, rainfall, total precipitation, elevation, soil type, soil pH, and soil water potential). We identified a total of 40 transcript modules associated with typical environmental factors and the accumulation of essential saponins. Among these, 18 modules were closely related to the influence of typical environmental factors, whereas 22 modules were closely related to the accumulation of essential saponins. These results were verified by examining the transcriptome, saponin contents, environmental factor information and the published data and revealed the regulatory basis of saponin accumulation at the transcriptome level under the influence of essential environmental factors. We proposed a working model of saponin accumulation mediated by the transcriptional regulatory network that is affected by typical environmental factors. An isomorphic white-box neural network was constructed based on this model and the predicted results of the white-box neural network correlated with saponin accumulation. The effectiveness of our correlation-directed graph in predicting saponin contents was verified by bioinformatics analysis based on results obtained in this study and transcripts known to affect the biosynthesis of saponin Rb1. The directed graph represents a useful tool for manipulating saponin biosynthesis while considering the influence of essential environmental factors in ginseng and other medicinal plants.
Introduction
In plants, the investigation of how environmental factors affect the production of secondary metabolites is an ongoing pursuit due to their close relationships with plant growth and development [1]. Plant secondary metabolites are usually defense compounds responding to various biotic and abiotic stresses [2]. Secondary metabolites produced by medicinal plants are important beneficial compounds for many reasons, and their effects on human health have garnered significant attention [3]. Among these, ginsenosides from Panax ginseng, a traditional Chinese medicine, are of particular interest due to their health-promoting and disease-alleviating properties [4–6]. Ginsenoside Rb1, a highly valued natural compound, has garnered extensive attention and investigation for its diverse range of biological activities and medicinal properties. This compound plays a pivotal role in various physiological processes, such as safeguarding the cardiovascular system [7–10], regulating the central nervous system [11–16], modulating immune responses [17–20], enhancing skeletal health [21], promoting metabolism [22], and exhibiting anti-diabetic effects [17]. Notably, ginsenoside Rg1 has been shown to prevent depression-like behaviors and neuronal structural plasticity induced by chronic stress [23].
Understanding the relationship between the biosynthesis of ginsenosides and environmental factors has been a hot research area, and recent developments in transcriptome analysis have brought about new opportunities for such studies. Many studies have explored how environmental factors affect ginsenoside accumulation. However, it is still unclear how the expression of certain genes (including CYP716A52v2, CYP716A53v2, DS, β-AS, and PgSE2) can affect the accumulation of certain secondary metabolites under the influences of environmental factors. Park et al. reported the effects of overexpression and transcriptome interference of the ginsenoside biosynthesis gene CYP716A53v2 on the composition of ginsenosides [24]. The key enzyme Squalene Synthase (SS), which mediates the biosynthesis of ginsenosides, was cloned by Liang et al. [25]. Han et al. investigated the regulation of ginsenoside and phytosterol biosynthesis by RNA interferences of squalene epoxidase genes (PgSQE1 and PgSQE2) [26], and Kim et al. analyzed ginsenoside profiles and the expression of related genes in Panax ginseng Meyer [27].
The relationship between typical saponin content and environmental factors was also examined at the macro level. For example, Chen et al. analyzed the correlation between ginsenoside content and ecological factors in Panax ginseng [28]; Hou et al. studied how ecological factors affect the contents of secondary metabolites in medicinal plants including Illumination, photo quality, temperature, moisture and soil factor (soil texture, soil mass elements, soil trace elements and soil microorganism) [29], and Jia et al. specifically investigated the correlation between ginsenoside content and ecological factors in ginseng [30].
Previous research has mainly explored the association between gene regulatory networks and saponin accumulation at a local mesoscopic level. This approach has revealed intricate relationships among environmental factors, transcriptome expression profiles, and saponin accumulation and spatial–temporal characteristics at the macro and micro levels. Wang et al. investigated the correlation between ginsenoside content in 15 tissues and the expression of six ginsenoside biosynthesis-related genes [31]. Other studies have mainly examined the effects of relevant enzymes on saponin accumulation and the relationship between saponin biosynthesis regulatory networks and saponin accumulation [24–26, 32–41]. Thus, investigators have examined saponin biosynthesis at macro, micro, and local mesoscopic levels. Nonetheless, few integrative studies have explored the relationships among transcriptional regulatory networks, environmental factors, and saponin accumulation at the mesoscopic level.
In this study, we explored the relationships between the ginseng transcriptome and the accumulation of essential saponins under the influence of essential environmental factors. Using weighted gene co-expression network analysis (WGCNA), we constructed a transcriptome expression correlation matrix based on known data. We identified a set of highly correlated transcripts that were significantly associated with the accumulation of essential saponins. Our findings offer new insights into the regulatory mechanisms of ginsenoside biosynthesis and provide a basis for further studies on the interactions between environmental factors and metabolite accumulation.
Materials and methods
Data sources and data pre-processing
Transcriptome data and saponin contents in samples collected from seven different ginseng-growing regions in Jilin Province, China, at different growth stages, were provided by the Jilin Province Ginseng Gene Resource Development and Utilization Engineering Research Center. Climate data were obtained from the Chinese National Meteorological Data Center (CRA40LAND, China Global Land Surface Reanalysis 40-year Product). The transcriptome sequencing data corresponded to 42 ginseng samples, each composed of 14 tissues. Samples with low expression levels were filtered out. Refer to Fig 1 for workflow of the study (refer to S1 Fig for a more detailed workflow).
Fig 1. Workflow of the study.
The growth stages of ginseng can be classified as the vigorous growth stage, normal growth stage, and aging stage (reference). The first to fourth years of ginseng growth, which are defined as the vigorous growth stage, are characterized by longitudinal and root growth. Longitudinal growth rate of the second year is 300%–400% higher than that of the first year, growth rate of the third year is 200% higher than that of the second year, and growth rate of the fourth year is 100% higher than that of the third year. During the 5–8 year period, the root system exhibits a normal growth rate―the rate at 5–6 years is about 90%–50% of that at 4 years, and the rate at 7–8 years is about 30%–20% of that at 5–6 years. This stage is characterized by lateral root growth. The aging stage occurs after the ninth year, and the growth rate of the ninth year is only about 10% higher than that of the eighth year [42]. At the fourth year, ginseng undergoes a transition from vigorous growth to normal growth, which involves both vertical and lateral root growth, and roots of the plant have more functional activity. At this stage, the response of ginseng to changes in external ecological factors is more comprehensive, and saponin accumulation increases rapidly, which better reflects the response of saponin accumulation for ginseng to environmental factors. Therefore, most analyses in this study were carried out on fourth-year ginseng, and ginseng samples from other age groups were also included for model training and further data analysis.
We sequenced the mRNAs extracted from 14 different tissues of four-year-old ginseng plants using the Illumina HiSeqTM2000 with a 100-nucleotide PE (paired-end) module. These 14 tissues were fiber root, leg root, main root epidermis, main root cortex, rhizome, arm root, stem, leaf peduncle, leaflet pedicel, leaf blade, fruit peduncle, fruit pedicel, fruit flesh, and seed. Clean reads were extracted from raw reads using the SOAPnuke1.5.0 software (https://github.com/ BGI-flexlab/SOAPnuke) by BGI (Hong Kong, China) [43] by removing the adapters and low-quality reads (Q ≤ 10, P < 0.05). The clean reads were used to assemble unigenes using the Trinity software [44, 45], and quantify the expressions of individual transcripts using the RSEM (RNA-seq by Expectation Maximization) software [56]. The transcriptome assembled using all clean reads from the 14 tissues was used as the reference (S1 Table).
WGCNA analysis
The WGCNA package (version 1.70–3) in R (https://cran.rstudio.com/ bin/windows/base version 4.0.5) was downloaded using the BiocManager package (version 1.30.10) and used to construct a co-expression network for transcriptome analysis [46, 47]. First, the goodSamples-Genes function in WGCNA was used to filter the expression matrix and remove unsatisfying transcripts and samples that contained missing data or exhibited zero variance. Second, sample clustering analysis was performed using the flashClust tool in R to detect outliers. Third, pairwise comparisons of transcripts were performed by calculating the Pearson correlation coefficient (PCC) of the expression matrix. Finally, the appropriate soft threshold (β) was selected using the pickSoftThreshold function to ensure the generation of a scale-free network. The adjacency matrix was then constructed using the power function:
where Sij represents the Pearson correlation coefficient between transcripts i and j, and aij represents the adjacency relationship between the two transcripts. The topological overlap matrix (TOM) was constructed using the adjacency function:
where is the product’s sum of the adjacency coefficients of the nodes connected by transcripts i and j, and k is the sum of the adjacency coefficients between the given transcript and all other transcripts in the weighted network. TOM is used to calculate the dissimilarity measure (1-TOM), and transcripts are assigned to modules using the dynamic cut tree method [48] based on the similarity of expression profiles [49]. The minimum number of transcripts per co-express module was set to 30.
We identified transcript co-expression modules (TCEMs) that exhibited expression patterns similar to those associated with the 13 essential ecological factors and the contents of essential saponins including Rb1, Rb2, Rb3, Rc, Rd, Re, Rf, Rg1, Rg2, and total saponins (TS) determined by liquid chromatography [50]. First, principal component analysis was used to describe featured gene (eigengene) by the first principal component in each module, which corresponded to unified featured gene expression profile within each module. The module membership (MM) corresponds to the correlation between the featured gene and each individual transcript within the same module. Second, gene significance (GS) was assigned to the linear relationship between transcripts expression profiles of specific module and the phenotypes (saponins profiles and environmental factors), which was expressed as logarithm of the P-value of the individual transcript significance. If the GS was stably correlated with the module membership (MM), then it was defined as the correlation between the module featured gene (eigengene) and the phenotype.Finally, we constructed a directed graph reflecting the relationship between ecological factors and saponin accumulation.
To ensure the validity of the results, the related directed graph, which was required to use published methods and data, was tested to determine whether it was compatible with existing knowledge in the field. The effectiveness of the network structure model was assessed using saponin Rb1. The correlations between known transcripts related to saponin Rb1 contents and TCEMs identified by the directed graph were validated. Transcripts possibly related to saponin Rb1 biosynthesis were determined by Gene Ontology analysis [51], and the distribution of co-expression modules in the directed graph was studied using bioinformatics methods. The compatibility of the related directed graph with the existing knowledge in the field was further tested by examining the relative position of corresponding modules in the directed graph and their association with the saponin Rb1 node.
Validity of the combination of every non-associated module for effective saponin synthesis was verified using the chi-square method. First, we set the null hypothesis H0, where the number of different combination categories was unrelated to the effectiveness of Rb1 synthesis. Second, we determined the degrees of freedom (DF) using the following equation:
| (1) |
where a and b are the numbers of categories corresponding to two different test conditions (the number of TCEMs combination categories and combinations of the target effect), with a representing the number of combination categories of related or unrelated TCEMs, and b representing the number of frequency point categories related or not related to the target effect. The selected significance level α was 0.05. Third, the experimental conditions NAMC and nNAMC are the number of combinations of associated and unassociated modules calculated based on the transcript number of relevant modules, respectively. The initial observation data for the contingency table were constructed, and the theoretical (expected) frequency of the chi-square was calculated. Finally, the chi-square analysis was performed, and P values were calculated. Based on the results of chi-square analysis, the dependency of the number of different combinatorial categories on the effectiveness of identifying ginsenoside Rb1 biosynthesis-related transcripts was determined.
In this study, a white-box neural network model was constructed based on a directed graph. Unlike traditional neural networks, this white-box model had an explicitly interpretable biological structure. The model was trained using typical ecological factors and published data of saponin accumulation in samples of the same developmental stages [52]. The white-box neural network model simulated the accumulation of essential saponins under specific ecological factors, and its consistency was validated with known cases.
Results
Data pre-processing
A total of 248,993 mRNA-seq data points in each ginseng sample were used for subsequent analyses. After filtering out samples that contained missing data or zero variance, 73,640 data points were removed. Using the Euclidean distance method, the samples were clustered into two groups, and hierarchical clustering was performed, with all samples clustering under a critical height threshold (43000), except for an outlier (S26), which was identified and removed (Fig 2A).
Fig 2. Weighted gene co-expression network analysis (WGCNA).
(A) Clustering dendrogram of samples based on their Euclidean distance for identifying outliers. (B) Sample dendrogram and environment factor heatmap. Using the association analysis of effective samples and corresponding environmental factors, the correlations between the similarity of clustered sample features and the change rule of environmental factors provide the basic law for the subsequent in-depth analysis. (C) Soft threshold values were enumerated to determine whether the R2 value characteristic of scale-free distribution was met, so as to obtain the best candidate soft threshold value (β), and the corresponding degree of average connection was combined to confirm this value. (D) Dendrogram of transcripts that shows the co-expression modules defined by WGCNA (designated by different colors). (E) Visualization of the eigengene network that represents the relationships among the TCEMs weight. (F) Correlations among transcripts aggregating within modules. (G) Combination of the scatter plot, correlation coefficients, and significance levels.*, P < 0.05; **, P < 0.01; ***, P < 0.001. (H) Correlation between modules and essential environmental factors.
Identification of co-expression modules by the WGCNA
Based on an expression matrix composed of 175,353 mRNA-seq data points, ginsenoside profiles, and environmental factors, a network was constructed using WGCNA, and clustering analysis was performed to check the quality of the data obtained from the 42 ginseng samples. These correlations were confirmed by analyses of hierarchical clustering, heat maps, and adjacency relationships (Fig 2A–2E). In terms of ginsenoside accumulation, the featured gene of module ME7 was positively correlated with Re (Rcor = 0.62, P = 2 × 10−5), whereas that of module ME12 was positively correlated with Rf (Rcor = 0.68, P = 9 × 10−7) (S4 Fig). Thirteen climate variables were applied in WGCNA (Fig 2B). To create a scale-free network, the soft threshold β was set to 5, with a decision coefficient of 0.9 and an average connectivity close to 0 (Fig 2C). TCEMs exhibiting similar expression patterns were grouped into the same module, and modules with a height difference less than 0.25 were merged. This process yielded 103 co-expression modules, with the gray module containing transcripts that could not be assigned to any other module. Some of the modules exhibited stable correlations between their featured genes and other modules (Fig 2F, 2G) or environmental factors. For example, temperature had a significant impact on ginseng growth, with the featured gene of module ME29 negatively correlated with the average surface temperature (Rcor = -0.62, P = 2 × 10−5), whereas the featured gene of module ME69 was positively correlated with this environmental factor (Rcor = 0.66, P = 2 × 10−6). In addition, ginseng growth was highly dependent on altitude, with the characteristic genes of module ME29 positively correlated with elevation (Rcor = 0.63, P = 9 × 10−6). Soil pH was also closely related to ginseng growth, with the featured gene of module ME85 positively correlated with soil pH (Rcor = 0.53, P = 3 × 10−4) (Fig 2H, S2 and S3 Figs). Our analysis indicates that some modules may be closely associated with essential environmental factors, whereas others may promote ginsenoside accumulation. As associations may be direct or indirect, further analysis is required to understand the relationship between these modules and ginsenoside content.
Relationships among essential ecological factors, saponin types, and co-expression modules
In this study, the relationships among essential ecological factors, saponin types, and co-expression modules were examined. For each ecological factor, the correlation coefficient served as a guide to construct a directed graph that connected any two nodes corresponding to the current node and successor node (typical ecological factor, TCEM or saponin type) with the correlation coefficient as the edge weight (Fig 2F–2H). In addition, the layer-by-layer exploration method was used to push forward the correlation network until the corresponding typical saponin node was reached, yielding a data-related directed graph of typical environmental factors, transcriptional regulatory networks, and saponin accumulation. Based on stable biological correlation properties among modules, we postulated that many changes in transcript levels were the result of combined action of multiple environmental factors. By factor analysis, the cumulative contribution rate of the variance between the precursor node and the current module was greater than 80%, which served as the threshold for filtering edges. Candidate precursor nodes were sorted in descending order of contribution of variance. On the other hand, an edge with an absolute correlation coefficient greater than 0.4 between the two nodes corresponding to the current node and successor node was used as another candidate condition. The ecological factors, transcriptional regulatory network nodes, and saponin trait fusion networks were macroscopic in expression (Fig 3A). Further indirect and direct relationships between environmental factors and essential saponins have been obtained through the visualization tool Cytoscape (Fig 3B).
Fig 3. Association diagram of ecological factors, saponin profiles, and TCEMs.
(A) Correlations of ecological factors and essential saponins with TCEMs. (B) Correlations among ecological factors, essential saponins, and grouped TCEMs.
Positive test
Genes that were recognized as closely related to saponin Rb1 biosynthesis were used to test the effectiveness of the model (Table 1). In the positive test, we validated the correlation between the directed graph of saponin biosynthesis regulatory networks affected by environmental factors and the transcripts closely corresponding to saponin Rb1 biosynthesis. The results clearly showed a direct correlation between the transcript clusters corresponding to saponin Rb1 and TCEMs including these associated transcripts with saponin Rb1 biosynthesis (Figs 4 and 5). In addition, indirectly associated modules that are consistent with general domain knowledge should also be considered.
Table 1. Corresponding transcript levels of genes related to Rb1 biosynthesis in different samples.
| RNA no. | Gene name | RNA transcriptome expression levels (TPMa) | ||||
|---|---|---|---|---|---|---|
| Sample ID (partial) | ||||||
| S1 | S2 | S3 | S4 | S5 | ||
| R59489 | CYP716A53v2_1 (PgCYP137) | 75.46 | 34.91 | 26.16 | 95.97 | 46.3 |
| R119018 | CYP716A52v2_3 (PgCYP311) | 62.21 | 15.2 | 32.65 | 78.32 | 24.22 |
| R105494 | DS _ 1 | 4.68 | 8.23 | 6.23 | 33.16 | 23.61 |
| R105492 | DS _ 3 | 53.58 | 5.34 | 5.74 | 22.35 | 15.69 |
| R96316 | CYP716A47 1 (PgCYP339) | 17.42 | 10.16 | 5.09 | 25.07 | 15.07 |
| R172551 | β-AS _ 1 | 4.37 | 0 | 4.57 | 0.84 | 0 |
| R72448 | SS _ 1 | 57.97 | 17.85 | 25.64 | 47.21 | 86.91 |
| R71730 | PgSE2 _ 1 | 0.99 | 3.49 | 0 | 3.22 | 1.68 |
| R177267 | PgSE2 _ 4 | 55.45 | 15.3 | 17.07 | 71.99 | 21.71 |
| R207525 | FPS _ 22 | 62.77 | 51.93 | 41.84 | 68.5 | 40.72 |
| R215855 | PgUGT71A27 _ 2 | 93.83 | 86.79 | 63.09 | 144.09 | 46.63 |
aTranscripts per kilobase per million mapped reads
Fig 4. Transcripts closely associated with Rb1 and different modules.
Fig 5. Correlation coefficient, significance level, and corresponding sample scatter plot of transcripts closely associated with Rb1 and different modules.
*, P < 0.05; **, P < 0.01; ***, P < 0.001.
The association between GS and MM of module ME57 was examined (Fig 6). The scatter plot shows the consistency of the statistical characteristics of ME57 module transcripts with transcripts closely related to saponin Rb1. This graph also revealed other transcripts that were closely related to the biosynthesis of saponin Rb1, such as those in the upper right corner of the module, providing references for future research. In addition, the statistical characteristics of ME57 module transcripts showed a stable correlation between the GS value of the saponin Rb1 profiles and the MM value of the ME57 module transcripts. As shown in Fig 6, the transcripts closely related to the module (upper right corner) were also closely related to saponin Rb1 profiles, whereas transcripts not closely related to the module (lower left corner) were also not related to the trait.
Fig 6. Scatter plot of gene significance for Rb1 versus module membership in module ME57.
The statistical characteristics of module ME57 transcripts reflect its biological significance (Fig 6). The homogeneity between the transcriptional regulatory network, ecological factors, saponin profiles, and Rb1-related transcripts is shown in Fig 7. The coverage of correlations between some nodes and edges of the constructed directed graph and several nodes and edges related to saponin Rb1 biosynthesis reflects the ability of the model to interpret Rb1-related data.
Fig 7. Overlap between transcripts known to influence saponin Rb1 biosynthesis and those from the constructed network.
Ordinary node is one that contained in directed graph, and positive node is the transcriptome co-expression module including transcripts that are closely related to saponin Rb1.
Negative test
Chi-square analysis was used for negative validation. The null hypothesis was that the number of different combination categories was independent of the combination of TCEMs related closely to Rb1 biosynthesis. Next, the DF value was calculated (formula 1), and a contingency table was constructed (Table 2).
Table 2. Validity of saponin biosynthesis determined by the chi-square test.
| Combined categories | Frequency | Effective frequency | Invalid frequency | Total |
|---|---|---|---|---|
| NAMCa | 86.28 | 86.28 | 0 | 86.28 |
| nNAMCb | 6684.53 | 0 | 6684.53 | 6684.53 |
| Total | 6684.53 | 86.28 | 6684.53 | 6684.53 |
aNumber of associated module combinations (NAMC)
bNo direct correlation with the number of associated module combinations (nNAMC)
Chi-square analysis showed X2 = 6691.56, DF = 1, and P < 0.01, indicating that the null hypothesis (H0) was not valid, and the different combinatorial types were related to the efficiency of saponin Rb1 biosynthesis. The model’s scientific rigor was verified through both positive and negative tests.
A homomorphic white-box neural network was constructed and trained based on the relationship between the environmental factors and secondary metabolite profiles [37], as well as the association among environmental factors, transcriptional regulatory networks, and saponin accumulation. The white-box neural network reflects the quantitative relationship between the environmental factors and secondary metabolites. It retains the regulatory characteristics of the environmental factors and transcriptional regulatory networks that control saponin biosynthesis. The network was trained and loaded with information on the 13 ecological factors to predict the change in saponin accumulation in ginseng of different growth stages (Fig 8 and S1 Fig).
Fig 8. Comparison of predicted (predict) values with reference (ref) values.
Model evaluation
The predictive performance of the model was evaluated using the residual plot method (Fig 9). The model had an RMSE (root mean square error) of 0.232 and a coefficient of variation of 0.067 (6.7%), which was below the weak variability qualitative standard of 10%. The goodness of fit, R2, was 0.942, exceeding the correlation standard of 0.9. These results indicate that the model is reliable with a high accuracy of prediction.
Fig 9. Sample fit of the residual error offset graph.
Discussion
The quality of medicinal herbs is influenced by external environmental factors, and changes in the external environment led to diverse responses in the biosynthesis of medicinal components through transcriptional regulation. Herbal quality is determined by the interplay between the transcriptome and external environmental factors. In line with findings reported by Xu et al. [53], our results indicate that genes involved in saponin synthesis, such as SS, FPS, and DS, are co-expressed. Consistent with the findings of Yang et al., there is a strong correlation between the expression of genes involved in the synthesis of saponin Rb1, including CYP716A47, β-AS, SS, FPS, and DS, and the variation in Rb1 saponin content [54]. The directed graph generated based on the cumulative variance contribution rate and absolute value of the correlation coefficient, which represents the influence of environmental factors on the transcriptome and saponin synthesis, reveals a strong correlation between the contents of essential saponins (Rg1, Re, Rf, Rb1, Rb3, and Rd) and temperature. These results are in accordance with the study by Jia et al. [30], suggesting that within a certain temperature range, the content of ginsenosides decreases with increasing temperature, and moderate low temperature promotes ginsenoside accumulation. Other studies have examined the influence of temperature on ginsenoside accumulation and the expression of key enzyme genes [55], analyzed the biosynthesis of ginsenosides under different water conditions [56], and investigated the impact of soil characteristics on ginseng growth and quality [57]. These studies have highlighted the interplay between ecological factors and ginsenoside synthesis [53]. Furthermore, transcriptome regulation was investigated to identify transcripts that influence ginsenoside synthesis [58]. These studies have provided profound understanding of the regulatory mechanisms of ginsenoside synthesis and its influencing factors. In this study, we analyzed the influence of 13 essential environmental factors on the accumulation of 10 important ginsenosides over time via the regulation of transcriptional networks. Our research adds to the current understanding of the correlation among transcriptional regulation, ecological factors, and ginsenoside accumulation and provides a useful model to interpret such data.
Based on the existing analysis, taking the synthesis of saponin Rb1 as an example, combined with the correlation digraph of the prediction model, the closely related environmental factors were derived from the Rb1 node: Surface temperature, soil temperature, PH, altitude, ground short-wave radiation and soil water potential are the key environmental factors for the synthesis of this saponin. Simulation and regulation of relevant environmental factors, combined with Rb1 synthesis yield index, provide reference for ginsenoside accumulation. At the same time, with the help of this model, TCEMs modules that have an important impact on the synthesis of specific saponins can be obtained, and transcripts that are highly correlated with the eigenvalues of the modules can be examined from such modules, which can provide guidance for the breeding improvement of their related genes by gene editing technology. For example, combined with ginseng reference genome data [55], it can be seen from the correlation among environmental factors, TCEMs and typical saponins that PH value is positively correlated with ME79 and TempBot (Bottom temperature) is positively correlated with ME7 (Fig 2H). The distribution of gene transcripts related to Rb1 saponins at each node of the constructed prediction digraph was investigated. CYP716A53v2, SS, DS and PgUGT71A27 genes were selected to obtain their gene promoter sequence (2000bp). By using plant promoter analysis website PlantPAN3.0 (plantpan.itps.ncku.edu.tw) and JASPAR (jaspar.genereg.net) for a typical promoter element and its active, The highly active promoter element sequences such as AGGAAAT were identified as potential gene editing sites, and the promoter region CAAT-box was considered to mainly control the frequency of transcription initiation and enhance transcription [56]. Demethylation of its associated histones and bases was adopted to improve the activity of the effect of the corresponding promoter element. According to relevant data [57–61], it can be seen that the expression intensity of corresponding genes can be increased by 0.07 to 46.3 times by optimizing the gene editing measures of promoter effect [62]. Considering that ginseng is a diploid plant, the alleles of ginseng genome are treated as above. The above analysis process was applied to the improvement of the prediction model, the association weights of TCEMs nodes and typical saponins Rb1 were modified in combination with the original association relationship of the TCEMs module of the model, and the output weights of ME79 of the coexpression module of the SS gene corresponding transcript were fine-adjusted separately. The prediction results of the model showed that under the conditions of appropriate environmental factors, Rb1 quantity of saponins from 0.02407 to 0.02498 , daily increases 3.78%, At the same time, the output weight of ME7, the coexpression module of DS gene corresponding to the transcript, was adjusted to increase the amount of saponin Rb1 predicted by the model from 0.02407 to 0.02707 , an increase of 12.5% per day, and the predicted output was improved. At the same time, gene editing effect of different gene promoter on saponin increase is different, and the validity of this prediction model in breeding and gene improvement of ginseng is also indicated, which provides a reference for practical breeding improvement.
This study acknowledges the broad coverage of environmental factors; however, there is limited representation of soil’s primary and trace nutrient characteristics. The collected ginseng samples were sourced exclusively from authentic production regions. Although soil element characteristics are not the primary stress factors, it is necessary to supplement the study by considering a simulation process that closely aligns with actual conditions. Additionally, it should be noted that different parts of ginseng, such as roots, stems, and leaves, may exhibit variations in the accumulation of secondary metabolites. While this paper considers ginseng as a whole plant, which does not affect the simulation results, further refinement is required in order to account for these differences. The identification of co-expressed transcriptional modules (TCEMs) in ginseng transcriptomes, although rigorously grouped using relevant analysis techniques, requires additional exploration when employing gene editing technologies for breeding improvements. Specifically, investigations focus on co-expressed transcript modules associated with specific saponins, targeting their corresponding gene promoters for simulation-based optimization rather than solely optimizing the expression intensity of specific transcripts. While this approximation holds some rationality, further investigation is necessary to elucidate the biological details underlying these phenomena.
In future research, there is a need to further investigate the potential of digital tools to gain a deeper understanding of secondary metabolite metabolism processes in ginseng and other herbs. The aim is to optimize growth conditions and assist in Chinese herbal breeding practices, particularly with ginseng, to enhance the accumulation of active ingredients like ginsenosides. By doing so, the application of knowledge related to plant growth and development mechanisms in Chinese herbal medicine research will be accelerated. To achieve this, additional efforts should be made to confirm the association between the accumulation of active ingredients in ginseng and the effects of environmental factors. This can be accomplished by studying larger populations and incorporating a broader range of environmental factors. Moreover, understanding the role of transcriptional regulatory networks in mediating these effects will be crucial to fully comprehend the complex interactions between environmental factors and active ingredient accumulation. Through these endeavors, the potential for advancing Chinese herbal medicine research and improving herbal breeding practices, such as with ginseng, will be significantly enhanced, leading to more effective utilization of these valuable medicinal resources.
Conclusion
This research conducted involved analyzing the correlations among the ginseng transcriptome, 13 essential environmental factors, and 10 important ginsenoside contents. The main objective was to understand the active influence of environmental factors on ginseng biological processes and their direct and indirect correlation with co-expression modules and typical saponin content. To achieve this, a directed graph was constructed to illustrate the influence of environmental factors on saponin accumulation, which was mediated by the transcriptome regulation network. In addition, a white-box neural network was created to train the network, and the validity of this model was verified. The combination of gene ontology analysis, ginseng promoter sequence simulation optimization, and data analysis were utilized based on the gene editing method. This approach helped to validate the practical application of the prediction model in ginseng breeding and improvement. By integrating various factors and using advanced computational methods, this research provides valuable insights into the complex relationship between environmental factors, transcriptome regulation, and ginsenoside content in ginseng. The prediction model developed in this study has the potential to be a useful tool for optimizing ginseng breeding and enhancing the production of important bioactive compounds, such as ginsenosides, to further improve the quality and effectiveness of ginseng in Chinese herbal medicine practices.
Supporting information
(TIF)
(TIF)
(TIF)
(TIF)
(TIF)
(TIF)
(TIF)
Acknowledgments
We would like to thank A&L Scientific Editing (www.alpublish.com) for its linguistic assistance during the preparation of this manuscript.
Data Availability
Gene expression and phenotype data produced for this research have been deposited in Dryad (https://datadryad.org/stash/share/W_z9Ss63PT3Q52S_2e8k1O18wlZtweYV7xbvRBfkUvU).
Funding Statement
1)the Education Development Plan Project of Jilin Province (No. JJKH20210415KJ,Zhc W), http://jyt.jl.gov.cn, supports study design, data collection and analysis; 2) the Science and Technology Development Plan Project of Jilin Province (No. YDZJ202201ZYTS692,Y T), http://kjt.jl.gov.cn, supports data collection and analysis, decision to publish.
References
- 1.Grierson CS, Barnes SR, Chase MW, Clarke M, Grierson D, Edwards KJ, et al. One hundred important questions facing plant science research. New Phytologist 2011; 192(1):6–12. doi: 10.1111/j.1469-8137.2011.03859.x [DOI] [PubMed] [Google Scholar]
- 2.Dong D, Shi YN, Mou ZM, Chen SY, Zhao DK. Grafting: a potential method to reveal the differential accumulation mechanism of secondary metabolites. Horticulture Research 2022; 9. doi: 10.1093/hr/uhac050 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Abdulhafiz F, Mohammed A, Reduan MFH, Kari ZA, Wei LS, Goh KW. Plant cell culture technologies: A promising alternatives to produce high-value secondary metabolites. Arabian Journal of Chemistry 2022; 15(11):104161. doi: 10.1016/j.arabjc.2022.104161 [DOI] [Google Scholar]
- 4.Cordell GA. Phytochemistry and traditional medicine–A revolution in process. Phytochemistry Letters 2011; 4(4):391–398. doi: 10.1016/j.phytol.2011.05.005 [DOI] [Google Scholar]
- 5.Zhao N, Wang GD, Norris A, Chen XL, Chen F. Studying Plant Secondary Metabolism in the Age of Genomics. Critical Reviews in Plant Sciences 2013; 32(6):369–382. doi: 10.1080/07352689.2013.789648 [DOI] [Google Scholar]
- 6.Fan HJ, Tan ZB, Wu YT, Feng XR, Bi YM, Xie LP, et al. The role of ginsenoside Rb1, a potential natural glutathione reductase agonist, in preventing oxidative stress-induced apoptosis of H9C2 cells. Journal of Ginseng Research 2020; 44(2):258–266. doi: 10.1016/j.jgr.2018.12.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wang Z, Li M, Wu WK, Tan HM, Geng DF. Ginsenoside Rb1 Preconditioning Protects Against Myocardial Infarction After Regional Ischemia and Reperfusion by Activation of Phosphatidylinositol-3-kinase Signal Transduction. Cardiovascular Drugs and Therapy 2008; 22(6):443–452. doi: 10.1007/s10557-008-6129-4 [DOI] [PubMed] [Google Scholar]
- 8.Liu ZP, Song L, Zhang PP, Cao ZZ, Hao J, Tian YJ, et al. Ginsenoside Rb1 exerts antiarrhythmic effects by inhibiting INa and ICaL in rabbit ventricular myocytes. Scientific Reports 2019; 9(1):20425. doi: 10.1038/s41598-019-57010-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Yan X, Xue JR, Wu HJ, Wang SQ, Liu YN, Zheng SD, et al. Ginsenoside-Rb1 Protects Hypoxic- and Ischemic-Damaged Cardiomyocytes by Regulating Expression of miRNAs. Evidence-Based Complementary and Alternative Medicine 2015; 2015:171306. doi: 10.1155/2015/171306 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ke SY, Liu DH, Wu L, Yu XG, Wang M, Shi GY, et al. Ginsenoside Rb1 Ameliorates Age-Related Myocardial Dysfunction by Regulating the NF-κB Signaling Pathway. The American Journal of Chinese Medicine 2020; 48(06):1369–1383. doi: 10.1142/S0192415X20500676 [DOI] [PubMed] [Google Scholar]
- 11.Shoaib RM, Ahsan MZ, Akhtar U, Ahmad KA, Ali U, Deng MY, et al. Ginsenoside Rb1, a principal effective ingredient of Panax notoginseng, produces pain antihypersensitivity by spinal microglial dynorphin A expression. Neuroscience Research 2023; 188:75–87. doi: 10.1016/j.neures.2022.11.003 [DOI] [PubMed] [Google Scholar]
- 12.Liang WY, Liu Y, Zhou K, Jian P, Zhang QN, Chang ZH, et al. Ginsenoside Rb1 prevents lipopolysaccharide-induced depressive-like behavior by inhibiting inflammation and neural dysfunction and F2 elicits a novel antidepressant-like effect: A metabolite-based network pharmacology study. Journal of Ethnopharmacology 2022; 282:114655. doi: 10.1016/j.jep.2021.114655 [DOI] [PubMed] [Google Scholar]
- 13.Liu YS, Li J, Wang X, Liu Y, Zhang C, Chabalala H, et al. Ginsenoside Rb1 attenuates lipopolysaccharide-induced chronic neuroinflammation in mice by tuning glial cell polarization. Journal of Traditional Chinese Medical Sciences 2022; 9(4):383–391. doi: 10.1016/j.jtcms.2022.06.015 [DOI] [Google Scholar]
- 14.Su LJ, Ren YC, Chen Z, Ma HF, Zheng F, Li F, et al. Ginsenoside Rb1 improves brain, lung, and intestinal barrier damage in middle cerebral artery occlusion/reperfusion (MCAO/R) micevia the PPARγ signaling pathway. Chinese Journal of Natural Medicines 2022; 20(8):561–571. doi: 10.1016/S1875-5364(22)60204-8 [DOI] [PubMed] [Google Scholar]
- 15.Lee A, Kwon OW, Jung KR, Song GJ, Yang HJ. The effects of Korean Red Ginseng-derived components on oligodendrocyte lineage cells: Distinct facilitatory roles of the non-saponin and saponin fractions, and Rb1, in proliferation, differentiation and myelination. Journal of Ginseng Research 2022; 46(1):104–114. doi: 10.1016/j.jgr.2021.04.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Shalaby AM, Alnasser SM, Ahmed Khairy D, Alabiad MA, Alorini M, Jaber FA, et al. The neuroprotective effect of ginsenoside Rb1 on the cerebral cortex changes induced by aluminium chloride in a mouse model of Alzheimer’s disease: A histological, immunohistochemical, and biochemical study. Journal of Chemical Neuroanatomy 2023; 129:102248. doi: 10.1016/j.jchemneu.2023.102248 [DOI] [PubMed] [Google Scholar]
- 17.Zhou RR, He D, Zhang HC, Xie J, Zhang SH, Tian XF, et al. Ginsenoside Rb1 protects against diabetes-associated metabolic disorders in Kkay mice by reshaping gut microbiota and fecal metabolic profiles. Journal of Ethnopharmacology 2023; 303:115997. doi: 10.1016/j.jep.2022.115997 [DOI] [PubMed] [Google Scholar]
- 18.Shaukat A, Shaukat I, Rajput SA, Shukat R, Hanif S, Jiang KF, et al. Ginsenoside Rb1 protects from Staphylococcus aureus-induced oxidative damage and apoptosis through endoplasmic reticulum-stress and death receptor-mediated pathways. Ecotoxicology and Environmental Safety 2021; 219:112353. doi: 10.1016/j.ecoenv.2021.112353 [DOI] [PubMed] [Google Scholar]
- 19.Lee YY, Saba E, Irfan M, Kim M, Chan JYL, Jeon BS, et al. The anti-inflammatory and anti-nociceptive effects of Korean black ginseng. Phytomedicine 2019; 54:169–181. doi: 10.1016/j.phymed.2018.09.186 [DOI] [PubMed] [Google Scholar]
- 20.Sung M, Yoon Y, Lee J. Intestinal Lactobacillus species improve immune enhancement by ginsenoside Rb1 and ginsenoside Rb1 increases intestinal Akkermansia proportions. Journal of Functional Foods 2021; 86:104730. doi: 10.1016/j.jff.2021.104730 [DOI] [Google Scholar]
- 21.Hossain MA, Alam MJ, Kim B, Kang CW, Kim JH. Ginsenoside-Rb1 prevents bone cartilage destruction through down-regulation of p-Akt, p-P38, and p-P65 signaling in rabbit. Phytomedicine 2022; 100:154039. doi: 10.1016/j.phymed.2022.154039 [DOI] [PubMed] [Google Scholar]
- 22.Tan SJ, Yu WK, Lin ZL, Chen QY, Shi JL, Dong Y, et al. Anti-inflammatory Effect of Ginsenoside Rb1 Contributes to the Recovery of Gastrointestinal Motility in the Rat Model of Postoperative Ileus. Biological and Pharmaceutical Bulletin 2014; 37(11):1788–1794. doi: 10.1248/bpb.b14-00441 [DOI] [PubMed] [Google Scholar]
- 23.Yu HL, Fan CQ, Yang LJ, Yu SY, Song QQ, Wang P, et al. Ginsenoside Rg1 Prevents Chronic Stress-Induced Depression-Like Behaviors and Neuronal Structural Plasticity in Rats. Cellular Physiology and Biochemistry 2018; 48(6):2470–2482. doi: 10.1159/000492684 [DOI] [PubMed] [Google Scholar]
- 24.Park SB, Chun JH, Ban YW, Han JY, Choi YE. Alteration of Panax ginseng saponin composition by overexpression and RNA interference of the protopanaxadiol 6-hydroxylase gene (CYP716A53v2). Journal of Ginseng Research 2016; 40(1):47–54. doi: 10.1016/j.jgr.2015.04.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Liang Y, Zhao S. Progress in understanding of ginsenoside biosynthesis. Plant Biology 2008; 10(4):415–421. doi: 10.1111/j.1438-8677.2008.00064.x [DOI] [PubMed] [Google Scholar]
- 26.Han JY, In JG, Kwon YS, Choi YE. Regulation of ginsenoside and phytosterol biosynthesis by RNA interferences of squalene epoxidase gene in Panax ginseng. Phytochemistry 2010; 71(1):36–46. doi: 10.1016/j.phytochem.2009.09.031 [DOI] [PubMed] [Google Scholar]
- 27.Kim YJ, Jeon JN, Jang MG, Oh JY, Kwon WS, Jung SK, et al. Ginsenoside profiles and related gene expression during foliation in Panax ginseng Meyer. Journal of Ginseng Research 2014; 38(1):66–72. doi: 10.1016/j.jgr.2013.11.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Chen YR, Han M, Cheng L, Yang LM. Correlation between ginsenoside content and ecological factors in Panax ginseng. Chinese Traditional and Herbal Drugs 2023; 54(8):2588–2595. doi: 10.7501/j.issn.0253-2670.2023.08.026 [DOI] [Google Scholar]
- 29.Hou Y, Ma Y, Zou LS, Liu XH, Chao JG, Gu W. The Effcts of ecological factors on the secondary metabolites in medicinal plants and their research methods. Lishizhen Medicine and Materia Medica Research 2015; 26(1):187–190. doi: 10.3969/j.issn.1008-0805.2015.01.077 [DOI] [Google Scholar]
- 30.Jia GL, Huang LF, Suo FM, Song JY, Xie CX, Sun J. Correlation between ginsenoside contents in Panax ginseng roots and ecological factors and ecological division of ginseng plantation in China. Chinese Journal of Plant Ecology 2012; 36(4):302–312. [Google Scholar]
- 31.Wang KY, Zhang MP, Li C, Jiang SC, Yin R, Sun CY, et al. Correlation of gene expression related to amount of ginseng saponin in 15 tissues and 6 kinds of ginseng saponin biosynthesis. China Journal of Chinese Materia Medica 2015; 40(16):3168–3173. [PubMed] [Google Scholar]
- 32.Wang Y, Choi KD, Yu HS, Jin FX, Im WT. Production of ginsenoside F1 using commercial enzyme Cellulase KN. Journal of Ginseng Research 2016; 40(2):121–126. doi: 10.1016/j.jgr.2015.06.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Koo H, Lee YS, Nguyen VB, Giang VNL, Koo HJ, Park HS, et al. Comparative transcriptome and metabolome analyses of four Panax species explore the dynamics of metabolite biosynthesis. Journal of Ginseng Research 2023; 47(1):44–53. doi: 10.1016/j.jgr.2022.07.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Han JY, Kwon YS, Yang DC, Jung YR, Choi YE. Expression and RNA Interference-Induced Silencing of the Dammarenediol Synthase Gene in Panax ginseng. Plant and Cell Physiology 2006; 47(12):1653–1662. doi: 10.1093/pcp/pcl032 [DOI] [PubMed] [Google Scholar]
- 35.Liu J. Advances in the Biosynthesis Research of Ginsenosides and Key Enzymes. Botanical Research 2014; 03(03):84–90. doi: 10.12677/BR.2014.33013 [DOI] [Google Scholar]
- 36.Liu YN, Geng YY, Zhang SX, Hu BK, Wang JH, He JL. Quantitative analysis and screening for key genes related to tea saponin in Camellia Oleifera Abel. Seeds. Food Bioscience 2022; 49:101901. doi: 10.1016/j.fbio.2022.101901 [DOI] [Google Scholar]
- 37.Chen YR, Han M, Cheng L, Yang LM. Study on the correlation between the content of ginsenosides and ecological factors in Panax ginseng. Chinese herbal medicine 2022; 53(16):1–8. [Google Scholar]
- 38.Zhang JY, Xu XZ, Kuang SB, Cun Z, Wu HM, Shuang SP, et al. Constitutive activation of genes involved in triterpene saponins enhances the accumulation of saponins in three-year-old Panax notoginseng growing under moderate light intensity. Industrial Crops and Products 2021; 171:113938. doi: 10.1016/j.indcrop.2021.113938 [DOI] [Google Scholar]
- 39.Han JY, Kim HJ, Kwon YS, Choi YE. The Cyt P450 Enzyme CYP716A47 Catalyzes the Formation of Protopanaxadiol from Dammarenediol-II During Ginsenoside Biosynthesis in Panax ginseng. Plant and Cell Physiology 2011; 52(12):2062–2073. doi: 10.1093/pcp/pcr150 [DOI] [PubMed] [Google Scholar]
- 40.Tansakul P, Shibuya M, Kushiro T, Ebizuka Y. Dammarenediol-II synthase, the first dedicated enzyme for ginsenoside biosynthesis, in Panax ginseng. FEBS Letters 2006; 580(22):5143–5149. doi: 10.1016/j.febslet.2006.08.044 [DOI] [PubMed] [Google Scholar]
- 41.Choi HS, Koo HB, Jeon SW, Han JY, Kim JS, Jun KM, et al. Modification of ginsenoside saponin composition via the CRISPR/Cas9-mediated knockout of protopanaxadiol 6-hydroxylase gene in Panax ginseng. Journal of Ginseng Research 2022; 46(4):505–514. doi: 10.1016/j.jgr.2021.06.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Guo QS. Cultivation of medicinal plants. 1st ed. Beijing: Higher Education Press; 2004. [Google Scholar]
- 43.Chen YX, Chen YS, Shi CM, Huang ZB, Zhang Y, Li SK, et al. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. GigaScience 2018; 7(1):4689118. doi: 10.1093/gigascience/gix120 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature Biotechnology 2011; 29(7):644–652. doi: 10.1038/nbt.1883 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Haas B, Papanicolaou A, Yassour M, Grabherr M, Blood P, Bowden J, et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat Protoc 2013; 8(8). doi: 10.1038/nprot.2013.084 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 2008; 9(1):559. doi: 10.1186/1471-2105-9-559 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Lv LJ, Zhang WY, Sun LJ, Zhao AJ, Zhang YJ, Wang LM, et al. Gene co-expression network analysis to identify critical modules and candidate genes of drought-resistance in wheat. PLOS ONE 2020; 15(8):e0236186. doi: 10.1371/journal.pone.0236186 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Zhang B, Horvath S. A General Framework for Weighted Gene Co-Expression Network Analysis. Statistical Applications in Genetics and Molecular Biology 2005; 4(1):17:1–43. doi: 10.2202/1544-6115.1128 [DOI] [PubMed] [Google Scholar]
- 49.Nakamura H, Fujii K, Gupta V, Hata H, Koizumu H, Hoshikawa M, et al. Identification of key modules and hub genes for small-cell lung carcinoma and large-cell neuroendocrine lung carcinoma by weighted gene co-expression network analysis of clinical tissue-proteomes. PLOS ONE 2019; 14(6):e0217105. doi: 10.1371/journal.pone.0217105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.National Pharmacopoeia Commission. Chinese Pharmacopoeia. 1st ed. BeiJing: China Medical Science and Technology Press; 2020. [Google Scholar]
- 51.Xu J, Chu Y, Liao BS, Xiao SM, Yin QG, Bai R, et al. Panax ginseng genome examination for ginsenoside biosynthesis. GigaScience 2017; 6(11). doi: 10.1093/gigascience/gix093 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Lin HM. Effect of ecological factors on ginsenoside content of Panax ginseng and the Expression of key biosynthesis enzymes genes. Jilin; 2016. [Google Scholar]
- 53.Xu S, Zhao CL, Wen GS, Zhang HL, Zeng XL, Liu ZJ, et al. Longitudinal expression patterns of HMGR, FPS, SS, SE and DS and their correlations with saponin contents in green-purple transitional aerial stems of Panax notoginseng. Industrial Crops and Products 2018; 119:132–143. doi: 10.1016/j.indcrop.2018.04.010 [DOI] [Google Scholar]
- 54.Yang LL, Zhang T, Yang LM, Han M. Effects of ecological factors on ginsenosides synthesis and its key enzyme genes expression. Chinese Traditional and Herbal Drugs 2017; 48(20):4296–4305. doi: 10.7501/j.issn.0253-2670.2017.20.026 [DOI] [Google Scholar]
- 55.Wang ZH, Wang XF, Lu TY, Li MR, Jiang P, Zhao J, et al. Reshuffling of the ancestral core-eudicot genome shaped chromatin topology and epigenetic modification in Panax. Nature Communications 2022; 13(1):1902. doi: 10.1038/s41467-022-29561-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Yang P, Duan G, Hu X. Overview of Higher Plant Promoters Research. Molecular Plant Breeding 2018; 16(5):1482–1493. doi: 10.13271/j.mpb.016.001482 [DOI] [Google Scholar]
- 57.Xu N, Wei L, Liu J. Recent advances in the applications of promoter engineering for the optimization of metabolite biosynthesis. World Journal of Microbiology and Biotechnology 2019; 35(2):33. doi: 10.1007/s11274-019-2606-0 [DOI] [PubMed] [Google Scholar]
- 58.Jin LQ, Jin WR, Ma ZC, Shen Q, Cai X, Liu ZQ, et al. Promoter engineering strategies for the overproduction of valuable metabolites in microbes. Applied Microbiology and Biotechnology 2019; 103(21–22):8725–8736. doi: 10.1007/s00253-019-10172-y [DOI] [PubMed] [Google Scholar]
- 59.Cazier AP, Blazeck J. Advances in promoter engineering: Novel applications and predefined transcriptional control. Biotechnology Journal 2021; 16(10):2100239. doi: 10.1002/biot.202100239 [DOI] [PubMed] [Google Scholar]
- 60.Haberle V, Stark A. Eukaryotic core promoters and the functional basis of transcription initiation. Nature Reviews Molecular Cell Biology 2018; 19(10):621–637. doi: 10.1038/s41580-018-0028-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Chu D, Kazana E, Bellanger N, Singh T, Tuite MF, von der Haar T. Translation elongation can control translation initiation on eukaryotic mRNAs. The EMBO Journal 2014; 33(1):21–34. doi: 10.1002/embj.201385651 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Liu Y, Mu QX, Shi YN, Yu B. Metabolic regulation in constructing microbial cell factories. Chinese Journal of Biotechnology 2021; 37(5):1541–1563. doi: 10.13345/j.cjb.200688 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
(TIF)
(TIF)
(TIF)
(TIF)
(TIF)
(TIF)
(TIF)
Data Availability Statement
Gene expression and phenotype data produced for this research have been deposited in Dryad (https://datadryad.org/stash/share/W_z9Ss63PT3Q52S_2e8k1O18wlZtweYV7xbvRBfkUvU).









