Skip to main content
Frontiers in Plant Science logoLink to Frontiers in Plant Science
. 2026 Feb 18;16:1703429. doi: 10.3389/fpls.2025.1703429

Metabolomic analysis of Yunnan cigar tobacco leaves: impact of geography and climate on flavor characteristics and machine learning-based origin traceability

Yuping Wu 1,, Guijuan Zhao 2,3,, Yi Li 2, Guifeng Li 2, Wenyuan Wang 3, Lei Yang 3, Zhonglong Lin 2, Heng Yao 1, Fangchan Jiao 1, Gaokun Zhao 1,*, Yongping Li 1, Guanghai Zhang 1, Meiwei Zhao 4,*, Tao Zhang 3,*, Jin Wang 3,*
PMCID: PMC12957283  PMID: 41789420

Abstract

To investigate how Yunnan's distinctive geographical and climatic conditions shape the unique metabolic profile of its cigar tobacco leaves (CTLs), and to establish a reliable method for origin traceability using machine learning, a non-targeted metabolomics analysis was conducted on 71 CTL samples collected from the Dominican Republic, Indonesia, and Yunnan, including Lincang, Pu’er, and Yuxi within Yunnan. A total of 778 highly reliable metabolites were identified. Influenced by Yunnan's high altitude, large diurnal temperature variation, intense ultraviolet radiation, and relative dryness, its CTLs exhibited characteristic metabolic profiles, with significant enrichment in pathways such as flavone and flavonol biosynthesis and betalain biosynthesis. Elevated levels of polyphenols, indoles, jasmonates, carotenoids, and other compounds were linked to Yunnan CTLs' distinct woody, roasted, and astringent flavor profile. Twelve key biomarkers were selected using Multivariate methods with unbiased variable selection in R (MUVR). Machine learning algorithms—including LDA, LR, GMM, KNN, and SVM—were applied to these biomarkers, achieving highly accurate origin discrimination across national (Yunnan vs. Dominican Republic/Indonesia) and regional (Lincang, Pu’er, Yuxi) scales. Validation results showed a median false classification rate of 0.1 over 100 iterations and an AUC close to 1, confirming the model's high accuracy and robustness for CTLs origin traceability.

Keywords: biomarkers, flavor profile, geographical origin, machine learning, metabolomics

1. Introduction

Cigar, a product of aged and fermented tobacco leaves, has been cherished by smokers worldwide for its unique flavors and aromas (Yao et al., 2024; Wang et al., 2023) Its qualities are attributed to the complex interplay of various chemical compounds formed during the tobacco plant’s growth, harvest, and subsequent processing stages (Li et al., 2021b; Zheng et al., 2022). The unique terroir of different tobacco-growing regions, including soil composition, climate, and agricultural practices, significantly influences the final biochemical profile of the tobacco leaves, thereby contributing to the distinct sensory attributes of cigars (Hu et al., 2023; Yan et al., 2022). Due to the promising geographical and climatic conditions and fermentation process, the current high-quality cigar tobacco is mainly produced in Cuba, the Dominican Republic, Indonesia, the United States, Honduras, etc (Chen et al., 2024). Dominican CTLs are known for their distinctive mellow, smoky, grassy, and smoothness (Chen et al., 2024; Zheng et al., 2022; Geng et al., 2023). Indonesian CTLs were characterized considerably by fruity, and woody, leathery, peppery, and baked aroma (Zheng et al., 2022; Yang et al., 2024). As a high-quality tobacco-producing area in China, the cultivation of cigar tobacco in Yunnan is gradually increasing. Yunnan, mainly in Yuxi, Lincang, Pu’er, have achieved a cigar tobacco cultivation of 1,000 hectares, accounting for more than 50% of China’s.

As the global cigar market expands, there is an increasing demand for more precise and scientific methods to distinguish cigars from different regions based on their chemical composition, offering insight into their origin, quality, and authenticity (Ye et al., 2023). On the other hand, origin authenticity is an important part of product authenticity (Ye et al., 2023). Origin traceability is of great significance for protecting agricultural products’ origin, supervising agricultural products’ circulation, and guaranteeing product quality (Jin et al., 2021). There are many technologies used for origin traceability, such as isotope analysis (Zhi et al., 2023), element analysis (Wang et al., 2021), infrared spectroscopy (Liang et al., 2022), nuclear magnetic resonance (Maestrello et al., 2022), and DNA analysis (Lanubile et al., 2024).

Metabolomics is the comprehensive analysis of small molecules, or metabolites, within a biological sample (Muthubharathi et al., 2021). This technique is based on the premise that the metabolic profile of an organism reflects its genetic makeup, environmental conditions, and physiological state (Pillon et al., 2021). By analyzing the metabolic profile of a sample, it is possible to gain insights into its biological origin and traceability. This approach has emerged as a powerful tool (Li et al., 2021a), allowing researchers to characterize complex biological systems by identifying the metabolites that contribute to specific traits (Muthubharathi et al., 2021; Pillon et al., 2021).

This paper aims to analyze the differences in chemical composition and metabolism of cigar tobacco leaves(CTLs) from Yunnan, Dominica, and Indonesia using metabonomics and reveal the material basis of the flavor profile of CTLs from different geo-origins. To further understand the shaping and influence of geographical climate on the flavor characteristics of CTLs in Yunnan. In addition, metabolites combined with machine learning methods were used to accurately trace the origin of CTLs at different geographical scales, from national to regional levels.

2. Materials and methods

2.1. Sampling

71 cigar leaf samples were collected from dominica (DMNJ, n=16), Indonesia (IDA, n=13), Yunnan, China (CN, n=42). China’s Yunnan cigar tobacco mainly comes from the three major cigar tobacco producing areas in Yunnan, Lincang(LC, n=10), Pu’er (PE, n=12)and Yuxi (YX, n=20). Dominican samples were collected in Cibao Valley regionand Indonesian samples were collected in Sumatra Island. The Yunnan samples were representative of the core cigar-producing regions: Lincang, Pu’er, and Yuxi. Cigars from these three places have the unique style of Yunnan cigars. The varieties of Indonesian samples are BESUKI and SUMATRAL, while the Dominican cigar varieties include HABANO, OLOR, and CONNECTICUT. Varieties of Yunnan samples are YX1, YX2, and YX8, which have been introduced from abroad and improved. All samples were collected during the 2020–2022 production season to minimize seasonal variation.

The following is the processing technology of Yunnan cigars. Referring to the processing technology of Cuban cigars, fresh tobacco leaves, after being harvested, enter the air-curing process, and the specific process is as follows: 1. Withering period. Duration days: 2–4 days, temperature is 25°C-28°C, relative humidity is about 80%. 2. Yellowing period. Duration days: 5–10 days, temperature is 25°C-28°C, relative humidity is about 85%. 3. Browning period. Duration days: 10–15 days, temperature is 28°C ~ 30°C, relative humidity is 85%-80%. 4. Dry tendon period. Duration days: 7–10 days for tobacco planted in shade (7–15 days for tobacco planted in sunshine). Tobacco leaves planted in shade should have a temperature of 30°C-32°C and a relative humidity of 80%-35%. Tobacco leaves planted in the sun require a temperature of 32°C ~ 35°C and a relative humidity of 80% -35%. After that, it enters the fermentation process, and the treatment method refers to “DB53/T1303–2024 Technical Regulations for Agricultural Pile Fermentation of Cigar Tobacco Leaves”. Finally, the storage conditions are the relative humidity 70-75% and temperature 20 ± 2°C.

Prior to analysis, all cigar leaf samples were dried in a constant temperature oven at 40°C for 48 hours until a constant weight was achieved. The dried samples were then ground into powder and passed through a 40-mesh sieve to ensure homogeneity.

2.2. Non-targeted metabolic analysis

Non-targeted metabonomic analysis and subsequent data analysis were undertaken by BGI (Wuhan, 430070, China). The sample extraction and determination methods were similar to those of Li et al. (Li et al., 2024), and some methods were adjusted. After the sample was extracted, d3-Leucine(250ng/ml), 13C9-Phenylalanine(25μm/ml), d5-Tryptophan(12.5μm/ml), 13C3-Progesterone (12.5μm/ml)were added as internal standards, UPLC-MS/MS detection was performed. The off-line data of mass spectrometry were imported into the Compound Discoverer 3.3 (Thermo Fisher Scientific, USA) software, combined with BMDB (BGI metabolome database), mzcCloud database and ChemSpider online database for mass spectrometry data analysis.

2.3. Data analysis

The PCA and PLS-DA were performed using SIMCA 14 (sartorius Stedim, Germany). Analysis of metabolic differential compounds was performed using the Metware Cloud, a free online platform for data analysis (https://cloud.metware.cn). Metaboanalyst 6.0 was used for metabolic pathway enrichment analysis(https://www.metaboanalyst.ca). The volcano plot, cluster heatmap, box plot, RDA etc. are mapped using Metware Cloud. For metabolic indices discovery, we used multivariate methods with Unbiased Variable selection in R package (MUVR) to select potential metabolic indices (Shen et al., 2021; Wang et al., 2023). The algorithm and multi classification ROC curve are also implemented by R package.

3. Results and discussion

3.1. Overview of origins geographical and climatic characteristics

Plants are highly sensitive to the environment in which they grow. Geographic and climate conditions, such as temperature, precipitation, altitude, and soil, have profound impacts on plant growth, development, and metabolic processes (Chaudhary et al., 2022). Table 1 presents the geographical and climatic data of three countries’ main cigar-producing areas. As shown in Table 1, the rainfall in Yunnan, China, has relatively low, the humidity and the temperature is relatively low, the altitude is high, and the temperature difference between day and night is significant; Indonesia has high rainfall and humidity, slight temperature difference between day and night, and the soil is volcanic ash soil; Dominica is red soil with moderate rainfall and other factors similar to Indonesia. Geographic and climate conditions play a critical role in shaping plant metabolism, influencing both primary and secondary metabolite production. Increased ultraviolet radiation associated with high altitude is correlated with enhanced metabolism and accumulation of flavonoids in plants, as reported in previous studies (Pant et al., 2021) A significant temperature difference is beneficial for the accumulation of carbohydrates, which has been confirmed in our previous work (Wu et al., 2023). The research shown that Yunnan’s CTLs has the highest total sugar, reducing sugar, and starch content than Indonesian and Dominican CTLs (Wu et al., 2023). An appropriate temperature is beneficial for plant growth and photosynthesis. And relatively high or low temperatures have a stress effect on plants (Yamori et al., 2022). Red soil is rich in Fe (Zong et al., 2021), volcanic soil is rich in Ca, Mg, Zn and other trace elements (Mulyono et al., 2024; Abe et al., 2020), both of which have important effects on plant metabolism. In conclusion, geographic and climate conditions play a critical role in shaping cigar tobacco metabolism. Moreover, agricultural production practices, fertilization, varieties, and fermentation processes also have significant impacts on the metabolites of cigar tobacco leaves.

Table 1.

The geographic and climate characteristics of cigar tobacco origins (Mulyono et al., 2024; Liang et al., 2021; Delanoy et al., 2022; Kettler, 2022, Jin et al., 2012, Irfan et al., 2023).

Origins Soil Climate Rainfall** (mm) Temperature** (°C) Day-night temperature difference** (°C) Humidity** (%) Elevation (m)
DMNJ(Cibao valley)* red soil tropical marine climate 1200-2000 25-30 3-7 >80 50-900
INA(Sumatra)* volcanic (ash) soil tropical rainforest climate 1600-2200 25-30 3-5 >80 10-700
CN(Yunnan)* red soil subtropical mountain monsoon 400-1500 10-28 10-15 40%-80 1200-2200

*The data is of mainly cigar-producing areas in brackets.

**It’s growing season data.

3.2. Multivariate statistical analysis

High-resolution LC-MS/MS identified 778 highly reliable metabolites, with 494 compounds identified in the positive mode and 284 compounds identified in the negative mode. A total of 433 metabolites (55.5%) were successfully annotated by matching against the PubChem and Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Principal Component Analysis (PCA) is a commonly used unsupervised data dimensionality reduction method. From the PCA shown in Figure 1A, it can be seen that there are significant differences between Yunnan and Indonesian and Dominican CTLs, but Indonesian and Dominican CTLs have a certain degree of overlap, indicating that there may be some similarity between their metabolite fingerprints, making it difficult to distinguish.

Figure 1.

Two scatter plots labeled A and B display data with colored dots representing groups: green for DMNJ, blue for INA, and red for CN. Both plots have elliptical outlines. Plot A shows red dots mostly on the left, green and blue dots on the right. Plot B shows a tighter cluster of red dots towards the center-left, with green and blue dots more spread out. Axes are marked as t[1] and t[2].

The PCA (A) and PLS-DA (B) plots of CTLs form Dominica, Indonesia and China (Yunnan).

And Partial Least Squares Discrimination Analysis (PLS-DA) is a commonly used supervised data dimensionality reduction method. Now, PLS-DA is applied to distinguish CTLs from different origins. As shown in Figure 1B, CTLs from the three regions can be clearly distinguished. The (R2 Y) and (Q2) of the PLS Da model are 0.812 and 0.745, respectively (see Supplementary Figure S1A), indicating good performance in identifying the origin of cigar tobacco leaves. R2 Y and Q2 parameters greater than 0.5 and close to 1 indicate that the model has a high correlation (R2 Y) and predictive ability (Q2) for sample geographic differentiation (Rivera-Pérez et al., 2021).

Furthermore, PLS-DA analysis showed that the number of important variables (VIP values) greater than 1 in the projection reached 493 (as shown in Supplementary Figure S1B). A VIP value >1 indicates that the variable is important to the PLS-DA model. If the independent variable and Y have the same explanatory power, the VIP value will be 1. The higher the VIP value, the greater the contribution of the independent variable to the model. Therefore, the relatively high proportion of VIP>1 (about 39.5%) also reflects the significant similarity in metabolites of cigar tobacco leaves from these three production areas, which to some extent reflects the difficulty in distinguishing CTLs production regions (Zhao et al., 2022).

3.3. Metabolomics analysis

The CTL Metabolomics analysis was performed using the MetaboAnalyst 6.0 (https://www.metaboanalyst.ca/home.xhtml) and the Metware Cloud (https://cloud.metware.cn). The significant differential metabolites are metabolites with P <0.05, fold changes of >2 or <0.5, and VIP>1, and visualized by Volcano plots. The significant differential metabolites were used for clustering analysis in a heatmap, followed by a pathway enrichment analysis with metabolites using Arabidopsis thaliana (thale cress) (KEGG) pathway library. The significantly enriched pathway threshold is -lg(P)>1 and pathway impact>0.25.

3.3.1. Comparison of the metabolic profiles of Dominican and Yunnan CTLs

PCA can distinguish between Dominican and Yunnan CTLs with little overlap, as shown in Figure 2A. Through differential metabolite analysis, 171 significantly different metabolites were identified, with 77 metabolites being significantly up-regulated and 94 metabolites being significantly down-regulated, as shown in the volcano plot in Figure 2B and Supplementary Table S1. Cluster analysis was conducted on 203 differential metabolites, which can significantly distinguish Dominican and Yunnan CTLs, as shown in Figure 2C.

Figure 2.

Chart panel with four parts: A shows a PCA plot with two groups, DMNJ in pink and CN in blue. B is a volcano plot with significant gene expression changes indicated by red and green dots. C presents a heatmap with a gradient from green to red, reflecting differential expression levels. D depicts a pathway impact plot highlighting significant biological pathways, such as flavone and flavonol biosynthesis.

Metabolic profiles of Dominican (DMNJ) and Yunnan (CN) CTLs. (A) Plot of PCA. (B) Volcano plot of differential metabolites. (C) Heatmap of differential metabolites. Red indicates metabolites that were up-regulated and green indicates metabolites that were down-regulated. (D) Pathway enrichment plot. Colors represent the relative degree of the impact of each pathway (X-axis) and statistical significance (Y-axis).

Through differential metabolic analysis(as shown in Supplementary Table S1), it was found that the content of indole compounds such as 3-Indolepropionic acid, Indole-3-lactic acid, trans-3-Indoleacrylic acid was relatively high in Yunnan CTLs(as shown in Figure 3). Indoles are one of the plant growth factors that can promote plant growth, cell division, and root growth (Sun et al., 2023). They can also be used to resist external stress. Therefore, due to the relatively dry weather in Yunnan, cigars require a relatively high content of indoles to promote root growth. However, some may also be in response to high UV radiation stress in Yunnan. In addition, the content of 7-Isomethyljasmonate is also relatively high(as shown in Figure 3). As a plant hormone, the increase in jasmonic acid content may also be related to the relative drought and UV stress in Yunnan (Raza et al., 2021). Furthermore, indoles and jasmonic acid compounds also have a certain odor (Acree et al., 1985; Priolo et al., 2009) and maybe contribute to the flavor of cigars. In Yunnan CTLs, such as Caffeic acid, Sinapinic acid, Gallic acid, Ferulic acid, Matairesinol, Kaempferol, Rutin, and other polyphenols have high content. Polyphenol content is very important for the quality of cigars. Fermentation with polyphenol oxidase (PPO)-producing microbes specifically degrades polyphenols to form quinones and other substances (Meng et al., 2022), which make the cigar less “green/leafy” harshness, more mellow body, darker “cured” aroma and improves cigar aroma richness while reducing miscellaneous/irritating notes (Zhang et al., 2023). Furthermore, during smoking, heat (up to 900°C) transforms precursors. Polyphenols pyrolyze into phenols (e.g., guaiacol via cleavage) (Rigling et al., 2023), detected by olfactory receptors for smoky sensations (Juhlke et al., 2017). The high content of polyphenols in Yunnan CTLs is potentially associated with the intense ultraviolet radiation in the region, consistent with the known role of polyphenols in plant UV stress response (Shen et al., 2022). Some works reported that the presence of polyphenols in wine can increase its astringency and flavor richness (Soares et al., 2017; Gawel, 1998). Therefore, Sinapinic acid with bitterness (Nandasiri et al., 2021), Gallic acid with bitterness, sourness, and astringency (Robichaud and Noble, 1990), as well as the important flavor compound Rutin in tobacco (Chen et al., 2021), may also increase the astringency and flavor of Yunnan tobacco. In addition, the increase in the content of carotenoids such as 1 ‘- Hydroxy-gamma-carotene and 1,1’ - Dihydroxylycopene in Yunnan CTLs may also be related to the high UV intensity in Yunnan (Badmus et al., 2022). Carotenoids are also important flavor compounds (Vogel et al., 2010) and have a certain color (Gordon et al., 1983), therefore, they have an impact on the flavor and color of Yunnan CTLs. The melatonin content of CTLs in Yunnan is 2.5 times that of the Dominican Republic, which is consistent with the potential influence of Yunnan’s intense ultraviolet radiation on the metabolism of cigar tobacco leaves. In Arabidopsis thaliana, melatonin not only acts as an antioxidant to promote UV-B stress resistance, but also regulates expression of several key components of UV-B signalling pathway to protect the plant from UV-B stress (Yao et al., 2021). After UV-B exposure, melatonin concentration increases in Malus hupehensis (Wei et al., 2019), and it can reduce DNA damage in Nicotiana sylvestris plants (Zhang et al., 2012). Therefore, the increase of melatonin may be related to UV radiation in Yunnan.

Figure 3.

Box plots labeled A to E compare the normalized area of various compounds between DMNJ and CN groups. A shows melatonin, B shows 7-Isomethyljasmonate, C shows 3-Indolepropionic acid, D shows trans-3-Indoleacrylic acid, and E shows Indole-3-lactic acid. CN group exhibits higher values than DMNJ across all compounds. Each plot includes outliers.

Box plots labeled A–E compare the normalized area of various compounds between DMNJ and CN groups. A shows melatonin, B shows 7-Isomethyljasmonate, C shows 3-Indolepropionic acid, D shows trans-3-Indoleacrylic acid, and E shows Indole-3-lactic acid. CN group exhibits higher values than DMNJ across all compounds. Each plot includes outliers.

Our previous research has shown that the aroma of Yunnan cigars is mainly woody, roasted, fresh sweet, bean, and scored (Wu et al., 2023). And the high content of Tolylacetate [sweet odor (Liu et al., 2024)], Benzoin (sweet-balsamic odor of vanilla) (Fernandez et al., 2006) and alpha-Ionone [(fruit, floral (Aloum et al., 2020)] maybe have significant contributions to the flavor of Yunnan CTLs. Dominican CTLs were reported tohad aromatics of mellow, smoky, grassy, and smoothness. And Dominican CTLs has relatively high content of Hydrocinnamic acid [floral (Stavrou et al., 2024)], trans-Cinnamic acid [warm spicy and balsamic odour (Miyazawa et al., 1996)]. These flavor compounds may be the basis of the flavor compounds in Dominican CTLs.

The KEGG pathway database is the most commonly used database in the study of metabolic and regulatory pathways. The analysis of metabolic pathways helps to systematically understand the impact of the habitat environment on plants, providing a good reference for further exploring the adaptability of plants to the environment. 68 (40%) of the differential metabolites were successfully annotated onto the KEGG pathway for comparison by importing them into Metaboanalyst. KEGG metabolic pathway enrichment analysis identified 8 metabolic pathways, with 3 significantly enriched pathways Flavone and Flavonol biosynthesis, Phenylpropanoid biosynthesis and Nicotinate and nicotinamide metabolism as shown in Figure 2D. Flavonoids and flavonols are important polyphenols in plant secondary metabolites, belonging to the two major subclasses of flavonoids. Flavor and flavor compounds can absorb ultraviolet light. At the same time, due to the electron-donating effect of phenolic hydroxyl groups, they can eliminate free radicals and reduce oxidative stress. Thus they play a protective role in plant cells under UV radiation stress. They are important regulatory means for plants to cope with UV stress (Shen et al., 2022; Jaakola and Hohtola, 2010). Phenylpropanoid metabolism is one of the core pathways of plant secondary metabolism, which is involved in the production of a series of important plant compounds, including lignin, flavonoids, isoflavones and phenolic acids (Dong and Lin, 2021). Phenylpropanoid metabolism is the upstream metabolic pathway of flavor and flavor metabolism. It is also an important means for plants to cope with UV radiation stress (Rahim et al., 2023). The enrichment of these two pathways is closely correlated with the high intensity of ultraviolet radiation in Yunnan, which may reflect an adaptive metabolic response of CTLs to the local environment.

Nicotinic acid and nicotinamide metabolism are important secondary metabolic pathways in plants. They not only directly regulate the content of alkaloids in tobacco, but also directly participate in the synthesis and circulation of nicotinamide adenine dinucleotide (NAD) and its phosphorylated form (NADP). NAD/NADP is an important redox coenzyme involved in redox reaction, energy metabolism, cell respiration, photosynthesis and carbon fixation (Hashida et al., 2016; Ashihara et al., 2020). The enrichment of nicotinate and nicotinamide metabolism may be the reason for the high nicotine content in Yunnan CTLs, which may be related to geography and climate, and may also be affected by agricultural production, such as the application of nitrogen fertilizer. In addition, the high carbohydrate content of Yunnan CTLs may also be related to nicotinate and nicotinamide metabolism.

3.3.2. Comparison of the metabolic profiles of Indonesian and Yunnan CTLs

PCA can clearly distinguish Indonesian and Yunnan cigars(as shown in Figure 4A). Through differential metabolite analysis, 200 significantly different metabolites were identified, with 107 metabolites being significantly up regulated and 93 metabolites were significantly down regulated. As shown in the volcano plot in Figure 4B and Supplementary Table S2. Cluster analysis was conducted on 161 differential metabolites, which can significantly distinguish CTLs from these two production areas, as shown in Figure 4C.

Figure 4.

Panel image depicting various data visualizations: A) PCA biplot with pink and blue clusters labeled INA and CN. B) Volcano plot showing distribution of log2 fold changes and p-values, with red, green, and gray dots indicating significance. C) Heatmap with a green to red color gradient. D) Scatter plot indicating pathway impacts, highlighting Betalain biosynthesis, Linoleic acid metabolism, and C5-Branched dibasic acid metabolism.

Metabolic profiles of Indonesian (INA) and Yunnan (CN) CTLs. (A) Plot of PCA. (B) Volcano plot of differential metabolites. (C) Heatmap of differential metabolites. Red indicates metabolites that were up-regulated and green indicates metabolites that were down-regulated. (D) Pathway enrichment plot. Colors represent the relative degree of the impact of each pathway (X-axis) and statistical significance (Y-axis).

Differential metabolite analysis showed that the levels of 7-Isomethyljasmonate, trans-3-Indoglycolic acid, and melatonin in CTLs from Yunnan were still higher than those in Indonesia (as shown in Figure 5). This result aligns with the potential association between high ultraviolet radiations in Yunnan on the metabolism of cigar tobacco. Indonesian CTLs were reported to characterize by leather, pepper, and baked aroma. High content of Hydrocannamic acid, trans-Cannamic acid, Methylcinnamate [fruity, sweet-balsamic (Padalia et al., 2017)], Phenyllacetic acid [honey (Szudera-Kończal et al., 2023)] in Indonesian CTLs, which may be the partial material basis of its flavor.

Figure 5.

Box plots comparing the normalized area of three compounds between INA and CN groups. A) Trans-3-Indoleacrylic acid shows higher values in CN. B) Melatonin displays greater variability and higher values in CN. C) 7-Isomethyljasmonate also shows higher values in CN than INA. Outliers are marked with dots.

Box plots comparing the normalized area of three compounds between INA and CN groups. (A) Trans-3-Indoleacrylic acid shows higher values in CN. (B) Melatonin displays greater variability and higher values in CN. (C) 7-Isomethyljasmonate also shows higher values in CN than INA. Outliers are marked with dots.

Comparison of differential metabolites revealed that 65 (33%) were successfully annotated onto the KEGG pathway. KEGG metabolic pathway enrichment analysis identified 20 metabolic pathways, with 3 significantly enriched pathways including Betalain biosynthesis, Linoleic acid metabolism, C5-Branched dibasic acid metabolism, etc., as shown in Figure 4D.

Betalains are hydrosoluble pigments almost wholly restricted to species of the core Caryophyllales and, within the fungal lineage, the Basidiomycetes (Tossi et al., 2021). A large number of Basidiomycetes are detected in Yunnan CTLs, which are relatively higher than Indonesian. The difference in Basidiomycetes content may be one reason for the enrichment of the Betalain biosynthesis pathway (Zheng et al., 2022).

Linoleic acid is a kind of polyunsaturated fatty acid, which plays various important roles in plants. It is not only an important part of membrane lipids, but also produces a series of bioactive signal molecules and secondary metabolites through its metabolic pathway, which plays a key role in plant growth and development and environmental stress response. Linoleic acid can metabolized into jasmonic acid (JA), which plays a key role in the response of plants to biotic and abiotic stresses (Nie et al., 2024). Oxidized lipids produced during linoleic acid metabolism can be used as antioxidant signal molecules. Enhance the antioxidant capacity of plants and protect plant cells from oxidative damage (Mata-Pérez et al., 2017). Linoleic acid metabolism involves plant secondary metabolic pathways, such as flavonoid and alkaloid biosynthesis. These metabolites enhance the environmental adaptability of plants (Wu et al., 2021). T The enrichment of the Linoleic acid metabolism pathway is potentially associated with UV radiation in Yunnan, which may contribute to the plant’s environmental adaptation (Nie et al., 2024).

C5-branched dibasic acid metabolism is an important metabolic pathway in plants, which is mainly involved in stress response, growth regulation and complex regulation of metabolic network. This metabolic pathway involves several key metabolites, such as α - ketoglutarate and branched chain amino acid degradation products, which can provide energy and regulatory molecules for plants (Kumari and Parida, 2018). Chenopodium quinoa roots significantly increased C5-branched dicarboxylic acid metabolism under waterlogging stress, and regulated urea cycle, starch and sucrose metabolism to adapt to hypoxic environment (Guo et al., 2024). Wheat (Triticum aestivum)significantly increased the metabolism of C5 branched chain dicarboxylic acid and the utilization efficiency of mineral nutrients under the condition of nitrogen and zinc supply (Nie et al., 2023). Therefore, the enriched in C5-branched dicarboxylic acid metabolism pathway may be related to the heavy rainfall in Indonesia and volcanic ash soil.

3.4. Redundancy analysis of origin, metabolites and environment of CTLs

In order to clarify the correlation characteristics between the metabolites composition of cigars and environmental factors, the data of metabolites from three origins and environmental factors (altitude, temperature difference between day and night, humidity, etc.) were subjected to redundant analysis (RDA), and the results are shown in Figure 6.

Figure 6.

Scatter plot showing RDA analysis with two axes: RDA1 (nineteen percent) and RDA2 (five point two percent). Red, yellow, and orange dots represent groups CN, DMN, and INA, respectively. Environmental variables like rainfall, humidity, and temperature are indicated by vectors. Dots show clustering and spread across the axes, suggesting group differentiation based on these environmental factors.

Redundancy analysis (RDA) plots of metabolite and environmental factors of CTLs form Dominica, Indonesia and China (Yunnan).

In RDA ordination chart, RDA1 axis explains 19% of metabolomic variation, which is attributable to the sampled environment, and RDA2 axis explains 5.2% of the variation, with a cumulative explanatory rate of 24.2%, suggesting that environmental factors appear to be significant contributors associated with variation of metabolites in cigars. However, Approximately 75% of the variations remain unexplained by the current set of environmental factors, suggesting the importance of factors such as soil physical and chemical properties, agricultural production, post-harvest processing, varieties, and processes. These will be studied systematically in the future work.

The samples in CN group are concentrated on the right side of RDA1 axis, and their metabolites are mainly driven by altitude and temperature difference between day and night. The samples in DMNJ and INA groups gathered on the left side of RDA1 axis, and the metabolite composition was more regulated by humidity and other factors.

The Elevation and Day–Night Temperature Difference arrows point positively towards RDA1, which is a key gradient distinguishing Yunnan CTLs from Dominica/Indonesia. Yunnan CTLs is located in the quadrant where the two arrows point in the same direction, indicating that its metabolic profile covaries with high elevation and large day-night temperature differences. The Temperature and Humidity arrows are roughly along the negative direction of RDA1 or close to the negative direction of RDA2, indicating that Dominica/Indonesia is more affected by higher temperature/higher humidity conditions.

These results indicated that the heterogeneity of the origin environment is closely associated with the differences in metabolite composition of cigars from different origins, suggesting it may be a key contributing factor.

While our RDA and metabolomics profiling reveal strong links between environmental variables and metabolite accumulation, it is important to emphasize that these results reflect statistical associations rather than direct causal evidence. The observed metabolic patterns co-vary with climatic factors like UV radiation and temperature, but further controlled physiological experiments are required to establish the specific causal mechanisms involved.

3.5. Origin traceability

After systematically analyzing the metabonomic characteristics and pathways of CTLs in Dominica, Indonesia and Yunnan, we began to develop origin traceability models by selecting metabolites and classification algorithms that can be used to distinguish CTLs from different countries and cities. At present, it isn’t easy to trace the origin, especially in small regional scale origins. To achieve the traceability of CTLs from different regions at different regional scales, we need to select the appropriate key metabolic indices and classification algorithms.

3.5.1. Screening of metabolic indices

Multivariable methods with unbiased variable selection in the R package (MUVR) are widely used in the screening of biological indicators of metabolic diseases (Shen et al., 2021; Wang et al., 2023) and have been proven to be very effective. We randomly selected 50 samples as the training set and the remaining as the validation set. To improve the classification performance and avoid over fitting and false positives, we use MUVR algorithm based on random forest for multivariate modeling, which can identify Min-, Mid-, and Max-optimal respectively, and perform regression analysis with all-relevant variables.

12 Min-optimal, 77 Mid-optimal, and 497 Max-optimal were screened through the MUVR model, and the corresponding number of error classifications per 100 times were 1, 0, and 1, respectively, show in Supplementary Table S3 and Figure 7. The twelve Min-optimal are the selected metabolic index, they are: Acetylpseudotropine, 3-Methoxysalicylic acid, Monobutyl phthalate, Sinapyl alcohol, Phytyl phosphate, 3-Hydroxyanthranilic acid, 1-Acetylaspidoalbidine, Anacardic acid, Eicosapentaenoic acid ethyl ester, 8-Amino-7-oxononanoate, 2-Amino-3-methoxybenzoic acid, 4-Dodecylbenzenesulfonic acid.

Figure 7.

A line graph illustrating Balanced Error Rate (BER) against the number of variables on a logarithmic scale. Gray lines represent validation segments and repetitions, while a bold black line shows the overall trend, decreasing sharply initially and flattening out. Vertical lines mark specific points: red at around 20 variables ('Min'), green near 100 variables ('Mid'), and blue at about 500 variables ('Max').

Variable selection based on a multivariate methods with unbiased variable selection in R (MUVR) algorithm. [Balanced Error Rate (BER)].

To evaluate the robustness of the 12 index for origin, one-way analysis of variance (ANOVA) Tukey’s test were employed to calculate the differences in index influenced by years and varieties. The P-value results are shown in Supplementary Table S4. As indicated in the table, for years, only 8-Amino-7-oxononanoate and 4-Dodecylbenzenesulfonic acid P-values are less than 0.05 (0.045 and 0.026, respectively), indicating that these two indicators were influenced by the years. Meanwhile, for varieties, only Anacardic acid, 2-Amino-3-methoxybenzoic acid, and 4-Dodecylbenzenesulfonic acid P-values are less than 0.05 (0.045 and 0.026, respectively), suggesting that these three indicators were affected by the varieties. Thus, a small part of 12 biomarkers were influenced by both year and cultivar, but most of them remained robust, demonstrating that they are primarily influenced by the geographical origin.

3.5.2. Selection of algorithm

Based on the above metabolic indices, the commonly used classification algorithms of Linear Discriminant Analysis (LDA), Logistic regression (LR), Gaussian Mixture Module (GMM), K-Nearest Neighbors (KNN), Classification Tree (CT), Artificial Neural Network (ANN) and Support Vector Machine (SVM) are selected for comparison. For 71 samples, 50 CTLs were randomly selected as the training set, and the remaining 21 samples were used as the test set for verification. Then the R software was used for 100 repeated modeling tests. The purpose of using random sampling and 100 repetitions is to ensure that all samples are assigned to the training set and test set with the same probability and frequency.

The result of the error classification rate is shown in Figure 8. The results show that LDA, LR, GMM, KNN and SVM have better classification effect, and the median of 100 false classification rate is about 0.1. The error classification rates of CT and ANN are relatively high, 0.29 and 0.42, respectively. This may be because CT and ANN are suitable for large and complex sample analysis, and small sample size classification is easy to overfit.

Figure 8.

Box plot showing the error distribution of seven models: LDA, LR, GMM, KNN, CT, ANN, and SVM. Each model is represented by a colored box with whiskers and outliers, displaying variations in their error rates.

100 prediction errors of SVM, LDA, LR, GMM, and KNN models for different origins CTLs.

Receiver Operating Characteristic Curve (ROC) is a commonly used method to evaluate the classification effect of algorithm model (Obuchowski and Bullen, 2018). Area under curve (AUC) is defined as the area enclosed by the coordinate axis under the ROC curve (Obuchowski and Bullen, 2018). The closer the AUC is to 1, the better the classification effect of the model. The algorithm was further evaluated, and the Hand-Till multiple class classifications ROC curves (Fawcett, 2006; Hand and Till, 2001) were constructed by using LDA, LR, GMM, KNN and SVM, respectively. The results are shown in Supplementary Figure S2. It can be seen that LDA, LR, GMM, KNN and SVM have a good classification effect in CTLs origin identification, with their AUC near 1. The results are consistent with the previous 100 false classification rate results, which once again confirm the classification accuracy of these algorithms. The indices obtained based on MUVR screening method can accurately trace the origin, whether in large national scale or in small prefecture scale that is difficult to trace, proving that MUVR is robust in the screening of geo-origin characteristic index.

4. Conclusions

In this paper, the CTLs from Dominica, Indonesia and Yunnan have been studied by non-targeted omics. The metabolic pathways of different metabolites in CTLs from Yunnan, Dominica and Indonesia were enriched, and six metabolic pathways such as Flavone and flavonol biosynthesis, Phenylpropanoid biosynthesis and Nicotinate and nicotinamide metabolism, Betalain biosynthesis, Linoleic acid metabolism, C5-Branched dibasic acid metabolism were obtained. This may be related to the geographical and climatic characteristics of Yunnan, such as UV light intensity, relative dryness (drought), large temperature difference between day and night, and may also be related to the fermentation process. The content of polyphenol flavor compounds in Yunnan CTLs is high, which may be one of the material basis of the special flavor of Yunnan cigars. The content of polyphenols was related to the intensity of UV-light, so that the flavor characteristics of Yunnan CTLs had a certain relationship with the high UV radiation in Yunnan. This study provides a comprehensive map of the metabolic differences in CTLs across different origins. However, the current study design is associative; therefore, while the geographical climate appears to shape the metabolic profile, future research employing controlled environmental chambers is necessary to confirm these causal relationships.

Based on hundreds of metabolites obtained from non-targeted metabolism analysis of CTLs, 12 characteristic metabolites related to origin were screened by MUVR method. Through the multiple algorithms modeling and verification analysis of sample training set-test set, it is found that LDA, LR, GMM, KNN and SVM have a very good effect on the origin traceability of CTLs, with the median of 100 false classification rate about 0.1 and the AUC close to 1. MUVR index screening combined with some algorithms, has achieved accurate traceability of CTLs’ origins at different regional scales, can accurately trace the origin of CTLs at different scales, whether at the national level or the more challenging regional level.

This study provides valuable insights into the relationship between the geographical environment of Yunnan and the metabolic characteristics of CTLs, and offers a robust and efficient method for origin tracing of CTLs. It has practical significance for the agricultural production and quality authentication of Yunnan CTLs.

Funding Statement

The author(s) declared that financial support was received for this work and/or its publication. This work was financially supported by the Foundation of the Yunnan Provincial Tobacco Monopoly Bureau(2023530000241001 and 2024530000241014), the Foundation of China National Tobacco Corporation(110202201037(XJ-08)), the National Natural Science Foundation of China (No. 32360115).

Footnotes

Edited by: Yang Bi, Gansu Agricultural University, China

Reviewed by: Wanpeng Xi, Southwest University, China

Muhammad Ali, Peking University, China

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author/s.

Author contributions

YW: Writing – original draft. GZ: Formal analysis, Writing – original draft. YL: Validation, Writing – original draft. GL: Validation, Writing – original draft. WW: Investigation, Writing – original draft. LY: Data curation, Writing – original draft. ZL: Project administration, Writing – original draft. HY: Funding acquisition, Writing – original draft. FJ: Resources, Writing – original draft. GaZ: Resources, Writing – original draft. YPL: Supervision, Writing – original draft. GuZ: Project administration, Writing – original draft. MZ: Supervision, Writing – original draft. TZ: Methodology, Software, Writing – original draft. JW: Conceptualization, Writing – review & editing.

Conflict of interest

GZ, WW, LY, TZ, JW was employed by company China Tobacco Yunnan Industrial Co., Ltd.

The remaining author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2025.1703429/full#supplementary-material

Supplementaryfile1.doc (943.5KB, doc)

References

  1. Abe S. S., Ashida K., Kamil M. I., Karyanto O., Hardjowigeno S., Tawaraya K. (2020). Land use and management effects on volcanic soils in West Sumatra, Indonesia. Geoderma Regional 22, e00308. doi:  10.1016/j.geodrs.2020.e00308 [DOI] [Google Scholar]
  2. Acree T. E., Nishida R., Fukami H. (1985). Odor thresholds of the stereoisomers of methyl jasmonate. J. Agric. Food Chem. 33, 425–427. doi:  10.1021/jf00063a026 [DOI] [Google Scholar]
  3. Aloum L., Alefishat E., Adem A., Petroianu G. (2020). Ionone is more than a violet’s fragrance: A review. Molecules 25, 5822. doi:  10.3390/molecules25245822, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Ashihara H., Crozier A., Ludwig I. A. (2020). Plant nucleotide metabolism: Biosynthesis, degradation, and alkaloid formation (Hoboken, NJ: John Wiley & Sons Ltd; ). [Google Scholar]
  5. Badmus U. O., Crestani G., Cunningham N., Gohir W., Jaremko M., Edel K. H., et al. (2022). UV radiation induces specific changes in the carotenoid profile of Arabidopsis thaliana. Biomolecules 12, 1879. doi:  10.3390/biom12121879, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Chaudhary P., Chaudhary A., Agri U., Khatoon N., Singh A. (2022). “ Recent trends and advancements for agro-environmental sustainability at higher altitudes,” in Survival strategies in cold-adapted microorganisms. Eds. Goel R., Soni R., Suyal D. ( Springer, Singapore: ), 425–435. [Google Scholar]
  7. Chen J., He X., Zhang X., Sun Y., Li Y., Wang C., et al. (2021). The applicability of different tobacco types to heated tobacco products. Ind. Crops Prod. 168, 113579. doi:  10.1016/j.indcrop.2021.113579 [DOI] [Google Scholar]
  8. Chen X., Wang P., Ai S., Abliz B., Du J., Liu H., et al. (2024). Response of cigar tobacco (Nicotiana tabacum L.) varieties to low potassium stress and dissecting biochemical determinants of low potassium tolerance mechanism. J. Crop Health 76, 1249–1262. doi:  10.1007/s10343-024-00983-z [DOI] [Google Scholar]
  9. Delanoy R., Espinosa C. M., Herrera Y. (2022). Heavy metals in agricultural soils of San Francisco de Macorís and La Vega, Dominican Republic. J. Geosci. Environ. Prot. 10, 54–65. doi:  10.4236/gep.2022.1010004 [DOI] [Google Scholar]
  10. Dong N. Q., Lin H. X. (2021). Contribution of phenylpropanoid metabolism to plant development and plant–environment interactions. J. Integr. Plant Biol. 63, 180–209. doi:  10.1111/jipb.13054, PMID: [DOI] [PubMed] [Google Scholar]
  11. Fawcett T. (2006). An introduction to ROC analysis. Pattern Recognit. Lett. 27, 861–874. doi:  10.1016/j.patrec.2005.10.010 [DOI] [Google Scholar]
  12. Fernandez X., Castel C., Lizzani-Cuvelier L., Loiseau A. M., Perichet C., Arnaudo J. F. (2006). Volatile constituents of benzoin gums: Siam and Sumatra, part 3. Fast characterization with an electronic nose. Flavour Fragr. J. 21, 439–446. doi:  10.1002/ffj.1603 [DOI] [Google Scholar]
  13. Gawel R. (1998). Red wine astringency: a review. Aust. J. Grape Wine Res. 4, 74–95. doi:  10.1111/j.1755-0238.1998.tb00137.x [DOI] [Google Scholar]
  14. Geng Z., He P., Gao H., Duan S., Liu Y., Hu W., et al. (2023). Aroma precursors of cigars from different tobacco parts and origins, and their correlations with sensory characteristics. Front. Plant Sci. 14. doi:  10.3389/fpls.2023.1264739, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Gordon H. T., Bauernfeind J. C., Furia T. E. (1983). Carotenoids as food colorants. CRC. Crit. Rev. Food Sci. Nutr. 18, 59–97. doi:  10.1080/10408398309527350, PMID: [DOI] [PubMed] [Google Scholar]
  16. Guo Y., Wang Q., Xie H., Wu Y., Qiao G. (2024). Analysis of root response mechanism of quinoa seedlings to waterlogging stress based on metabolome. J. Plant Growth Regul. 43, 1–14. doi:  10.1007/s00344-023-11169-8 [DOI] [Google Scholar]
  17. Hand D. J., Till R. J. (2001). A simple generalisation of the area under the ROC curve for multiple class classification problems. Mach. Learn. 45, 171–186. doi:  10.1023/A:1010920819831 [DOI] [Google Scholar]
  18. Hashida S. N., Itami T., Takahara K., Hiraga S., Fujita M., Konishi R., et al. (2016). Increased rate of NAD metabolism shortens plant longevity by accelerating developmental senescence in Arabidopsis. Plant Cell Physiol. 57, 2427–2439. doi:  10.1093/pcp/pcw154, PMID: [DOI] [PubMed] [Google Scholar]
  19. Hu W., Cai W., Jia Y., He P., Gao H., Duan S., et al. (2023). Sensory attributes, chemical and microbiological properties of cigars aged with different media. Front. Bioeng. Biotechnol. 11. doi:  10.3389/fbioe.2023.1294667, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Irfan M., Safrina S., Koriyanti E., Suryatmojo H., Mizuno K. (2023). Effects of climate anomaly on rainfall, groundwater depth, and soil moisture on peatlands in South Sumatra, Indonesia. J. Groundw. Sci. Eng. 11, 81–88. doi:  10.26599/JGSE.2023.9280008 [DOI] [Google Scholar]
  21. Jaakola L., Hohtola A. (2010). Effect of latitude on flavonoid biosynthesis in plants. Plant Cell Environ. 33, 1239–1247. doi:  10.1111/j.1365-3040.2010.02154.x, PMID: [DOI] [PubMed] [Google Scholar]
  22. Jin Y. B., Qu R., Li T. F., Liu Y., Wang J. H. (2012). Clustering analysis of the climate in tobacco planting zone of Yunnan. Meteorol. Environ. Res. 3, 77–84. [Google Scholar]
  23. Jin C.-Y., Retsef L., Liang Q., Wang Y., Liu Y., Liu Y. (2021). Food safety inspection and the adoption of traceability in aquatic wholesale markets: A game-theoretic model and empirical evidence. J. Integr. Agric. 20, 2807–2819. doi:  10.1016/S2095-3119(20)63467-1 [DOI] [Google Scholar]
  24. Juhlke F., Lorber K., Wagenstaller M., Buettner A. (2017). Influence of the chemical structure on odor qualities and odor thresholds of halogenated guaiacol-derived odorants. Front. Chem. 5. doi:  10.3389/fchem.2017.00120, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kettler A. J. (2022). “ Dominican republic,” in The americas: an encyclopedia of culture and society, vol. 2. ( Bloomsbury Publishing PLC, London: ). [Google Scholar]
  26. Kumari A., Parida A. K. (2018). Metabolomics and network analysis reveal the potential metabolites and biological pathways involved in salinity tolerance of the halophyte Salvadora persica. Environ. Exp. Bot. 148, 85–99. doi:  10.1016/j.envexpbot.2018.01.009 [DOI] [Google Scholar]
  27. Lanubile A., Stagnati L., Marocco A., Battilani P., Busconi M., Soffritti G., et al. (2024). DNA-based techniques to check quality and authenticity of food, feed and medicinal products of plant origin: A review. Trends Food Sci. Technol. 149, 104568. doi:  10.1016/j.tifs.2024.104568 [DOI] [Google Scholar]
  28. Li Y., Ren K., Hu M., Wang Z., Luo D., Liu S., et al. (2021. b). Cold stress in the harvest period: effects on tobacco leaf quality and curing characteristics. BMC Plant Biol. 21, 1–15. doi:  10.1186/s12870-021-03235-8, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Li S., Tian Y., Jiang P., Lin Z., Liu X., Yang H. (2021. a). Recent advances in the application of metabolomics for food safety control and food quality analyses. Crit. Rev. Food Sci. Nutr. 61, 1448–1469. doi:  10.1080/10408398.2020.1761287, PMID: [DOI] [PubMed] [Google Scholar]
  30. Li J., Wang F., Sayed M. A., Wei J., Liu W., Wang X., et al. (2024). Integrated transcriptomic and metabolomic data reveal the cold stress responses molecular mechanisms of two coconut varieties. Front. Plant Sci. 15. doi:  10.3389/fpls.2024.1353352, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Liang B., Han G., Zeng J., Gao W. (2021). Source and risk assessment of trace metals in red soils from Yunnan Province, Southwest China. CLEAN – Soil Air Water 49, 2000288. doi:  10.1002/clen.202000288 [DOI] [Google Scholar]
  32. Liang N., Sun S., Zhang C., Wang H., Sun B., Liu Y. (2022). Advances in infrared spectroscopy combined with artificial neural network for the authentication and traceability of food. Crit. Rev. Food Sci. Nutr. 62, 2963–2984. doi:  10.1080/10408398.2020.1863329, PMID: [DOI] [PubMed] [Google Scholar]
  33. Liu Y., Du S., Sun L., Kenaan A., Wang G., Li J., et al. (2024). Volatile metabolomics and metagenomics reveal the effects of lactic acid bacteria on alfalfa silage quality, microbial communities, and volatile organic compounds. Commun. Biol. 7, 1565. doi:  10.1038/s42003-024-07049-w, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Maestrello V., Solovyev P., Bontempo L., Camin F. (2022). Nuclear magnetic resonance spectroscopy in extra virgin olive oil authentication. Compr. Rev. Food Sci. Food Saf. 21, 4056–4075. doi:  10.1111/1541-4337.13004, PMID: [DOI] [PubMed] [Google Scholar]
  35. Mata-Pérez C., Sánchez-Calvo B., Padilla M. N., Begara-Morales J. C., Valderrama R., Corpas F. J., et al. (2017). Nitro-fatty acids in plant signaling: new key mediators of nitric oxide metabolism. Redox Biol. 11, 554–561. doi:  10.1016/j.redox.2016.12.031, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Meng L., Song W., Chen S., Hu F., Pang B., Cheng J., et al. (2022). Widely targeted metabolomics analysis reveals the mechanism of quality improvement of flue-cured tobacco. Front. Plant Sci. 13. doi:  10.3389/fpls.2022.1074029, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Miyazawa M., Minamino Y., Kameoka H. (1996). Volatile components of the rhizomes of Rheum palmatum L. Flavour Fragr. J. 11, 57–60. doi:  [DOI] [Google Scholar]
  38. Mulyono A., Setiawan I., Hidayat E., Suryantini S., Idris M., Pramumijoyo S. (2024). Distribution and potential contamination assessment of rare earth elements (REE) in Indonesian volcanic soil. Ecol. Front. 44, 33–41. doi:  10.1016/j.ecofro.2023.12.002 [DOI] [Google Scholar]
  39. Muthubharathi B. C., Gowripriya T., Balamurugan K. (2021). Metabolomics: small molecules that matter more. Mol. Omics 17, 210–229. doi:  10.1039/D0MO00176G, PMID: [DOI] [PubMed] [Google Scholar]
  40. Nandasiri R., Zago E., Thiyam-Holländer U., Eskin M. N. A. (2021). Attenuation of sinapic acid and sinapine-derived flavor-active compounds using a factorial-based pressurized high-temperature processing. J. Am. Oil Chem. Soc 98, 779–794. doi:  10.1002/aocs.12510 [DOI] [Google Scholar]
  41. Nie J., Ma W., Ma X., Si Y., Liu Y., Li C., et al. (2024). Integrated transcriptomic and metabolomic analysis reveal the dynamic process of Bama hemp seed development and the accumulation mechanism of α-linolenic acid and linoleic acid. J. Agric. Food Chem. 72, 10862–10878. doi:  10.1021/acs.jafc.4c01705, PMID: [DOI] [PubMed] [Google Scholar]
  42. Nie Z., Wang L., Zhao P., Zhang X., Li J., Zhou J. (2023). Metabolomics reveals the impact of nitrogen combined with the zinc supply on zinc availability in calcareous soil via root exudates of winter wheat (Triticum aestivum). Plant Physiol. Biochem. 204, 108069. doi:  10.1016/j.plaphy.2023.108069, PMID: [DOI] [PubMed] [Google Scholar]
  43. Obuchowski N. A., Bullen J. A. (2018). Receiver operating characteristic (ROC) curves: review of methods with applications in diagnostic medicine. Phys. Med. Biol. 63, 07TR01. doi:  10.1088/1361-6560/aab4b1, PMID: [DOI] [PubMed] [Google Scholar]
  44. Padalia R. C., Verma R. S., Chauhan A., Chanotiya C. S. (2017). Essential oil composition and antimicrobial activity of methyl cinnamate–linalool chemovariant of Ocimum basilicum L. from India. Rec. Nat. Prod. 11, 193. doi:  10.5281/zenodo.812855 [DOI] [Google Scholar]
  45. Pant P., Pandey S., Dall’Acqua S. (2021). The influence of environmental conditions on secondary metabolites in medicinal plants: A literature review. Chem. Biodivers. 18, e2100345. doi:  10.1002/cbdv.202100345, PMID: [DOI] [PubMed] [Google Scholar]
  46. Pillon N. J., Loos R. J. F., Marshall S. M., Zierath J. R. (2021). Metabolic consequences of obesity and type 2 diabetes: balancing genes and environment for personalized care. Cell 184, 1530–1544. doi:  10.1016/j.cell.2021.02.012, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Priolo A., Vasta V., Fasone V., Lanza M., Biondi L. (2009). Meat odour and flavour and indoles concentration in ruminal fluid and adipose tissue of lambs fed green herbage or concentrates with or without tannins. Animal 3, 454–460. doi:  10.1017/S1751731108003648 [DOI] [PubMed] [Google Scholar]
  48. Rahim M. A., Zhang X., Busatto N., Liu Z., Li Z., Xing Y., et al. (2023). Phenylpropanoid biosynthesis in plants. Front. Plant Sci. 14. doi:  10.3389/fpls.2023.1230664, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Raza A., Charagh S., Zahid Z., Mubarik M. S., Javed R., Siddiqui M. H., et al. (2021). Jasmonic acid: a key frontier in conferring abiotic stress tolerance in plants. Plant Cell Rep. 40, 1513–1541. doi:  10.1007/s00299-021-02687-4, PMID: [DOI] [PubMed] [Google Scholar]
  50. Rigling M., Höckmeier L., Leible M., Herrmann K., Gibis M., Weiss J., et al. (2023). Characterization of the aroma profile of food smoke at controllable pyrolysis temperatures. Separations 10, 176. doi:  10.3390/separations10030176 [DOI] [Google Scholar]
  51. Rivera-Pérez A., Romero-González R., Frenich A. G. (2021). Application of an innovative metabolomics approach to discriminate geographical origin and processing of black pepper by untargeted UHPLC-Q-Orbitrap-HRMS analysis and mid-level data fusion. Food Res. Int. 150, 110722. doi:  10.1016/j.foodres.2021.110722, PMID: [DOI] [PubMed] [Google Scholar]
  52. Robichaud J. L., Noble A. C. (1990). Astringency and bitterness of selected phenolics in wine. J. Sci. Food Agric. 53, 343–353. doi:  10.1002/jsfa.2740530307 [DOI] [Google Scholar]
  53. Shen N., Wang T., Gan Q., Liu S., Wang L., Jin B. (2022). Plant flavonoids: classification, distribution, biosynthesis, and antioxidant activity. Food Chem. 383, 132531. doi:  10.1016/j.foodchem.2022.132531, PMID: [DOI] [PubMed] [Google Scholar]
  54. Shen X., Wang C., Liang N., Liu Z., Lv X., Wang H., et al. (2021). Serum metabolomics identifies dysregulated pathways and potential metabolic biomarkers for hyperuricemia and gout. Arthritis Rheumatol. 73, 1738–1748. doi:  10.1002/art.41733, PMID: [DOI] [PubMed] [Google Scholar]
  55. Soares S., Brandão E., Mateus N., de Freitas V. (2017). Sensorial properties of red wine polyphenols: astringency and bitterness. Crit. Rev. Food Sci. Nutr. 57, 937–948. doi:  10.1080/10408398.2014.946468, PMID: [DOI] [PubMed] [Google Scholar]
  56. Stavrou K., Christou A., Goulas V. (2024). Optimization of green sample preparation for the determination of hydroxycinnamic acids in multi-floral honey using response surface methodology. Appl. Sci. 14, 5781. doi:  10.3390/app14135781 [DOI] [Google Scholar]
  57. Sun P., Huang Y., Yang X., Liu C., Wang D., Li Z., et al. (2023). The role of indole derivative in the growth of plants: A review. Front. Plant Sci. 13. doi:  10.3389/fpls.2022.1120613, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Szudera-Kończal K., Myszka K., Kubiak P., Szwengiel A., Myszka M. (2023). The combined effect of lactic acid bacteria and Galactomyces geotrichum fermentation on the aroma composition of sour whey. Molecules 28, 4308. doi:  10.3390/molecules28114308, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Tossi V. E., Tosar L. M., Pitta-Álvarez S. I., Lamattina L., Cassia R. O. (2021). Casting light on the pathway to betalain biosynthesis: A review. Environ. Exp. Bot. 186, 104464. doi:  10.1016/j.envexpbot.2021.104464 [DOI] [Google Scholar]
  60. Vogel J. T., Tieman D. M., Sims C. A., Odabasi A. Z., Clark D. G., Klee H. J. (2010). Carotenoid content impacts flavor acceptability in tomato (Solanum lycopersicum). J. Sci. Food Agric. 90, 2233–2240. doi:  10.1002/jsfa.4076, PMID: [DOI] [PubMed] [Google Scholar]
  61. Wang M., Li R., Qi H., Li Y., Li Q., Sun M., et al. (2023). Metabolomics and machine learning identify metabolic differences and potential biomarkers for frequent versus infrequent gout flares. Arthritis Rheumatol. 75, 2252–2264. doi:  10.1002/art.42578, PMID: [DOI] [PubMed] [Google Scholar]
  62. Wang Y., Yuan X., Liu L., Zhi L., Yuan W., Yudi H. (2021). Multielement principal component analysis and origin traceability of rice based on ICP-MS/MS. J. Food Qual. 2021, 5536241. doi:  10.1155/2021/5536241 [DOI] [Google Scholar]
  63. Wei Z., Li C., Gao T., Zhang Z., Liang B., Lv Z., et al. (2019). Melatonin increases the performance of Malus hupehensis after UV-B exposure. Plant Physiol. Biochem. 139, 630–641. doi:  10.1016/j.plaphy.2019.04.026, PMID: [DOI] [PubMed] [Google Scholar]
  64. Wu Y., Huang D., Kong G., Jiao F., Zhao G., Li Y., et al. (2023). Geographical origin determination of cigar at different spatial scales based on C and N metabolites and mineral elements combined with chemometric analysis. Biol. Trace Elem. Res. 201, 4191–4201. doi:  10.1007/s12011-022-03514-x, PMID: [DOI] [PubMed] [Google Scholar]
  65. Wu Z., Liu H., Zhan W., Yu Z., Qin E., Liu S., et al. (2021). The chromosome-scale reference genome of safflower (Carthamus tinctorius) provides insights into linoleic acid and flavonoid biosynthesis. Plant Biotechnol. J. 19, 1725–1742. doi:  10.1111/pbi.13586, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Yamori N., Levine C. P., Mattson N. S., Yamori W. (2022). Optimum root zone temperature of photosynthesis and plant growth depends on air temperature in lettuce plants. Plant Mol. Biol. 110, 385–395. doi:  10.1007/s11103-022-01299-0, PMID: [DOI] [PubMed] [Google Scholar]
  67. Yan S., Ren T., Mahari W. A. W., Wang Z., Li J., Chen W., et al. (2022). Soil carbon supplementation: improvement of root-surrounding soil bacterial communities, sugar and starch content in tobacco (Nicotiana tabacum). Sci. Total Environ. 802, 149835. doi:  10.1016/j.scitotenv.2021.149835, PMID: [DOI] [PubMed] [Google Scholar]
  68. Yang L., Liu L., Ji L., Wang J., Li Y., Zhang T., et al. (2024). Analysis of differences in aroma and sensory characteristics of the mainstream smoke of six cigars. Heliyon 10, e26331. doi:  10.1016/j.heliyon.2024.e26331, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Yao J. W., Ma Z., Ma Y. Q., Rong L., Xu D. Q., Wang Y. N., et al. (2021). Role of melatonin in UV-B signaling pathway and UV-B stress resistance in Arabidopsis thaliana. Plant Cell Environ. 44, 114–129. doi:  10.1111/pce.13892, PMID: [DOI] [PubMed] [Google Scholar]
  70. Yao L., Zhang T., Shan Y., Wang J., Li Y., Yang L., et al. (2024). The impact of fangxian huangjiu on the fermentation quality and microbial community dynamics of cigar wrapper leaves. Front. Bioeng. Biotechnol. 12. doi:  10.3389/fbioe.2024.1428750, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Ye H., Yang J., Xiao G., Li Y., Liu Y., Xu Y., et al. (2023). A comprehensive overview of emerging techniques and chemometrics for authenticity and traceability of animal-derived food. Food Chem. 402, 134216. doi:  10.1016/j.foodchem.2022.134216, PMID: [DOI] [PubMed] [Google Scholar]
  72. Zhang L., Jia J., Xu Y., Wang Y., Hao J., Li T. (2012). Production of transgenic Nicotiana sylvestris plants expressing melatonin synthetase genes and their effect on UV-B-induced DNA damage. In Vitro Cell. Dev. Biol.-Plant 48, 275. doi:  10.1007/s11627-012-9434-3 [DOI] [Google Scholar]
  73. Zhang Q., Kong G., Zhao G., Liu J., Jin H., Li Z., et al. (2023). Microbial and enzymatic changes in cigar tobacco leaves during air-curing and fermentation. Appl. Microbiol. Biotechnol. 107, 5789–5801. doi:  10.1007/s00253-023-12663-5, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Zhao J., Li A., Jin X., Pan Y., Zheng G., Liu G., et al. (2022). Discrimination of geographical origin of agricultural products from small-scale districts by widely targeted metabolomics with a case study on Pinggu peach. Front. Nutr. 9. doi:  10.3389/fnut.2022.891302, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Zheng T., Zhang Q., Li P., Wu X., Liu Y., Yang S., et al. (2022). Analysis of microbial community, volatile flavor compounds, and flavor of cigar tobacco leaves from different regions. Front. Microbiol. 13. doi:  10.3389/fmicb.2022.907270, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Zhi L., Yuan W., Yudi H., Wang Y., Liu L., Yuan X., et al. (2023). Multi-stable isotope and multi-element origin traceability of rice from the main producing regions in Asia: A long-term investigation during 2017–2020. Food Chem. 412, 135417. doi:  10.1016/j.foodchem.2023.135417, PMID: [DOI] [PubMed] [Google Scholar]
  77. Zong M., Lin C., Li S., Liu C., Wang Y., Mi X., et al. (2021). Tillage activates iron to prevent soil organic carbon loss following forest conversion to cornfields in tropical acidic red soils. Sci. Total Environ. 761, 143253. doi:  10.1016/j.scitotenv.2020.143253, PMID: [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementaryfile1.doc (943.5KB, doc)

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author/s.


Articles from Frontiers in Plant Science are provided here courtesy of Frontiers Media SA

RESOURCES