Abstract
This review highlights the critical role of chemotaxonomy in the identification, authentication, and discovery of bioactive compounds in medicinal plants. By analyzing secondary metabolites using techniques like UV spectroscopy, FTIR, HPLC, GC-MS, NMR, LC-MS-Qtof, and MALDI-TOF MS, chemotaxonomy ensures accurate plant identification, supporting the safe and effective use of plants in herbal medicine. Key secondary metabolites used in chemotaxonomic identification include alkaloids, flavonoids, terpenoids, phenolics, tannins, and plant peptides. Chemotaxonomy also facilitates the discovery of novel compounds with therapeutic potential, contributing to drug development. The integration of chemotaxonomy with genomics and proteomics allows a deeper understanding of plant biosynthesis and the mechanisms behind bioactive compound production. However, challenges due to variability in metabolite profiles and the lack of standardized methods remain, and future research should focus on developing global databases, improving standardization, and incorporating artificial intelligence and machine learning to enhance plant identification and bioactive compound discovery. The integration of chemotaxonomy with personalized medicine offers the potential to tailor plant-based therapies to individual genetic profiles, advancing targeted treatments. This review underscores chemotaxonomy’s importance in bridging traditional knowledge and modern science, offering sustainable solutions for medicinal plant use and drug development.
Keywords: secondary metabolites, diversity, omics data, artificial intelligence, DNA barcoding
1. Introduction
In plant sciences, “Taxonomy,” the identification, classification, and naming of plants based on shared characteristics and evolutionary relationships, is considered a foundational field of study [1,2]. Taxonomy is crucial in the conservation of endangered species, as identifying and classifying species accurately is necessary for establishing conservation priorities and protecting biodiversity [3]. In the field of medicine, proper plant identification is vital for ensuring the efficacy and safety of plant-derived medicines [4], and taxonomic classification based on secondary metabolites (alkaloids, flavonoids, terpenoids, etc.) aids in discovering new bioactive compounds for drug development [5]. Similarly, in agricultural sciences, taxonomy supports crop improvement by identifying and classifying economically important plants [6], helping to develop pest-resistant varieties and optimize cultivation practices. With the advent of molecular techniques like DNA barcoding, the accuracy and speed of plant identification have greatly improved, allowing precise classification and even parsing species complexes and identifying cryptic species [7,8].
Classical plant taxonomy emphasizes the use of stable morphological characters that are minimally influenced by environmental factors for taxonomic distinction, excluding traits with high phenotypic plasticity [9]. However, in practice, especially when dealing with commercialized plant materials often presented as ground or fragmented forms lacking key diagnostic organs, reliable identification using macroscopic morphology becomes challenging [10]. In these cases, microscopic anatomical features, molecular markers (DNA-based methods), and chemical profiling can provide more robust and accurate means of identification [11], particularly for medicinal plants, where precise authentication is critical. Palynology, which uses the size, shape, and surface texture of pollen grains to identify plant species, is another valuable method, especially in ecological and archaeological contexts [12]. Molecular methods, such as DNA barcoding, have revolutionized plant identification. By providing a genetic “barcode” using markers like rbcL and matK, precise species identification is possible even in cases where morphological features are absent or ambiguous [13]. Additionally, metabolomics, which uses advanced technologies like liquid chromatography–mass spectrometry (LC-MS) and nuclear magnetic resonance (NMR) to profile the entire metabolome, provides in-depth insights into plant chemistry, further aiding species differentiation [14,15]. Moreover, artificial intelligence (AI) and machine learning (ML) are increasingly being integrated with traditional methods, enhancing the efficiency of plant classification by automating data analysis and detecting complex patterns in chemical and molecular datasets [16]. Through these advances, chemotaxonomy has become important, since this science uses secondary metabolites like flavonoids, alkaloids, terpenoids, and phenolic compounds analyzed through techniques such as high-performance liquid chromatography (HPLC), gas chromatography–mass spectrometry (GC-MS), and LC-MS with quadrupole time of flight (LC-MS-Qtof) [17]. Such high-throughput techniques are helpful to distinguish species based on chemical profile, thus offering high accuracy in cases of closely related or cryptic species [18]. Interestingly, multiple plant identification techniques, including DNA barcoding, AI, and chemotaxonomy, are increasingly being integrated, with such protocols often referred to as hybrid methods [19].
Chemotaxonomy is an old but now rapidly evolving field that plays a significant role in plant classification and identification [20]. Secondary metabolites are often unique to plant species and provide valuable insights into their evolutionary relationships [21,22]. This integration of morphological and chemical traits allows for more accurate and reliable plant identification, which is especially critical in the study of medicinal plants. Chemotaxonomy has witnessed significant advances in recent years, particularly with the integration of molecular biology techniques. For instance, DNA barcoding has become a popular method for plant identification, offering a molecular-marker-based approach that complements chemical profiling [23]. Metabolomics, through the comprehensiveness of the catalog of metabolites, can be used for specific plant tissues and has also enhanced the capabilities of chemotaxonomy [24]. Additionally, multivariate analysis techniques, such as principal component analysis (PCA) and cluster analysis (CA), have enabled researchers to better correlate chemical data with taxonomic information, improving the accuracy of plant classification [25]. Furthermore, the integration of bioinformatic tools and databases has facilitated the analysis and storage of large volumes of chemotaxonomic data, making it easier to access and compare plant profiles from different regions and studies [20,26].
The objective of this review is to provide an in-depth exploration of the current trends in chemotaxonomy, particularly in the context of medicinal plant identification. It highlights recent technological advances in chemotaxonomy, including the latest analytical techniques and molecular methods used to study plant chemical composition. It also discusses the integration of chemotaxonomy with traditional taxonomy and molecular biology to improve the accuracy and reliability of plant identification.
2. Concept of Chemotaxonomy and Medicinal Plant Identification
Chemotaxonomy is a discipline that not only utilizes the chemical characteristics of a plant to classify plants but also distinguishes between closely related species [20] and differentiates intraspecific taxa such as subspecies and varieties [27]. This makes it an essential and nuanced tool in plant identification and classification. Unlike traditional morphological classification, chemotaxonomy focuses on the chemical compounds found within a plant. Secondary metabolites are often characteristic of specific plant species and families, making them valuable for taxonomic classification [28]. However, their levels and presence can vary significantly depending on factors such as plant developmental stage, organ type, harvest time, and environmental conditions [29]. Therefore, while secondary metabolites can complement morphological traits in classification, their variability should be carefully considered during chemotaxonomic studies. This technique is often used in conjunction with traditional morphological methods, and together, they provide a deeper, molecular-level understanding of plant diversity. However, by analyzing their chemical profiles, chemotaxonomy can uncover subtle differences between species that are otherwise unobservable (Figure 1).
Figure 1.
Taxonomical identification methods used by researchers in previous and current eras.
3. Primary and Secondary Metabolites in Medicinal Plants
In medicinal plants, primary metabolites (i.e., carbohydrates, amino acids, proteins, and fatty acids) are inherently essential to the plant’s basic growth and cellular processes [30]. They play key roles in energy production, structural functions, and cell division within the plant, and are crucial to overall plant health and survival [31]. Secondary metabolites, on the other hand, are considered non-essential compounds not directly involved in basic plant functions but serve key ecological roles in, for example, defense against herbivores, pathogens, and environmental stressors [32]. These compounds include an array of diverse classes like alkaloids, flavonoids, terpenoids, phenolic acids, glycosides, etc., which are the primary pharmacologically active compounds in medicinal plants [33] (Figure S1). These pharmacological activities include anti-inflammatory, antimicrobial, anticancer, antidiabetic, antiviral, and antioxidant activities, making them critical in the development of new therapeutic agents [34,35] (Table 1, Figure S2).
Table 1.
Primary and secondary metabolites and their role in plant health.
| Metabolite Type | Metabolite Class | Occurrence in Plant Part | Metabolite Role | References |
|---|---|---|---|---|
| Primary metabolite | Carbohydrates | Leaves | Energy source, essential for respiration. |
[30,36] |
| Primary metabolite | Amino Acids | Leaves, Roots | Building blocks of proteins, crucial for plant growth. | [30,36] |
| Primary metabolite | Fatty Acids | Seeds, Leaves | Vital for membrane structure and energy storage. | [30,36] |
| Primary metabolite | Chlorophyll | Leaves | Key for photosynthesis, converting light into energy. | [30,36] |
| Secondary metabolite | Alkaloids | Roots, Seeds | Defensive compounds, deter herbivores and pathogens. | [37,38] |
| Secondary metabolite | Flavonoids | Flowers, Leaves | Provide UV protection, antioxidant properties, and pigmentation. | [37,38] |
| Secondary metabolite | Terpenoids | Leaves, Roots | Involved in plant defense. | [37,38] |
| Secondary metabolite | Phenolics, Tannins | Roots, Leaves | Play roles in defense, antioxidation, and stress response. | [37,38] |
4. Chemotaxonomy vs. Traditional Morphological Taxonomy
Morphological plant identification that is based on physical plant features including leaf shape, flower color, fruit type, and plant size, has been a foundation of plant taxonomy for centuries [39]. While this method provides an easily accessible and non-invasive approach, environmental factors, phenotypic plasticity, and developmental stage variations are key limits that can affect accuracy [40,41]. Chemotaxonomy, on the other hand, relies on the chemical composition of plants, particularly the secondary metabolites, which are more stable and consistent traits in plants [20,42]. Additionally, chemotaxonomy can identify cryptic species that may appear morphologically similar but differ in chemical composition [43]. Chemotaxonomy can therefore be considered as a “complementing tool” that can facilitate robust plant identification.
Combining morphological characteristics with chemical analysis offers a more comprehensive understanding of plant relationships. Chemotaxonomy thus has the potential to enhance the accuracy of plant classification, particularly when used in combination with molecular tools, such as DNA barcoding [9]. Together, these methods can offer a more complete and precise plant identification system, ensuring that medicinal plants are accurately classified and effectively utilized for therapeutic purposes (Table 2).
Table 2.
A comparative review of morphological and chemotaxonomical identification of medicinal plants.
| Feature | Morphological Identification |
Chemotaxonomical Identification |
Citations |
|---|---|---|---|
| Basis of classification | Observable physical traits (leaf shape, flower structure, stem, etc.) | Chemical composition, mainly secondary metabolites and other biochemical markers | [44,45] |
| Attributes for examination | External features (e.g., leaves, flowers) | Secondary metabolites and primary compounds | [44,46] |
| Ecological impact | High (traits may vary due to climate, soil, etc.) | Low (compounds are more stable) | |
| Tools required | Microscope, visual inspection | Chromatography, spectroscopy | [44,46] |
| Resolution | precise due to phenotypic Often limited at intraspecific level (varieties, subspecies) |
Can distinguish species and intraspecific taxa | [44,45,46] |
| Part used | Leaf shape, flower color, stem structure | Alkaloids, flavonoids, composition of plant part (terpenoids, amino acids) | [44,46] |
| Use in modern taxonomy | Foundational, and widely used in conjunction with molecular methods | Widely used in conjunction with molecular methods | [44,46] |
| Speed and accessibility | Relatively quick and low cost, can be carried out in the field | More time-consuming and costly, requires laboratory equipment | [44,45,46] |
| Cryptic species | Phenotype plasticity Difficulties in identification of cryptic Species |
Requires specialized equipment and expertise More effective; can detect biochemical differences in cryptic species |
[44,46] |
5. Applications of Chemotaxonomy in the Herbal and Medicinal Plant Sciences
The primary advantage of chemotaxonomy in medicinal plant identification is its ability to more precisely and reliably distinguish between plant species. As mentioned above, many plants appear similar in terms of their morphology but differ significantly in their chemical composition. This makes chemotaxonomy a powerful tool for identifying plants with unique medicinal properties, as the chemical profile is often directly linked to the plant’s therapeutic potential and ethnopharmacological relevance [47]. Additionally, by focusing on chemical markers, chemotaxonomy allows researchers to identify plants in situations where fresh or whole plants are unavailable, e.g., in cases of adulteration and substitution, which is essential for ensuring the correct plant species is used in herbal medicines and pharmaceuticals [48]. Moreover, chemotaxonomy supports the standardization and quality control of herbal products by providing reproducible chemical markers that act as benchmarks for authentication [49]. This ensures batch-to-batch consistency, which is critical for the efficacy and safety of herbal medicines. Furthermore, it also facilitates regulatory compliance by enabling manufacturers to present validated phytochemical profiles of their raw materials and final products [50] (Figure 2).
Figure 2.
Applications of chemotaxonomy in herbal and medicinal plant sciences.
6. Analytical Methods Used for Compound Identification
Chemotaxonomy involves both qualitative and quantitative analyses of compounds to establish relationships between plant species. In chemotaxonomy, the identification of secondary metabolites is typically carried out using analytical techniques such as HPLC, mass spectrometry (MS, including gas chromatography–MS (GC-MS)], LC-MS-Qtof, matrix-assisted laser desorption/ionization time-of-flight MS (MALDI-TOF MS)), UV and infrared (Fourier transform infrared), and NMR. These techniques allow the efficient separation, identification, and quantification of plant metabolites, providing a chemical “fingerprint” that is unique to each species (Table 3).
Table 3.
Various analytical techniques used in chemotaxonomy.
| Analytical Technique | Typical Uses | Accuracy/Precision | Types of Secondary Metabolites | Citations |
|---|---|---|---|---|
| UV-Vis Spectroscopy | Quantification | Moderate accuracy, ideal for fast and non-destructive quantification. | Flavonoids, phenolic compounds, carotenoids, alkaloids | [51] |
| FTIR | Identification of functional groups and molecular structures | Good resolution for functional group identification. Lower sensitivity compared to MS-based techniques. | Terpenoids, alkaloids, flavonoids, phenolic acids, lipids | [52] |
| HPLC | Separation and quantification of compounds, particularly in mixtures | High accuracy in separating complex mixtures. Precision depends on column and mobile phase. | Alkaloids, flavonoids, phenolic acids, glycosides, terpenoids | [53] |
| GCMS | Identifying and quantifying volatile compounds, especially in complex mixtures | High sensitivity and precision for volatile organic compounds, good for trace analysis. | Volatile terpenes, essential oils, fatty acids, aldehydes | [53,54] |
| LCMS-QTOF | Comprehensive profiling of metabolites and complex biomolecules | Very high sensitivity and accuracy, capable of accurate molecular mass determination, used for complex samples. | Alkaloids, flavonoids, peptides, lipids, steroids, phenolic compounds | [53,55] |
| MALDI-TOF MS | High-throughput analysis of biomolecules, especially proteins and peptides | High sensitivity for large biomolecules like proteins, peptides, and lipids. Excellent for high-throughput applications. | Peptides, proteins, lipids, alkaloids | [53,55] |
| NMR | Structural elucidation, identification of compounds, and quantification in small to medium-sized molecules | High accuracy for molecular structure determination. Limited sensitivity compared to MS techniques, but excellent for structural analysis. | Alkaloids, flavonoids, terpenoids, phenolic compounds, saponins | [53] |
6.1. HPLC
High-performance liquid chromatography has become indispensable in chemotaxonomy. Its versatility, high resolution, and reproducibility make it ideal for the analysis of complex plant samples containing a wide variety of chemical compounds [56]. Researchers have been applying HPLC to chemotaxonomic identification within plant families for decades, and almost all plant families have been extensively studied using HPLC. Here are a few examples: The Fabaceae family, which includes a large number of economically and medicinally important plants, has been the subject of several such studies. Here, HPLC has been used to identify flavonoids and alkaloids, which have become key chemotaxonomic markers [57]. Similarly, within the genera Medicago (alfalfa) and Cicer (chickpea), HPLC has been used to profile flavonoids such as quercetin, kaempferol, and their derivatives [58]. These compounds help differentiate species within these genera, as the presence and concentration of specific flavonoids vary between species. Additionally, the alkaloid profiling of Lupinus (lupine) species has been investigated using HPLC, identifying compounds such as quinolizidine alkaloids, which are characteristic of this genus [59].
Similarly, chemical identification in the family Solanaceae has been extensively performed using HPLC, with the main focus on alkaloids and other bioactive compounds. In Capsicum (chili pepper) species, for instance, the compound responsible for their signature heat, flavor, and pungency, capsaicin, was identified and characterized using HPLC [60]. Similarly, nicotine, the alkaloid that serves as a chemotaxonomic marker for Nicotiana tabacum (tobacco) within the family, was also identified using HPLC [61].
Chemotaxonomic identification based on HPLC has also been used in many other plant groups including, the family Asteraceae, which includes well-known medicinal plants, such as Artemisia annua (used for malaria treatment) and Echinacea purpurea (used for immune support). Flavonoids, sesquiterpenes, and alkaloids have been extensively profiled using HPLC [62,63]. The family Rutaceae (most notable for containing the genus Citrus) is another plant group that has been widely studied using HPLC. Citrus fruits, including oranges (Citrus sinensis) and lemons (Citrus limon), are known for their content of flavonoids and terpenoids, especially hesperidin and narirutin, and these serve as important chemotaxonomic markers within this family [64]. Additionally, detailed data is available on plants of the Rubiaceae family, especially Coffea arabica (coffee), which contains extensive concentrations of caffeine, chlorogenic acids, and other alkaloids that are critical for distinguishing between Coffea species [65]. More recently, the integration of HPLC with chemometric techniques, like PCA and CA, has further enhanced plant classification. For example, Coffea species were differentiated from each other based on their alkaloid and flavonoid content using HPLC and PCA [66]. Similarly, using analyses combining HPLC data with chemometrics, essential oil and secondary metabolite profiles helped define Coriandrum sativum (coriander) and Carum carvi (caraway) as species [67]. Traditional HPLC mainly utilizes retention time and UV-Vis spectral fingerprints for compound analysis. Advanced systems equipped with photodiode array (PDA) detectors incorporate UV-Vis spectral libraries, allowing tentative compound identification through spectral matching against reference databases (e.g., Agilent OpenLab, ChemStation) [68,69,70]. However, due to overlapping UV absorption profiles among structurally related compounds, identification based solely on UV-Vis spectra often lacks specificity and requires complementary techniques for confirmation.
6.2. GC-MS
Gas chromatography–mass spectrometry (GC-MS) is an important technique used for the analysis of volatile compounds. It is widely applied in the study of essential oils containing terpenoids, fatty acids, and polyphenolics, which are responsible for the medicinal properties of many plants [71]. The use of GC-MS in plant chemotaxonomy dates back to the 1970s and 1980s, and this technique plays a key role in chemotaxonomy of volatile compounds.
For several years, GC-Ms has been used for component analysis in a number of diverse plant families. For instance, the Lamiaceae family, which is often referred to as the “mint family,” is well known for its “aromatic” qualities. It includes well-known species like Mentha piperita (mint) and genera like Thymus (thyme) and Ocimum (basil), and it has been extensively studied using GCMS [72]. The Mentha species have been classified based on their high concentrations of menthol and other terpenoids using GCMS [73]. Similarly, Thymus vulgaris (thyme) mainly contains high concentrations of thymol and carvacrol, which are key components of its essential oil [74]. Thymol and carvacrol are considered as valuable chemotaxonomic markers of similar kinds of compounds that can be helpful in assessing the quality of medicinal herbs.
Another notable application of GC-MS is its use in the analysis of Apiaceae, which includes aromatic and medicinal plants like Carum carvi (caraway) and Co. sativum (coriander). GC-MS has been widely used to identify the volatile compounds in the essential oils of these plants, such as carvone, limonene, and linalool, which are characteristic of Ca. carvi and Co. sativum [67,75]. These compounds thus serve as chemotaxonomic markers, aiding in the differentiation of species within the Apiaceae family. Similarly, Citrus species of the Rutaceae family have been studied using GC-MS, with limonene and other terpenes identified as key chemotaxonomic markers for species identification [76].
More recently, GC-MS has been combined with chemometrics to enhance plant species delineation based on chemical profiles [77]. The integration of multivariate statistical techniques, such as PC and CA, with GC-MS data has improved our ability to differentiate closely related plant species, detecting even subtle variations in their chemical profiles [78] Additionally, combining GC-MS and LC-MS allows the development of a complete chemical fingerprint of a plant, helping researchers identify the key compounds responsible for its therapeutic properties [79].
6.3. LC-MS-Qtof
Liquid chromatography–mass spectrometry with quadrupole time of flight is an advanced analytical tool in chemotaxonomy that enables the precise identification of a plant species based on their secondary metabolites [80]. This high-resolution technique combines the separation capabilities of liquid chromatography with the accuracy of time-of-flight mass spectrometry, making it ideal for producing plant chemical profiles. The application of LC-MS-Qtof in chemotaxonomy began in the early 2000s, at which time it represented a new approach. Early studies primarily focused on the analysis of alkaloids, flavonoids, and terpenoids as key metabolites often used in chemotaxonomic studies [80,81]. However, this technique is equally efficient for essential oils. For example, in investigations focusing on the family Lamiaceae, LC-MS-Qtof has been effectively used for the chemotaxonomical profiling of essential oils, flavonoids, and phenolic compounds [82,83].
Similarly, in Citrus species (Rutaceae), LC-MS-Qtof has been used to distinguish between species based on terpenoid profiles, with particular attention paid to limonene, a characteristic compound of these species [84]. Likewise, in the Solanaceae family, LC-MS-Qtof has been extensively used to analyze bioactive alkaloids like capsaicin and nicotine [85]. In an investigation, LC-MS-Qtof was used to study Capsicum annuum (chili pepper), with capsaicinoids identified as key chemotaxonomic markers for this species [86]. Similarly, in the Asteraceae family, LC-MS-Qtof has proven valuable for identifying bioactive compounds in Artemisia (wormwood) and Echinacea species, both of which are widely used in traditional medicine. For example, A. annua is very well known for its antimalarial properties, which rely on the presence of artemisinin. It has been studied extensively using LC-MS-Qtof to determine its sesquiterpene content [87], and E. purpurea, which is known for its immune-boosting properties, has been analyzed to characterize its contents of echinacoside and other caffeic acid derivatives [88]. These examples highlight the increasing use of LC-MS-Qtof for the validation and authentication of medicinal plants. As with other analytical methods, the coupling of LC-MS-Qtof data with PCA and CA is gaining the attention of researchers for its ability to make data analysis more efficient and interpretation more reliable [89]. Modern LC-MS platforms leverage extensive mass spectral libraries to enable rapid and accurate compound identification by matching acquired spectra against reference databases. Prominent libraries include the comprehensive NIST Mass Spectral Library; METLIN, which specializes in metabolites and natural products, the open-access MassBank repository; and the commercial high-resolution mzCloud database with predictive fragmentation features [90,91,92]. Collaborative platforms like GNPS facilitate community-driven annotation of MS/MS data, especially for natural products [93]. Integrated software tools, such as Thermo Fisher’s Compound Discoverer, Agilent’s MassHunter, Waters’ UNIFI, and Sciex OS [93], utilize these libraries for automated spectral matching, molecular formula prediction, and in silico fragmentation, significantly improving identification confidence for complex mixtures.
6.4. MALDI-TOF MS
Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry has become a key technique in chemotaxonomy [94]. The MALDI-TOF MS system employs a matrix to assist in the ionization of molecules, which are then analyzed using time of flight to identify their characteristic mass-to-charge ratios (m/z). This technique’s ability to generate unique spectral fingerprints of an organism’s metabolites, particularly proteins and lipids, has made MALDI-TOF MS invaluable in taxonomic studies [95].
As a recent example, MALDI-TOF MS was utilized to identify alkaloids and other bioactive compounds in species of Cucurbita (squash plants), which are used in traditional medicines [96]. By analyzing the protein and lipid profiles, researchers were able to categorize these species more accurately than when using traditional morphological approaches. In addition, the MALDI-TOF MS analysis revealed the presence of several important bioactive compounds, including cucurbitacins (e.g., cucurbitacin B and cucurbitacin E), quinolone alkaloids, and flavonoids, as well as phenolic acids like caffeic acid and chlorogenic acid [96]. Similarly MALDI-TOF MS analysis provided a comprehensive chemical profiling of closely related species and strains of Echinacea, including E. purpurea and Echinacea angustifolia, that highlighted common chemical compounds [97]. This analysis is crucial in ensuring the correct species is used in medicinal products, as different species may have different active compounds that contribute to immune modulation.
6.5. Nuclear Magnatic Resoanace (NMR)
Nuclear magnetic resonance spectroscopy offers a non-destructive, detailed method for characterizing the structures of plant metabolites and is especially useful for those not easily characterized via other methods [98]. It provides extensive information about the chemical environment of atoms within a molecule, allowing for the identification of complex compounds with high specificity [99]. It is important to note that NMR is often used in conjunction with other techniques, such as HPLC (Preparative), GC-MS, LC-MS, or LC-MS-Qtof. In such combinations, NMR can greatly enhance plant identification in families with diverse metabolites. For instance, in the Apiaceae family, for which flavonoids and coumarins are essential secondary metabolites, studies on Angelica and Coriandrum species have successfully employed NMR to elucidate unique flavonoid, glycoside, and furanocoumarin profiles that are critical for chemotaxonomic classification within these genera [100,101]. Such metabolite profiles can be directly correlated with a plant’s therapeutic properties, not only allowing accurate species identification but also effective quality control [102]. Similarly, NMR spectroscopy has greatly facilitated chemotaxonomy in the Lamiaceae family [20,103], where it has been extensively used to profile terpenoids, such as menthol, carvacrol, and eugenol, which serve as chemical markers [104] for distinguishing between closely related species. For instance, M. piperita and Mentha spicata (spearmint) can be differentiated based on their distinct terpenoid profiles, as revealed through NMR analysis [105]. Overall, the phytochemistry and plant science literature is replete with NMR analyses, and no phytochemical research is complete without one.
7. Current Trends in the Identification of Medicinal Plants
In recent years, advancements in analytical and molecular technologies have significantly enhanced the scope and precision of chemotaxonomic studies. Furthermore, the integration of chemotaxonomy with bioinformatics and computational tools has streamlined data analysis, enabling more comprehensive studies of plant species and their medicinal properties. In this section, we review current trends in chemotaxonomy, focusing on the integration of molecular tools and multivariate analyses and the computational and technological advancements that are pushing the field forward (Figure 3).
Figure 3.
An overview of current trends in chemotaxonomy.
7.1. Integrating Molecular Techniques with Chemotaxonomy
Molecular techniques complement traditional chemotaxonomy analyses well, and recent advancements are enabling even more accurate identification and classification of medicinal plants. Two molecular approaches, DNA barcoding and metabolomics, have gained significant attention [24,106]. By integrating chemical and molecular information, plant identification can be more authentic.
7.1.1. DNA Barcoding
DNA barcoding has emerged as a critical tool in modern plant taxonomy, as it provides a rapid and reliable method for plant species identification. By utilizing the DNA sequences of short, universally accepted genomic regions, such as rbcL and matK gene regions and the ITS (internal transcribed spacer) region of the ribosomal RNA gene [107], DNA barcoding enables precise, species-level identification, even for morphologically similar species or when only incomplete plant specimens are available [108]. These regions are selected due to the balance between conservation and variation their sequences exhibit, which allows differentiation among species while maintaining a high degree of genetic stability across plant families [109]. By helping to catalog species, DNA barcoding aids in the conservation of plant biodiversity, especially in regions with rich but under-studied flora [110]. Barcoding, along with the computational tools and global genomic databases that have grown in its wake, has enabled researchers to address challenges related to species identification with enhanced efficiency and accuracy, driving significant advancements in plant biology and botanical research.
7.1.2. Metabolomics
Metabolomics, the comprehensive study of metabolites in organisms, has emerged as a powerful tool in chemotaxonomy [24]. For medicinal plants, this approach involves analyzing the complete set of metabolites in a plant, including both primary and secondary metabolites, which are critical for identification. By employing techniques like HPLC, GC-MS, and NMR, metabolomics enables the profiling of plant species based on their metabolic signatures [111]. The integration of metabolomics into chemotaxonomy provides a deeper understanding of the functional roles of plant metabolites in medicinal efficacy, enhancing the identification of bioactive compounds and improving the overall quality control of medicinal plant products [20].
7.2. Multivariate Analysis in Chemotaxonomy
Multivariate analyses are crucial for processing and interpreting the large datasets generated by many chemotaxonomic techniques. Principal component analysis and CA are two commonly used methods that help researchers extract meaningful patterns and relationships from complex chemical datasets, facilitating plant identification and classification [44,112].
7.2.1. Principal Component Analyses (PCA)
Principal component analyses are widely employed in chemotaxonomy to reduce the dimensionality of large chemical datasets while retaining the most significant variance [113]. By transforming the data into a set of orthogonal components, PCA facilitates the identification of patterns and trends that may not be immediately apparent in the raw data [17]. In the context of medicinal plants, PCA helps in grouping plant species based on their chemical profiles, providing insights into the chemical diversity of plant families or genera. This technique is particularly useful when dealing with large datasets generated when combining multiple analytical techniques, such as HPLC, GC-MS, and NMR [114].
7.2.2. Cluster Analysis (CA)
Cluster analysis (CA) complements PCA by grouping plant species or samples based on similarities in their chemical profiles. This method uses algorithms, such as hierarchical clustering or k-means clustering, to classify plant species into distinct groups or clusters [115]. It is particularly valuable when dealing with closely related species or varieties that exhibit overlapping chemical traits, as it can identify subtle yet significant chemical differences [116]. By organizing species into clusters based on their chemical characteristics, CA not only aids in the identification of novel plant species but also enhances our understanding of the ecological adaptations of different plant groups and the evolutionary relationships within and between groups [117]. Overall, CA offers chemotaxonomy a powerful tool for the classification and identification of plant species, facilitating the discovery of bioactive compounds and improving the quality control of medicinal plant products.
8. The Role of AI in Chemotaxonomy
Artificial intelligence has emerged as a transformative force in chemotaxonomy, where it is used to enhance plant identification, classification, and the discovery of bioactive compounds [20,118]. In particular, AI models based on ML, deep learning, and natural language processing offer powerful tools for processing and analyzing the vast chemical datasets generated by analytical techniques like NMR, GC-MS, and LC-MS-Qtof [119]. Furthermore, AI can assist in automating plant identification, improving pattern recognition among chemical profiles, and enabling predictive analyses of plant compound bioactivity (Figure 4). In this section, we discuss the key areas where AI is making an impact in chemotaxonomy.
Figure 4.
Potential of AI tools in chemotaxonomical identification of medicinal plants.
8.1. Data Analysis and Pattern Recognition
One of the primary applications of AI in chemotaxonomy is in the analysis of complex datasets through techniques like PCA and CA, as discussed earlier.
8.2. Automation of Plant Identification
Various AI models, especially deep learning algorithms like convolutional neural networks (CNNs), are increasingly used to automate plant species identification [120]. By training AI systems on large datasets of chemical fingerprints, these models learn to recognize specific metabolite patterns that distinguish one species from another [121]. The advantage of AI in this context is its ability to process data much faster than human experts and to handle larger datasets, leading to a more efficient identification process. For instance, CNNs have been applied to GC-MS and LC-MS-Qtof data to automate the classification of plant species [122]. These AI models are able to detect chemical signatures in chemical profiles containing secondary metabolites, such as terpenoids, flavonoids, and alkaloids, which are characteristic of specific plant families [123].
8.3. Integration of Multi-Omics Data
Given their ability to discern patterns in large datasets, AI models are playing a significant role in the integration of various omics data types (e.g., genomics, metabolomics, and environmental data), facilitating more accurate and comprehensive plant identification and classification [124,125]. By efficiently combining metabolomics data from LC-MS-Qtof and GC-MS analyses with genomic data, AI models can provide more robust predictions about plant species and their bioactive compounds [126]. This integration helps enhance plant chemotaxonomy by offering a more holistic view of a plant’s chemical, genetic, and environmental profile.
8.4. Predicting Bioactivity and Medicinal Potential
Artificial intelligence has significantly advanced the prediction of bioactivity, and thus medicinal potential, by allowing the analysis of extensive datasets to identify novel drug candidates and forecast their interactions and efficacy [127,128]. Researchers have been working to expedite drug discovery through machine and deep learning techniques, focusing on the prediction of protein structure, drug–target interactions, and molecular properties [129]. Similarly, AI models can more efficiently predict drug toxicity, bioactivity, and physicochemical properties, thereby streamlining the drug development process [130]. For example, in the evaluation of medicinal plants, AI algorithms have been trained to predict the anti-inflammatory, antimicrobial, and anticancer properties of plant compounds based on their chemical structure [131]. These AI-based methods can drastically reduce the time and resources required for experimental testing by providing preliminary insights into the therapeutic potential of specific compounds.
8.5. Advancements in AI Algorithms and Chemotaxonomy
Recent advancements in AI, particularly in deep learning and natural language processing, have further enhanced its application in chemotaxonomy. Newer algorithms are able to handle larger and more complex datasets, enabling faster and more accurate plant species identification based on chemical profiles [132]. For instance, transformer-based models like BERT (bidirectional encoder representations from transformers) have been applied to metabolomic data, improving our understanding of plant metabolites and their role in species differentiation [133,134]
9. Limitations of Chemotaxonomy
Despite its valuable contributions to the identification and classification of medicinal plants, chemotaxonomy faces several limitations that must be addressed before its full potential can be realized (Table 4). This section discusses several important points that must be considered.
Table 4.
Limitations of chemotaxonomy with possible alternate solutions.
| Limitation | Description | Possible Alternate | Citations |
|---|---|---|---|
| Variability in Secondary Metabolite Profiles | Variability due to environmental, genetic, or developmental factors. | Use DNA barcoding or meta barcoding as an alternative. | [135] |
| Standardization Issues | No standardized methodology for metabolite analysis. | Standardize techniques like mass spectrometry or NMR. | [135,136] |
| Lack of Comprehensive Databases | Chemotaxonomic databases are often incomplete. | Collaborate to build comprehensive chemotaxonomic databases. | [135,137] |
| Accessibility and High Costs of Analytical Techniques | High costs and specialized expertise needed for advanced techniques. | Utilize portable, low-cost devices for on-site analysis. | [135] |
| Challenges in Chemotaxonomic Identification | Overlapping chemical profiles make accurate identification difficult. | Use multi-omics approaches for more accurate identification. | [27,135,138] |
9.1. Variability in Secondary Metabolite Profiles
Secondary metabolites in plants are highly influenced by environmental factors, such as soil conditions, climate, altitude, and seasonal changes, as well as genetic diversity [135]. This variability can lead to significant differences in alkaloid, terpenoid, and flavonoid profiles across regions or growing conditions, which complicates chemotaxonomic identification and classification [139].
9.2. Standardization Issues
A lack of standardized methods for analyzing plant chemical profiles presents another significant challenge in chemotaxonomy [135]. Different researchers may use different analytical techniques, such as HPLC, GC-MS, or NMR, under varying experimental conditions, which can lead to inconsistent results [140]. Additionally, the way plant extracts are prepared, including the solvents used and the extraction methods applied, can influence the chemical composition of the samples [141].
9.3. Lack of Comprehensive Databases
One unavoidable issue in chemotaxonomy is the absence of comprehensive and accessible databases that house detailed chemical profiles of medicinal plants [135]. While several databases have been developed to catalog plant species and their associated chemical compounds, these resources are incomplete, lack uniformity, or are not easily accessible [142]. This limits the utility of these databases for taxonomic and medicinal plant identification.
9.4. Accessibility and High Costs of Analytical Techniques
The high costs associated with advanced analytical techniques like HPLC, GC-MS, NMR, and LC-MS-Qtof are significant barriers to the widespread use of chemotaxonomy, especially in resource-limited settings [17]. The purchase and maintenance costs of the sophisticated equipment required for these techniques, as well as the cost of reagents and consumables, can be prohibitive for many research institutions and laboratories, particularly in developing countries.
9.5. Ethnobotanical Knowledge
Another limitation of chemotaxonomy is a lack of comprehensive ethnobotanical knowledge, which could inform the classification and identification of medicinal plants [135]. Ethnobotanical data, which includes knowledge of traditional plant usage in indigenous communities, is crucial for understanding the medicinal properties of plants and their modern applications [143]. However, much of this knowledge has not been systematically documented and is at risk of being lost as traditional practices fade away.
10. Challenges in Chemotaxonomic Identification
Despite the importance of chemotaxonomy, several limitations do also exist. For instance, there is high variability in plant chemical compositions, which can be influenced by various factors, including environmental conditions, developmental stage, and genetics [29,144]. This variability can complicate identification and classification, leading to inconsistencies in chemotaxonomic results [145]. Another challenge is the high cost and complexity of the analytical techniques required for identifying plant metabolites [146]. Similarly, a lack of standardized methods for using these techniques creates an issue [147]. Moreover, there is a deficiency in globally harmonized chemotaxonomic reference databases, limiting comparative analysis across species or genera. These technical disparities hinder the development of universal chemotaxonomic frameworks, underscoring the need for method standardization, data sharing platforms, and the integration of AI to streamline identification and interpretation.
11. Future Directions in Chemotaxonomy
As new tools and methodologies are emerging, chemotaxonomy is becoming an even more integral part of medicinal plant research and applications. This section highlights several key areas we believe should be considered important focuses of future research.
11.1. Integration of Multi-Omics Approaches
The convergence of genomics, transcriptomics, proteomics, and metabolomics is revolutionizing chemotaxonomy. Tools like MEGA, Cytoscape, and WGCNA enable pathway prediction and co-expression analysis of plant chemical components [148]. Similarly, databases such as MetaboLights, HMDB, and KEGG provide metabolite–gene linkage data that can assist in taxonomic discrimination [149]. Serval multi-omics workflows have recently been applied to cyanobacteria and algae, demonstrating the potential to open new paths in chemotaxonomy [150], which in fact adds to its relevance.
11.2. Application of AI and ML
Both AI and ML models are consistently being used to classify plants based on high-dimensional chemical fingerprints. Random forests and support vector machines have already been used in LC-MS-based medicinal plant classification [151]. Similarly, deep learning-based bioinformatics tools, such as DeepChem and DeepMetabolome, can process large metabolomics datasets for automated taxonomic classification [152].
11.3. Development of Comprehensive Chemotaxonomic Databases
Given the increasing importance of chemotaxonomy, there exists a great need to develop interoperable and centralized platforms that integrate genomics, phytochemistry, and environmental metadata. Databases like KNApSAcK, PlantCyc, MassBank, and NPASS can be used as foundations to create these resources [153].
11.4. Digital Herbarium Platforms with Integrated Chemoprofiling
Future digital herbaria could integrate high-resolution images, geospatial metadata, and chemical fingerprints from NMR, LC-MS, GC-MS, etc., enabling greater taxonomic and biochemical insight [154]. Current systems, like GBIF and iDigBio, lack chemical metadata, which can be helpful in this case [155]. Digital herbaria should allow spectral searchability, further enhancing the synergy created by the integration of data types.
11.5. Synthetic Biology for Metabolic Pathway Validation
Synthetic biology allows scientists to validate biosynthetic pathways inferred from chemotaxonomic studies [156]. By expressing gene clusters in model organisms (e.g., Nicotiana benthamiana’s secondary metabolites expressed in yeast), the origin and regulation of secondary metabolites can be studied [157]. This approach is crucial for confirming gene function, regulatory sequences, and enzyme interactions that are otherwise obscured in native plant systems due to redundancy or low expression levels [158].
11.6. Chemotaxonomy in Conservation and Drug Discovery
By linking phytochemical diversity with taxonomic and phylogenetic information, chemotaxonomy can be a very helpful tool for conservation biology and drug discovery [27]. By identifying taxa with rich and unique metabolite profiles, chemotaxonomy can guide conservation efforts toward chemically and evolutionarily valuable species, many of which may currently be endangered or underexplored [139]. This approach enhances the efficiency of bioprospecting, as taxonomic proximity to known medicinal plants often predicts similar bioactivity profiles. For instance, the chemotaxonomic mapping of phytochemical “hotspots” in the plant kingdom has been proposed as a way to identify priorities for both biodiversity protection and pharmaceutical exploration [159] (Figure 5).
Figure 5.
Future directions in chemotaxonomical identification of medicinal plants.
12. Conclusions
Chemotaxonomy serves as a vital tool in medicinal plant research by enabling precise species identification and medicinal product authentication through the chemical profiling of secondary metabolites. Analytical techniques such as HPLC, GC-MS, NMR, LC-MS/QToF, and Fourier transform infrared spectroscopy are instrumental in detecting bioactive compounds, supporting both quality control and therapeutic agent discovery. When integrated with genomics and proteomics, chemotaxonomic approaches facilitate the elucidation of biosynthetic pathways, enhancing our understanding of metabolite diversity and function. Such methodologies are particularly critical for ensuring the safety and efficacy of herbal medicines by preventing misidentification and adulteration. Moreover, chemotaxonomy contributes to drug development by uncovering novel phytochemicals with pharmacological potential. Emerging trends in this field involve the incorporation of AI and ML to accelerate compound identification and classification. Despite these advancements, the standardization of analytical protocols and the development of comprehensive global databases remain key challenges. Continued progress in chemotaxonomy is expected to drive innovations in plant-based drug discovery and personalized phytotherapeutics and accelerate traditional medicine validation while promoting sustainable utilization of botanical resources.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants14142234/s1, Figure S1. Structure of diverse secondary metabolites (and their classes) commonly found in plants. Figure S2. Overview of plant secondary metabolites in correlation with plant identification.
Author Contributions
Conceptualization, writing—original draft preparation, resources, software, validation, visualization, A.A.; writing—review and editing, S.P. and A.A.; supervision, S.P. All authors have read and agreed to the published version of the manuscript.
Data Availability Statement
Data are contained within the article.
Conflicts of Interest
The authors declare no conflicts of interest.
Funding Statement
This research received no external funding.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
References
- 1.Kunhikannan C., Anju M.V., Prasanna R. Textbook of Forest Science. Springer; Berlin/Heidelberg, Germany: 2025. Taxonomy: Importance, Relevance and Application; pp. 65–85. [Google Scholar]
- 2.Rouhan G., Gaudeul M. Molecular Plant Taxonomy: Methods and Protocols. 2nd ed. Humana Press; Totowa, NJ, USA: 2021. Plant taxonomy: A historical perspective, current challenges, and perspectives; pp. 1–38. [DOI] [PubMed] [Google Scholar]
- 3.Zink R.M., Klicka L.B. The taxonomic basis of subspecies listed as threatened and endangered under the endangered species act. Front. Conserv. Sci. 2022;3:971280. doi: 10.3389/fcosc.2022.971280. [DOI] [Google Scholar]
- 4.Chaachouay N., Zidane L. Plant-derived natural products: A source for drug discovery and development. Drugs Drug Candidates. 2024;3:184–207. doi: 10.3390/ddc3010011. [DOI] [Google Scholar]
- 5.Gong X., Yang M., He C.N., Bi Y.Q., Zhang C.H., Li M.H., Xiao P.G. Plant pharmacophylogeny: Review and future directions. Chin. J. Integr. Med. 2022;28:567–574. doi: 10.1007/s11655-020-3270-9. [DOI] [PubMed] [Google Scholar]
- 6.Demilie W.B. Plant disease detection and classification techniques: A comparative study of the performances. J. Big Data. 2024;11:5. doi: 10.1186/s40537-023-00863-9. [DOI] [Google Scholar]
- 7.Antil S., Abraham J.S., Sripoorna S., Maurya S., Dagar J., Makhija S., Bhagat P., Gupta R., Sood U., Lal R. DNA barcoding, an effective tool for species identification: A review. Mol. Biol. Rep. 2023;50:761–775. doi: 10.1007/s11033-022-08015-7. [DOI] [PubMed] [Google Scholar]
- 8.Wang M., Lin H., Lin H., Du P., Zhang S. From species to varieties: How modern sequencing technologies are shaping Medicinal Plant Identification. Genes. 2024;16:16. doi: 10.3390/genes16010016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Zaman W., Ayaz A., Park S. Integrating morphological and molecular data in plant taxonomy. Pak. J. Bot. 2025;57:1453–1466. doi: 10.30848/PJB2025-4(27). [DOI] [Google Scholar]
- 10.Paul T., Kumar K.J. Role of Herbal Medicines: Management of Lifestyle Diseases. Springer; Berlin/Heidelberg, Germany: 2024. Standardization of Herbal Medicines for Lifestyle Diseases; pp. 545–557. [Google Scholar]
- 11.Intharuksa A., Phrutivorapongkul A., Thongkhao K. Integrating DNA barcoding, microscopic, and chemical analyses for precise identification of Plumbago indica L., A prominent medicinal plant. Microchem. J. 2024;199:110038. doi: 10.1016/j.microc.2024.110038. [DOI] [Google Scholar]
- 12.Erdtman G. Pollen Morphology and Plant Taxonomy: Angiosperms (An Untroduction to Palynology) Brill; Leiden, The Netherlands: 2023. [Google Scholar]
- 13.Chac L.D., Thinh B.B. Species identification through DNA barcoding and its applications: A review. Biol. Bull. 2023;50:1143–1156. doi: 10.1134/S106235902360229X. [DOI] [Google Scholar]
- 14.Shen S., Zhan C., Yang C., Fernie A.R., Luo J. Metabolomics-centered mining of plant metabolic diversity and function: Past decade and future perspectives. Mol. Plant. 2023;16:43–63. doi: 10.1016/j.molp.2022.09.007. [DOI] [PubMed] [Google Scholar]
- 15.Waris M., Kocak E., Gonulalan E.M., Demirezer L.O., Kır S., Nemutlu E. Metabolomics analysis insight into medicinal plant science. TrAC Trends Anal. Chem. 2022;157:116795. doi: 10.1016/j.trac.2022.116795. [DOI] [Google Scholar]
- 16.Zhu W., Li W., Zhang H., Li L. Big data and artificial intelligence-aided crop breeding: Progress and prospects. J. Integr. Plant Biol. 2025;67:722–739. doi: 10.1111/jipb.13791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Mali S., Yadav R., Gauttam V., Sawale J. An Updated Review on Taxonomy and Chemotaxonomy. Toxicol. Int. 2023;30:121–129. doi: 10.18311/ti/2023/v30i1/32123. [DOI] [Google Scholar]
- 18.Huymann L.R., Hannecker A., Giovanni T., Liimatainen K., Niskanen T., Probst M., Peintner U., Siewert B. Revised taxon definition in European Cortinarius subgenus Dermocybe based on phylogeny, chemotaxonomy, and morphology. Mycol. Prog. 2024;23:26. doi: 10.1007/s11557-024-01959-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zhang X., Yang L.-E., Hu Y., Wu X., Wang Z., Miao Y., Sun H., Nie Z., Tan N. Integrating morphology, molecular phylogeny and chemotaxonomy for the most effective authentication in Chinese Rubia with insights into origin and distribution of characteristic Rubiaceae-type cyclopeptides. Ind. Crops Prod. 2023;191:115775. doi: 10.1016/j.indcrop.2022.115775. [DOI] [Google Scholar]
- 20.Lawson P.A., Patel N.B. Trends in the Systematics of Bacteria and Fungi. CABI Publishing; Wallingford, UK: 2021. The strength of chemotaxonomy; pp. 141–167. [Google Scholar]
- 21.Wu Q., Tong W., Zhao H., Ge R., Li R., Huang J., Li F., Wang Y., Mallano A.I., Deng W. Comparative transcriptomic analysis unveils the deep phylogeny and secondary metabolite evolution of 116 Camellia plants. Plant J. 2022;111:406–421. doi: 10.1111/tpj.15799. [DOI] [PubMed] [Google Scholar]
- 22.Weng J.-K., Lynch J.H., Matos J.O., Dudareva N. Adaptive mechanisms of plant specialized metabolism connecting chemistry to function. Nat. Chem. Biol. 2021;17:1037–1045. doi: 10.1038/s41589-021-00822-6. [DOI] [PubMed] [Google Scholar]
- 23.Kowsalya K., Halka J., Ramalashmi K., Anand A.V., Razia M., Anand M., Begam A.K.U., Arun M. Biotechnology, Multiple Omics, and Precision Breeding in Medicinal Plants. CRC Press; Boca Raton, FL, USA: 2025. Molecular Marker-Assisted Breeding in Medicinal Plants for Enhanced Secondary Metabolites Production; pp. 160–176. [Google Scholar]
- 24.Peters K., Blatt-Janmaat K.L., Tkach N., van Dam N.M., Neumann S. Untargeted metabolomics for integrative taxonomy: Metabolomics, DNA marker-based sequencing, and phenotype bioimaging. Plants. 2023;12:881. doi: 10.3390/plants12040881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Rebiai A., Hemmami H., Zeghoud S., Ben Seghir B., Kouadri I., Eddine L.S., Elboughdiri N., Ghareba S., Ghernaout D., Abbas N. Current application of chemometrics analysis in authentication of natural products: A review. Comb. Chem. High Throughput Screen. 2022;25:945–972. doi: 10.2174/1386207324666210309102239. [DOI] [PubMed] [Google Scholar]
- 26.Parastar H., Tauler R. Big (bio) chemical data mining using chemometric methods: A need for chemists. Angew. Chem. 2022;134:e201801134. doi: 10.1002/ange.201801134. [DOI] [PubMed] [Google Scholar]
- 27.Umoh O.T. Chemotaxonomy: The role of phytochemicals in chemotaxonomic delineation of taxa. taxon. 2020;13:14. doi: 10.9734/aprj/2020/v5i130100. [DOI] [Google Scholar]
- 28.Gershenzon J., Mabry T.J. Secondary metabolites and the higher classification of angiosperms. Nord. J. Bot. 1983;3:5–34. doi: 10.1111/j.1756-1051.1983.tb01442.x. [DOI] [Google Scholar]
- 29.Li Y., Kong D., Fu Y., Sussman M.R., Wu H. The effect of developmental and environmental factors on secondary metabolites in medicinal plants. Plant Physiol. Biochem. 2020;148:80–89. doi: 10.1016/j.plaphy.2020.01.006. [DOI] [PubMed] [Google Scholar]
- 30.Bocso N.-S., Butnariu M. The biological role of primary and secondary plants metabolites. J. Nutr. Food Process. 2022;5:1–7. [Google Scholar]
- 31.Salam U., Ullah S., Tang Z.-H., Elateeq A.A., Khan Y., Khan J., Khan A., Ali S. Plant metabolomics: An overview of the role of primary and secondary metabolites against different environmental stress factors. Life. 2023;13:706. doi: 10.3390/life13030706. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ahmed E., Arshad M., Khan M.Z., Amjad M.S., Sadaf H.M., Riaz I., Sabir S., Ahmad N. Secondary metabolites and their multidimensional prospective in plant life. J. Pharmacogn. Phytochem. 2017;6:205–214. [Google Scholar]
- 33.Böttger A., Vothknecht U., Bolle C., Wolf A., Böttger A., Vothknecht U., Bolle C., Wolf A. Lessons on Caffeine, Cannabis & Co: Plant-Derived Drugs and Their Interaction with Human Receptors. Springer; Berlin/Heidelberg, Germany: 2018. Plant secondary metabolites and their general function in plants; pp. 3–17. [Google Scholar]
- 34.Rai S.K., Chauhan R. Phytochemicals in Medicinal Plants: Biodiversity, Bioactivity and Drug Discovery. De Gruyter; Berlin, Germany: 2023. Phytochemicals in drug discovery.331p [Google Scholar]
- 35.Ekiert H.M., Szopa A. Biological activities of natural products. Molecules. 2020;25:5769. doi: 10.3390/molecules25235769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hounsome N., Hounsome B., Lobo M.G. Handbook of Vegetables and Vegetable Processing. Wiley; Hoboken, NJ, USA: 2018. Biochemistry of Vegetables: Major Classes of Primary Metabolites (Carbohydrates, Amino Acids, Vitamins, Organic Acids, and Fatty Acids) pp. 25–46. [Google Scholar]
- 37.Kandar C.C. Bioactive Natural Products for Pharmaceutical Applications. Springer; Berlin/Heidelberg, Germany: 2020. Secondary metabolites from plant sources; pp. 329–377. [Google Scholar]
- 38.Simsek M., Whitney K. Examination of primary and secondary metabolites associated with a plant-based diet and their impact on human health. Foods. 2024;13:1020. doi: 10.3390/foods13071020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Cope J.S., Corney D., Clark J.Y., Remagnino P., Wilkin P. Plant species identification using digital morphometrics: A review. Expert Syst. Appl. 2012;39:7562–7573. doi: 10.1016/j.eswa.2012.01.073. [DOI] [Google Scholar]
- 40.Duminil J., Di Michele M. Plant species delimitation: A comparison of morphological and molecular markers. Plant Biosyst. 2009;143:528–542. doi: 10.1080/11263500902722964. [DOI] [Google Scholar]
- 41.Rouhan G., Gaudeul M. Molecular Plant Taxonomy: Methods and Protocols. Humana Press; Totowa, NJ, USA: 2014. Plant taxonomy: A historical perspective, current challenges, and perspectives; pp. 1–37. [DOI] [PubMed] [Google Scholar]
- 42.Hadacek F. Secondary metabolites as plant traits: Current assessment and future perspectives. Crit. Rev. Plant Sci. 2002;21:273–322. doi: 10.1080/0735-260291044269. [DOI] [Google Scholar]
- 43.Julier A.C.M., Jardine P.E., Coe A.L., Gosling W.D., Lomax B.H., Fraser W.T. Chemotaxonomy as a tool for interpreting the cryptic diversity of Poaceae pollen. Rev. Palaeobot. Palynol. 2016;235:140–147. doi: 10.1016/j.revpalbo.2016.08.004. [DOI] [Google Scholar]
- 44.Marne P.A., Pawar A.T., Tagalpallewar A.A., Baheti A.M. Pharmacognosy and Phytochemistry: Principles, Techniques, and Clinical Applications. Wiley; Hoboken, NJ, USA: 2025. Comparative Phytochemistry and Chemotaxonomy; pp. 333–346. [Google Scholar]
- 45.Şentürk B., Aytar E.C., Durmaz A., İşler S., Deniz İ.G., Kömpe Y.Ö. Morphological and chemical characteristics of Fritillaria species: Species differentiation through morphometric measurements and GC-MS analysis. Plant Genet. Resour. 2025;23:217–227. doi: 10.1017/S1479262124000686. [DOI] [Google Scholar]
- 46.Abdelfattah H.M.E., Hussein H.A., Teleb S.S., El-Demerdash M.M., George N.M. Chemotaxonomy compared to morphological and anatomical taxonomy of five Hibiscus species. J. Plant Res. 2024;137:967–984. doi: 10.1007/s10265-024-01566-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Bagul V.S., Bafna P.S., Patil D.M., Mutha R.E. Pharmacognosy and Phytochemistry: Principles, Techniques, and Clinical Applications. Wiley; Hoboken, NJ, USA: 2025. Classification of Crude Drugs of Natural Origin; pp. 17–44. [Google Scholar]
- 48.Chaudhary M.K., Misra A., Kumar B., Srivastava S. Traditional Medicines in Drug Discovery and Development. Cambridge Scholars Publishing; Newcastle upon Tyne, UK: 2024. Chemotaxonomy-Guided Effective Herbal Product Development: Asustainable Model For Ayush Industries. [Google Scholar]
- 49.Li Y., Cheng Y., Zhang Y., Nan H., Lin N., Chen Q. Quality markers of Polygala fallax Hemsl decoction based on qualitative and quantitative analysis combined with network pharmacology and chemometric analysis. Phytochem. Anal. 2024;35:1496–1508. doi: 10.1002/pca.3380. [DOI] [PubMed] [Google Scholar]
- 50.Banerjee D., Parekh N., Kaur G., Sharma S., Buttar H.S., Chauhan R., Kaur D., Tuli H.S., Jairoun A.A., Shahwan M. Chemometric perspective on herbal medicine evaluation: Tools, techniques, and trends. J. Appl. Pharm. Sci. 2025;15:085–093. doi: 10.7324/JAPS.2025.226539. [DOI] [Google Scholar]
- 51.Singhal A., Saini U., Chopra B., Dhingra A.K., Jain A., Chaudhary J. Uv-visible spectroscopy: A review on its pharmaceutical and bio-allied sciences applications. Curr. Pharm. Anal. 2024;20:161–177. doi: 10.2174/0115734129300562240408042614. [DOI] [Google Scholar]
- 52.Lestari N.B., Sulistyaningsih Y.C., Umar A.H., Ratnadewi D. Distribution and FTIR-based fingerprint of secondary metabolites in different organs of ant-plant (Myrmecodia tuberosa) Biodiversitas J. Biol. Divers. 2024;25:3. doi: 10.13057/biodiv/d250325. [DOI] [Google Scholar]
- 53.Halder M., Kundu A., Jha S. Plant Specialized Metabolites: Phytochemistry, Ecology and Biotechnology. Springer; Berlin/Heidelberg, Germany: 2024. Secondary Metabolites Identification Techniques of the Current Era; pp. 1–41. [Google Scholar]
- 54.Bajo-Fernández M., Souza-Silva É.A., Barbas C., Rey-Stolle M.F., García A. GC-MS-based metabolomics of volatile organic compounds in exhaled breath: Applications in health and disease. A review. Front. Mol. Biosci. 2024;10:1295955. doi: 10.3389/fmolb.2023.1295955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Verma A., Chattopadhaya A., Gupta P., Tiwari H., Singh S., Kumar L., Gautam V. Integration of Hyphenated Techniques for Characterizing and Chemical Profiling of Natural Products. Chem. Biodivers. 2025:e202500234. doi: 10.1002/cbdv.202500234. early view . [DOI] [PubMed] [Google Scholar]
- 56.Marston A. Role of advances in chromatographic techniques in phytochemistry. Phytochemistry. 2007;68:2786–2798. doi: 10.1016/j.phytochem.2007.08.004. [DOI] [PubMed] [Google Scholar]
- 57.Usman M., Khan W.R., Yousaf N., Akram S., Murtaza G., Kudus K.A., Ditta A., Rosli Z., Rajpar M.N., Nazre M. Exploring the phytochemicals and anti-cancer potential of the members of Fabaceae family: A comprehensive review. Molecules. 2022;27:3863. doi: 10.3390/molecules27123863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Šibul F., Orčić D., Vasić M., Anačkov G., Nađpal J., Savić A., Mimica-Dukić N. Phenolic profile, antioxidant and anti-inflammatory potential of herb and root extracts of seven selected legumes. Ind. Crops Prod. 2016;83:641–653. doi: 10.1016/j.indcrop.2015.12.057. [DOI] [Google Scholar]
- 59.Ganzera M., Krüger A., Wink M. Determination of quinolizidine alkaloids in different Lupinus species by NACE using UV and MS detection. J. Pharm. Biomed. Anal. 2010;53:1231–1235. doi: 10.1016/j.jpba.2010.05.030. [DOI] [PubMed] [Google Scholar]
- 60.Li J., Dadmohammadi Y., Abbaspourrad A. Flavor components, precursors, formation mechanisms, production and characterization methods: Garlic, onion, and chili pepper flavors. Crit. Rev. Food Sci. Nutr. 2022;62:8265–8287. doi: 10.1080/10408398.2021.1926906. [DOI] [PubMed] [Google Scholar]
- 61.Gui Z.-Q., Yuan X.-L., Yang J., Du Y.-M., Zhang P. An updated review on chemical constituents from Nicotiana tabacum L.: Chemical diversity and pharmacological properties. Ind. Crops Prod. 2024;214:118497. doi: 10.1016/j.indcrop.2024.118497. [DOI] [Google Scholar]
- 62.Shinyuy L.M., Loe G.E., Jansen O., Mamede L., Ledoux A., Noukimi S.F., Abenwie S.N., Ghogomu S.M., Souopgui J., Robert A. Secondary metabolites isolated from Artemisia afra and Artemisia annua and their anti-malarial, anti-inflammatory and immunomodulating properties—Pharmacokinetics and pharmacodynamics: A review. Metabolites. 2023;13:613. doi: 10.3390/metabo13050613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Ahmadi F. Phytochemistry, Mechanisms, and Preclinical Studies of Echinacea Extracts in Modulating Immune Responses to Bacterial and Viral Infections: A Comprehensive Review. Antibiotics. 2024;13:947. doi: 10.3390/antibiotics13100947. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Alam F., Mohammadin K., Shafique Z., Amjad S.T., Asad M.H.H. Citrus flavonoids as potential therapeutic agents: A review. Phytother. Res. 2022;36:1417–1441. doi: 10.1002/ptr.7261. [DOI] [PubMed] [Google Scholar]
- 65.López-Parra M.B., Gómez-Domínguez I., Iriondo-DeHond M., Villamediana Merino E., Sánchez-Martín V., Mendiola J.A., Iriondo-DeHond A., Del Castillo M.D. The impact of the drying process on the antioxidant and anti-inflammatory potential of dried ripe coffee cherry pulp soluble powder. Foods. 2024;13:1114. doi: 10.3390/foods13071114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Patil S., Shankar S.R., Murthy P.S. Impact of different varieties and mature stages on phytochemicals from Coffea arabica and Coffea robusta leaves. Biochem. Syst. Ecol. 2023;110:104699. doi: 10.1016/j.bse.2023.104699. [DOI] [Google Scholar]
- 67.Hajlaoui H., Arraouadi S., Noumi E., Aouadi K., Adnan M., Khan M.A., Kadri A., Snoussi M. Antimicrobial, antioxidant, anti-acetylcholinesterase, antidiabetic, and pharmacokinetic properties of Carum carvi L. and Coriandrum sativum L. essential oils alone and in combination. Molecules. 2021;26:3625. doi: 10.3390/molecules26123625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Hemida M.H.S. Ph.D. Thesis. University of Tasmania; Hobart, Australia: 2021. Portable liquid chromatography for on-site analysis. [Google Scholar]
- 69.Marittimo N., Grasselli G., Arigò A., Famiglini G., Agostini M., Renzoni C., Palma P., Cappiello A. Liquid electron ionization-mass spectrometry as a novel strategy for integrating normal-phase liquid chromatography with low and high-resolution mass spectrometry. Analyst. 2024;149:2664–2670. doi: 10.1039/D3AN02109B. [DOI] [PubMed] [Google Scholar]
- 70.Dathe B., Otto M. Library search for HPLC diode array detection based on spectral interpolation. Chromatographia. 1993;37:31–36. doi: 10.1007/BF02272184. [DOI] [Google Scholar]
- 71.Pinto T., Aires A., Cosme F., Bacelar E., Morais M.C., Oliveira I., Ferreira-Cardoso J., Anjos R., Vilela A., Gonçalves B. Bioactive (poly) phenols, volatile compounds from vegetables, medicinal and aromatic plants. Foods. 2021;10:106. doi: 10.3390/foods10010106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Osarieme Imade R., Adesina Ayinde B., Alam A. GC-MS Analysis and In Vitro Cytotoxic Effects of Ocimum gratissimum (Lamiaceae) Volatile Oil and Thymol on Cancer Cells. Pharm. Biomed. Res. 2023;9:115–124. doi: 10.32598/PBR.9.2.1110.1. [DOI] [Google Scholar]
- 73.Tian Y., Xu Z., Liu Z., Zhu R., Zhang F., Liu Z., Si X. Botanical discrimination and classification of Mentha plants applying two-chiral column tandem GC–MS analysis of eight menthol enantiomers. Food Res. Int. 2022;162:112035. doi: 10.1016/j.foodres.2022.112035. [DOI] [PubMed] [Google Scholar]
- 74.Galovičová L., Borotová P., Valková V., Vukovic N.L., Vukic M., Štefániková J., Ďúranová H., Kowalczewski P.Ł., Čmiková N., Kačániová M. Thymus vulgaris essential oil and its biological activity. Plants. 2021;10:1959. doi: 10.3390/plants10091959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Wei S., Wei L., Xie B., Li J., Lyu J., Wang S., Khan M.A., Xiao X., Yu J. Characterization of volatile profile from different coriander (Coriandrum sativum L.) varieties via HS-SPME/GC–MS combined with E-nose analyzed by chemometrics. Food Chem. 2024;457:140128. doi: 10.1016/j.foodchem.2024.140128. [DOI] [PubMed] [Google Scholar]
- 76.Sevindik E., Aydın S., Sujka M., Apaydın E., Yıldırım K., Palas G. GC-MS analysis and evaluation of antibacterial and antifungal activity of essential oils extracted from fruit Peel of Citrus aurantium L. (Rutaceae) Grown in the West Anatolian Area. Erwerbs-Obstbau. 2021;63:135–142. doi: 10.1007/s10341-021-00549-7. [DOI] [Google Scholar]
- 77.Al-Busaidi Y.S.A. Ph.D. Thesis. Sultan Qaboos University; Seeb, Oman: 2024. Aroma Profile Characterization of Selected Types of Omani Honey by Gas Chromatography-Mass Spectrometry and Electronic Nose. [Google Scholar]
- 78.Tufariello M., Pati S., Palombi L., Grieco F., Losito I. Use of multivariate statistics in the processing of data on wine volatile compounds obtained by HS-SPME-GC-MS. Foods. 2022;11:910. doi: 10.3390/foods11070910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Asmaey M.A. Propagation to Pharmacopeia. CRC Press; Boca Raton, FL, USA: 2024. Utilizing GC-MS and UPLC-MS for rapid metabolomic analysis in medicinal plants; pp. 361–378. [Google Scholar]
- 80.Nengovhela N. Master’s Thesis. University of Venda; South Africa: 2021. Investigation of Biochemical Strategies Leading to Metabolome Complexity of Two Closely Related Coccinia Species Through LC-QTOF-MS-based Metabolite Fingerprinting. [Google Scholar]
- 81.Li L., He L., Su X., Amu H., Li J., Zhang Z. Chemotaxonomy of Aster species from the Qinghai-Tibetan Plateau based on metabolomics. Phytochem. Anal. 2022;33:23–39. doi: 10.1002/pca.3058. [DOI] [PubMed] [Google Scholar]
- 82.Aydin E. Phytochemicals from Phillyrea latifolia L. leaves and fruit extracted with various solvents: Their identification and quantification by LC-MS and antihyperglycemic effects. Folia Hortic. 2023;35:233–242. doi: 10.2478/fhort-2023-0018. [DOI] [Google Scholar]
- 83.Kandyliari A., Potsaki P., Bousdouni P., Kaloteraki C., Christofilea M., Almpounioti K., Moutsou A., Fasoulis C.K., Polychronis L.V., Gkalpinos V.K. Development of dairy products fortified with plant extracts: Antioxidant and phenolic content characterization. Antioxidants. 2023;12:500. doi: 10.3390/antiox12020500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Mahboubifar M., Zidorn C., Farag M.A., Zayed A., Jassbi A.R. Chemometric-based drug discovery approaches from natural origins using hyphenated chromatographic techniques. Phytochem. Anal. 2024;35:990–1016. doi: 10.1002/pca.3382. [DOI] [PubMed] [Google Scholar]
- 85.Cong W. Ph.D. Thesis. The Ohio State University; Columbus, OH, USA: 2021. Identification of Compounds that Impact Whole Wheat Bread Flavor Liking Using LC-MS Flavoromics. [Google Scholar]
- 86.Munyoki N.M. Master’s Thesis. University of Nairobi; Nairobi, Kenya: 2023. Phytochemical Characterization and Biopesticidal Activity of Aqueous Plant Extracts Against French Bean thrips and Aphids. [Google Scholar]
- 87.Goya L., de Pascual-Teresa S. Effects of polyphenol-rich foods on chronic diseases. Nutrients. 2023;15:4134. doi: 10.3390/nu15194134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Zhu Y., Zhang J., Wang C., Zheng T., Di S., Wang Y., Fei W., Liang W., Wang L. Ameliorative effect of ethanolic echinacea purpurea against hyperthyroidism-induced oxidative stress via AMRK and PPAR signal pathway using transcriptomics and network pharmacology analysis. Int. J. Mol. Sci. 2022;24:187. doi: 10.3390/ijms24010187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Cong W., Tello E., Simons C.T., Peterson D.G. Identification of non-volatile compounds that impact flavor disliking of whole wheat bread made with aged flours. Molecules. 2022;27:1331. doi: 10.3390/molecules27041331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Ali A., Bashmil Y.M., Cottrell J.J., Suleria H.A.R., Dunshea F.R. Lc-ms/ms-qtof screening and identification of phenolic compounds from australian grown herbs and their antioxidant potential. Antioxidants. 2021;10:1770. doi: 10.3390/antiox10111770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Hu Y., Schnaubelt M., Chen L., Zhang B., Hoang T., Lih T.M., Zhang Z., Zhang H. MS-PyCloud: A cloud computing-based pipeline for proteomic and glycoproteomic data analyses. Anal. Chem. 2024;96:10145–10151. doi: 10.1021/acs.analchem.3c01497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Liu L.F., Li M., Mao R.S. Design and implementation of mass spectrometer database; Proceedings of the 13th International Beam Instrumentation Conference; Beijing, China. 9–13 September 2024; pp. 439–443. [Google Scholar]
- 93.Alaribe S.C., Titilayo B.E., Ikpatt U.N., Oladipupo A.R., Alabi F.A., Ikwugbado R.I. Computational methods for advanced mass spectrometry—A review. Phys. Sci. Rev. 2025 doi: 10.1515/psr-2024-0006. [DOI] [Google Scholar]
- 94.Agarwal V., Bajpai M. Imaging and Non-imaging Analytical Techniques Used for Drug Nanosizing and their Patents: An Overview. Recent Pat. Nanotechnol. 2024;18:494–518. doi: 10.2174/0118722105243388230920013508. [DOI] [PubMed] [Google Scholar]
- 95.Bonatto C.C., Silva L.P. A MALDI-TOF mass spectrometry-based approach for molecular profiling of leaves from pasture and feed forages species. Biochem. Syst. Ecol. 2021;94:104215. doi: 10.1016/j.bse.2020.104215. [DOI] [Google Scholar]
- 96.Mejía-Morales C., Rodríguez-Macías R., Salcedo-Pérez E., Zamora-Natera J.F., Rodríguez-Zaragoza F.A., Molina-Torres J., Délano-Frier J.P., Zañudo-Hernández J. Contrasting metabolic fingerprints and seed protein profiles of Cucurbita foetidissima and C. Radicans fruits from feral plants sampled in central Mexico. Plants. 2021;10:2451. doi: 10.3390/plants10112451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Greene L.A., Isaac I., Gray D.E., Schwartz S.A. Streamlining plant sample preparation: The use of high-throughput robotics to process echinacea samples for biomarker profiling by MALDI-TOF mass spectrometry. J. Biomol. Tech. 2007;18:238–244. [PMC free article] [PubMed] [Google Scholar]
- 98.Hicks J.C.E., Dávila A.A.C., Villarruel D.S.-A. Food Security, Safety, and Sustainability. Apple Academic Press; Palm Bay, FL, USA: 2025. Insights into Nuclear Magnetic Resonance Spectroscopy as a Tool for Metabolic Markers in Pre-and PostHarvest Sustainable Production of Food Crops; pp. 145–163. [Google Scholar]
- 99.Brigante F.I., Solovyev P., Bontempo L. Nuclear magnetic resonance applications in Fermented Foods and Plant-based beverages: Challenges and opportunities. Food Rev. Int. 2024;40:3370–3397. doi: 10.1080/87559129.2024.2355271. [DOI] [Google Scholar]
- 100.Batiha G.E.-S., Shaheen H.M., Elhawary E.A., Mostafa N.M., Eldahshan O.A., Sabatier J.-M. Phytochemical constituents, folk medicinal uses, and biological activities of genus angelica: A review. Molecules. 2022;28:267. doi: 10.3390/molecules28010267. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Abdul-Jalil T.Z., Ibrahim R.M., Nassir Z.S. Extraction, isolation and structure elucidation of two phenolic acids from aerial parts of celery and coriander. Biomed. Pharmacol. J. 2023;16:2315–2332. doi: 10.13005/bpj/2807. [DOI] [Google Scholar]
- 102.Wang Z.-F., You Y.-L., Li F.-F., Kong W.-R., Wang S.-Q. Research progress of NMR in natural product quantification. Molecules. 2021;26:6308. doi: 10.3390/molecules26206308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Otify A.M., Serag A., Porzel A., Wessjohann L.A., Farag M.A. NMR Metabolome-based classification of Cymbopogon species: A prospect for phyto-equivalency of its different accessions using chemometric tools. Food Anal. Methods. 2022;15:2095–2106. doi: 10.1007/s12161-022-02257-8. [DOI] [Google Scholar]
- 104.Fabry P., Weber S., Teipel J., Richling E., Walch S.G., Lachenmeier D.W. Quantitative NMR Spectrometry of Phenylpropanoids, including Isoeugenol in Herbs, Spices, and Essential Oils. Foods. 2024;13:720. doi: 10.3390/foods13050720. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Zhao J., Wang M., Lee J., Ali Z., Khan I.A. Characterization, Differentiation, and Adulteration Detection of Peppermint Essential Oil: An NMR Approach. J. Pharm. Biomed. Anal. 2025;263:116941. doi: 10.1016/j.jpba.2025.116941. [DOI] [PubMed] [Google Scholar]
- 106.Alberts P.S.F., Meyer J.J.M. Integrating chemotaxonomic-based metabolomics data with DNA barcoding for plant identification: A case study on south-east African Erythroxylaceae species. South Afr. J. Botany. 2022;146:174–186. doi: 10.1016/j.sajb.2021.10.005. [DOI] [Google Scholar]
- 107.Sawhney G., Sharma M., Agrawal N. Computation in BioInformatics: Multidisciplinary Applications. Wiley; Hoboken, NJ, USA: 2021. DNA-RNA Barcoding and Gene Sequencing; pp. 165–227. [Google Scholar]
- 108.Khan S. Omics Studies of Medicinal Plants. CRC Press; Boca Raton, FL, USA: 2023. Genome-based Barcoding for the Authentication of Medicinal Plants and Their Products; pp. 123–136. [Google Scholar]
- 109.Letsiou S., Madesis P., Vasdekis E., Montemurro C., Grigoriou M.E., Skavdis G., Moussis V., Koutelidakis A.E., Tzakos A.G. DNA Barcoding as a Plant Identification Method. Appl. Sci. 2024;14:1415. doi: 10.3390/app14041415. [DOI] [Google Scholar]
- 110.Gostel M.R., Kress W.J. The expanding role of DNA barcodes: Indispensable tools for ecology, evolution, and conservation. Diversity. 2022;14:213. doi: 10.3390/d14030213. [DOI] [Google Scholar]
- 111.Caesar L.K., Montaser R., Keller N.P., Kelleher N.L. Metabolomics and genomics in natural products research: Complementary tools for targeting new chemical entities. Nat. Prod. Rep. 2021;38:2041–2065. doi: 10.1039/D1NP00036E. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Peltomaa E., Asikainen H., Blomster J., Pakkanen H., Rigaud C., Salmi P., Taipale S. Phytoplankton group identification with chemotaxonomic biomarkers: In combination they do better. Phytochemistry. 2023;209:113624. doi: 10.1016/j.phytochem.2023.113624. [DOI] [PubMed] [Google Scholar]
- 113.Diaz M.A.L., Del Fueyo G.M., D’Angelo J.A., Carrizo M.A. How much does Chemotaxonomy help to resolve the overrepresented Cycadolepis Saporta (Bennettitales)? A case study of the Cretaceous of Patagonia, Argentina. Rev. Palaeobot. Palynol. 2021;293:104489. doi: 10.1016/j.revpalbo.2021.104489. [DOI] [Google Scholar]
- 114.Lau H., Tan L.H., Ee L.Y., Dayal H., Lim S.Y., Liu F., Li S.F.Y. Application of 1H-NMR-and LC-MS based Metabolomic analysis for the evaluation of celery preservation methods. LWT. 2022;169:113938. doi: 10.1016/j.lwt.2022.113938. [DOI] [Google Scholar]
- 115.Lee N., Yoo H., Yang H. Cluster analysis of medicinal plants and targets based on multipartite network. Biomolecules. 2021;11:546. doi: 10.3390/biom11040546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Zhan C., Shen S., Yang C., Liu Z., Fernie A.R., Graham I.A., Luo J. Plant metabolic gene clusters in the multi-omics era. Trends Plant Sci. 2022;27:981–1001. doi: 10.1016/j.tplants.2022.03.002. [DOI] [PubMed] [Google Scholar]
- 117.Tang D., Wei F., Cai Z., Wei Y., Khan A., Miao J., Wei K. Analysis of codon usage bias and evolution in the chloroplast genome of Mesona chinensis Benth. Dev. Genes Evol. 2021;231:1–9. doi: 10.1007/s00427-020-00670-9. [DOI] [PubMed] [Google Scholar]
- 118.Li G., Lin P., Wang K., Gu C.-C., Kusari S. Artificial intelligence-guided discovery of anticancer lead compounds from plants and associated microorganisms. Trends Cancer. 2022;8:65–80. doi: 10.1016/j.trecan.2021.10.002. [DOI] [PubMed] [Google Scholar]
- 119.Srivastava M., Nandan S., Zaidi A., Samani A.S., Shukla V., Aslam H., Srivastava A., Maurya P., Khan M.A., Khan M.F. Artificial Intelligence Driven Applications in Analytical Chemistry, Drug Discovery, and Food Science: Advancements, Outlook, and Challenges. ChemistrySelect. 2025;10:e202404446. doi: 10.1002/slct.202404446. [DOI] [Google Scholar]
- 120.Dey B., Ferdous J., Ahmed R., Hossain J. Assessing deep convolutional neural network models and their comparative performance for automated medicinal plant identification from leaf images. Heliyon. 2024;10:e23655. doi: 10.1016/j.heliyon.2023.e23655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Zhong C., Li L., Wang Y.-Z. Applications of chemical fingerprints and machine learning in plant ecology: Recent progress and future perspectives. Microchem. J. 2024;206:111447. doi: 10.1016/j.microc.2024.111447. [DOI] [Google Scholar]
- 122.Chi J., Shu J., Li M., Mudappathi R., Jin Y., Lewis F., Boon A., Qin X., Liu L., Gu H. Artificial intelligence in metabolomics: A current review. TrAC Trends Anal. Chem. 2024;178:117852. doi: 10.1016/j.trac.2024.117852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Hosseini M., Pereira D.M. The chemical space of terpenes: Insights from data science and AI. Pharmaceuticals. 2023;16:202. doi: 10.3390/ph16020202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Kang M., Ko E., Mersha T.B. A roadmap for multi-omics data integration using deep learning. Brief. Bioinform. 2022;23:bbab454. doi: 10.1093/bib/bbab454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Vahabi N., Michailidis G. Unsupervised multi-omics data integration methods: A comprehensive review. Front. Genet. 2022;13:854752. doi: 10.3389/fgene.2022.854752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Sarkar J., Singh R., Chandel S. Understanding LC/MS-Based Metabolomics: A Detailed Reference for Natural Product Analysis. Proteom. Clin. Appl. 2025;19:e202400048. doi: 10.1002/prca.202400048. [DOI] [PubMed] [Google Scholar]
- 127.Bajorath J., Chávez-Hernández A.L., Duran-Frigola M., Fernández-de Gortari E., Gasteiger J., López-López E., Maggiora G.M., Medina-Franco J.L., Méndez-Lucio O., Mestres J. Chemoinformatics and artificial intelligence colloquium: Progress and challenges in developing bioactive compounds. J. Cheminformatics. 2022;14:82. doi: 10.1186/s13321-022-00661-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Qin Y., Li C., Shi X., Wang W. MLP-based regression prediction model for compound bioactivity. Front. Bioeng. Biotechnol. 2022;10:946329. doi: 10.3389/fbioe.2022.946329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Ishfaq M., Aamir M., Ahmad F., M Mebed A., Elshahat S. Machine learning-assisted prediction of the biological activity of aromatase inhibitors and data mining to explore similar compounds. ACS Omega. 2022;7:48139–48149. doi: 10.1021/acsomega.2c06174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Pérez Santín E., Rodríguez Solana R., González García M., García Suárez M.D.M., Blanco Díaz G.D., Cima Cabal M.D., Moreno Rojas J.M., López Sánchez J.I. Toxicity prediction based on artificial intelligence: A multidisciplinary overview. Wiley WIREs Comput. Mol. Sci. 2021;11:e1516. doi: 10.1002/wcms.1516. [DOI] [Google Scholar]
- 131.Filimonov D.A., Druzhilovskiy D.S., Lagunin A.A., Gloriozova T.A., Rudik A.V., Dmitriev A.V., Pogodin P.V., Poroikov V.V. Computer-aided prediction of biological activity spectra for chemical compounds: Opportunities and limitations. Biomed. Chem. Res. Methods. 2018;1:e00004. doi: 10.18097/BMCRM00004. [DOI] [Google Scholar]
- 132.Dial K. Ph.D. Thesis. McMaster University; Hamilton, ON, Canada: 2023. Democratising Deep Learning In Microbial Metabolites Research. [Google Scholar]
- 133.Soylu N.N., Sefer E. DeepPTM: Protein post-translational modification prediction from protein sequences by combining deep protein language model with vision transformers. Curr. Bioinform. 2024;19:810–824. doi: 10.2174/0115748936283134240109054157. [DOI] [Google Scholar]
- 134.Moldwin A., Shehu A. Foundation Models for AI-Enabled Biological Design. arXiv. 20252505.11610 [Google Scholar]
- 135.Singh R. Natural Products and Drug Discovery. Elsevier; Amsterdam, The Netherlands: 2018. Chemotaxonomy of medicinal plants: Possibilities and limitations; pp. 119–136. [Google Scholar]
- 136.Spring O. Chemotaxonomy based on metabolites from glandular trichomes. Adv. Bot. Res. 2000;31:153–174. [Google Scholar]
- 137.Alston R.E., Mabry T.J., Turner B.L. Perspectives in Chemotaxonomy: Studies of secondary compounds in plants may provide knowledge of phylogenetic relationships. Science. 1963;142:545–552. doi: 10.1126/science.142.3592.545. [DOI] [PubMed] [Google Scholar]
- 138.Ramawat K.G. Biodiversity and Chemotaxonomy. Volume 24 Springer; Berlin/Heidelberg, Germany: 2019. [Google Scholar]
- 139.Wink M., Botschen F., Gosmann C., Schäfer H., Waterman P.G. Annual Plant Reviews Volume 40: Biochemistry of Plant Secondary Metabolism. Wiley; Hoboken, NJ, USA: 2010. Chemotaxonomy seen from a phylogenetic perspective and evolution of secondary metabolism; pp. 364–433. [Google Scholar]
- 140.Issaq H.J., Van Q.N., Waybright T.J., Muschik G.M., Veenstra T.D. Analytical and statistical approaches to metabolomics research. J. Sep. Sci. 2009;32:2183–2199. doi: 10.1002/jssc.200900152. [DOI] [PubMed] [Google Scholar]
- 141.Huie C.W. A review of modern sample-preparation techniques for the extraction and analysis of medicinal plants. Anal. Bioanal. Chem. 2002;373:23–30. doi: 10.1007/s00216-002-1265-3. [DOI] [PubMed] [Google Scholar]
- 142.Miralles A., Bruy T., Wolcott K., Scherz M.D., Begerow D., Beszteri B., Bonkowski M., Felden J., Gemeinholzer B., Glaw F. Repositories for taxonomic data: Where we are and what is missing. Syst. Biol. 2020;69:1231–1253. doi: 10.1093/sysbio/syaa026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 143.Kumar A., Kumar S., Komal, Ramchiary N., Singh P. Role of traditional ethnobotanical knowledge and indigenous communities in achieving sustainable development goals. Sustainability. 2021;13:3062. doi: 10.3390/su13063062. [DOI] [Google Scholar]
- 144.Campbell M.M., Sederoff R.R. Variation in lignin content and composition (mechanisms of control and implications for the genetic improvement of plants) Plant Physiol. 1996;110:3. doi: 10.1104/pp.110.1.3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145.Polatoglu K. “Chemotypes”–a fact that should not be ignored in natural product studies. Nat. Prod. J. 2013;3:10–14. doi: 10.2174/2210315511303010004. [DOI] [Google Scholar]
- 146.Hegeman A.D. Plant metabolomics—Meeting the analytical challenges of comprehensive metabolite analysis. Brief. Funct. Genom. 2010;9:139–148. doi: 10.1093/bfgp/elp053. [DOI] [PubMed] [Google Scholar]
- 147.Wolfender J.-L., Marti G., Thomas A., Bertrand S. Current approaches and challenges for the metabolite profiling of complex natural extracts. J. Chromatogr. A. 2015;1382:136–164. doi: 10.1016/j.chroma.2014.10.091. [DOI] [PubMed] [Google Scholar]
- 148.Rosita A.S., Shivani J., Saji A., Sundar M.B., Begum T.N., Thilagavathi G., Sumithra A., Anand A.V. Biotechnology, Multiple Omics, and Precision Breeding in Medicinal Plants. CRC Press; Boca Raton, FL, USA: 2025. Bioinformatics Tools for Research on Plant Secondary Metabolites; pp. 177–195. [Google Scholar]
- 149.Marco-Ramell A., Palau-Rodriguez M., Alay A., Tulipani S., Urpi-Sarda M., Sanchez-Pla A., Andres-Lacueva C. Evaluation and comparison of bioinformatic tools for the enrichment analysis of metabolomics data. BMC Bioinform. 2018;19:1. doi: 10.1186/s12859-017-2006-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 150.Zammit G., Zammit M.G., Buttigieg K.G. Emerging technologies for the discovery of novel diversity in cyanobacteria and algae and the elucidation of their valuable metabolites. Diversity. 2023;15:1142. doi: 10.3390/d15111142. [DOI] [Google Scholar]
- 151.Nazarenko D.V., Kharyuk P.V., Oseledets I.V., Rodin I.A., Shpigun O.A. Machine learning for LC–MS medicinal plants identification. Chemom. Intell. Lab. Syst. 2016;156:174–180. doi: 10.1016/j.chemolab.2016.06.003. [DOI] [Google Scholar]
- 152.Pomyen Y., Wanichthanarak K., Poungsombat P., Fahrmann J., Grapov D., Khoomrung S. Deep metabolome: Applications of deep learning in metabolomics. Comput. Struct. Biotechnol. J. 2020;18:2818–2825. doi: 10.1016/j.csbj.2020.09.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 153.Afendi F.M., Okada T., Yamazaki M., Hirai-Morita A., Nakamura Y., Nakamura K., Ikeda S., Takahashi H., Altaf-Ul-Amin M., Darusman L.K. KNApSAcK family databases: Integrated metabolite–plant species databases for multifaceted plant research. Plant Cell Physiol. 2012;53:e1. doi: 10.1093/pcp/pcr165. [DOI] [PubMed] [Google Scholar]
- 154.Lephatsi M.M. Ph.D. Thesis. University of Johannesburg; Johannesburg, South Africa: 2024. Metabolomics and 4IR Technologies for Natural Products Research: Characterization of the Phytochemistry and Biochemistry of Helichrysum Species with Potential Anti-Cancer Activities. [Google Scholar]
- 155.James S.A., Soltis P.S., Belbin L., Chapman A.D., Nelson G., Paul D.L., Collins M. Herbarium data: Global biodiversity and societal botanical needs for novel research. Appl. Plant Sci. 2018;6:e1024. doi: 10.1002/aps3.1024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 156.Jamieson C.S., Misa J., Tang Y., Billingsley J.M. Biosynthesis and synthetic biology of psychoactive natural products. Chem. Soc. Rev. 2021;50:6950–7008. doi: 10.1039/D1CS00065A. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 157.Yao X., Wuzhang K., Peng B., Chen T., Zhang Y., Liu H., Li L., Fu X., Tang K. Engineering the expression of plant secondary metabolites-genistein and scutellarin through an efficient transient production platform in Nicotiana benthamiana L. Front. Plant Sci. 2022;13:994792. doi: 10.3389/fpls.2022.994792. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 158.Mohammadi S., Saberidokht B., Subramaniam S., Grama A. Scope and limitations of yeast as a model organism for studying human tissue-specific pathways. BMC Syst. Biol. 2015;9:96. doi: 10.1186/s12918-015-0253-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 159.Hao D.-C., Xiao P.-G. Genomics and evolution in traditional medicinal plants: Road to a healthier life. Evol. Bioinform. 2015;11:EBO-S31326. doi: 10.4137/EBO.S31326. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data are contained within the article.





