Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2018 Oct 24;47(Database issue):D1118–D1127. doi: 10.1093/nar/gky965

CMAUP: a database of collective molecular activities of useful plants

Xian Zeng 1,2, Peng Zhang 2, Yali Wang 2, Chu Qin 2, Shangying Chen 2, Weidong He 2, Lin Tao 2,3, Ying Tan 1, Dan Gao 1, Bohua Wang 4,5, Zhe Chen 3, Weiping Chen 4,, Yu Yang Jiang 1,, Yu Zong Chen 2,
PMCID: PMC6324012  PMID: 30357356

Abstract

The beneficial effects of functionally useful plants (e.g. medicinal and food plants) arise from the multi-target activities of multiple ingredients of these plants. The knowledge of the collective molecular activities of these plants facilitates mechanistic studies and expanded applications. A number of databases provide information about the effects and targets of various plants and ingredients. More comprehensive information is needed for broader classes of plants and for the landscapes of individual plant’s multiple targets, collective activities and regulated biological pathways, processes and diseases. We therefore developed a new database, Collective Molecular Activities of Useful Plants (CMAUP), to provide the collective landscapes of multiple targets (ChEMBL target classes) and activity levels (in 2D target-ingredient heatmap), and regulated gene ontologies (GO categories), biological pathways (KEGG categories) and diseases (ICD blocks) for 5645 plants (2567 medicinal, 170 food, 1567 edible, 3 agricultural and 119 garden plants) collected from or traditionally used in 153 countries and regions. These landscapes were derived from 47 645 plant ingredients active against 646 targets in 234 KEGG pathways associated with 2473 gene ontologies and 656 diseases. CMAUP (http://bidd2.nus.edu.sg/CMAUP/) is freely accessible and searchable by keywords, plant usage classes, species families, targets, KEGG pathways, gene ontologies, diseases (ICD code) and geographical locations.

INTRODUCTION

Many functionally useful plants have been extensively investigated and explored for benefiting humans. Herb-based indigenous medicines and medicinal plants have been and are continuously being used as therapeutics by huge population in the world (1–4). Food and edible plants have been studied for increased food productivity or nutrition yield under different environmental conditions (5,6). Their health-beneficiary (7,8) and toxic (9) effects have been studied for appropriate applications and safe uses. Expanding number of novel garden plants have been discovered or engineered by such methods as genetic manipulations (10).

The broader explorations of the functionally useful plants have been facilitated by the molecular and systems biology studies of their functional mechanisms. For instance, the mechanisms of the synergistic combinations of multiple herbal ingredients (11) and the traditionally used synergistic herb-pairs (12) in traditional Chinese medicine (TCM) have been revealed by the analysis of their molecular interaction profiles and chemical structural similarity landscapes from the perspectives of systems biology. The regulatory roles of the metabolic pathways and transporter genes of food plants have been investigated and manipulated for increased food productivity particularly in the hotter and drier environment (5) and under soil salinity conditions (6). The functions and the expression of the selected combinations of genes have been studied for deriving salt-resistant crops (6). Exotic garden plants have been produced by the alteration of the gene expression levels in specific biosynthetic pathways (10).

Such investigations and developmental efforts can be facilitated by the availability of the information about individual plant’s collective activities and multiple targets of its ingredients, and the regulated biological pathways, biological processes and diseases for more diverse classes of medicinal, food, edible, garden, and other functionally useful plants. Established plant databases (e.g. Plants of the World Online (http://www.plantsoftheworldonline.org), The Plant List (http://www.theplantlist.org), the PLANTS database (https://plants.sc.egov.usda.gov) and the Plant Trait Database (https://www.try-db.org/TryWeb/Home.php)) provide comprehensive information about the characteristics, taxonomy and resources of large number plants, but the plant ingredient and activity information are unincluded. Specialized databases provide additional information about medicinal plants (e.g. KANpSAcK (13), IMPPAT (14), BATMA-TCM (15), NuBBE (16), TM-MC (17), TCM@Taiwan (18), TCMID (19), TIPdb-3D (20), TCMSP (21), TCM-ID (22) and TCMAnalyzer (23), food and edible plants (e.g. KANpSAcK (13) and Nutrichem (24), ePlantLIBRA (25), and the species that produce bioactive compounds (e.g. NPASS (26) and HIT (27).

Some of these databases provide the information about the targets or activities of plant ingredients. In particular, TCMSP (21) and TCMAnalyzer (23) enable convenient access of the multiple targets of individual medicinal plants used in TCM and the regulated diseases by these plants. However, the targets are not grouped into functional categories and no regulated pathways and biological processes are given. These groupings and the systems-level information are highly useful for deciphering the complex multi-target activities of multiple ingredients and the underlying mechanisms of the functions of the individual plants. Moreover, these two databases cover a limited number (499 and 618, respectively) of TCM medicinal plants, lack the medicinal plants from other traditional medicines (e.g. Indian Ayurveda and Japanese Kampo) or used by folks of other regions (e.g. European and African countries) and are without coverage of other classes of functionally useful plants such as food, edible and garden plants are not covered. These limitations restrict our ability to analyze the collective activities and the functional implications of the multiple ingredients of individual plants from more diverse plant groups.

Hence, there is a need for a resource to conveniently access the information about individual plant’s multiple targets and collective activities of its ingredients, and the regulated biological pathways, biological processes and diseases for more diverse ranges of functionally useful plants. To complement the existing plant databases for catering this need, we developed a new database, Collective Molecular Activities of Useful Plants (CMAUP), to provide the collective landscapes of multiple targets (grouped into ChEMBL target classification classes, e.g. G Protein-Coupled Receptors), activity profiles (displayed in 2D target-ingredient heatmap), gene ontologies (grouped into gene ontology GO categories, e.g. GO:0002376 immune system process), biological pathways (grouped into KEGG pathway categories top and second level, e.g. ‘Environmental Information Processing’ and ‘Signal transduction’) and diseases (grouped into the international classification of disease ICD-10 blocks) for the classes of medicinal, food, edible, agricultural (exclude food and edible) and garden plants. The relevant information was extracted from the literature and the established internet resources of functionally useful plants, natural products, activities of natural products, proteins, gene ontologies, biological pathways and diseases. The facilities were set-up for convenient access of CMAUP by multiple search modes including keywords, plant usage classes, species families, targets, KEGG pathways, gene ontologies, diseases (ICD code) and geographical locations.

DATA COLLECTION AND PROCESSING

The functionally useful plants were searched from the internet sources, books and literature database PubMed (28) by using the keywords ‘medicinal’, ‘food’, ‘edible’, ‘agricultural’ and ‘garden’ in combination with the keyword ‘plant’ or ‘plants’. The searched sources must be scientific journals, credible databases, books or official reports of government agencies, botanic gardens, international scientist networks and charitable organizations. Only those plants with the experimentally determined target and quantitative activity value were included in CMAUP. The geographic location of each plant is the location of plant collection or an indigenous medicine reported by the source or literature. The ingredients of the searched plants were from the literature (1562 research articles), published natural product databases or plant databases that provide the ingredients of individual plants reported in a publication.

The protein targets and the quantitative activity values of these plant ingredients were from the literatures and published databases that describe the experimentally determined targets and activity values of natural products reported in a research article. The quantitative activity value may be measured as half-maximal inhibition concentration IC50, equilibrium inhibition constant Ki, half-maximal effective concentration EC50, half-maximal activity concentration AC50, half-maximal growth inhibitory concentration GI50 and potency of half-maximal effect. The disease and its ICD-10 code regulated by each protein target was(were) from Therapeutic Target Database (TTD) (29) and the corresponding ChEMBL target classification class was from ChEMBL (30). The GO category and KEGG pathway affiliated with each target were generated from the enrichment tools GOATOOLS (31) and Enrichr python package (32). Crosslinks to the relevant databases (Plants of the World Online (http://www.plantsoftheworldonline.org), NCBI Taxonomy (33), PubChem (34), ZINC (35), ChEMBL (30), UniProt, (36), QuickGO (37), KEGG (38), TTD (29), PubMed (28) and doi.org) were curated and provided.

The 2D target-ingredient heatmap was created as follows: an activity matrix M of the ingredient-target activities was created with the matrix element Mij representing the activity value of a plant ingredient i against a protein target j. The ‘heatmap.2’ R package was used to cluster the activity matrix M with respect to the targets using the default parameters (Distance method: ‘Euclidean’; Hierarchical cluster linkage method: ‘complete’) (https://cran.r-project.org/web/packages/gplots/gplots.pdf). The activity matrix after hierarchical clustering was displayed on the web interface by using the ‘HighCharts’ heatmap JavaScript library (https://www.highcharts.com).

DATABASE CONTENTS, STRUCTURE AND ACCESS

CMAUP (freely accessible at http://bidd2.nus.edu.sg/CMAUP/, the main webpage in Figure 1) currently contains 5645 plants (references in Supplementary Table S1), 70.2 and 99.9% of which are with identifiable taxonomic information at species and genus level, respectively, which were obtained through matching species name against NCBI Taxonomy database (33). The 5645 plants include 2567 medicinal, 170 food, 1567 edible (exclude food plants), 3 agricultural (exclude food and edible plants) and 119 garden plants collected from or medically used in 153 countries/regions. Medicinal plants include the herbs of indigenous medicines such as TCM, traditional Indian medicines (folk, Ayurveda, Siddha, Unani, Sowa-Rigpa, Homeopathy) and Japanese Kampo and the plants used for medicinal purposes in various 26 Asian, 19 African, 23 European, 8 North and South American and 3 Oceania countries, which were from a number of established databases summarized in Supplementary Table S1.

Figure 1.

Figure 1.

The homepage of CMAUP web interface. The webpage allows users to search plants by keywords (e.g. plant name, target, disease, pathway), search plants by ingredients, browse plants by plant usage classes and other options, and access plants by the interactive world map.

Food plants are the plants used as foods, flavours or preservatives by sizable populations or are edible with higher nutritional values, which were primarily from the European Tea Committee and European Herbal Infusions Association’s report ‘THIE Inventory List of Herbals Considered as Food’, the book chapter ‘Economic Botany- Plants as Food, and the websites of Food Plants International (http://foodplantsinternational.com) and the Gardener’s network (http://gardenersnet.com/gardening/plantbotanicalnames.htm#vegetables). Edible plants (exclude food plants) are the plants used by small population of folks as foods, flavours and preservatives and with less nutritional values, which were mainly from the Plants For A Future (https://pfaf.org) and KANpSAcK (13) databases. Agricultural plants (exclude food and edible plants) are the plants grown by humans for commercial purposes (e.g. timber and rubber), which were from European Commission’s Plant variety database (http://ec.europa.eu/food/plant/plant_propagation_material/ plant_variety_catalogues_databases/search/public/index.cfm), Plant Encyclopedia (https://www.bhg.com/gardening/plant-dictionary), and Wiki pages. Garden plants are for public, commercial and home gardens and landscapes, which were from the gardener’s network (http://gardenersnet.com/gardening/plantbotanicalnames.htm#flowers), Cornell University Home Garden-Based Learning (http://www.gardening.cornell.edu/homegardening), Japanese Garden Plants (http://www.jgarden.org/plants.asp) and Wiki Garden plants of Europe (https://en.wikipedia.org/wiki/Category:Garden_plants_of_Europe).

There are 6892 and 2275 ingredients from 5645 plants that are with both a target and experimentally measured activity value of IC50, Ki, EC50, AC50, GI50 or potency <10 μM (strong and moderate potency) and 10–50 μM (low potency), respectively. The 6892 strong and moderate potency ingredients are active against 646 human protein targets, which are grouped into 73 ChEMBL target classification level-3 classes (e.g. GSK3B in the ‘Protein Kinase’ class). The 646 targets are affiliated with 234 KEGG pathways, which are classified into the 7 top-level categories (e.g. mTOR signaling pathway in the ‘Environmental Information Processing’ category) and 39 second-level categories (e.g. mTOR signaling pathway in the ‘Signal transduction’ category) based on KEGG Orthology. The 646 targets are associated with 2473 gene ontologies, which are grouped into 41 GO categories. Also, 332 of these targets are associated with 656 diseases (a target-disease association means that the target has been reported as a therapeutic target for the disease treatment based on TTD), which are classified into 21 disease ICD-10 code blocks.

The 2275 experimentally determined low-potency ingredients and their targets are also provided in the respective plant ingredient page of CMAUP. Although some of these ingredients are expected to contribute to the collective activities and observed functions of their host plants, a large percentage of them unlikely make significant contribution. A comprehensive study of the synergistic combinations of sub-potent natural products has shown that only a small percentage of the synergistic combinations of sub-potent natural products produce drug-level potent activities (39). Therefore, additional information is needed for selecting those low-potency ingredients with experimentally confirmed or computationally demonstrated ability in producing significantly enhanced collective activities in combination with other ingredients of the same plant. The collection of such information is ongoing and expectedly takes some time. Hence, in the current version of CMAUP, the possible contribution of these ingredients to the collective activities of their host plants is tentatively un included until the completion of our collection works.

There are 47 645 known ingredients derived from the 5645 plants, 38 478 of them are without a reported activity value. To facilitate the investigations of these ingredients, CMAUP provides the experimentally determined targets of the molecules in the ChEMBL database (30) that are structurally similar to each of the 37 712 plant ingredients. These structurally similar molecules were generated by the similarity searching method, which is based on the hypothesis that structurally similar molecules tend to have similar activities (40). Hence, the targets of structurally similar molecules may indicate the potential targets or target classes of the plant ingredients. In our method, the chemical structures are represented by PubChem molecular fingerprints (41,42) and structural similarity between molecules is measured by Tanimoto coefficient (40,42). The targets of these structurally similar molecules are provided in the lower section of the target page of each plant.

For each user-searched plant, its ingredients are visualized in physiochemical property spaces and the reference of each individual plant-ingredient association is provided in the popup window of each chemical structure (Figure 2). The landscape of the multiple targets of its ingredients (Figure 3) is generated with a default target activity cut-off value of IC50, Ki, EC50, AC50, GI50 or potency ≤1 μM. This cut-off gives rise to a landscape of nanomolar range potent activities, which ensures the landscape can reveal some of the observable effects of the plant. It is noted that the synergistic effects of multiple natural products may involve moderate (e.g. sub-micromolar range) activities (39). Therefore, an optional cut-off value of IC50, Ki, EC50, AC50, GI50 or potency <10 μM was introduced for generating an alternative target landscape to cater the need for the study of the contribution of moderate as well as potent activities. CMAUP also provide the overview of the target activity landscapes across all the plant species. Users can access the webpage of this cross-species target landscape by clicking the relevant button in the target page of each individual plant. Moreover, the distribution of each individual ingredient across plant species is also provided in CMAUP. Users can access the webpage of this cross-species ingredient distribution profile by clicking the relevant button in the ingredient page of each individual plant.

Figure 2.

Figure 2.

Ingredients of a specific plant. (A) Chemical ingredients are shown in the graphs based on distributions of few important physiochemical properties (e.g. molecular weight, XlogP, number of hydrogen bond donor). (B) Chemical structure images are provided when available. The chemical information, reference of plant-ingredient association and distribution of the ingredient across plants can be accessed by click the ‘SEE DETAILS’ button (C).

Figure 3.

Figure 3.

An example webpage of protein targets of a specific plant. Target proteins are categorized into classes based on ChEMBL target classification. External links are provided in the table in the right pane.

The landscape of the target-ingredient activity profiles is further displayed by the 2D target-ingredient heatmap (Figure 4). In this heatmap, the activity values of the plant ingredients are clustered against their respective targets, with each red-colored small-block representing the activity value (unit of nM) of an individual ingredient against a target and the brightness of the red-color indicating the level of the activity (the higher activity with the darker color). Users can click the heatmap block to access the detailed information of the activities, including the referenced literature where the activity was reported. Based on the associations of the multiple targets of an individual plant with biological pathway, gene ontology and disease, the corresponding landscapes of the KEGG pathways (Figure 5), GO categories (Figure 6) and disease ICD-10 blocks (Figure 7) were also provided.

Figure 4.

Figure 4.

Activity landscape of a specific plant. The interactive heatmap allows users to access the detailed information of activity by clicking the individual cells.

Figure 5.

Figure 5.

KEGG pathways of a specific plant. The pie chart shows the distribution of pathways in each KEGG Orthology category. The bar chart presents the top 10 enriched pathways ordered by the adjusted P-values. The bottom table lists detailed information of enriched KEGG pathways.

Figure 6.

Figure 6.

Gene Ontology terms of a specific plant. Distributions of GO terms in biological process, molecular function and cellular component categories are displayed in the top pie charts. The bottom table lists detailed information about enriched GO terms.

Figure 7.

Figure 7.

Human diseases related to a specific plant. The upper pie chart shows the number of diseases in each WHO ICD-10 disease category. The below table lists detailed information about these diseases.

CMAUP entries can also be searched by plant ingredients. Multiple search options are provided. These include chemical name, chemical SMILES string and the user-drawing chemical structure by using an on-screen chemical structure drawing tool JSME (43) installed in the relevant structure search section of the CMAUP webpage. JSME is a well-established free molecule editor in JavaScript with multiple functionalities including substituent menu, copy/paste, drag, drop and undo/redo operations. The search options of SMILES strings and user-drawing structures are particularly useful for searching ingredients of unique chemical structures in the situation that an individual ingredient has multiple names or identifiers. Moreover, the structural search option returns additional set of ingredients that are structurally similar to the user-input molecule. These structurally similar ingredients are generated by a similarity searching (40) back-end engine wherein the chemical structures are represented by PubChem molecular fingerprints (41,42) and structural similarity between molecules is measured by Tanimoto coefficient (40,42).

CMAUP was developed on MySQL database and PHP server software. Its web-interfaces were built by using HTML, PHP and JavaScript, and were designed to enable the access of its entries by browsing or searching plant name, species family, plant usage class (e.g. medicinal, food, edible, garden plant), target gene name, target gene ontology (GO categories), target-regulated KEGG pathway and target-regulated disease. While applicable, the plant entries are cross-linked to xxx and NCBI Taxonomy database, ingredient entries are crosslinked to PubChem, ChEMBL, ZINC and SuperNatural (44) databases, and the target genes crosslinked to UniProt database. The references for the plant ingredients and their targets and activity values are also provided in the respective entry pages. All data can be freely and conveniently downloaded from the respective entry page. A download summary section provides hyperlinks for users to download data of interest by selecting specific data groups. Specifically, options are provided for users to download the data in the groups of all plants, all targets, all ingredients, all active ingredients, all plant-ingredient associations and all ingredient-target activity records, respectively.

PERSPECTIVES

Functionally useful plants have been extensively studied and explored for broader applications from multiple perspectives including the species phylogenies (45), phytochemical informatics (46), altered metabolisms by multi-gene and pathway manipulations (5,6), improved activities by multi-targeted systems regulations (11,39), reverse pharmacokinetics (47), CMAUP complements existing plant related databases (13–27) for serving the expanding needs in plant research and development efforts by providing convenient access of the information about individual plant’s multiple targets and collective activities of its ingredients, and the regulated biological pathways, biological processes and diseases for more diverse ranges of functionally useful plants.

The rich collections of activity profiles of the plant ingredients in CMAUP may be combined with the plant metabolomics data (48) for facilitating the annotation of plant metabolomics (49,50) and for the interpretation of metabolic data in plant natural product discovery (51). CMAUP data may also be useful for facilitating other studies. These include the study of the metabolite distribution across plant species (52), the analysis of the relationships of the plant species (53), phylogenies (54), phytochemicals (55) and functions (56) across geographical regions and environmental conditions. There is a need for more comprehensive literature mining for the multi-target activities of multiple ingredients of more diverse ranges of functionally useful plants, and the incorporation of these profiles into this and other relevant databases.

Supplementary Material

Supplementary Data

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

Shenzhen Municipal Government [JCYJ20160324163734374, JCYJ20170413113448742, 20151030A0610001]; Key Project of Zhejiang Province Ministry of Science and Technology [2015C03055]; Key Project of National Natural Science Foundation of China [81730108]. Funding for open access charge: The above acknowledged fundings will pay for the publication charges.

Conflict of interest statement. None declared.

REFERENCES

  • 1. Leonti M., Casu L.. Traditional medicines and globalization: current and future perspectives in ethnopharmacology. Front Pharmacol. 2013; 4:92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Furst R., Zundorf I.. Evidence-Based phytotherapy in Europe: Where do we stand. Planta Med. 2015; 81:962–967. [DOI] [PubMed] [Google Scholar]
  • 3. Leonti M., Verpoorte R.. Traditional Mediterranean and European herbal medicines. J. Ethnopharmacol. 2017; 199:161–167. [DOI] [PubMed] [Google Scholar]
  • 4. Buenz E.J., Verpoorte R., Bauer B.A.. The ethnopharmacologic contribution to bioprospecting natural products. Annu. Rev. Pharmacol. Toxicol. 2018; 58:509–530. [DOI] [PubMed] [Google Scholar]
  • 5. Yang X., Cushman J.C., Borland A.M., Edwards E.J., Wullschleger S.D., Tuskan G.A., Owen N.A., Griffiths H., Smith J.A., De Paoli H.C. et al. A roadmap for research on crassulacean acid metabolism (CAM) to enhance sustainable food and bioenergy production in a hotter, drier world. New Phytol. 2015; 207:491–504. [DOI] [PubMed] [Google Scholar]
  • 6. Roy S.J., Negrao S., Tester M.. Salt resistant crop plants. Curr. Opin. Biotechnol. 2014; 26:115–124. [DOI] [PubMed] [Google Scholar]
  • 7. Tosti V., Bertozzi B., Fontana L.. Health benefits of the mediterranean Diet: Metabolic and molecular mechanisms. J. Gerontol. A Biol. Sci. Med. Sci. 2018; 73:318–326. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Clark A.K., Haas K.N., Sivamani R.K.. Edible plants and their influence on the gut microbiome and acne. Int. J. Mol. Sci. 2017; 18:E1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Kristanc L., Kreft S.. European medicinal and edible plants associated with subacute and chronic toxicity part I: Plants with carcinogenic, teratogenic and endocrine-disrupting effects. Food Chem. Toxicol. 2016; 92:150–164. [DOI] [PubMed] [Google Scholar]
  • 10. Ono E., Fukuchi-Mizutani M., Nakamura N., Fukui Y., Yonekura-Sakakibara K., Yamaguchi M., Nakayama T., Tanaka T., Kusumi T., Tanaka Y.. Yellow flowers generated by expression of the aurone biosynthetic pathway. Proc. Natl. Acad. Sci. U.S.A. 2006; 103:11075–11080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Kim H.U., Ryu J.Y., Lee J.O., Lee S.Y.. A systems approach to traditional oriental medicine. Nat. Biotechnol. 2015; 33:264–268. [DOI] [PubMed] [Google Scholar]
  • 12. Yue S.J., Xin L.T., Fan Y.C., Li S.J., Tang Y.P., Duan J.A., Guan H.S., Wang C.Y.. Herb pair Danggui-Honghua: mechanisms underlying blood stasis syndrome by system pharmacology approach. Sci. Rep. 2017; 7:40318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Nakamura Y., Afendi F.M., Parvin A.K., Ono N., Tanaka K., Hirai Morita A., Sato T., Sugiura T., Altaf-Ul-Amin M., Kanaya S.. KNApSAcK metabolite activity database for retrieving the relationships between metabolites and biological activities. Plant Cell Physiol. 2014; 55:e7. [DOI] [PubMed] [Google Scholar]
  • 14. Mohanraj K., Karthikeyan B.S., Vivek-Ananth R.P., Chand R.P.B., Aparna S.R., Mangalapandi P., Samal A.. IMPPAT: A curated database of Indian Medicinal Plants, Phytochemistry And Therapeutics. Sci. Rep. 2018; 8:4329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Liu Z., Guo F., Wang Y., Li C., Zhang X., Li H., Diao L., Gu J., Wang W., Li D. et al. BATMAN-TCM: a bioinformatics analysis tool for molecular mechANism of traditional chinese medicine. Sci. Rep. 2016; 6:21146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Valli M., dos Santos R.N., Figueira L.D., Nakajima C.H., Castro-Gamboa I., Andricopulo A.D., Bolzani V.S.. Development of a natural products database from the biodiversity of Brazil. J. Nat. Prod. 2013; 76:439–444. [DOI] [PubMed] [Google Scholar]
  • 17. Kim S.K., Nam S., Jang H., Kim A., Lee J.J.. TM-MC: a database of medicinal materials and chemical compounds in Northeast Asian traditional medicine. BMC Complement. Altern. Med. 2015; 15:218. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Chen C.Y. TCM Database@Taiwan: the world's largest traditional Chinese medicine database for drug screening in silico. PLoS One. 2011; 6:e15939. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Huang L., Xie D., Yu Y., Liu H., Shi Y., Shi T., Wen C.. TCMID 2.0: a comprehensive resource for TCM. Nucleic. Acids. Res. 2018; 46:D1117–D1120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Tung C.W., Lin Y.C., Chang H.S., Wang C.C., Chen I.S., Jheng J.L., Li J.H.. TIPdb-3D: the three-dimensional structure database of phytochemicals from Taiwan indigenous plants. Database (Oxford). 2014; 2014:bau055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Ru J., Li P., Wang J., Zhou W., Li B., Huang C., Li P., Guo Z., Tao W., Yang Y. et al. TCMSP: a database of systems pharmacology for drug discovery from herbal medicines. J. Cheminform. 2014; 6:13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Chen X., Zhou H., Liu Y.B., Wang J.F., Li H., Ung C.Y., Han L.Y., Cao Z.W., Chen Y.Z.. Database of traditional Chinese medicine and its application to studies of mechanism and to prescription validation. Br. J. Pharmacol. 2006; 149:1092–1103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Liu Z., Du J., Yan X., Zhong J., Cui L., Lin J., Zeng L., Ding P., Chen P., Zhou X. et al. TCMAnalyzer: A Chemo- and bioinformatics web service for analyzing traditional chinese medicine. J. Chem. Inf. Model. 2018; 58:550–555. [DOI] [PubMed] [Google Scholar]
  • 24. Jensen K., Panagiotou G., Kouskoumvekaki I.. NutriChem: a systems chemical biology resource to explore the medicinal value of plant-based foods. Nucleic. Acids. Res. 2015; 43:D940–D945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Plumb J., Lyons J., Norby K., Thomas M., Norby E., Poms R., Bucchini L., Restani P., Kiely M., Finglas P. et al. ePlantLIBRA: A composition and biological activity database for bioactive compounds in plant food supplements. Food Chem. 2016; 193:121–127. [DOI] [PubMed] [Google Scholar]
  • 26. Zeng X., Zhang P., He W., Qin C., Chen S., Tao L., Wang Y., Tan Y., Gao D., Wang B. et al. NPASS: natural product activity and species source database for natural product research, discovery and tool development. Nucleic. Acids. Res. 2018; 46:D1217–D1222. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Ye H., Ye L., Kang H., Zhang D., Tao L., Tang K., Liu X., Zhu R., Liu Q., Chen Y.Z. et al. HIT: linking herbal active ingredients to targets. Nucleic. Acids. Res. 2011; 39:D1055–D1059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Coordinators, N.R. Database resources of the National Center for Biotechnology Information. Nucleic. Acids. Res. 2018; 46:D8–D13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Li Y.H., Yu C.Y., Li X.X., Zhang P., Tang J., Yang Q., Fu T., Zhang X., Cui X., Tu G. et al. Therapeutic target database update 2018: enriched resource for facilitating bench-to-clinic research of targeted therapeutics. Nucleic. Acids. Res. 2018; 46:D1121–D1127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Gaulton A., Hersey A., Nowotka M., Bento A.P., Chambers J., Mendez D., Mutowo P., Atkinson F., Bellis L.J., Cibrian-Uhalte E. et al. The ChEMBL database in 2017. Nucleic. Acids. Res. 2017; 45:D945–D954. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Klopfenstein D.V., Zhang L., Pedersen B.S., Ramirez F., Warwick Vesztrocy A., Naldi A., Mungall C.J., Yunes J.M., Botvinnik O., Weigel M. et al. GOATOOLS: A python library for gene ontology analyses. Sci. Rep. 2018; 8:10872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Kuleshov M.V., Jones M.R., Rouillard A.D., Fernandez N.F., Duan Q., Wang Z., Koplev S., Jenkins S.L., Jagodnik K.M., Lachmann A. et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic. Acids. Res. 2016; 44:W90–W97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Federhen S. Type material in the NCBI Taxonomy Database. Nucleic. Acids. Res. 2015; 43:D1086–D1098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Kim S., Thiessen P.A., Bolton E.E., Chen J., Fu G., Gindulyte A., Han L., He J., He S., Shoemaker B.A. et al. PubChem substance and compound databases. Nucleic. Acids. Res. 2016; 44:D1202–D1213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Sterling T., Irwin J.J.. ZINC 15–Ligand discovery for everyone. J. Chem. Inf. Model. 2015; 55:2324–2337. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. UniProt Consortium, T. UniProt: the universal protein knowledgebase. Nucleic. Acids. Res. 2017; D1:D158–D169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Binns D., Dimmer E., Huntley R., Barrell D., O’Donovan C., Apweiler R.. QuickGO: a web-based tool for Gene Ontology searching. Bioinformatics. 2009; 25:3045–3046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Kanehisa M., Furumichi M., Tanabe M., Sato Y., Morishima K.. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic. Acids. Res. 2017; 45:D353–D361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Qin C., Tan K.L., Zhang C.L., Tan C.Y., Chen Y.Z., Jiang Y.Y.. What does it take to synergistically combine sub-potent natural products into drug-level potent combinations. PLoS One. 2012; 7:e49969. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Willett P. Similarity-based virtual screening using 2D fingerprints. Drug Discov. Today. 2006; 11:1046–1053. [DOI] [PubMed] [Google Scholar]
  • 41. Cheng T., Pan Y., Hao M., Wang Y., Bryant S.H.. PubChem applications in drug discovery: a bibliometric analysis. Drug Discov. Today. 2014; 19:1751–1756. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Zhang C., Tao L., Qin C., Zhang P., Chen S., Zeng X., Xu F., Chen Z., Yang S.Y., Chen Y.Z.. CFam: a chemical families database based on iterative selection of functional seeds and seed-directed compound clustering. Nucleic. Acids. Res. 2015; 43:D558–D565. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Bienfait B., Ertl P.. JSME: a free molecule editor in JavaScript. J Cheminform. 2013; 5:24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Banerjee P., Erehman J., Gohlke B.O., Wilhelm T., Preissner R., Dunkel M.. Super Natural II–a database of natural products. Nucleic. Acids. Res. 2015; 43:D935–D939. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Saslis-Lagoudakis C.H., Savolainen V., Williamson E.M., Forest F., Wagstaff S.J., Baral S.R., Watson M.F., Pendry C.A., Hawkins J.A.. Phylogenies reveal predictive power of traditional medicine in bioprospecting. Proc. Natl. Acad. Sci. U.S.A. 2012; 109:15835–15840. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Ehrman T.M., Barlow D.J., Hylands P.J.. Phytochemical informatics of traditional Chinese medicine and therapeutic relevance. J. Chem. Inf. Model. 2007; 47:2316–2334. [DOI] [PubMed] [Google Scholar]
  • 47. Hao H., Zheng X., Wang G.. Insights into drug discovery from natural medicines using reverse pharmacokinetics. Trends Pharmacol. Sci. 2014; 35:168–177. [DOI] [PubMed] [Google Scholar]
  • 48. Bais P., Moon S.M., He K., Leitao R., Dreher K., Walk T., Sucaet Y., Barkan L., Wohlgemuth G., Roth M.R. et al. PlantMetabolomics.org: a web portal for plant metabolomics experiments. Plant Physiol. 2010; 152:1807–1816. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Iijima Y., Nakamura Y., Ogata Y., Tanaka K., Sakurai N., Suda K., Suzuki T., Suzuki H., Okazaki K., Kitayama M. et al. Metabolite annotations based on the integration of mass spectral information. Plant J. 2008; 54:949–962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Yang Z., Nakabayashi R., Okazaki Y., Mori T., Takamatsu S., Kitanaka S., Kikuchi J., Saito K.. Toward better annotation in plant metabolomics: isolation and structure elucidation of 36 specialized metabolites from Oryza sativa (rice) by using MS/MS and NMR analyses. Metabolomics. 2014; 10:543–555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Hur M., Campbell A.A., Almeida-de-Macedo M., Li L., Ransom N., Jose A., Crispin M., Nikolau B.J., Wurtele E.S.. A global approach to analysis and interpretation of metabolic data for plant natural product discovery. Nat. Prod. Rep. 2013; 30:565–583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Riach A.C., Perera M.V., Florance H.V., Penfield S.D., Hill J.K.. Analysis of plant leaf metabolites reveals no common response to insect herbivory by Pieris rapae in three related host-plant species. J. Exp. Bot. 2015; 66:2547–2556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Kreft H., Jetz W.. Global patterns and determinants of vascular plant diversity. Proc. Natl. Acad. Sci. U.S.A. 2007; 104:5925–5930. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Donoghue M.J. Colloquium paper: a phylogenetic perspective on the distribution of plant diversity. Proc. Natl. Acad. Sci. U.S.A. 2008; 105:11549–11555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Richards L.A., Dyer L.A., Forister M.L., Smilanich A.M., Dodson C.D., Leonard M.D., Jeffrey C.S.. Phytochemical diversity drives plant-insect community diversity. Proc. Natl. Acad. Sci. U.S.A. 2015; 112:10973–10978. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Reich P.B., Walters M.B., Ellsworth D.S.. From tropics to tundra: global convergence in plant functioning. Proc. Natl. Acad. Sci. U.S.A. 1997; 94:13730–13734. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES