Abstract
The numerous natural products and their bioactivity potentially afford an extraordinary resource for new drug discovery and have been employed in cancer treatment. However, the underlying pharmacological mechanisms of most natural anticancer compounds remain elusive, which has become one of the major obstacles in developing novel effective anticancer agents. Here, to address these unmet needs, we developed an anticancer herbs database of systems pharmacology (CancerHSP), which records anticancer herbs related information through manual curation. Currently, CancerHSP contains 2439 anticancer herbal medicines with 3575 anticancer ingredients. For each ingredient, the molecular structure and nine key ADME parameters are provided. Moreover, we also provide the anticancer activities of these compounds based on 492 different cancer cell lines. Further, the protein targets of the compounds are predicted by state-of-art methods or collected from literatures. CancerHSP will help reveal the molecular mechanisms of natural anticancer products and accelerate anticancer drug development, especially facilitate future investigations on drug repositioning and drug discovery. CancerHSP is freely available on the web at http://lsp.nwsuaf.edu.cn/CancerHSP.php.
Cancer has been declared to be one of the major global public health problems, and there is a disturbing increase in both incidence and mortality rates in recent year1. As a source for remedies, natural products have been recognized since ancient times and still constitute a substantial percentage of today’s new drugs and will promote drug discovery going forward2,3,4,5. To date, increasing natural products with anticancer effects were identified and isolated from herbs in last decades6. However, pharmacokinetics (the absorption, distribution, metabolism, and excretion (ADME) properties of a drug) and targets of these products were seldom fully evaluated owing to the time consuming and costing of the traditional methods, which have become one of major obstacles in developing novel effective anticancer agents7,8. As a holistic approach, computer-based systems approach integrates large scale experimental studies and computational analyses to provide a mechanistic understanding of drug action across multiple scales of complexity ranging from molecular and cellular levels to tissue and organism levels, leading to a new opportunity to overcome these obstacles9,10. Thus developing an anticancer natural products repository with systems methods will accelerate the development of modern medicine for cancer treatment.
Although tremendous efforts have been made in the past for providing databases containing cancer related information, to our knowledge, no such dedicated comprehensive repository of anticancer herbs and anticancer herb-originating natural products has been developed currently as yet. Some databases like CancerDR11 and CancerPPD12 have been made in the past for providing comprehensive data involved in anticancer ingredients. However, the CancerDR mainly focuses on FDA (US Food and Drug Administration) approved and experimental drugs, and CancerPPD is a database of anticancer peptides and proteins. Considering the bleak situation of cancer and absence of systematic database for anticancer herbal products, for the first time, we have developed a comprehensive repository named anticancer herbs database of systems pharmacology (CancerHSP).
CancerHSP records anticancer herbal medicines with anticancer compounds. Further more, holistic evaluation were performed for each compound. At molecular level, protein targets of the ingredients are obtained through computational predictions and literatures. At cellular level, anticancer activities of these compounds based on cancer cell lines were obtained from literatures. After mapping cell lines to primary organ or tissue sites, the therapeutic effects of each compound were emerging at the organ or tissue level. For whole body level, pharmacokinetic ADME parameters are provided by a series of computational methods. CancerHSP will not only provide more effective new therapies, but also accelerate the process of drug resistance study, drug repositioning and even anticancer drug discovery.
Results
Data
As a comprehensive anticancer herbs database of systems pharmacology, CancerHSP were consisted of six major parts of data: (1) anticancer herbs, (2) anticancer ingredients for each herb, (3) targets with action mode for ingredients, (4) anticancer activities for ingredients based on cancer cell lines, (5) primary sites (organ or tissue) of cell lines, and (6) pharmacokinetic properties (ADME) of ingredients (Fig. 1). All anticancer herbs, anticancer ingredients for herbs and anticancer activities for all ingredients were compiled from research articles and book chapters.
The current release of CancerHSP has recorded 2439 anticancer herbal medicines and 3575 anticancer ingredients coupled with anticancer activities based on 492 different cancer cell lines. These cell lines are distributed in 21 tissues and organs (Fig. 2). Further, 13 protein targets on average and nine key ADME parameters for each ingredient are predicted by state-of-art methods (see methods).
Database description and utility
Search
In our web server, we have provided a user-friendly search tool for exploring CancerHSP. Users can search the information by herbal name, chemical name, InChIKey, CAS number, target name, and bioactivity in the search box at the CancerHSP homepage (Fig. 3). The results in search options come in the form of a table, which displays details in initial option as selected. For example, when users select “Chemical name” option in the search box and use “Taxol” as a keyword for searching, the browser will display the page of chemical entries which contains the highlighted “taxol” substring in the chemical names. Particularly, CancerHSP provides a very useful gadget which is a feasible way to filter and sort data, thus user could easily find the full matched entry in this page. After clicking the “taxol” in “Name & synonyms” column, molecule information page of taxol will be visible, where molecule structure, pharmacological and molecular properties, anticancer activities, targets, related herbs, references, etc., were well organized and displayed. In addition, user can easily visit other pages such as the information of herbs and targets by the inner hyperlink provided in this page (Fig. 3). The search for other keywords option can be similarly done as mentioned above. In addition, a case study was provided in CancerHSP to illustrate how to use this database to help uncover the underlying anticancer mechanisms of natural anticancer compounds (http://lsp.nwsuaf.edu.cn/load_intro.php?site=CancerHSP&id=60).
Browse
We have also introduced a powerful browsing option, which can provide an overall view on cell line data. Similar to result page of searching, the browsing page also allows users to sort and filter the result in every column of table. The primary organ or tissue sites of cell lines were provided in this page. In addition, the cell lines in browsing page are internal linked to result page of searching and partially external linked to Cancer Cell Line Encyclopedia (CCLE) database (Fig. 3), where the users could obtain more detailed information of the cell lines.
Download and update
As a public-facing database, CancerHSP provides a “Download” page where the entire database can be downloaded for further analysis. To maintain CancerHSP comprehensive and up to date, a submission interface was provided where users can submit his/her own data concerning to natural products for cancer treatment. However, to ensure accuracy of the submission, we will scrutinize the authentication of them.
Discussion
To date, increasing herbal anticancer products were identified, isolated and characterized from herbs in last decades and constitute a substantial percentage of today’s new drugs. Unfortunately, most of these drugs were impeded by the major problems including narrow therapeutic index, strong side effects and drug resistance, which could reduce life quality of cancer survivors13. The most important reasons are unknown targets, poor absorption or rapid metabolism or excretion of a drug, resulting in unintended off-targets and low drug concentration levels in serum14,15. One of the useful way to develop a successful anticancer drug is to gather these information when the lead compound was found. However, to our knowledge, there is no such repository have been developed so far, which systematically provides information of anticancer herbs, anticancer ingredients, targets and ADME parameters of each ingredient. Inspired by the process of absorbed drugs and success of systems pharmacology (Fig. 1), we have developed CancerHSP which is the first step on this direction for systematic collection and evaluation of efficient natural products for cancer.
CancerHSP integrates large scale experimental studies and computational analyses to provide a mechanistic understanding of drug action across multiple scales of complexity ranging from molecular and cellular levels to tissue and organism levels. It can be used to identify targets for new drugs, study drug repositioning, and evaluate drug side effects and adverse events. Availability of natural products in the database for development of innovative combination cancer therapy regimens will provide a benchmark for the resolution of the cancer therapy translational research enterprise.
In all cancer treatment cases, the effectiveness of the treatment is directly related to the therapeutic ability to targets and to kill the cancer cells while affecting as few healthy cells as possible16. The data concerning to effect on healthy cells are also of importance. But there are insufficient data concerning to drug effects on normal cells and whole body, such as normal cells inhibit rate and life prolong rate in the current release of CancerHSP, due to the limitation of current research articles. We hope that much more work could be done for these kinds of data when an anticancer product was identified. Although we have included the most recent data from literature in CancerHSP, in order to maintain it comprehensive and up to date, we will incorporate the new data as soon as they will be available, and plan to update CancerHSP annually to incorporate new functions. In the future versions, more anticancer compounds and experimental information will be included, such as the drug sensitive, drug-drug interactions, drug metabolites and drug toxicities.
Methods
Data collection
In order to develop a comprehensive information resource of anticancer herbs and ingredients, research articles providing information related to keywords such as “herb”, “traditional Chinese medicine”, etc., coupled with keywords like “anticancer”, “cytotoxicity”, “anti-proliferation” and “apoptosis” were extracted from PubMed and Google scholar (as of 30th 10, 2014). After careful reading these articles, information related to herbs, herbal ingredients and anticancer activities based on cancer cell lines were manually extracted and compiled. Further, the primary sites of cell lines and external links to CCLE17 were integrated into CancerHSP.
Molecular information
The molecular structures of herbal ingredients were downloaded from PubChem18 and ChemSpider19, or produced by ISIS Draw 2.5 (MDL Information Systems, Inc.) and further optimized by Sybyl 6.9 (Tripos, Inc.) with Sybyl force field and default parameters20. InChIKey, a fixed length (25 character) condensed digital representation of the InChI (IUPAC International Chemical Identifier), was produced by Open Babel21 for each molecule. Further, for keeping uniformity, different format types of the chemical files were converted to SDF format by Open Babel, and the duplicates were removed according to InChIKey22. To facilitate searching, synonyms and CAS numbers for each molecule were obtained from PubChem and ChemSpider and included to CancerHSP.
Drug targets
Drug targeting was performed by two in-house tools: SysDT and WES. SysDT is a systematic tool that efficiently integrates the chemical, genomic, and pharmacological information for drug targeting and discovery on a large scale, based on two powerful methods of random forest and support vector machine8. WES is a novel computational model which has been constructed to detect drug direct targets on a large scale based on the newly developed weighted ensemble similarity method. In order to strictly assess the relationships between compounds and corresponding targets, the mode of action was predicted by another in-house tool preAM, which is an accurate model to classify drug-target interactions into different action modes, i.e. activation and inhibition, based on the robust random forest algorithm.
ADME properties
Analysis of ADME related pharmacokinetic properties, such as oral bioavailability (OB) and Caco-2 permeability (Caco-2), blood-brain barrier (BBB) and Lipinski’s rule of five (MW, AlogP, TPSA , Hdon, Hacc, RBN) were performed as previously described22. Detailed parameters’ information and calculation can be obtained from CancerHSP.
Database framework and web interface
CancerHSP is designed as a relational database on an apache server. All data were organized in a publicly available MySQL database as the back end (Fig. 4a), with a user-friendly web interface based on HTML, CSS, PHP and JavaScript programming languages as the front end (Fig. 4b).
Additional Information
How to cite this article: Tao, W. et al. CancerHSP: anticancer herbs database of systems pharmacology. Sci. Rep. 5, 11481; doi: 10.1038/srep11481 (2015).
Acknowledgments
Funding: This work was supported by the Fund of Northwest A & F University and was financially supported by the National Natural Science Foundation of China [Grant number 31170796, 81373892] and New Century Excellent Talents in University of Ministry of Education of China.
Footnotes
Author Contributions W.T., B.L., Y.B., P.A.S., W.Z., Z.G., K.S., C.H., Y.F. and C.Z collected, organized and compiled the data. B.L., Y.B., J.M., T.P., Y.W. and Y.L. scrutinized the data. W.T. and B.L. wrote the manuscript and prepared figures. W.T., B.L. and S.G. developed the web interface and integrated the tools. Y.W. conceived the idea and coordinated the project.
References
- Siegel R., Naishadham D. & Jemal A. Cancer statistics, 2013. CA. Cancer J. Clin. 63, 11–30 (2013). [DOI] [PubMed] [Google Scholar]
- da Rocha A. B., Lopes R. M. & Schwartsmann G. Natural products in anticancer therapy. Curr. Opin. Pharmacol. 1, 364–369 (2001). [DOI] [PubMed] [Google Scholar]
- Newman D. J. & Cragg G. M. Natural Products As Sources of New Drugs over the 30 Years from 1981 to 2010. J. Nat. Prod. 75, 311–335 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhu F. et al. Clustered patterns of species origins of nature-derived drugs and clues for future bioprospecting. Proc. Natl. Acad. Sci. USA 108, 12943–12948 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tao L. et al. Nature’s contribution to today’s pharmacopeia. Nat. Biotechnol. 32, 979–980 (2014). [DOI] [PubMed] [Google Scholar]
- Demain A. L. & Vaishnav P. Natural products for cancer chemotherapy. Microb. Biotechnol. 4, 687–699 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu X. et al. A novel chemometric method for the prediction of human oral bioavailability. Int. J. Mol. Sci. 13, 6964–6982 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu H. et al. A systematic prediction of multiple drug-target interactions from chemical, genomic, and pharmacological data. PLoS One 7, e37608 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berger S. I. & Iyengar R. Network analyses in systems pharmacology. Bioinformatics 25, 2466–2472 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berger S. I. & Iyengar R. Role of systems pharmacology in understanding drug adverse events. Wiley Interdiscip. Rev. Syst. Biol. Med. 3, 129–135 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kumar R. et al. CancerDR: cancer drug resistance database. Sci. Rep. 3, 1445 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tyagi A. et al. CancerPPD: a database of anticancer peptides and proteins. Nucleic Acids Res. 43, D837–843 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wong A. S. T., Che C.-M. & Leung K.-W. Recent advances in ginseng as cancer therapeutics: a functional and mechanistic overview. Nat. Prod. Rep. 32, 256–272 (2015). [DOI] [PubMed] [Google Scholar]
- Gottesman M. M. Mechanisms of cancer drug resistance. Annu. Rev. Med. 53, 615–627 (2002). [DOI] [PubMed] [Google Scholar]
- Lounkine E. et al. Large-scale prediction and testing of drug activity on side-effect targets. Nature 486, 361–367 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brannon-Peppas L. & Blanchette J. O. Nanoparticle and targeted systems for cancer therapy. Adv. Drug Deliv. Rev. 56, 1649–1659 (2004). [DOI] [PubMed] [Google Scholar]
- Barretina J. et al. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483, 603–607 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Q., Cheng T., Wang Y. & Bryant S. H. PubChem as a public resource for drug discovery. Drug Discov. Today 15, 1052–1057 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pence H. E. & Williams A. ChemSpider: An Online Chemical Information Resource. J. Chem. Educ. 87, 1123 (2010). [Google Scholar]
- Li X. et al. A system-level investigation into the mechanisms of Chinese Traditional Medicine: Compound Danshen Formula for cardiovascular disease treatment. PLoS One 7, e43918 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Boyle N. M. et al. Open Babel: An open chemical toolbox. J. Cheminform. 3, 33 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ru J. et al. TCMSP: a database of systems pharmacology for drug discovery from herbal medicines. J. Cheminform. 6, 13 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]