Skip to main content
Heliyon logoLink to Heliyon
. 2023 Aug 15;9(8):e19151. doi: 10.1016/j.heliyon.2023.e19151

Comprehensive survey of target prediction web servers for Traditional Chinese Medicine

Xia Ren a,b,1, Chun-Xiao Yan a,b,1, Run-Xiang Zhai a,b, Kuo Xu a,b, Hui Li a,b, Xian-Jun Fu a,b,c,
PMCID: PMC10468387  PMID: 37664753

Abstract

Traditional Chinese medicine (TCM) is characterized by multi-components, multiple targets, and complex mechanisms of action and therefore has significant advantages in treating diseases. However, the clinical application of TCM prescriptions is limited due to the difficulty in elucidating the effective substances and the lack of current scientific evidence on the mechanisms of action. In recent years, the development of network pharmacology based on drug systems research has provided a new approach for understanding the complex systems represented by TCM. The determination of drug targets is the core of TCM network pharmacology research. Over the past years, many web tools for drug targets with various features have been developed to facilitate target prediction, significantly promoting drug discovery. Therefore, this review introduces the widely used web tools for compound-target interaction prediction databases and web resources in TCM pharmacology research, and it compares and analyzes each web tool based on their basic properties, including the underlying theory, algorithms, datasets, and search results. Finally, we present the remaining challenges for the promising future of compound-target interaction prediction in TCM pharmacology research. This work may guide researchers in choosing web tools for target prediction and may also help develop more TCM tools based on these existing resources.

Keywords: Target prediction, Nature product, Traditional Chinese medicine, Web tools, Databases

Graphical abstract

Image 1

1. Introduction

New drug discovery is a complex, expensive, time-consuming, and challenging undertaking [1]. Drugs play a role in treating diseases by acting on specific targets and producing corresponding biological outcomes [2,3]. Drug targets generally refer to molecules in the human body and pathogens that can interact with drugs, including enzymes, receptors, ion channels, transporters, nucleic acids, and other molecules [4]. In addition, there are other therapeutic targets, such as protein-protein interactions and nucleic acid (DNA/RNA)-protein interactions. Traditional drug discovery has mainly focused on highly specific inhibitors with a single target, in which the most specific drugs acting on one particular target of individual illnesses are sought [5]. It is challenging to treat complex diseases effectively using drugs with a single target. Using monomer compounds to treat complex diseases makes obtaining an excellent curative effect difficult (see Fig. 1, Fig. 2, Fig. 3).

Fig. 1.

Fig. 1

Number of citations of each web serves.

Fig. 2.

Fig. 2

Application of SEA web servers: (A) journal publications; (B) Top 10 research areas; (C) Top 10 countries/regions where authors belong.

Fig. 3.

Fig. 3

Application of TCMSP web servers: (A) journal publications; (B) Top 10 research areas; (C) Top 10 countries/regions where authors belong.

Traditional Chinese medicine (TCM) has a long history in China and has played an essential role in the struggle between the Chinese nation and disease. It is the accumulation of much experience to form classic prescriptions with definitive clinical efficacy. TCM provides a rich material basis for discovering and developing modern drugs. TCM-based new drug development, especially for treating complex diseases, has gradually attracted increasingly more attention. However, due to the complex chemical compositions and modes of action of TCM, the understanding of TCM is still limited; the theoretical research of modern TCM lags, which hinders the development of modern TCM [6]. TCM includes various active compounds, and drug-drug interactions may occur among each combination, affecting the pharmacokinetics of drugs in vivo [7]. TCM's essence, concepts, and methods in treating diseases differ from those of modern medicine, which are discovered by targeting a specific protein. TCM plays its role through multiple targets and pathways. Therefore, correct identification and validation of drug-target interactions is the first step in activity prediction and mechanism elucidation of TCM. This highlights the critical role of identifying compound target interactions for the accuracy of specific TCMs.

There are many methods for target prediction, which are mainly divided into experimental and computational categories. The most direct method experimentally is the biochemical test, which directly evaluates the binding ability of small molecule compounds and corresponding proteins after incubation and elution to judge whether the small molecule binds to the target [8]. In addition, there are genetic and genomic approaches; for example, after locking the target of small molecules in a specific range of gene clusters, a series of point mutations or knockouts are performed on the gene clusters [9]. If the mutated or knock-out samples show the same effects as the small molecules, the protein encoded by the mutated or knock-out genes will likely be a target of the small molecules. The accuracy of the experimental method is relatively high; however, these experiments are costly and time-consuming, and a thorough search is not feasible because there are millions of drug-like compounds and hundreds of potential targets [10]. In recent years, computational methods that can quickly predict compound targets have emerged. Their strong development momentum could provide new potential drug-target interaction candidates for experimental biological validation and reduce the time and cost of biological experiments [11,12]. These methods are often called target identification, “in silico target prediction,” or “target fishing” [13,14]. Recently, increasingly more evidence indicates that a combination of target prediction and network pharmacology analyses is a feasible and powerful way to analyze the molecular mechanism of a drug and TCM [15,16].

In recent years, researchers have developed many machine learning methods to predict the targets of small molecules, many of which provide web services. The advantage of a web-based target prediction tool is that it is easy to operate and can be used even if you do not understand the underlying methods, which pharmaceutical chemists and biologists favor. In recent years, web-based target prediction tools have also been applied to TCM, which has played an important role in elucidating TCM's active substances and mechanisms of action. In this article, we first summarized the currently widely used drug-target interaction prediction databases and web resources for TCM pharmacology research, an area that is under tremendous development. Subsequently, we compared different web resources based on their basic properties, including fundamental theory, algorithms, and datasets. Third, we compared different web resources based on their basic properties and search results.

2. Public databases make drug-target data available

With the rapid development of bioinformatics, the number of drug-target interaction databases has increased in recent years, and they provide crucial support for TCM pharmacology research. These biological activity databases contain many chemical compositions, structure information, and activity data; provide free data retrieval for research; and deepen our understanding of the targets [14]. An overview of many of the significant public databases used in TCM pharmacology is presented in Table 1. These databases contain biological activity data verified through experiments of many compounds for many proteins. Their common goal is to integrate different data types to facilitate drug design and new drug development, which is also of great interest to pharmaceutical companies [17].

Table 1.

Overview of many public drug-target data resources available today. Due to the fast development of the area, this list does not claim to be exhaustive.

Databases Latest release Data Web address
ChEMBL [18] ChEMBL 32, released 03-01-2023 ChEMBL contains >2 million small molecules with >1.2 million bioactivity data points. https://www.ebi.ac.uk/chembl/g/
DrugBank [19] version 5.1.10, released 01-04-2023 DrugBank contains 15,441 drug entries, including 2739 approved small molecule drugs, 1575 approved biologics (proteins, peptides, vaccines, and allergenics), 134 nutraceuticals, and over 6716 experimental (discovery-phase) drugs. Additionally, 5294 non-redundant protein (i.e. drug target/enzyme/transporter/carrier) sequences are linked to these drug entries. http://www.drugbank.ca
Comparative Toxico- genomics Database [20] version 6.1, released 03-01-2023 CTD provide 51 million toxicogenomic relationships for over 17,100 chemicals, 54,300 genes, 6100 phenotypes, 7270 diseases and 202,000 exposure events, from 600 comparative species http://ctdbase.org/
Therapeutic Target Database [21] 2022 version, released 09-29-2021 TTD contains 5059 patented drugs (collected from 3145 patents of WIPO, USA, Europe, Japan, etc.) targeting 215 successful, 236 clinical trial and 207 patent-recorded targets (with the structures of 4774 patented drugs drawn and provided in.mol files, and with the target activities of 3388 patented drugs collected from literature). http://db.idrblab.net/ttd/
BindingDB [22] 2023 version, released 03-01-2023 BindingDB contains 41,328 Entries, each with a DOI, containing 2,680,191 binding data for 9000 protein targets and 1,153,558 small molecules. http://www.bindingdb.org/bind/index.jsp
TDR Targets Database [23] version 6.1, released 03-01-2021 TDR Targets Database contained 14,315 drug entries and 5239 targets. https://tdrtargets.org/
The IUPHAR/BPS Guide to PHARMACOLOGY [24] version 04-2022 The IUPHAR/BPS Guide to PHARMACOLOGY contains 11,596 ligands, 6219 targets proteins, and 49558 binding constants. http://www.guidetopharmacology.org/

Public drug-target databases vary in data size, type, and how they are implemented. Public databases such as ChEMBL [18], DrugBank [19], and TTD [25] continuously increase in size, with the ChEMBL latest release comprising approximately 2 million distinct small molecules and approximately 1.2 million bioactivity data points. In recent years, a remarkable development is that the size of databases has increased, and different types of data have begun to be integrated [26]. DrugBank and TTD link approved small molecules with their known and explored therapeutic proteins and pathways. CTD provides manually curated chemical-gene/protein interactions, and chemical-disease and gene-disease relationships [20]. Combined with functional and pathway data, these data help develop hypotheses about the underlying mechanisms of environmental effects on diseases. In addition, the update speed of these data is relatively fast. Except for SuperTarget, all the other databases had versions released after 2020, ensuring the timeliness of this database information. These databases provide a convenient method for researchers to find identified targets of compounds.

3. Web servers for target prediction

With the massive growth of drug target databases in recent years, the calculation tools for predicting small molecule protein targets have become more critical. The user inputs the molecular structure into the computational bioactivity model and generates the “prediction” target of the compound based on the “informed guess” of the database without experiments [27]. However, their use is limited by complex algorithms and specific operating software systems, even if the algorithm can be obtained as open-source material. It is encouraging that web-based technology provides an opportunity to solve the problem of using computer simulation target prediction methods. Some free web resources for assisted target prediction analysis have recently emerged. These web services bring great convenience to the research of TCM pharmacology. By consulting the literature, we summarized the target prediction databases commonly used for the pharmacology of TCMs. Table 2 lists websites that apply to the target prediction of small molecules. It is worth noting that many platforms, such as TCMID [28], TCMSP [29], and BATMAN-TCM [30], which combine TCM ingredients, properties, targets, and diseases based on TCM information, have been established in recent years. However, we have also noticed that some web services are inaccessible (see Table 3).

Table 2.

Public web services that can be used for the target prediction of small molecules.

Type Database Web address Year
Target prediction web servers SEA [31] http://sea.bkslab.org/ 2007
STITCH [32] http://stitch.embl.de/ 2008
PharmMapper [33] http://www.lilab-ecust.cn/pharmmapper/ 2010
idTarget [34] http://idtarget.rcas.sinica.edu.tw 2012
HitPick [35] http://mips.helmholtz-muenchen.de/hitpick/ 2013
ChemMapper [36] http://lilab-ecust.cn/chemmapper/ 2013
SwissTargetPrediction [37] http://www.swisstargetprediction.ch/index.php 2014
Superpred [38] http://prediction.charite.de/ 2014
Polypharmacology Browser 2 [39] http://ppb2.gdb.tools 2018
TCM related databases TCMID [28] http://www.megabionet.org/tcmid/ 2012
TCMSP [29] http://tcmspw.com/tcmsp.php 2014
BATMAN-TCM [30] http://bionet.ncpsb.org/batman-tcm/ 2016
ETCM [40] http://www.tcmip.cn/ETCM/index.php/Home/Index/index.html 2018
SymMap [41] https://www.symmap.org 2019
HERB [42] http://herb.ac.cn/ 2021
HIT2.0 [43] http://www.badd-cao.net:2345/ 2021
Not available TarFisDock [44] http://www.dddc.ac.cn/tarfisdock 2008
DRAR-CP [45] https://cpi.bio-x.cn/drar/?page=home 2011
ChemProt [46] http://www.cbs.dtu.dk/services/ChemProt/ 2011
TCMAnalyzer [47] http://www.rcdd.org.cn/tcmanalyzer 2016
TCM-Mesh [48] http://mesh.tcm.microbioinformatics.org/ 2017

Table 3.

Initial publishing journal of the web server articles and impact factors.

Type Database Publish IF
Target prediction web servers SEA Nature Biotechnology 68.16
STITCH Nucleic Acids Research 19.16
PharmMapper Nucleic Acids Research 19.16
idTarget Nucleic Acids Research 19.16
HitPick Bioinformatics 6.931
ChemMapper Bioinformatics 6.931
SwissTargetPrediction Nucleic Acids Research 19.16
Superpred Nucleic Acids Research 19.16
Polypharmacology Browser 2 Journal of Chemical Information and Modeling 6.162
TCM related databases TCMID Nucleic Acids Research 19.16
TCMSP Journal of Cheminformatics 8.489
BATMAN-TCM Scientific Reports 4.996
ETCM Nucleic Acids Research 19.16
SymMap Nucleic Acids Research 19.16
HERB Nucleic Acids Research 19.16
HIT 2.0 Nucleic Acids Research 19.16

3.1. Impact of the web servers

One way to measure the impact of these databases is to consider the impact factors of the journals in which these databases were published. The journal impact factors are from the Journal Citation Reports (Clarivate Analytics, 2022). Journals that published these databases had an impact factor of >5.998, and the median impact factor of these journals was 19.16 (Table 5). We also collected and compared their citation counts from the Web of Science on March 15, 2023. Most of the 13 databases included in the analysis enjoyed a high number of citations (median = 112). The top three published databases, TCMSP, STITCH, and SwissTargetPrediction, have accumulated 2650 citations, more significant than the combined citations of the remaining 10 predictors. These results suggested that target prediction tools are based on solid interest of the scientific community.

Table 5.

Comparisons among the Web servers for TCM-related web servers.

Index Function Prediction methods Dataset
Database Input type
Formulas Herb Ingredients Target Dieases
TCMID TCMID provides information on six different areas (Fomula, Herb, Compound, Disease, Drug and Target) and their links. NA 99582 10846 43413 17521 2679 Encyclopedia of Traditional Chinese Medicines/TCM@Taiwan/DrugBank/OMIM TCM formula name
Herb name in Chinese name or English
Compound name STITCH database ID
CAS number
TCMSP Compounds, targets, and diseases;
Herbal information ingredients with their ADME-related properties; Compounds-Targets/Targets-Diseases relationships;
SysDT model NA 499 29384 3311 837 Drugbank/TTD/PharmGKB Herbal name ingredient's chemical name InChIKey
CAS number target name disease name
BATMAN-TCM TCM ingredients' target prediction; Functional analyses of targets; The visualization of ingredient-Target-pathway/disease association network;
KEGG biological pathway with highlighted targets; Comparison analysis of multiple TCMs
Similarity-based target prediction 46914 8159 25210 NA NA TCMID/Drugbank/TTD/KEGG/OMIM TCM formula name Herb name PubChem_CID chemical structure of InChI format
ETCM ETCM includes multiple aspects of clinical and functional essential information on TCM herb species, TCM formulas, herbal ingredients, validated or predicted drug targets, as well as related diseases Similarity-based target prediction 3959 402 7284 2266 4323 The Fourth National Survey on Chinese/Materia Medica Resources Pharmacopoeia of the People's Republic of China (2015 version)/China Food and Drug Administration/Pubchem/OMIM DisGeNET/ORPHANET/HPO TCM formula name Herb name in Chinese name or English Herbal name ingredient's chemical name
SymMap SymMap integrates traditional Chinese medicine (TCM) with modern medicine (MM) through both internal molecular mechanism and external symptom mapping, thus provides massive information on herbs/ingredients, targets, as well as the clinical symptoms and diseases they are used to treat for drug screening efforts. Similarity-based target prediction NA 698 25975 20965 14082 HERB/ETCM/TCMSP/BATMAN-TCM/GeneCards MalaCards Herb name in Chinese name or English or latin name or pinyin name Compound name
CAS number PubChem ID
HERB HERB database is a natural medicine database platform that integrates high-throughput experimental data and reference mining data. This database provides functions such as browsing, searching, viewing, and downloading for TCM herbs, TCM herbal active ingredients, target gene, disease, high-throughput experiment and reference mining data. NA NA 7263 49258 12933 28212 SymMap/TCMID/TCMSP/TCM-ID/NCBI GEO database/PubMed database Herb name in Chinese name or English or latin name or pinyin name Compound name
CAS number Gene name Disease name
HIT 2.0 HIT2.0 is a comprehensive searching and curation platform for herbal ingredients and target information based on literature evidences.Alternatively, uses can enter My-target curation system to check the detailed compound-target relationship and create the latest personal targeting profiles Text-mining technology NA 1250 1237 2208 NA PubMed literatures/TTD/Drugbank Herbs:Chinese name, Chinese characters or English name. Herbal ingredient: name, CAS number or Pubchem CID

3.2. Applications of the web servers

With the rapid growth of web servers for target prediction, their use has steadily increased over the past years. We selected two widely used databases, SEA and TCMSP, and analyzed the publication year, research area, and country or region of the publication that was cited. The number of publications using both databases increased with time, showing similar rapidly growing trends. For a deeper understanding of the related research areas of the journals using these databases, we further counted the number of research areas of these journal papers. The results showed that among the types of literature using the SEA database, the number of published types of literature in pharmacology pharmacy was the highest, followed by biochemistry, molecular biology, chemistry, and computer science. Unlike the SEA database, the types of literature using the TCMSP database were complementary, and the number of integrative medicine documents was the highest. Pharmacology pharmacy ranked second regarding the proportion of research volume in journal publications. This indicates that researchers are more inclined to use the database dedicated to TCM for integrative, complementary medicine. This can also be shown in the countries and regions where the authors of these journals belong. Among the researchers who use TCMSP data, 93.8% are from China. However, countries and regions are widely distributed in the SEA database, and Chinese researchers still make up a considerable proportion. Overall, applications of web servers in pharmacology and complementary integrative medicine have become increasingly popular in recent years. Notably, the research in integrative, complementary medicine accounts for a large proportion of all research.

3.3. Overview of the target prediction web servers

3.3.1. Prediction method

Several methods for predicting targets have been developed. The existing computational target prediction methods are generally divided into ligand-based methods centered on small molecules and structure-based processes based on protein three-dimensional structure information [49]. The ligand-based method predicts the target according to the principle of chemical similarity; that is, similar chemical structures show similar biological activities [50,51]. These methods first collect known ligands acting on the target and then determine the similarity between the chemical under evaluation and the known ligand set to infer the drug's interaction target [52]. The majority of web tools for target prediction are based on the principle of ligand-based methods.

The model based on the chemical structure is premised on similar chemical structures showing similar biological activities and depends on prior knowledge of bioactive ligands and protein structures [53,54]. Therefore, describing molecular structures to the computer is a crucial problem of this model. When a user enters a compound, a more complex binary vector, such as a molecular fingerprint, is usually obtained in the intermediate conversion step from a 1D sequential textual format (e.g., SMILES) to a 2D structure. The 3D structure requires additional information about the 3D conformation of the compounds. There are many molecular descriptors, and they show great differences in evaluating the similarity between molecules [55]. As shown in Table 4, compounds in the chemical structure-based model are divided into 2D and 3D fingerprints. The 2D molecular fingerprinting method focuses on the similarity of molecular chemical structure, while the 3D focuses more on pharmacophore similarity. Similarity ensemble approach (SEA) [31], SuperPred [38], HitPick [35], and The Polypharmacology Browser (PPB2) [39] use 2D molecular fingerprinting. Still, some specific methods exist; the difference lies in defining substructures and coding methods. SEA, SuperPred, and PPB2 adopted circular fingerprint-extended connectivity fingerprints, which were the most widely used 2D molecular fingerprints for building quantitative structure-activity relationship models of compounds [56]. PPB also represents two other molecular fingerprints: molecular fingerprint encoding composition, molecular shape, and pharmacophores. HitPick generated Morgan fingerprints with feature invariants similar to functional-class fingerprints [57]. In addition, numerous online tools using 3D similarity methods are emerging, such as ChemMapper, which explores chemical and target pharmacology relationships against any given small molecule by SHAFTS. The 3D similarity calculation is driven by molecular shape and chemotype features [58]. SwissTargetPrediction searches for similar molecules in 2D and 3D; the 2D measure uses path-based binary fingerprints, while the 3D measure is based on Electroshape 5D (ES5D) [59,60].

Table 4.

Comparisons among the Web servers for Target Prediction. The comparison contains the search method, molecular fingerprints, algorithm, different input types, general statistics, and limitations of several TCM databases.

Type of Method Web servers Searching Method Molecular fingerprints Algorithm Dataset
Source Input type
Compounds Targets interactions
Chemical structure-based SEA 2D Similarity ECFP BLAST 3665 N N ChEMBL MDDR SMILES
STITCH NA NA Random Forests 430,000 N N DrugBank GLIDA TTD CTD ChEMBL PDSP PDB SMILES
Superpred 2D Similarity ECFP4 Tanimoto coefficients 341000 1800 665000 SuperTarget ChEMBL BindingDB PubChem ID
SMILES
MOL file
Chemical structure
SwissTargetPrediction 2D and 3D Similarity Openbabel FP2/ES5D Tanimoto coefficients/Manhattan distance 376,342 3068 580496 ChEMBL Chemical structure SMILES SDF MOL2
HitPick 2D Similarity FCFP NN searches with NB machine learning 99,572 1375 145549 STITCH SMILES
PPB 2 2D Similarity searching MQN
Xfp
ECFP
NN searches with NB machine learning/NN searches with NB machine learning 344163 1720 555346 ChEMBL SMILES Chemical structure
ChemMapper 3D Similarity searching SHAFTS Random walk algorithm 800,000 NA NA ChEMBL SMILES Chemical structure
Structure-based methods PharmMapper Pharmacophore based NA Triangle hashing (TriHash) Genetic algorithm (GA) optimization NA 23236 16159 TargetBank DrugBank BindingDB PDTD SMILES Chemical structure

Note: ECFP: Extended Connectivity Fingerprints; FCFP: Functional-Class Fingerprints; ES5D: Electroshape vectors; MQN: Molecular Quantum Number; Xfp: atom category extended atom-pair fingerprint SHAFTS: SHApe-FeaTure Similarity; NN: nearest neighbor; NB: Naıve Bayes; DNN: Deep Neural Network.

Another widely used target recognition method is the structure-based target prediction, which refers to the protein-ligand interaction mode corresponding to the desired pharmacological action, and can be regarded as the greatest common divisor shared by a group of active molecules [61,62]. Therefore, the interaction between protein targets and any ligand can be characterized by matching the 3D chemical structures with the model characteristics [63]. PharmMapper is an online web server based on a pharmacophore model that identifies potential target candidates for small molecules [64].

3.3.2. Searching method

The target prediction method of each web server is different. The simplest and fastest target prediction method relies on querying the structural similarity of compounds, such as in SuperPred and SwissTargetPrediction. SuperPred suggests protein targets according to the molecular similarity between the query compound and compounds with known biological activities using the Tanimoto coefficient and sorts the targets accordingly. SwissTargetPrediction predicts the target of a compound based on its similarity to the 2D and 3D structures of known compounds. The 2D similarity is also quantified by the Tanimoto coefficient of the query molecule and the screening molecule, and 3D similarity is quantified by the Manhattan distance between the electric shape vectors (ES5D) of 20 conformational isomers of the query molecule and the screening molecule. The final score corresponds to the combination of similarity measures based on a logistic regression of similarity values, in which the most similar ligands use 2D and 3D similarity measures [65]. Unlike single compound similarity, the SEA server uses the similarity ensemble method based on the sequence alignment algorithm BLAST to group proteins based on ligand topology. A set of web servers, such as HitPick and PPB2, use machine learning algorithms to generate predictions. HitPick applies the B-score method for hit identification combining one-nearest-neighbor similarity searching [66] and Laplacian-modified naïve Bayesian target models [67] to predict targets of the identified hits. PPB2 combines nearest neighbor searches with naïve Bayes and deep neural network (DNN) machine learning to enhance performance. Nearest neighbor and naïve Bayes combinations perform best in terms of precision statistics in PPB2.

3.3.3. Data set

Target prediction methods focus on ligand-based methods to identify structures similar to those from a bioactivity database. Therefore, the number of internal datasets, usually from several databases, is a critical factor affecting the web server quality. These web servers cover between 3665 and 800,000 compounds, including drugs and drug-like compounds, while the number of protein targets ranges between 1720 and 23,236 (Table 4). ChemMapper, PharmMapper, and STITCH contain human proteins and proteins of other species. Another important indicator by which to evaluate the web servers is the number of compound/ligand-target interactions. This number varies between 16,159 and 665,000. The SuperPred database contains the largest number of compound/ligand-target interactions. These compounds and target data derived from source databases include MDDR [68], PDSP [69], GLIDA [70], Binding DB [22], TTD [71], CTD [72], KEGG [73], DrugBank [19], SuperTarget Matador [74], STITCH [32], ChEMBL [18], and PDTD [75]. Typically, the ChEMBL database is used as a public source for chemical structures, while DrugBank, TTD, and BindingDB are for bioactivity. Generally, SuperPred and SwissTargetPrediction contain the largest quantity of data.

3.4. TCM-related web servers

As databases for herbal ingredients emerge, various web servers are used for TCM analysis to bridge the gap between TCM and modern Western medicine [47]. Unlike the commonly used target prediction websites, these web servers often record information related to TCM collected from different resources and using text-mining methods, including information on herbs, chemicals, targets, drug-target networks, and related drug target-disease networks. Recent studies compared the currently available web servers for TCM analysis in more detail [76]. Our paper focuses on the TCMID, TCMSP, BATMAN-TCM, ETCM, SymMap, and HERB databases, which store herbal compound-target information (Table 5). These databases all provide herbal compound-target-disease networks, which support studies on the mechanism of TCM. These databases also have their unique functions.

TCMID has the greatest number of medicines and TCMs, with 99,582 and 8,159, respectively. HERB has the most number of components collected, with 49,258. The greatest number of targets is in SymMap, with 20,965 targets. In addition to TCM and its components, SymMap includes 1717 corresponding TCM syndromes, corresponding to 961 Western medicine symptoms. In this way, SymMap relates TCM and modern medicine from phenotype to the molecular level. The TCMSP and ETMC databases include many herbal entries with ADME properties for drug screening and evaluation.

In addition, TCMSP, BATMAN-TCM, ETCM, SymMap, and HERB are intelligent, integrated pharmacological research platforms of TCM that can be used for target prediction, functional analysis, and reverse searches of TCM and compound prescriptions. It is worth noting that TCMSP not only contains the known targets of compounds but also establishes a target prediction method based on random forest and support vector machine (SVM), called the SysDT model, which integrates chemical, genomic, and pharmacological information [77]. The target prediction method of BATMAN-TCM is based on the similarity of chemical structure [30]. The core idea is to rank possible drug-target interactions based on their similarity to known drug-target interactions obtained from the DrugBank, KEGG, and TTD databases [16].

3.5. Comparative analysis of target prediction web servers in TCM pharmacology research

To further compare the web servers, we collected 54 indexed components of different TCMs from Chinese Pharmacopoeia and compared the ability of other web servers to predict known targets (Supplementary Table S1). Compound structures (SMILE) were collected from PubChem by querying their compound name. Next, the known targets were retrieved from DrugBank and TTD, and 54 known targets of chemical constituents were collected (Supplementary Table S2) (see Fig. 4).

Fig. 4.

Fig. 4

Web servers based on chemical structure-based model workflow.

We counted the number of targets predicted by each web server, and the results showed that the number varied greatly, ranging from several to hundreds. The number of target outputs by SwissTargetPrediction and PPB2 database was certain. The number of targets predicted by most web servers was less than 100. Next, we performed statistical tests to identify the accuracy of each web server in predicting known targets. As shown in Fig. 5, considering that at least one target was predicted, SwissTargetPrediction demonstrated the best performance in accuracy (53.7%), followed by PPB2 (51.9%), SuperPred (44.4%), and SEA (37.0%), while the others were below 25%. It is worth noting that the prediction accuracy of the unique databases of TCM, such as TCMID, TCMSP, and BATMAN-TCM, was not high.

Fig. 5.

Fig. 5

The search results of different web servers for 54 compounds: (A) the number of targets predicted by each web server; (B) the predicted results of different web servers.

4. Discussion and conclusion

Drug target identification is one of drug discovery's most critical and complex steps [78]. The construction and development of computational methods and data-driven technological approaches have brought new ideas for studying the material basis and action mechanisms of TCM. These systems provide new ideas for researching complex TCM systems and offer further scientific and technological support for rational clinical drug use and new drug research and development. In this review, we witnessed the efforts of current computing technology in drug target prediction and the modernization of TCM in the form of online tools. We also note that all developers have made commendable efforts to provide non-computing users with tools to enable a broad audience to analyze drug targets quickly. Each web server has its advantages regarding the specificity of its data and algorithm. Although significant progress has been made in its methodology and applications, current TCM target research has several problems and limitations.

Unfortunately, the target prediction databases applied to TCM are based on Western medicine compositions and target databases. Unlike Western drugs, most herbal composite compound targets are unknown. For example, even by comprehensive literature curation, only 6% of 9862 herbal ingredients in HIT were found to have direct or indirect targets in PubMed abstracts [79]. TCM-related information has typical big data characteristics. The existing web tools have their emphases, and the information in a single database is limited and biased. Therefore, we should build high-quality databases and strengthen the association of major databases. The prediction methods used in the target prediction database mainly include target discovery based on ligand structure and target prediction based on receptor structure characteristics. Ligand-based drug target prediction is performed according to the similarity between molecules to those in the database. However, the affinity between the compounds and potential targets is not reflected. Moreover, the mode of interaction between the compound and the target cannot be predicted. Therefore, it is usually necessary to comprehensively use different prediction tools and make a selective experimental verification of the target through manual judgment.

Many scholars have proposed AI algorithm models. With the rapid accumulation and digital transformation of drug research and development data, as well as the accelerated development of AI technology, machine learning models such as decision trees, random forest, and SVM, as well as deep learning algorithms such as DNNs, convolutional neural networks, and cyclic neural networks, are gradually applied to the field of drug discovery, such as peptide synthesis, virtual screening, toxicity prediction, drug monitoring and release, pharmacophore modeling, quantitative structure-activity relationships, drug repositioning, multiple pharmacological and physiological activities, and other drug discovery processes [80]. For example, a multi-target drug prediction platform for anti-AD was established based on a machine learning algorithm. The global drug database was predicted and mined, from which 13 drugs were selected, which acted on at least one anti-AD drug target that could resist AD through various actions [81]. AI technology based on computational learning is also suitable for predicting and screening the activity of TCM with complex components. AI technology has been successfully applied to the analysis of complex functions of TCM and the screening of new drugs in TCM, providing a new direction and ideas for speeding up the modernization of TCM.

By summarizing and sorting out the multi-dimensional big data of TCM from different sources, an extensive dataset of molecular-target interactions according to the characteristics of TCM is constructed. Using deep learning to establish an intelligent prediction model of "molecular-target" can improve predictability, reliability, and repeatability. Moreover, efficient algorithms that follow the complex characteristics of TCM are lacking. Therefore, the development of TCM needs high-quality data and efficient algorithms. This is conducive to the modernization and internationalization of TCM and may provide significant technical support for drug development, clinical diagnosis, and personalized medicine.

Author contribution statement

All authors listed have significantly contributed to the development and the writing of this article. ⟨/p⟩

5. Data availability statement

Data included in article/supplementary material/referenced in article.

Funding

This work was supported by the National Natural Science Foundation of China (Nos. 41806191), China's National Key Research and Development Program (2017YFC1702703), Shandong Taishan Scholar Climbing Project, China, the youth innovation team of Shandong University of Traditional Chinese Medicine, China.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Footnotes

Appendix A

Supplementary data to this article can be found online at https://doi.org/10.1016/j.heliyon.2023.e19151.

Appendix A. Supplementary data

The following is the Supplementary data to this article.

Multimedia component 1
mmc1.doc (198.5KB, doc)

References

  • 1.Sayers E.W., Barrett T., Benson D.A., et al. 2010. Database Resources of the National Center for Biotechnology Information. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Santos R., Ursu O., Gaulton A., Bento A.P., Donadi R.S., Bologa C.G., Karlsson A., Al-Lazikani B., Hersey A., Oprea T.I., et al. A comprehensive map of molecular drug targets. Nat. Rev. Drug Discov. 2017;16(1):19–34. doi: 10.1038/nrd.2016.230. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Shao L., Tai P., Fan W., et al. Network pharmacology in traditional Chinese medicine. Evid Based Compl & Alt. 2014;2014:1–2. doi: 10.1155/2014/138460. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Lafferty-Whyte K., Mormeneo D., Del Fresno Marimon M. Trial watch: opportunities and challenges of the 2016 target landscape. Nat. Rev. Drug Discov. 2017;16(1):10–11. doi: 10.1038/nrd.2016.263. [DOI] [PubMed] [Google Scholar]
  • 5.Hopkins A.L. Network pharmacology: the next paradigm in drug discovery. Nat. Chem. Biol. 2008;4(11):682–690. doi: 10.1038/nchembio.118. [DOI] [PubMed] [Google Scholar]
  • 6.Corson T.W., Crews C.M. Molecular understanding and modern application of traditional medicines: triumphs and trials. Cell. 2007;130(5):769–774. doi: 10.1016/j.cell.2007.08.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Wang J.H. Dalian University of Technology; 2017. Study of the Interaction Mechanism between Drugs and Targets Based on Multi-Level. [Google Scholar]
  • 8.Forouzesh A., Samadi Foroushani S., Forouzesh F., et al. Reliable target prediction of bioactive molecules based on chemical similarity without employing statistical methods. Front. Pharmacol. 2019;10:835. doi: 10.3389/fphar.2019.00835. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Burdine L., Kodadek T. Target identification in chemical genetics: the (often) missing link. Chem Biol. 2004;11(5):593–597. doi: 10.1016/j.chembiol.2004.05.001. [DOI] [PubMed] [Google Scholar]
  • 10.Zheng X.S., Chan T.F., Zhou H.H. Genetic and genomic approaches to identify and study the targets of bioactive small molecules. Chem Biol. 2004;11(5):609–618. doi: 10.1016/j.chembiol.2003.08.011. [DOI] [PubMed] [Google Scholar]
  • 11.Chen X., Liu M.X., Yan G.Y. Drug–target interaction prediction by random walk on the heterogeneous network. Mol. Biosyst. 2012;8(7):1970–1978. doi: 10.1039/c2mb00002d. [DOI] [PubMed] [Google Scholar]
  • 12.Kim J., Park S., Min D., Kim W. Comprehensive survey of recent drug discovery using deep learning. Int. J. Mol. Sci. 2021;22(18) doi: 10.3390/ijms22189983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Peska L., Buza K., Koller J. Drug-target interaction prediction: a Bayesian ranking approach. Comput Methods Programs Biomed. 2017;152:15–21. doi: 10.1016/j.cmpb.2017.09.003. [DOI] [PubMed] [Google Scholar]
  • 14.Sydow D., Burggraaff L., Szengel A., et al. Advances and challenges in computational target prediction. J Chem Inf Mod. 2019;59(5):1728–1742. doi: 10.1021/acs.jcim.8b00832. [DOI] [PubMed] [Google Scholar]
  • 15.Cheng T., Li Q., Wang Y., Bryant S.H. Identifying compound-target associations by combining bioactivity profile similarity search and public databases mining. J Chem Inf Mod. 2011;51(9):2440–2448. doi: 10.1021/ci200192v. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Perlman L., Gottlieb A., Atias N., et al. Combining drug and gene similarity measures for drug-target elucidation. J. Comput. Biol. 2011;18(2):133. doi: 10.1089/cmb.2010.0213. [DOI] [PubMed] [Google Scholar]
  • 17.Rognan D. Chemogenomic approaches to rational drug design. Brit J Pharm. 2007;152(1):38–52. doi: 10.1038/sj.bjp.0707307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Anna G., Bellis L.J., Patricia B.A., et al. ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res. 2012;40(Database issue):D1100–D1107. doi: 10.1093/nar/gkr777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Wishart D.S., Knox C., Guo A.C., et al. DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res. 2006;34(Database issue):D668–D672. doi: 10.1093/nar/gkj067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Davis A.P., Grondin C.J., Johnson R.J., et al. Comparative toxicogenomics database (CTD): update 2021. Nucleic Acids Res. 2021;49(D1):D1138–D1143. doi: 10.1093/nar/gkaa891. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Nikolai H., Jessica A., Joachim V.E., et al. SuperTarget goes quantitative: update on drug–target interactions. Nucleic Acids Res. 2012;40(D1):D1113–D1117. doi: 10.1093/nar/gkr912. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Liu T., Lin Y., Wen X., et al. BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res. 2007;35(Database issue):D198–D201. doi: 10.1093/nar/gkl999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Magariños M., Carmona S.J., Crowther G.J., et al. TDR Targets: a chemogenomics resource for neglected diseases. Nucleic Acids Res. 2012;D1:D1118–D1127. doi: 10.1093/nar/gkr1053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Armstrong J.F., Faccenda E., Harding S.D., et al. The IUPHAR/BPS Guide to PHARMACOLOGY in 2020: extending immunopharmacology content and introducing the IUPHAR/MMV Guide to MALARIA PHARMACOLOGY. Nucleic Acids Res. 2019;48(Suppl. 1) doi: 10.1093/nar/gkz951. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Qin C., Zhang C., Zhu F., et al. Therapeutic target database update 2014: a resource for targeted therapeutics. Nucleic Acids Res. 2014;(D1):1118–1123. doi: 10.1093/nar/gkt1129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Kuhn M., Campillos M., Letunic I., et al. A side effect resource to capture phenotypic effects of drugs. Mol Syst Biolo. 2014;6(1) doi: 10.1038/msb.2009.98. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Koutsoukas A., Simms B., Kirchmair J., et al. From in silico target prediction to multi-target drug design: current databases, methods and applications. J Proteomics. 2011;74(12):2554–2574. doi: 10.1016/j.jprot.2011.05.011. [DOI] [PubMed] [Google Scholar]
  • 28.Xue R., Fang Z., Zhang M., et al. TCMID: traditional Chinese Medicine integrative database for herb molecular mechanism analysis. Nucleic Acids Res. 2013;41(Database issue):D1089–D1095. doi: 10.1093/nar/gks1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ru J., Li P., Wang J., et al. TCMSP: a database of systems pharmacology for drug discovery from herbal medicines. J Cheminform. 2014;6(1):13. doi: 10.1186/1758-2946-6-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Liu Z., Guo F., Wang Y., et al. BATMAN-TCM: a bioinformatics analysis tool for molecular mechanism of traditional Chinese medicine. Sci Rep-UK. 2016;6 doi: 10.1038/srep21146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Keiser M.J., Roth B.L., Armbruster B.N., et al. Relating protein pharmacology by ligand chemistry. Nat Biotech. 2007;25(2):197–206. doi: 10.1038/nbt1284. [DOI] [PubMed] [Google Scholar]
  • 32.Kuhn M., von Mering C., Campillos M., Jensen L.J., Bork P. STITCH: interaction networks of chemicals and proteins. Nucleic Acids Res. 2008;36(Database issue):D684–D688. doi: 10.1093/nar/gkm795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Liu X., Ouyang S., Yu B., et al. PharmMapper server: a web server for potential drug target identification using pharmacophore mapping approach. Nucleic Acids Res. 2010;38:W609–W614. doi: 10.1093/nar/gkq300. Web Server issue) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Wang J.C., Chu P.Y., Chen C.M., et al. idTarget: a web server for identifying protein targets of small chemical molecules with robust scoring functions and a divide-and-conquer docking approach[J] Nucleic Acids Res. 2012;(W1):W393–W399. doi: 10.1093/nar/gks496. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Liu X., Ingo V., Tanseem H., et al. HitPick: a web server for hit identification and target prediction of chemical screenings. Bioinformatics. 2013;(15):1910–1912. doi: 10.1093/bioinformatics/btt303. [DOI] [PubMed] [Google Scholar]
  • 36.Gong J., Cai C., Liu X., et al. ChemMapper: a versatile web server for exploring pharmacology and chemical structure association based on molecular 3D similarity method. Bioinformatics. 2013;(14):1827–1829. doi: 10.1093/bioinformatics/btt270. [DOI] [PubMed] [Google Scholar]
  • 37.David G., Aurélien Grosdidier, Matthias W., et al. SwissTargetPrediction: a web server for target prediction of bioactive small molecules. Nucleic Acids Res. 2014;42(Web Server issue):32–38. doi: 10.1093/nar/gku293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Janette N., Bjoern-Oliver G., Jevgeni E., et al. SuperPred: update on drug classification and target prediction[J] Nucleic Acids Res. 2014;(W1):W26–W31. doi: 10.1093/nar/gku477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Awale M., Reymond J.L. The polypharmacology browser: a web-based multi-fingerprint target prediction tool using ChEMBL bioactivity data. J Cheminform. 2017;9:11. doi: 10.1186/s13321-017-0199-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Xu H.Y., Zhang Y.Q., Liu Z.M., et al. ETCM: an encyclopaedia of traditional Chinese medicine. Nucleic Acids Res. 2019;47(D1):D976–D982. doi: 10.1093/nar/gky987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Yang W., Fei L., Zhang K., et al. Symmap: an integrative database of traditional Chinese medicine enhanced by symptom mapping. Nucleic Acids Res. 2019 8;47(D1):D1110–D1117. doi: 10.1093/nar/gky1021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Fang S.S., Dong L., Liu L., et al. HERB: a high-throughput experiment- and reference-guided database of traditional Chinese medicine. Nucleic Acids Res. 2020;D1:D1. doi: 10.1093/nar/gkaa1063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Yan Deyu, Zheng Genhui, Wang Caicui, et al. Nucleic Acids Research; 2021. HIT 2.0: an Enhanced Platform for Herbal Ingredients' Targets. gkab1011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Li H., Gao Z., Kang L., et al. TarFisDock: a web server for identifying drug targets with docking approach. Nucleic Acids Res. 2006;34:W219–W224. doi: 10.1093/nar/gkl114. Web Server issue) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Luo H., Chen J., Shi L., et al. DRAR-CPI: a server for identifying drug repositioning potential and adverse drug reactions via the chemical-protein interactome. Nucleic Acids Res. 2011;39:W492–W498. doi: 10.1093/nar/gkr299. Web Server issue) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Taboureau O., Nielsen S.K., Audouze K., et al. ChemProt: a disease chemical biology database. Nucleic Acids Res. 2011 Jan;39(Database issue):D367–D372. doi: 10.1093/nar/gkq906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Liu Z., Du J., Yan X., et al. TCMAnalyzer: a chemo-and bioinformatics web service for analyzing traditional Chinese medicine. J Chem Inf Mod. 2018;58(3):550–555. doi: 10.1021/acs.jcim.7b00549. [DOI] [PubMed] [Google Scholar]
  • 48.Zhang R.Z., Yu S.J., Bai H., et al. TCM-Mesh: the database and analytical system for network pharmacology analysis for TCM preparations. Sci. Rep. 2017;7(1):2821. doi: 10.1038/s41598-017-03039-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Liu X., Xu Y., Li S., et al. In Silico target fishing: addressing a "Big Data" problem by ligand-based similarity rankings with data fusion. J Cheminform. 2014;6(1):33. doi: 10.1186/1758-2946-6-33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Schenone M., Dančík V., Wagner B.K., et al. Target identification and mechanism of action in chemical biology and drug discovery. Nat. Chem. Biol. 2013;9(4):232–240. doi: 10.1038/nchembio.1199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Bender A., Glen R.C. Molecular similarity: a key technique in molecular informatics. Org. Biomol. Chem. 2004;2(22):3204–3218. doi: 10.1039/B409813G. [DOI] [PubMed] [Google Scholar]
  • 52.Martin Y.C., Kofron J.L., Traphagen L.M. Do structurally similar molecules have similar biological activity? J. Med. Chem. 2002;45(19):4350–4358. doi: 10.1021/jm020155c. [DOI] [PubMed] [Google Scholar]
  • 53.Bender A., Glen R.C. 2008. Molecular Similarity: a Key Technique in Molecular Informatics†. [DOI] [PubMed] [Google Scholar]
  • 54.Peter W., John M., Barnard G.M., et al. Chemical similarity searching. J. Chem. Inf. Model. 1998;38(6):983–996. [Google Scholar]
  • 55.Bender A. How similar are those molecules after all? Use two descriptors and you will have three different answers. Expet Opin. Drug Discov. 2010;5(12):1141–1151. doi: 10.1517/17460441.2010.517832. [DOI] [PubMed] [Google Scholar]
  • 56.Rogers D., Hahn M. Extended-connectivity fingerprints. J Chem Inf Mod. 2010;50(5):742–754. doi: 10.1021/ci100050t. [DOI] [PubMed] [Google Scholar]
  • 57.Hamad S., Adornetto G., Naveja J.J., et al. HitPickV2: a web server to predict targets of chemical compounds. Bioinformatics. 2019;35(7):1239–1240. doi: 10.1093/bioinformatics/bty759. [DOI] [PubMed] [Google Scholar]
  • 58.Liu X., Jiang H., Li H. SHAFTS: a hybrid approach for 3D molecular similarity calculation. 1. Method and assessment of virtual screening. J Chem Inf Mod. 2011;51(9):2372–2385. doi: 10.1021/ci200060s. [DOI] [PubMed] [Google Scholar]
  • 59.Armstrong M.S., Finn P.W., Morris G.M., et al. Improving the accuracy of ultrafast ligand-based screening: incorporating lipophilicity into ElectroShape as an extra dimension. J Comput-aid Mol Des. 2011;25(8):785–790. doi: 10.1007/s10822-011-9463-8. [DOI] [PubMed] [Google Scholar]
  • 60.Armstrong M.S., Morris G.M., Finn P.W., et al. ElectroShape: fast molecular similarity calculations incorporating shape, chirality and electrostatics. J Comput-aid Mol Des. 2010;24(9):789–801. doi: 10.1007/s10822-010-9374-0. [DOI] [PubMed] [Google Scholar]
  • 61.Schuster D. 3D pharmacophores as tools for activity profiling. Drug Discov. Today Technol. 2010;7(4):e205–e211. doi: 10.1016/j.ddtec.2010.11.006. [DOI] [PubMed] [Google Scholar]
  • 62.Nettles J.H., Jenkins J.L., Williams C., et al. Flexible 3D pharmacophores as descriptors of dynamic biological space. J Mol Graphi Model. 2008;26(3):622–633. doi: 10.1016/j.jmgm.2007.02.005. [DOI] [PubMed] [Google Scholar]
  • 63.Wang X., Pan C., Gong J., et al. Enhancing the enrichment of pharmacophore-based target prediction for the polypharmacological profiles of drugs. J Chem Inf Mod. 2016;56(6):1175–1183. doi: 10.1021/acs.jcim.5b00690. [DOI] [PubMed] [Google Scholar]
  • 64.Liu X., Ouyang S., Yu B., et al. PharmMapper server: a web server for potential drug target identification using pharmacophore mapping approach. Nucleic Acids Res. 2010;38:W609–W614. doi: 10.1093/nar/gkq300. Web Server issue) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.David G., Olivier M., Vincent Z. Shaping the interaction landscape of bioactive molecules. Bioinformatics. 2013;(23):3073–3079. doi: 10.1093/bioinformatics/btt540. [DOI] [PubMed] [Google Scholar]
  • 66.66Schuffenhauer A., Floersheim P., Acklin P., et al. Similarity metrics for ligands reflecting the similarity of the target proteins. J Chem Inf Comp Sci. 2003;43(2):391. doi: 10.1021/ci025569t. [DOI] [PubMed] [Google Scholar]
  • 67.Nidhi N., Glick M., Davies J.W., et al. Prediction of biological targets for compounds using multiple-category bayesian models trained on chemogenomics databases. ChemInform. 2006;37(3):1124–1133. doi: 10.1021/ci060003g. [DOI] [PubMed] [Google Scholar]
  • 68.None . vol. 29. Inc; 1995. (MDL Information Systems). [Google Scholar]
  • 69.Receptor D. 2015. PDSP Ki Database. [Google Scholar]
  • 70.Okuno Y., Tamon A., Yabuuchi H., et al. GLIDA: GPCR—ligand database for chemical genomics drug discovery—database and tools update. Nucleic Acids Res. 2008;36(1):907–912. doi: 10.1093/nar/gkm948. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Wang Y., Zhang S., Li F., et al. Therapeutic target database 2020: enriched resource for facilitating research and early development of targeted therapeutics. Nucleic Acids Res. 2020;48:D1031–D1041. doi: 10.1093/nar/gkz981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Mattingly C.J., Rosenstein M.C., Colby G.T., et al. The Comparative Toxicogenomics Database (CTD): a resource for comparative toxicological studies. J. Exp. Zool. Part A Comparative Experimental Biology. 2010;305(9):689–692. doi: 10.1002/jez.a.307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Minoru K., Miho F., Mao T., et al. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 2017;D1:D353–D361. doi: 10.1093/nar/gkw1092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Stefan G., Michael K., Mathias D., et al. SuperTarget and Matador: resources for exploring drug-target relationships. Nuclc Acids Res. 2008;36 doi: 10.1093/nar/gkm862. (Database issue) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Gao Z., Li H., Zhang H., et al. PDTD: a web-accessible protein database for drug target identification. BMC Bioinf. 2008;9(1):104. doi: 10.1186/1471-2105-9-104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Zhang R., Zhu X., Bai H., et al. Network pharmacology databases for traditional Chinese medicine: review and assessment. Front. Pharmacol. 2019;10:123. doi: 10.3389/fphar.2019.00123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Hua Y., Chen J., Xue X., et al. A systematic prediction of multiple drug-target interactions from chemical, genomic, and pharmacological data. PLoS One. 2012;7(5) doi: 10.1371/journal.pone.0037608. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Mendez-Lucio O., Baillif B., Clevert D.A., et al. De novo generation of hit-like molecules from gene expression signatures using artificial intelligence. Nat. Commun. 2020;11(1):10. doi: 10.1038/s41467-019-13807-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Hao Y., Li Y., Kang H., et al. HIT: linking herbal active ingredients to targets. Nucleic Acids Res. 2011;39(Database issue):D1055. doi: 10.1093/nar/gkq1165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Sagi O., Rokach L. Ensemble learning: a survey. Wiley Interdiscip. Rev. 2018;8:1–18. [Google Scholar]
  • 81.Zhang B.Y., Pang X.C., Jia H., et al. Repositioning drug discovery for Alzheimer's disease based on global marketed drug data. Acta Pharm. Sin. 2019;54(7):1214. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Multimedia component 1
mmc1.doc (198.5KB, doc)

Data Availability Statement

Data included in article/supplementary material/referenced in article.


Articles from Heliyon are provided here courtesy of Elsevier

RESOURCES