Abstract
DNA‐based biological ingredient identification and downstream pharmacology network analysis are commonly used in research for Traditional Chinese Medicine preparations (TCM formulas). Advancements in bioinformatics tools and the accumulation of related data have become driving forces for progress in this field. However, a lack of a platform integrating biological ingredient identification and downstream pharmacology network analysis hinders the deep understanding of TCM. In this study, we developed the TCM‐Suite platform composed of two sub‐databases, Holmes‐Suite and Watson‐Suite, for TCM biological ingredient identification and network pharmacology investigation, respectively, both are among the most complete: In the Holmes‐Suite, we collected and processed six types of marker gene sequences, accounting for 1,251,548 marker gene sequences. In the Watson‐Suite, we curated and integrated a massive number of entries from more than 10 public databases. Importantly, we developed a comprehensive pipeline to integrate TCM biological ingredient identification and downstream network pharmacology research, allowing users to simultaneously identify components of a TCM formula and analyze its potential pharmacology mechanism. Furthermore, we designed search engines and a user‐friendly platform to better search and visualize these rich resources. TCM‐Suite is a comprehensive and holistic platform for TCM‐based drug discovery and repurposing. TCM‐Suite website: http://TCM-Suite.AImicrobiome.cn.
Keywords: composition identification, database, holistic pipeline, network pharmacology, Traditional Chinese Medicine, webserver
TCM‐Suite platform is composed of two sub‐databases, Holmes‐Suite and Watson‐Suite. Holmes‐Suite database covers six marker genes, resulting in 235,470 kinds of biological ingredients. Watson‐Suite retrieved massive data for mining the “herb‐compound‐protein‐disease” interpretation. A holistic pipeline was designed to connect Holmes and Watson, showed by a user‐friendly platform.

Highlights
TCM‐Suite platform is composed of two sub‐databases, Holmes‐Suite and Watson‐Suite.
Holmes‐Suite database covers six marker genes, resulting in 235,470 kinds of biological ingredients.
Watson‐Suite retrieved massive data for mining the “herb‐compound‐protein‐disease” interpretation.
A holistic pipeline was designed to connect Holmes and Watson, shown by a user‐friendly platform.
INTRODUCTION
Traditional Chinese Medicine (TCM) is a branch of traditional medicine based on over 3500 years of Chinese medical practice encompassing various forms of herbal medicine [1, 2]. TCM preparation (i.e., formula) has considerable promise for treating various diseases and is a great resource for drug development [3–5]. With the advancement in bioinformatic tools and the accumulation of related data, TCM component (biological ingredient) identification based on advanced sequencing technology has become an emergent issue [6, 7]: For the accurate and efficient identification of TCM biological ingredients, it is crucial to generate a comprehensive and accurate DNA‐based marker sequence database, as well as to design efficient sequence search engines. Additionally, these TCM biological ingredient identification results could be linked to target proteins and molecular mechanisms of drug effects to enhance the understanding of the TCM formulas [8].
DNA‐based molecular marker genes (i.e., DNA barcodes) are efficient and accurate tools for identifying TCM biological ingredients [9, 10], especially in mixtures containing many species [11, 12]. DNA barcode is defined as a standardized short DNA sequence from a conservative region of a species' genome [10, 13], which can be used to delineate between taxonomic lineages [14, 15]. ITS2, matK, trnH‐psbA, trnL, rpoc1, ycf1, and other typical marker genes [16–18] are extensively used in TCM research. These DNA barcodes make it feasible and convenient to identify biological ingredients from TCM formula, which is a typical mixed system [19, 20]. Identification of TCM biological ingredients is essentially equivalent to identifying the taxonomy species belonging to various raw materials, depending on the precise identification of DNA barcodes of TCM biological ingredients. Using high‐throughput sequencing, the metagenomic approach has become one of the most important and effective approaches to deciphering the taxonomical profile of a mixture of biological ingredients based on marker genes [13, 21, 22], which could help to establish an accurate and efficient method for biological ingredient analysis of the TCM formula.
Typically, the TCM formula comprises of several herbs containing tens, hundreds, or even thousands of compounds [20, 21]. Consequently, TCM formulas represent a rich resource for clinical practice and drug development, while the identification of clear biological ingredients of TCM provides the basis for downstream pharmacological research. Each biological ingredient may contribute to the treatment of corresponding syndromes [23, 24]. Hence, after detecting the biological ingredients (i.e., herbs) of a TCM formula, investigation of the active compounds for each herb can effectively explore the herb's target genes and diseases [25, 26]. From a network perspective, network pharmacology may explain intricate relationships between biological systems of the active compounds (or small molecules), human proteins (or genes), and diseases [27–29]. When the herbs in the TCM formula are parsed, network pharmacology studies can decipher the role of a biological ingredient at the molecular level. Moreover, by examining the “compound‐protein‐disease” interaction network in medication processes, the potential effects of biological ingredients can be systematically assessed [30–32].
Recently, various databases have been developed for the analysis of TCM formulas, focusing on both the identification of biological ingredients and the inference of network pharmacology. For the TCM biological ingredient identification, databases like ITS2 Ribosomal RNA Database [33] and TCMBarcode Database [34] were constructed. For network pharmacology investigation, Super Natural II [35], TCM@Taiwan [36], ChEMBL [37], and TCMID [38] have been developed for the identification of formula‐herb and herb‐compound relationships for TCM formulas; STITCH [39] was constructed for compound‐protein association analysis, STRING [40] for protein–protein interaction analysis, and OMIM [41], and GAD [42] for protein‐disease association analysis. Finally, for a comprehensive understanding of TCM effects, toxicity [43], and side effect [43] databases have been developed for safety assessments.
Despite the remarkable success of these TCM‐related databases, the biggest bottleneck for the modernization and globalization of TCM research has not been overcome: what are the accurate component of a TCM formula, what are the special active compounds, and what are their targeted proteins in human disease? These questions remain largely unaddressed by any of the previously described TCM‐related databases. The limitations of current TCM‐related databases are as follows: (I) Currently, biological and chemical methods were far from complete and accurate. (II) For TCM biological ingredient identification, although many databases have been constructed for TCM network pharmacology analysis, the lack of integration and redundancy in multiple databases led to error‐prone, incomplete, and inefficient network pharmacology research. (III). Importantly, due to the current situation (e.g., lacking marker sequences for biological ingredients), a holistic pipeline is still lacking for TCM analysis, which includes identification of herbs of TCM formulas, mining their related active compounds and protein targets, and analysis of their associations with diseases.
In this study, we developed the TCM‐Suite database to facilitate TCM‐related information management, data‐driven hypothesis generation, and potential drug discovery. First, to construct a comprehensive database, a large number of entries covering different types of TCM‐related data were collected. For TCM biological ingredient identification, we collected sequences and related information for six marker genes (ITS2, matK, trnH‐psbA, trnL, rpoc1, and ycf1), resulting in the Holmes‐Suite sub‐database, containing 235,470 kinds of biological ingredients and 1,251,548 marker gene sequences. For downstream network pharmacology analysis, we retrieved large sets of public records, constructing the Watson‐Suite sub‐database, which includes 6,692 formulas, 7,322 herbs, 704,321 compounds, and 19,319 proteins, and 15,437 diseases, as well as corresponding correlations between different types of entries. Second, a holistic pipeline was developed to seamlessly integrate these two sub‐databases, covering the whole process from TCM biological ingredient identification to downstream network pharmacology investigation. Third, based on the enormous number of entries and holistic analysis pipeline, we designed a user‐friendly platform for users to explore and visualize these rich resources. We believe that TCM‐Suite could serve as a resource for TCM‐based drug discovery and repurposing, and that ongoing work on this integrated database and analytical platform will contribute to pharmaceutical research and clinical practice worldwide. The TCM‐Suite database website is available at http://TCM-Suite.AImicrobiome.cn.
CONSTRUCTION AND CONTENT
Data collection and processing for Holmes‐Suite
Raw sequences were obtained from the NCBI nucleotide database (https://www.ncbi.nlm.nih.gov/nucleotide/) in GenBank format in December 2020, using gene names as keywords (ITS2, matK, trnH‐psbA, trnL, rpoc1, and ycf1). We developed a TCM marker gene database and TCM ingredient identification system in the Holmes‐Suite database to satisfy these stringent requirements for TCM biological ingredient identification. The approach using machine learning to process different marker genes works as follows: First, for every type of marker gene, 1000 manually checked sequences representing clean and complete marker genes devoid of ambiguous base pairs and out‐of‐boundary sequences, as well‐annotated marker gene sequences, were selected. Second, a Hidden Markov Model (HMM) was trained using these well‐annotated marker gene sequences to identify the potential marker gene regions of these candidate sequences before these sequences could be included in our curated database. All downloaded sequences and sub‐sequences that did not fit the model were filtered out. Third, the sequences of positive entries not covered by the model were manually gathered. A detailed process workflow is exemplified by the ITS2 marker gene in the Holmes‐Suite database and provided on the help page (http://TCM-Suite.AImicrobiome.cn/help). Based on these high‐quality marker genes, a sequence‐based search engine with high accuracy and coverage was constructed.
Data collection and processing for Watson‐Suite
Watson‐Suite was composed of six data fields: formula, herb, ingredient, compound, protein, and disease. In the Watson database, different data fields were linked by direct correlations collected and curated from the existing database. Based on the data in existing databases, the direct correlations include formula‐herb correlations, herb‐ingredient, ingredient‐compound, compound‐protein, and protein‐disease [44, 45]. the other correlations among these six data were defined as indirect correlations, calculated based on the network pharmacology.
For these collected data from different databases, data integration and filtration were conducted, followed by manual annotations and corrections. First, to improve the compatibility of data, we converted all of them into a standard format. Second, based on these integrated data, each entry was manually checked and tagged with a unique label, and then duplicate records were removed. The detailed process procedure was supplied on our website(http://TCM-Suite.AImicrobiome.cn/help).
Interconnection between entries in Holmes‐Suite and Watson‐Suite
This interconnection was established by connecting the “species” field in Holmes‐Suite and the “herb” field in Watson‐Suite, as both fields indicated the Latin name of the biological ingredient of TCM‐related herbs (Figure 1A). Fuzzy matching and Elastic search (https://www.elastic.co) were used to match these two fields. For the matched herbs, we integrated the detailed information of the two sub‐databases on the website for the TCM biological ingredient in Holmes‐Suite (Figure 1B) or determination of the target herb in Watson‐Suite (Figure 1C), while providing a hyperlink to the corresponding database. Similarly, this interconnection was also uncovered using the Holmes‐Suite search module (Figure 1D).
Figure 1.

Interconnection of Holmes‐Suite and Watson‐Suite sub‐databases. (A) Fuzzy matching of biological ingredients for Holmes‐Suite and Watson‐Suite connectivity. The “species” field in Holmes‐Suite was linked to the “herb” field in Watson‐Suite via Elastic search. (B) Entry‐specific information from the Holmes‐Suite sub‐database. The details of each entry contained the taxonomical classification. The frame of “Link to Watson” provided the hyperlink for corresponding network pharmacology analysis in Watson‐Suite. (C) Detailed herbal information page for Watson‐Suite database. The species name and drug‐related properties for this entry were provided. Similarly, the hyperlink for the species in Holmes‐Suite was provided. (D) The sequence search result in the Holmes‐Suite database also supplied the hyperlink to the record in Holmes‐Suite. Users could conveniently obtain the taxonomical information of this search result and the corresponding network pharmacology information.
Implementation of TCM‐Suite
The TCM‐Suite website was developed using the Python‐Flask, Nginx, and React JavaScript frameworks and is compatible with most major browsers (tested with Chrome, Edge, Firefox, and Safari) under different operating systems. TCM‐Suite database supports 10 people to search the database and 30 people to browse online at the same time. TCM‐Suite stores its data in a PostgreSQL database. TCM‐Suite is dedicated to providing a user‐friendly web interface for users to browse, search, visualize, and download data. TCM‐Suite is freely accessible at http://TCM-Suite.AImicrobiome.cn without any registration.
Data resources and database construction
To establish a comprehensive database, we collected and integrated data from multiple sources for the construction of the TCM‐Suite database: To incorporate the marker gene sequences for identifying the TCM biological ingredient, the Holmes‐Suite sub‐database was constructed (http://TCM-Suite.AImicrobiome.cn/holmes/browse). For downstream network pharmacology analysis, the Watson‐Suite sub‐database was constructed (http://TCM-Suite.AImicrobiome.cn/watson/browse). Watson‐Suite contains up to 6,692 formulas, 65,231 formula‐herb associations, 7,322 herbs, 54,868 herb‐compound associations, 704,321 compounds, 4,350,151 protein‐compound associations, 19,319 proteins, and 814,250 gene target sites for proteins, 160,095 protein‐disease related records, 15,437 diseases, and 160,169 side effect records. The databases from which these entries were collected are summarized in Supporting Information: Table 1.
Detailed statistics of the data in the Holmes and Watson databases were depicted in Figure 2. In the Holmes database, the species distribution for six marker genes was presented (Figure 2A). According to this species distribution, the six marker genes have distinct species distributions, although they nevertheless share 1131 species entries. The high quality of these marker genes is reflected by a long average length based on the length distribution (Figure 2). In the Watson database, the characters of pharmacology networks for different types of entries are reflected by direct correlations collected and curated from the existing database (Figure 2C) and indirect correlations deduced by network pharmacology analysis (Figure 2D). Compared to previous databases, our database has improved both the number of marker genes and the complexity of the pharmacology network [44, 45].
Figure 2.

Statistical results for the Holmes‐Suite and Watson‐Suite sub‐databases. (A) Distribution of species across six marker genes. The Venn diagram illustrates the number of species grouped by six marker gene databases. Six types of marker gene share 1131 common species. (B) Sequence length distribution for six marker genes. For each type of marker gene, length distribution was calculated and labeled in different colors. (C) Direct correlations for six categorizations of pharmacology networks in the Watson‐Suite sub‐database. Direct correlations were collected and curated from the existing database. Bar plots show the total number of direct associations for five types of direct associations. (D) Indirect correlations for five categorizations of pharmacology networks in the Watson‐Suite database. Indirect correlations were deduced by network pharmacology analysis. Bar plots show the total number of indirect associations for five direct associations.
Evaluation of the Holmes‐Suite database
The requirements of high‐throughput analysis of TCM biological ingredients have resulted in an elevated standard for accuracy (high sensitivity required), completeness (cover as many species as possible), as well as efficiency (processing batch and bulk data quickly) of identification. To evaluate the accuracy, completeness, and efficiency of the Holmes‐Suite database, we compared it to two existing databases: ITS2 Ribosomal RNA Database (http://its2.bioapps.biozentrum.uni-wuerzburg.de/) and TCM‐Barcode Database (http://www.tcmbarcode.cn/en/), using marker gene ITS2 (1000 ITS2 raw entries were randomly selected from NCBI as queries, restricted to the plant ITS2), evaluated both on genus and species levels. The data was downloaded in December 2020. The comparison result illustrated that Holmes‐Suite performs better in searching accuracy, searching efficiency, and data coverage compared to the other two databases (Figure 3).
Figure 3.

Comparison of identification accuracy using ITS2 across the three databases Holmes‐Suite, ITS2 Ribosomal RNA, and TCMBarcode databases. (A) Genus‐level and (B) species‐level sensitivities for the three databases. For each of the horizontal histograms, the number inside each bar shows the number and rate of sequences correctly identified at the target taxonomy level. (C) The number of entries (DNA‐based marker sequences) contained in each database (blue) and data component of Holmes‐ITS2 (ITS2 part of the Holmes‐Suite database). Note that only plant species were used here for comparison. (D) Holmes‐Suite covers a relatively complete phylogenetic tree for species identifiable by ITS2. From the inside to outside of the taxonomy hierarchical tree, each node represents a taxon with a distinct level of taxonomy, such as kingdom, phylum, or class. Each connection represents an affiliation of the outer node to the inner node. (E) HMM prediction for training the ITS2 sequence edge model and (F) identification with BLAST and Kraken search engines. Solid horizontal bars represent the time cost of the classification procedure for a different number of queries. Trend lines of the time cost of different procedures are illustrated by dashed lines. The classification time cost of different steps showed a linear increase; hence, time costs under one million and one hundred million queries are projected values indicated by horizontal dashed line bars.
First, the search accuracy for the three databases was evaluated and compared, revealing that Holmes‐Suite outperforms the other two. For accuracy benchmark on genus level, the Holmes‐Suite database using BLAST as the search engine achieved the highest score among the three databases (Figure 3A), with a sensitivity of up to 95.5%; whereas the Holmes‐Suite database using Kraken as search engine reached 95.4% sensitivity. The sensitivity of the Holmes‐Suite database with both search engines was greater than that of TCMBarcode (73.4%) and the ITS2 Ribosomal RNA database (77.6%). For the species‐level accuracy benchmark (Figure 3B), the Holmes‐Suite database using BLAST as a search engine achieved the highest classification sensitivity (73.1%) across all three databases, demonstrating a substantial advantage (Holmes with Kraken search engine: 64.5%; TCMBarcode:5 4.2%; ITS2 Ribosomal RNA database: 49.5%). Consequently, Holmes‐Suite showed the best performance in classification accuracy among the three databases. Although our findings are based on ITS2 sequences as query sequences, we believe that when processed with the same pipeline, the results on the other six marker genes would also be very accurate.
Second, the database completeness for the three databases was evaluated and compared (Figure 3C,D). With an order of magnitude larger database of Holmes‐ITS2 (ITS2 part of the Holmes‐Suite database) than the ITS2 Ribosomal RNA database and the TCMBarcode database (Supporting Information: Table 2, Figure 3C), the TCM‐Suite would cover a rather comprehensive phylogenetic tree for species identifiable by ITS2 (Figure 3D). Interestingly, given that Holmes‐Suite was far bigger than other databases, its higher sensitivity may indicate the superior quality of the content of the Holmes‐Suite database. However, arguably, bigger databases are not always associated with the cost of slower database queries. This is not the case for the Holmes‐Suite database.
Third, the efficiency was evaluated by the classification speed of two main procedures (Figure 3E,F). The time cost analysis on the HMM model (a trained model to trim off the ambiguous boundary regions of the download ITS2 sequences for higher accuracy, see Materials and Methods) (Figure 3E) processing and searching by BLAST (Figure 3F) have shown that these costs have linear relationships with the increase in query number; on average, the speed of the two procedures were 250 queries and 15 queries per second, respectively, based on tests using our web server. It is noteworthy that the time cost of classification by Kraken, which did not significantly rise with the number of queries tested, is an average classification speed of more than 6,000 sequences per second. During the process of identification of a small amount of data, fluctuation in time cost resulted because of a significant difference from the maximum identification speed of Kraken.
Evaluation of Watson‐Suite database
To evaluate the accuracy and efficiency of the Watson‐Suite database, 125 compounds from the herb Panax ginseng (i.e. ginseng) were used as a search query and compared to the TCMID database [46] (Supporting Information: Figure 1). For this herb, 60 compounds were retrieved from the Watson‐Suite database, however, the TCMID database only returned 44 compounds (Supporting Information: Figure 1A). Then, we evaluated the pharmacology network associated with ginseng retrieved from the Watson‐Suite database: To identify the most reliable correlations, we selected 10 genes for each of the 21 compounds, ranked by the combined score [47, 48], yielding a collection of 152 genes and 483 pairs of gene interactions for the 152 genes. By targeting some of these 152 genes, ginseng could exert effects on a variety of diseases, such as protopine, sleep disorder, atherosclerosis, diabetes, Alzheimer's disease, hypertension, cardiac dysrhythmias, insulin resistance, pyresis, and depression. Based on the correlations with a large number of genes and diseases (Supporting Information: Figure 1B), these 21 compounds were related to the biological traits of the ginseng, demonstrating the pharmacological activity of ginseng similar to previous studies [49, 50].
To validate the “herb‐compound‐protein‐disease” network generated for ginseng, we investigated the main active compounds of ginseng and discovered that the ginsenoside Rg1‐related genes, i.e., TNF, are related to diabetes, Alzheimer's disease, hypertension, and atherosclerosis, which was consistent with previous studies [51, 52]. In addition, a ginsenoside Rb1‐related gene, AKT1, was found to be associated with diabetes. Furthermore, the genes related to other compounds of ginseng identified from the TCM‐Suite, such as ADA, ADK, ADORA2A, and GCG, have been previously reported to be associated with sleep disorders [53, 54] and diabetes [55].
To further evaluate the efficiency of the Watson‐Suite database searches, 100 FDA‐approved drugs were selected randomly from DrugBank [56] as a benchmark data set. For all of these 100 drugs, the average searching time was 0.648 s and 0.617 s for the first and second queries, respectively (Supporting Information: Figure 1C). As the TCM‐Suite could save the result of the first search in a cache, the second query consumed less time. Furthermore, we used varying quantities of compounds to search in our database (Supporting Information: Figure 1D). The result showed that retrieval efficiency grows linearly as the quantity of compounds increases.
UTILITY AND DISCUSSION
A holistic pipeline for TCM analysis
To facilitate more in‐depth mining of the massive amount of information on natural products and the clinical application of TCM, we realized the interconnection of the marker gene database (Holmes‐Suite) and downstream network pharmacology analysis (Watson‐Suite). Based on the machine learning method, 153,016 species entries (65% of all 235,470 species entries) in TCM‐Suite could be matched to the “herb” field in Watson‐Suite. For the matched herbs, we integrated the detailed information from the two sub‐databases. Consequently, this holistic pipeline allowed users to search for a TCM‐source sequence, for which TCM‐Suite would return the matched entries in the Holmes‐Suite database utilizing the sequence‐based search engine. Additionally, as all the data were interconnected, TCM‐Suite would also return the related network pharmacology information. And vice versa, the user could query the interesting entry in Watson‐Suite and would return all the related entries in the Watson‐Suite database by network pharmacology analysis. TCM‐Suite would also return the related marker gene information in Holmes‐Suite. This integrated pipeline highlights the explainability of our database, hence broadening and deepening TCM‐related data mining.
Database features and web interface
The TCM‐Suite database provides an easy‐to‐use interface for users to browse data and search related entries in the database and mine the related information. The front page and data browsing pages for TCM‐Suite are shown in Figure 4: On the homepage of TCM‐Suite (Figure 4A), the key functions of the TCM‐Suite were provided: on the top right, links to all modules were shown. Additionally, we provided an interface for users to directly search all the database entries. The search interface was followed by basic database statistics, and the conceptions of TCM‐Suite were also described (Figure 4B), providing the logic principles behind our database. On the Holmes‐Suite sub‐database, the browse page (Figure 4C) listed all the marker gene entries after the selection of the marker gene. The column labeled “Detail” provided the hyperlink to the classification information for each entry and the link to Watson‐Suite, which provided the corresponding network pharmacology information. The browse page of Watson‐Suite (Figure 4D) displayed all the entries in the database after selection of the category of network pharmacology. For each entry in the Watson‐Suite database, the associated data obtained from other categorizations were also listed (Figure 4F). To comprehensively and intuitively display the complex correlations among different categorizations of entries, we designed a visualization interface using Cytoscape Web (http://cytoscapeweb.cytoscape.org/) to show the associations in the pharmacology network. If the detailed information contains the category of “herb” (i.e., if the marker sequence is accessible), a link to the corresponding records in Holmes‐Suite was provided on the “detail” page (Figure 4E).
Figure 4.

Screenshots of TCM‐Suite databases interface. (A) The homepage of TCM‐Suite. In the top right corner, hyperlinks to all modules are provided. (B) The browsing page of marker gene ITS2 for Holmes‐Suite. Detailed taxonomical classification is shown, along with a detailed description linked to Wikipedia. (C) The front page of Holmes‐Suite. The browse page of marker gene entry in Holmes‐Suite consisted of two parts: taxonomical information and a link to Watson‐Suite for mining downstream network pharmacology information. (D) The detailed information page of an herb for Watson‐Suite. All the records for this herb are listed at the top of the page, followed by the link to marker gene sequences in the Holmes‐Suite database. The pharmacology network generated for the selected herb in the Watson‐Suite database is shown. Information was displayed when the cursor was pointed to the corresponding area. Finally, the detailed information and corresponding link were provided under the network and organized by category of entry.
Network pharmacology investigation for COVID‐19
The coronavirus disease 2019 (COVID‐19) has garnered international attention [57, 58]. Based on a large number of clinical studies, several TCM formulas were proven to be effective in the treatment of COVID‐19 and were included in the guidelines on COVID‐19 diagnosis and treatment [59, 60]. As a powerful platform for integrating TCM resources, the TCM‐Suite platform also allows the investigation of COVID‐19‐related pharmacology networks for the discovery of potential gene targets and therapeutic drugs.
Based on previous studies, five candidate genes were selected as the targets for treating COVID‐19 were proposed [61, 62]. For these candidate genes, we constructed the “herb‐protein‐disease” networks using our database. For the five query genes (ACE2, SLC6A20, SIT1, CCR9, and CXCR6), we identified compounds (12, 23, 1, 2, and 14, respectively) and herbs (58, 128, 2, 8, and 54, respectively) based on the entries in our database. Among these five genes, most records in our database were identified as being associated with genes SLC6A20 and ACE2, which was also observed in previous studies [61, 62]. Further investigations into these potential target genes indicated that most of the effects of the 152 herbs could act through “clear heat and detoxify” and “facilitate the flow of the lung and relieves toxicity,” consistent with previous studies [63, 64].
We also investigated COVID‐19 clinical symptoms to construct pharmacological networks. Based on the COVID‐19 clinical symptoms identified in previous studies [65], we conducted a network pharmacological survey (Supporting Information: Table 3). Pneumonia is an important symptom of COVID‐19 infection [66, 67], and interestingly, many records in the TCM‐Suite database were associated with “pneumonia.”
We combined these pharmacological networks and obtained a target candidate gene cluster and potential therapeutic drugs for COVID‐19. The common target genes and biological ingredients in these networks (in over five pharmacological networks) were calculated, resulting in a total of 15 drug targets, 52 compounds, and 86 biological ingredients. Many of these target genes have also been identified in previous studies, such as ACE2, SLC6A20, SIT1, CCR9, and CXCR6 [68–71]. Many of these 86 biological ingredients were identified as important components of clinically effective TCM to cure COVID‐19 [69]. For example, the biological ingredients Lonicera japonica, Scutellaria baicalensis, and Forsythia suspensa are the active ingredients of Shuanghuanglian; Forsythia suspensa, Ephedra sinica, Lonicera japonica, Isatis indigotica, Mentha haplocalyx, and Dryopteris crassirhizoma are the main components of the Lianhua Qingwen capsule. All these TCM formulas were identified as effective for COVID‐19 treatment [69–71]. In this case study, the TCM‐Suite platform showed immense potential for the evaluation of TCM's effectiveness on diseases.
CONCLUSION
Existing databases for TCM biological ingredient identification and network pharmacology inference have limited in search accuracy, database completeness, and search speed. In developing the TCM‐Suite, we aimed to consolidate and combine existing database resources to facilitate the seamless integration of TCM biological ingredient identification (Holmes‐Suite) and downstream network pharmacological analysis (Watson‐Suite) into a holistic analysis platform. For TCM biological ingredient identification, we developed a highly accurate, comprehensive, and efficient identification system Holmes‐Suite, which is user‐friendly and highly interactive, and empowered by machine learning techniques such as HMM for entry matches and sequence searches. The collection of data from six marker genes (ITS2, matK, trnH‐psbA, trnL, rpoc1, and ycf1) resulted in the Holmes‐Suite sub‐database, covering 235,470 kinds of biological ingredients and 1,251,548 marker gene sequences. This would boost the usage of Holmes‐Suite and its value for effective biological ingredient identification for TCM formulas. For downstream TCM‐related network pharmacology analysis, Watson‐Suite enables deeper mining of the pipeline for “herb‐compound‐protein‐disease” interpretation. The Watson‐Suite sub‐database covers 6,692 formulas, 7,322 herbs, 704,321 compounds, 19,319 proteins, and 15,437 diseases, and corresponding network pharmacology entries. Watson‐Suite can establish a comprehensive view of network pharmacology by integrating existing data.
TCM‐Suite achieved a smooth interconnection between the two sub‐databases, Holmes‐Suite and Watson‐Suite. Our platform has great potential for TCM‐based drug discovery and repurposing, which is reflected in our benchmark work. In the benchmark of Holmes‐Suite, 1,000 ITS2 raw entries randomly selected from NCBI served as test data set queries found that 552 (55.2%) of them could be retrieved in Watson‐Suite for network pharmacology analysis. Similar interconnections were observed in the benchmark of Watson‐Suite. These results demonstrated a strong connection between Holmes‐Suite and Watson‐Suite databases, but also revealed insufficient biological ingredient information, as well as network pharmacology information, for TCM formulas.
Additionally, the high speed of TCM biological ingredient identification and network pharmacology search empowers TCM‐Suite for effective application. Compared to other databases, the query speed of Holmes‐Suite is faster and a linear relationship was detected between the time required for the query and the number of queries. Both results reflect the high efficiency of the search engine inside the Holmes‐Suite sub‐database. Meanwhile, Watson‐Suite also works rapidly: when querying 100 drugs, the reaction time of the database was almost 0.6 s. Moreover, with a user‐friendly interface, the TCM‐Suite could be used easily for many pharmacology applications. Taken together, TCM‐Suite is a rich resource for TCM‐based drug discovery and repurposing and could contribute to both fundamental and clinical research.
TCM research, especially that related to molecular biology, is becoming popular, and huge amounts of data have been accumulated. TCM‐Suite is planned to be updated bi‐annually to reflect the latest advancements in biological ingredient identification and network pharmacology. Additionally, more advanced artificial intelligence techniques, such as deep learning for more accurate TCM ingredient identification model building (not limited to biological ingredient only), as well as better data mining techniques, such as network motif discovery for network pharmacology examination, may be applied. We believe continuous efforts in this integrated database and analytical platform will contribute to pharmaceutical research and clinical practice worldwide.
AUTHOR CONTRIBUTIONS
Hong Bai, Pengshuo Yang, Hongjun Li and Kang Ning conceived and proposed the idea, and designed the study. Pengshuo Yang, Jidong Lang, and Hongjun Li performed the experiments. Hong Bai, Pengshuo Yang, Hongjun Li, Jidong Lang, Geng Tian, and Kang Ning analyzed the data. Hong Bai, Pengshuo Yang, Hanyang Lin, Jingxiang Lu, Jialiang Yang and Kang Ning. contributed to editing and proofreading of the manuscript. All authors read and approved the final manuscript.
CONFLICT OF INTEREST
Jidong Lang and Jialiang Yang are employed by Genesis Beijing Co. Ltd. The remaining authors declare no conflict of interest.
Supporting information
Supporting information.
ACKNOWLEDGMENTS
We thank LetPub (www.letpub.com) for its linguistic assistance during the preparation of this manuscript. This study was partially supported by the National Science Foundation of China grants 81774008, 81573702, 62072283, 31871334, and 31671374, the Ministry of Science and Technology (High‐Tech) grant no. 2021YFA0910500 and 2018YFC0910502, the Natural Science Foundation of Hunan Province (No. 2018JJ2461), and the project to introduce intelligence from oversea experts to Changsha City (No. 2089901).
Yang, Pengshuo , Lang Jidong, Li Hongjun, Lu Jinxiang, Lin Hanyang, Tian Geng, Bai Hong, Yang Jialiang, and Ning Kang. 2022. “TCM‐Suite: A comprehensive and holistic platform for Traditional Chinese Medicine component identification and network pharmacology analysis.” iMeta 1, e47. 10.1002/imt2.47
Pengshuo Yang and Jidong Lang contributed equally to this study.
Contributor Information
Hong Bai, Email: baihong@hust.edu.cn.
Jialiang Yang, Email: yangjl@geneis.cn.
Kang Ning, Email: ningkang@hust.edu.cn.
DATA AVAILABILITY STATEMENT
The TCM‐Suite database website is available at http://TCM-Suite.AImicrobiome.cn/. Supplementary materials (figures, tables, scripts, graphical abstract, slides, videos, Chinese translated version and update materials) may be found in the online DOI or iMeta Science http://www.imeta.science/.
REFERENCES
- 1. Cyranoski, David . 2018. “Why Chinese Medicine is Heading for Clinics Around the World.” Nature 561: 448–50. 10.1038/d41586-018-06782-7 [DOI] [PubMed] [Google Scholar]
- 2. Zhang, Rui‐Wen . 2017. “Traditional Chinese Medicine: Research and Development, Globalization, and Regulation.” Chinese Journal of Natural Medicines 15: 1–3. 10.1016/S1875-5364(17)30002-X [DOI] [PubMed] [Google Scholar]
- 3. Gu, Peiqin , and Chen Huajun. 2014. “Modern Bioinformatics Meets Traditional Chinese Medicine.” Briefings in Bioinformatics 15: 984–1003. 10.1093/bib/bbt063 [DOI] [PubMed] [Google Scholar]
- 4. Sun, Ya‐Zhou , Zhang De‐Hong, Ming Zhong, Li Jian‐Qiang, and Chen Xing. 2017. “DLREFD: A Database Providing Associations of Long Non‐Coding RNAs, Environmental Factors and Phenotypes.” Database 2017: bax084. 10.1093/database/bax084 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Zhou, Xiaonan , Lu Qingqing, Kang Xiaozhen, Tian Geng, Ming Deguo, and Yang Jialiang. 2021. “Protective Role of a New Polysaccharide Extracted From Lonicera japonica Thunb in Mice With Ulcerative Colitis Induced by Dextran Sulphate Sodium.” BioMed Research International 2021: 8878633. 10.1155/2021/8878633 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Arulandhu, Alfred J. , Staats Martijn, Hagelaar Rico, Voorhuijzen Marleen M., Prins Theo W., Scholtens Ingrid, Costessi Adalberto, et al. 2017. “Development and Validation of a Multi‐Locus DNA Metabarcoding Method to Identify Endangered Species in Complex Samples.” Gigascience 6: 1–18. 10.1093/gigascience/gix080 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Srirama, Ramanujam , Santhosh Kumar J. U., Seethapathy G. S., Newmaster Steven G., Ragupathy S., Ganeshaiah K. N., Uma Shaanker R., and Ravikanth Gudasalamani. 2017. “Species Adulteration in the Herbal Trade: Causes, Consequences and Mitigation.” Drug Safety 40: 651–61. 10.1007/s40264-017-0527-0 [DOI] [PubMed] [Google Scholar]
- 8. de Boer, Hugo J. , Ghorbani Abdolbaset, Manzanilla Vincent, Raclariu Ancuta‐Cristina, Kreziou Anna, Ounjai Sarawut, Osathanunkul Maslin, and Gravendeel Barbara. 2017. “DNA Metabarcoding of Orchid‐Derived Products Reveals Widespread Illegal Orchid Trade.” Proceedings of the Royal Society B: Biological Sciences 284: 20171182. 10.1098/rspb.2017.1182 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Besse, P. 2014. “Nuclear Ribosomal RNA Genes: ITS Region.” Methods in Molecular Biology 1115: 141–49. 10.1007/978-1-62703-767-9_7 [DOI] [PubMed] [Google Scholar]
- 10. Li, De‐Zhu , Gao Lian‐Ming, Li Hong‐Tao, Wang Hong, Ge Xue‐Jun, Liu Jian‐Quan, Chen Zhi‐Duan, et al. 2011. “Comparative Analysis of a Large Dataset Indicates That Internal Transcribed Spacer (ITS) Should be Incorporated Into the Core Barcode for Seed Plants.” Proceedings of the National Academy of Sciences of the United States of America 108: 19641–46. 10.1073/pnas.1104551108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Liu, Yong‐Xin , Qin Yuan, Chen Tong, Lu Meiping, Qian Xubo, Guo Xiaoxuan, and Bai Yang. 2021. “A Practical Guide to Amplicon and Metagenomic Analysis of Microbiome Data.” Protein Cell 12: 315–30. 10.1007/s13238-020-00724-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Chen, Tingting , Chen Xu, Zhang Sisi, Zhu Junwei, Tang Bixia, Wang Anke, Dong Lili, et al. 2021. “The Genome Sequence Archive Family: Toward Explosive Data Growth and Diverse Data Types.” Genomics, Proteomics & Bioinformatics 19: 578–83. 10.1016/j.gpb.2021.08.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Chen, Shilin , Pang Xiaohui, Song Jingyuan, Shi Linchun, Yao Hui, Han Jianping, and Leon Christine. 2014. “A Renaissance in Herbal Medicine Identification: From Morphology to DNA.” Biotechnology Advances 32: 1237–44. 10.1016/j.biotechadv.2014.07.004 [DOI] [PubMed] [Google Scholar]
- 14. John, Reshma , and Mathew Linu. 2017. “Endophytic Fungal Assemblage in Achyranthes aspera Linn. Revealed by Internal Transcribed Spacer Region of Nuclear Ribosomal RNA Genes.” 3 Biotech 7: 109. 10.1007/s13205-017-0748-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Yang, Jialiang , Grünewald Stefan, and Wan Xiu‐Feng. 2013. “Quartet‐net: A Quartet‐Based Method to Reconstruct Phylogenetic Networks.” Molecular Biology and Evolution 30: 1206–17. 10.1093/molbev/mst040 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. An, Wenli , Li Jing, Yang Zerui, Huang Yuying, Huang Song, and Zheng Xiasheng. 2020. “Characteristics Analysis of the Complete Wurfbainia villosa Chloroplast Genome.” Physiology and Molecular Biology of Plants 26: 747–58. 10.1007/s12298-019-00748-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Xiong, Chao , Sun Wei, Li Jingjian, Yao Hui, Shi Yuhua, Wang Ping, Huang Bisheng, et al. 2018. “Identifying the Species of Seeds in Traditional Chinese Medicine Using DNA Barcoding.” Frontiers in Pharmacology 9: 701. 10.3389/fphar.2018.00701 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Zhang, Dequan , Mo Xinchun, Xiang Jianying, and Zhou Nong. 2016. “Molecular Identification of Original Plants of Fritillariae Cirrhosae Bulbus, A Traditional Chinese Medicine (TCM) Using Plant DNA Barcoding.” African Journal of Traditional, Complementary and Alternative Medicines 13: 74–82. 10.21010/ajtcam.v13i6.12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Miller, Scott E. 2007. “DNA Barcoding and the Renaissance of Taxonomy.” Proceedings of the National Academy of Sciences of the United States of America 104: 4775–76. 10.1073/pnas.0700466104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Soininen, Eeva M. , Valentini Alice, Coissac Eric, Miquel Christian, Gielly Ludovic, Brochmann Christian, Brysting Anne K., et al. 2009. “Analysing Diet of Small Herbivores: The Efficiency of DNA Barcoding Coupled With High‐Throughput Pyrosequencing for Deciphering the Composition of Complex Plant Mixtures.” Frontiers in Zoology 6: 16. 10.1186/1742-9994-6-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Blaxter, Mark , Mann Jenna, Chapman Tom, Thomas Fran, Whitton Claire, Floyd Robin, and Abebe Eyualem. 2005. “Defining Operational Taxonomic Units Using DNA Barcode Data.” Philosophical Transactions of the Royal Society B: Biological Sciences 360: 1935–43. 10.1098/rstb.2005.1725 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Handelsman, Jo . 2004. “Metagenomics: Application of Genomics to Uncultured Microorganisms.” Microbiology and Molecular Biology Reviews 68: 669–85. 10.1128/MMBR.68.4.669-685.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Tang, Hsin‐Chieh , Huang Hung‐Jin, Lee Cheng‐Chun, and Chen Calvin Yu Chian. 2017. “Network Pharmacology‐Based Approach of Novel Traditional Chinese Medicine Formula for Treatment of Acute Skin Inflammation In Silico.” Computational Biology and Chemistry 71: 70–81. 10.1016/j.compbiolchem.2017.08.013 [DOI] [PubMed] [Google Scholar]
- 24. Zhou, Zhuchen , Chen Bing, Chen Simiao, Lin Minqiu, Chen Ying, Jin Shan, Chen Weiyan, and Zhang Yuyan. 2020. “Applications of Network Pharmacology in Traditional Chinese Medicine Research.” Evidence‐Based Complementary and Alternative Medicine 2020: 1–7. 10.1155/2020/1646905 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Tao, Quyuan , Du Jiaxin, Li Xiantao, Zeng Jingyan, Tan Bo, Xu Jianhua, Lin Wenjia, and Chen Xin‐lin. 2020. “Network Pharmacology and Molecular Docking Analysis on Molecular Targets and Mechanisms of Huashi Baidu Formula in the Treatment of COVID‐19.” Drug Development and Industrial Pharmacy 46: 1345–53. 10.1080/03639045.2020.1788070 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Guo, Wei , Huang Jihan, Wang Ning, Tan Hor‐Yue, Cheung Fan, Chen Feiyu, and Feng Yibin. 2019. “Integrating Network Pharmacology and Pharmacological Evaluation for Deciphering the Action Mechanism of Herbal Formula Zuojin Pill in Suppressing Hepatocellular Carcinoma.” Front Pharmacol 10: 1185. 10.3389/fphar.2019.01185 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Hu, Rui‐Feng , and Sun Xiao‐Bo. 2017. “Design of New Traditional Chinese Medicine Herbal Formulae for Treatment of Type 2 Diabetes Mellitus Based on Network Pharmacology.” Chinese Journal of Natural Medicines 15: 436–41. 10.1016/S1875-5364(17)30065-1 [DOI] [PubMed] [Google Scholar]
- 28. Lee, Won‐Yung , Lee Choong‐Yeol, Kim Youn‐Sub, and Kim Chang‐Eop. 2019. “The Methodological Trends of Traditional Herbal Medicine Employing Network Pharmacology.” Biomolecules 9(8): 362. 10.3390/biom9080362 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Yang, Jialiang , Peng Shouneng, Zhang Bin, Houten Sander, Schadt Eric, Zhu Jun, Suh Yousin, and Tu Zhidong. 2020. “Human Geroprotector Discovery by Targeting the Converging Subnetworks of Aging and Age‐Related Diseases.” Geroscience 42: 353–72. 10.1007/s11357-019-00106-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Li, Shao , and Zhang Bo. 2013. “Traditional Chinese Medicine Network Pharmacology: Theory, Methodology and Application.” Chinese Journal of Natural Medicines 11: 110–20. 10.3724/SP.J.1009.2013.00110 [DOI] [PubMed] [Google Scholar]
- 31. Lin, Lan , Wang Qiuhong, Yi Yongxin, Wang Shihan, and Qiu Zonglin. 2016. “Liuwei Dihuang Pills Enhance the Effect of Western Medicine in Treating Diabetic Nephropathy: A Meta‐Analysis of Randomized Controlled Trials.” Evidence‐Based Complementary and Alternative Medicine 2016: 1–9. 10.1155/2016/1509063 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Yuan, Hai‐Dan , Kim Jung‐Tae, Kim Sung‐Hoon, and Chung Sung‐Hyun. 2012. “Ginseng and Diabetes: The Evidences From In Vitro, Animal and Human Studies.” Journal of Ginseng Research 36: 27–39. 10.5142/jgr.2012.36.1.27 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Koetschan, Christian , Hackl Thomas, Müller Tobias, Wolf Matthias, Förster Frank, and Schultz Jörg. 2012. “ITS2 Database IV: Interactive Taxon Sampling for Internal Transcribed Spacer 2 Based Phylogenies.” Molecular Phylogenetics and Evolution 63: 585–88. 10.1016/j.ympev.2012.01.026 [DOI] [PubMed] [Google Scholar]
- 34. Shi, Lin‐Chun , Yao Hui, Xie L‐Fang, Zhu Ying‐Jie, Song Jing‐Yuan, Zhang Hui, and Chen Shi‐Lin. 2014. “[Integrated DNA Barcoding Database for Identifying Chinese Animal Medicine].” Zhongguo Zhong Yao Za Zhi 39: 2155–59. [PubMed] [Google Scholar]
- 35. Banerjee, Priyanka , Erehman Jevgeni, Gohlke Björn‐Oliver, Wilhelm Thomas, Preissner Robert, and Dunkel Mathias. 2015. “Super Natural II—A Database of Natural Products.” Nucleic Acids Research 43: D935–39. 10.1093/nar/gku886 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Chen, Calvin Yu‐Chian . 2011. “TCM Database@Taiwan: The World's Largest Traditional Chinese Medicine Database for Drug Screening In Silico.” PLoS One 6: e15939. 10.1371/journal.pone.0015939 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Gaulton, Anna , Bellis Louisa J., Bento A. Patricia, Chambers Jon, Davies Mark, Hersey Anne, Light Yvonne, et al. 2012. “ChEMBL: A Large‐Scale Bioactivity Database for Drug Discovery.” Nucleic Acids Research 40: D1100–107. 10.1093/nar/gkr777 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Xue, Ruichao , Fang Zhao, Zhang Meixia, Yi Zhenghui, Wen Chengping, and Shi Tieliu. 2012. “TCMID: Traditional Chinese Medicine Integrative Database for Herb Molecular Mechanism Analysis.” Nucleic Acids Research 41: D1089–95. 10.1093/nar/gks1100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Kuhn, Michael , Szklarczyk Damian, Pletscher‐Frankild Sune, Blicher Thomas H., von Mering Christian, Jensen Lars J., and Bork Peer. 2014. “STITCH 4: Integration of Protein‐Chemical Interactions With User Data.” Nucleic Acids Research 42: D401–407. 10.1093/nar/gkt1207 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Szklarczyk, Damian , Franceschini Andrea, Wyder Stefan, Forslund Kristoffer, Heller Davide, Huerta‐Cepas Jaime, Simonovic Milan, et al. 2015. “STRING v10: Protein‐Protein Interaction Networks, Integrated Over the Tree of Life.” Nucleic Acids Research 43: D447–52. 10.1093/nar/gku1003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Hamosh, Ada. 2004. “Online Mendelian Inheritance in Man (OMIM), A knowledgebase of Human Genes and Genetic Disorders.” Nucleic Acids Research 33: D514–17. 10.1093/nar/gki033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Becker, Kevin G. , Barnes Kathleen C., Bright Tiffani J., and Wang S Alex. 2004. “The Genetic Association Database.” Nature Genetics 36: 431–32. 10.1038/ng0504-431 [DOI] [PubMed] [Google Scholar]
- 43. Kuhn, Michael , Letunic Ivica, Jensen Lars Juhl, and Bork Peer. 2016. “The SIDER Database of Drugs and Side Effects.” Nucleic Acids Research 44: D1075–79. 10.1093/nar/gkv1075 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Wu, Yang , Zhang Feilong, Yang Kuo, Fang Shuangsang, Bu Dechao, Li Hui, Sun Liang, et al. 2019. “SymMap: An Integrative Database of Traditional Chinese Medicine Enhanced by Symptom Mapping.” Nucleic Acids Research 47: D1110–17. 10.1093/nar/gky1021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Xu, Hai‐Yu , Zhang Yan‐Qiong, Liu Zhen‐Ming, Chen Tong, Lv Chuan‐Yu, Tang Shi‐Huan, Zhang Xiao‐Bo, et al. 2019. “ETCM: An Encyclopaedia of Traditional Chinese Medicine.” Nucleic Acids Research 47: D976–82. 10.1093/nar/gky987 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Huang, Lin , Xie Duoli, Yu Yiran, Liu Huanlong, Shi Yan, Shi Tieliu, and Wen Chengping. 2018. “TCMID 2.0: A Comprehensive Resource for TCM.” Nucleic Acids Research 46: D1117–120. 10.1093/nar/gkx1028 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Wu, Changsong , Dai Yishan, Yuan Gailing, Su Jianguo, and Liu Xiaoling. 2019. “Immunomodulatory Effects and Induction of Apoptosis by Different Molecular Weight Chitosan Oligosaccharides in Head Kidney Macrophages From Blunt Snout Bream (Megalobrama amblycephala).” Frontiers in Immunology 10: 869. 10.3389/fimmu.2019.00869 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Zhang, Run‐zhi , Yu Shao‐jun, Bai Hong, and Ning Kang. 2017. “TCM‐Mesh: The Database and Analytical System for Network Pharmacology Analysis for TCM Preparations.” Scientific Reports 7: 2821. 10.1038/s41598-017-03039-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Sabouri‐Rad, Sarvenaz , Sabouri‐Rad Sara, Sahebkar Amirhossein, and Tayarani‐Najaran Zahra. 2017. “Ginseng in Dermatology: A Review.” Current Pharmaceutical Design 23: 1649–66. 10.2174/1381612822666161021152322 [DOI] [PubMed] [Google Scholar]
- 50. Smith, Imogen , Williamson Elizabeth M., Putnam Sophie, Farrimond Jonathan, and Whalley Benjamin J.. 2014. “Effects and Mechanisms of Ginseng and Ginsenosides on Cognition.” Nutrition Reviews 72: 319–33. 10.1111/nure.12099 [DOI] [PubMed] [Google Scholar]
- 51. Remmert, Michael , Biegert Andreas, Hauser Andreas, and Söding Johannes. 2012. “HHblits: Lightning‐Fast Iterative Protein Sequence Searching by HMM‐HMM Alignment.” Nature Methods 9: 173–75. 10.1038/nmeth.1818 [DOI] [PubMed] [Google Scholar]
- 52. Boison, Detlev . 2013. “Adenosine Kinase: Exploitation for Therapeutic Gain.” Pharmacological Reviews 65: 906–43. 10.1124/pr.112.006361 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Khosrowbeygi, Ali , Shiamizadeh Najmeh, and Taghizadeh Nima. 2016. “Maternal Circulating Levels of Some Metabolic Syndrome Biomarkers in Gestational Diabetes Mellitus.” Endocrine 51: 245–55. 10.1007/s12020-015-0697-4 [DOI] [PubMed] [Google Scholar]
- 54. Mazzotti, Diego Robles , Guindalini Camila, de Souza Altay Alves Lino, Sato João Ricardo, Santos‐Silva Rogério, Bittencourt Lia Rita Azeredo, and Tufik Sergio. 2012. “Adenosine Deaminase Polymorphism Affects Sleep EEG Spectral Power in a Large Epidemiological Sample.” PLoS One 7: e44154. 10.1371/journal.pone.0044154 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Rupp, Tracy L. , Wesensten Nancy J., Newman Rachel, and Balkin Thomas J.. 2013. “PER3 and ADORA2A Polymorphisms Impact Neurobehavioral Performance During Sleep Restriction.” Journal of Sleep Research 22: 160–65. 10.1111/j.1365-2869.2012.01062.x [DOI] [PubMed] [Google Scholar]
- 56. Wishart, David S. , Feunang Yannick D., Guo An C., Lo Elvis J., Marcu Ana, Grant Jason R., Sajed Tanvir, et al. 2017. “DrugBank 5.0: A Major Update to the DrugBank Database for 2018.” Nucleic Acids Research 46: D1074–82. 10.1093/nar/gkx1037 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Stebbing, Justin , Phelan Anne, Griffin Ivan, Tucker Catherine, Oechsle Olly, Smith Dan, and Richardson Peter. 2020. “COVID‐19: Combining Antiviral and Anti‐Inflammatory Treatments.” The Lancet Infectious Diseases 20: 400–402. 10.1016/S1473-3099(20)30132-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Peng, Lihong , Tian Xiongfei, Tian Geng, Xu Junlin, Huang Xin, Weng Yanbin, Yang Jialiang, and Zhou Liqian. 2020. “Single‐Cell RNA‐seq Clustering: Datasets, Models, and Algorithms.” RNA Biology 17: 765–83. 10.1080/15476286.2020.1728961 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Jun‐Ling, Jun‐ling , Zhang Ai‐Hua, and Wang Xi‐Jun. 2020. “Traditional Chinese Medicine for COVID‐19 Treatment.” Pharmacological Research 155: 104743. 10.1016/j.phrs.2020.104743 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Zhu, Na , Zhang Dingyu, Wang Wenling, Li Xingwang, Yang Bo, Song Jingdong, Zhao Xiang, et al. 2020. “A Novel Coronavirus From Patients With Pneumonia in China, 2019.” New England Journal of Medicine 382: 727–33. 10.1056/NEJMoa2001017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. South, Andrew M. , Diz Debra I., and Chappell Mark C.. 2020. “COVID‐19, ACE2, and the Cardiovascular Consequences.” American Journal of Physiology‐Heart and Circulatory Physiology 318: H1084–90. 10.1152/ajpheart.00217.2020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Ellinghaus, D. , Degenhardt F., Bujanda L., Buti M., Albillos A., Invernizzi P., et al 2020. “Genomewide Association Study of Severe Covid‐19 with Respiratory Failure.” New England Journal of Medicine 383: 1522–34. 10.1056/NEJMoa2020283 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Xu, H. Y. , Zhang Y. Q., Qing Y. W., Zhao H. Y., Wang P., and Liu F.. 2020. “Exploration on Scientific Connotation of TCM Syndromes and Recommended Prescriptions Against COVID‐19 Based on TCMTP V2.0].” Zhongguo Zhong Yao Za Zhi 45: 1488–98. 10.19540/j.cnki.cjcmm.20200229.401 [DOI] [PubMed] [Google Scholar]
- 64. Song, Zhengbo , Chen Xiangbin, Shi Yi, Huang Rongfang, Wang Wenxian, Zhu Kunshou, Lin Shaofeng, et al. 2020. “Evaluating the Potential of T Cell Receptor Repertoires in Predicting the Prognosis of Resectable Non‐Small Cell Lung Cancers.” Molecular Therapy—Methods & Clinical Development 18: 73–83. 10.1016/j.omtm.2020.05.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Majumder, Joydeb , and Minko Tamara. 2021. “Recent Developments on Therapeutic and Diagnostic Approaches for COVID‐19.” AAPS Journal 23: 14. 10.1208/s12248-020-00532-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Shi, Heshui , Han Xiaoyu, Jiang Nanchuan, Cao Yukun, Alwalid Osamah, Gu Jin, Fan Yanqing, and Zheng Chuansheng. 2020. “Radiological Findings From 81 Patients With COVID‐19 Pneumonia in Wuhan, China: A Descriptive Study.” The Lancet Infectious Diseases 20: 425–34. 10.1016/S1473-3099(20)30086-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Yu, Nan , Li Wei, Kang Qingling, Xiong Zhi, Wang Shaoshuai, Lin Xingguang, Liu Yanyan, et al. 2020. “Clinical Features and Obstetric and Neonatal Outcomes of Pregnant Patients With COVID‐19 in Wuhan, China: A Retrospective, Single‐Centre, Descriptive Study.” The Lancet Infectious Diseases 20: 559–64. 10.1016/S1473-3099(20)30176-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Luo, Erdan , Zhang Daiyan, Luo Hua, Liu Bowen, Zhao Keming, Zhao Yonghua, Bian Ying, and Wang Yitao. 2020. “Treatment Efficacy Analysis of Traditional Chinese Medicine for Novel Coronavirus Pneumonia (COVID‐19): An Empirical Study From Wuhan, Hubei Province, China.” Chinese Medicine 15: 34. 10.1186/s13020-020-00317-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Yang, Yang , Islam Md Shahidul, Wang Jin, Li Yuan, and Chen Xin. 2020. “Traditional Chinese Medicine in the Treatment of Patients Infected With 2019‐New Coronavirus (SARS‐CoV‐2): A Review and Perspective.” International Journal of Biological Sciences 16: 1708–17. 10.7150/ijbs.45538 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Du, Hong‐Zhi , Hou Xiao‐Ying, Miao Yu‐Huan, Huang Bi‐Sheng, and Liu Da‐Hui. 2020. “Traditional Chinese Medicine: An Effective Treatment for 2019 Novel Coronavirus Pneumonia (NCP.” Chinese Journal of Natural Medicines 18: 206–210. 10.1016/S1875-5364(20)30022-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Runfeng, Li , Yunlong Hou, Jicheng Huang, Weiqi Pan, Qinhai Ma, Yongxia Shi, Chufang Li, et al. 2020. “Lianhuaqingwen Exerts Anti‐Viral and Anti‐Inflammatory Activity Against Novel Coronavirus (SARS‐CoV‐2.” Pharmacological Research 156: 104761. 10.1016/j.phrs.2020.104761 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supporting information.
Data Availability Statement
The TCM‐Suite database website is available at http://TCM-Suite.AImicrobiome.cn/. Supplementary materials (figures, tables, scripts, graphical abstract, slides, videos, Chinese translated version and update materials) may be found in the online DOI or iMeta Science http://www.imeta.science/.
