Skip to main content
ACS AuthorChoice logoLink to ACS AuthorChoice
. 2020 Apr 27;63(16):8653–8666. doi: 10.1021/acs.jmedchem.9b01989

Drug Research Meets Network Science: Where Are We?

Maurizio Recanatini 1,*, Chiara Cabrelle 1
PMCID: PMC8007104  PMID: 32338900

Abstract

graphic file with name jm9b01989_0005.jpg

Network theory provides one of the most potent analysis tools for the study of complex systems. In this paper, we illustrate the network-based perspective in drug research and how it is coherent with the new paradigm of drug discovery. We first present data sources from which networks are built, then show some examples of how the networks can be used to investigate drug-related systems. A section is devoted to network-based inference applications, i.e., prediction methods based on interactomes, that can be used to identify putative drug–target interactions without resorting to 3D modeling. Finally, we present some aspects of Boolean networks dynamics, anticipating that it might become a very potent modeling framework to develop in silico screening protocols able to simulate phenotypic screening experiments. We conclude that network applications integrated with machine learning and 3D modeling methods will become an indispensable tool for computational drug discovery in the next years.

1. Introduction

The first decade of the 2000s has seen a consistent modification of the drug research landscape, due, among other aspects, to a rethinking of the drug discovery paradigm1 and to the entrance into the era of Big Data.2 The paradigm change in drug discovery is derived from both the need to improve the success rate of pharmaceutical industry and the diffusion across the drug research community of discoveries and concepts related to systems biology.3 In fact, driven by the initially debated and finally accepted proposal of multitarget drugs,4 the conventional one-disease–one-target idea was abandoned to embrace a less reductionist view. The inspired concept of network pharmacology elaborated by Andrew L. Hopkins5 allowed us to move toward a new system-based paradigm.6,7 On the other hand, the availability of relevant amounts of data, from molecular descriptors to bioassay results, from -omic information to clinical records, provides the starting material for the elaboration of multilevel integrated models that take full advantage of the constantly growing informatics technology for model building, analysis, interpretation, and exploitation.

Complexity is a distinctive trait of living systems, and recently researchers have started to investigate natural complex systems with adequate theoretical tools.8 In particular, network science permits catching of the behavior of a system as a whole, especially regarding its emergent properties, that is, the features that arise from the interaction of the systems’ parts and not as a mere sum of them. In the field of medicine, the point of view proposed by Albert L. Barabási is illuminating: if disease phenotypes can be viewed as emergent properties deriving from the interconnection of pathobiological processes, in turn arising from the cross-talk of molecular, metabolic, and regulatory networks at cellular level, a framework of “network medicine” might help in exploring causes and finding therapies at a global integrated level.9

Networks rely on the idea of modeling a real system as a map of interconnected dots and lines representing the set of elements and the set of relationships between them, respectively. The mathematical description of a network is addressed by the graph theory such that graph stands for network, and often, even though inaccurately, these terms are used as synonyms. Elements of the network are called nodes, connections between them, links or edges. The start of graph theory is commonly traced back to Leonhard Euler’s paper Solutio problematis ad geometriam situs pertinentis that appeared in 1741 on the Commentarii Academiae Scientiarum Petropolitanae.10 The problem to be solved was to find a pathway going through all seven bridges of the city of Koenigsberg without crossing the same bridge twice. Euler demonstrated the inexistence of such a pathway, based on an abstract representation of the four areas of the city (nodes) linked by the seven bridges (links). In other words, he, for the first time, used a graph to represent and solve a mathematical problem. From then on, graph theory and network science developed mainly in the fields of mathematics and physics, respectively, and today they form a sound body of science and provide formidable tools to deal with the architecture and properties of complex systems.1113

Now the question is to what extent the network-based approach is influencing or has already influenced the way new therapies are looked for. Actually, the ways network theory is applied to drug discovery are numerous and aimed at different purposes. Limiting ourselves to the medicinal chemistry/drug design area, the main fields where network-based methods are employed are target identification and drug repurposing, both eventually in the context of polypharmacology. Further fields of application are the analysis of chemical spaces and the prediction of adverse drug reactions or toxicity. Several examples of these applications of network theory have appeared in the literature in the past years, and they have been reviewed in a number of papers. It is worth mentioning here the comprehensive review on the argument published in 2013 by Peter Csermely et al.14 that laid the foundations of a network-based drug science. In the present Perspective article, we recapitulate the basis of the approach and see how far we have gone with it by illustrating some recently published results. We also discuss some possible advancement and points of attention in the field.

2. Data, Databases, and Networks

2.1. Data and Databases

The first and most important issue to consider when dealing with the construction of networks to investigate pathobiological processes and drug effects on them concerns the material that we use to build these models, that is, what we usually call “data”. By this term we mean a wide variety of items, expressed in numerical, alphabetical, or digital form and collected in databases that often are publicly available. Although in the drug discovery community we make use of data sets at least since the early days of QSAR, the information made available by the high-throughput experimental technologies is growing at an unprecedented pace. Nowadays, we can access data sources on compounds, targets, and diseases that cover millions of molecules and thousands of proteins and genes in almost every therapeutic field.

2.1.1. Chemical Databases

On the chemical compounds side, the number and variety of freely accessible databases increase continuously, and each of these data sets contains information for numbers of molecules that range from 103 to 107. Recently, Yang et al. attempted to classify the public chemical databases into six categories based on their content, namely, (1) chemical information, (2) bioactivity, (3) drugs, (4) natural products, (5) commercial availability, and (6) fragments.15 All types of data contained in the chemical databases can be useful for drug design purposes in general, but for what concerns network applications, data on bioactivity, drugs, and natural products are most interesting. With this respect, the most popular databases are CHEMBL16 and PubChem,17 which provide knowledge on bioactive compounds, particularly data from activity assays and target information. DrugBank18 contains data on approved and experimental drugs and can be an important source of information in target identification and drug repurposing studies. On the purely chemical side, ChemSpider19 is a very rich source of physicochemical and spectral data, as well as of names, synonyms, and identifiers. In Table 1, the main features of the mentioned databases are summarized.

Table 1. Chemical Databases.
database name description url ref.
ChemBL Collection of bioactive drug-like small molecules with 2D structures, calculated chemical properties and bioactivities. https://www.ebi.ac.uk/chembl/ (16)
PubChem Open chemistry database of mostly small molecules that collects information on chemical structures, chemical and physical properties, and biological activities. It is structured into three linked databases: substance, compound, and bioassay. https://pubchem.ncbi.nlm.nih.gov/ (17)
DrugBank Freely accessible data on small molecules and biotechnological drugs (with chemical, pharmaceutical, and pharmacological profiles), and drug targets (sequence and functions of target/enzyme/transporter/carrier) intended as drugs encyclopedia. https://www.drugbank.ca/ (18)
ChemSpider Free chemical structures database collecting structures and related information, such as physicochemical properties and interactive spectra, made accessible through a fast search engine allowing search by name, structure, or advanced options. http://www.chemspider.com/ (19)

The most important issue to consider when using a chemical data set is the need of preprocessing its content, a crucial operation that may take a long time but indispensable to lower the probability of obtaining misleading results and/or building erroneous models. Tropsha and co-workers have addressed this argument in the context of QSAR modeling,20 but their conclusions and suggestions fit perfectly here. In particular, the curation of a data set should consider chemical, biological, and item identification aspects, as, for example, representation of chemical structure and presence in the data set of nonstandardized structures (salts, ions, etc.), variability of biological data for the same compound, reproducibility of results through different laboratories, presence of activity cliffs, misspelled, or mislabeled compounds, and incorrect identifiers. The same authors provided thoughtful guidelines for chemical and biological data curation to avoid errors and to ensure the stability and reliability of the models.21

2.1.2. Biological Databases

When dealing with target identification or drug repurposing studies in a system-wide perspective, it is mandatory to include in the network the knowledge that is increasingly generated by the huge array of biological techniques. Fortunately, the availability of free access biological databases is even bigger than that of chemical substances. The journal Nucleic Acid Research (NAR) in 2019 published the 26th annual issue of the Molecular Biology Database Collection,22 presenting an impressive list of 1613 databases, shortly describing the new entries, and eventually updating old ones. Also in this case, databases are grouped, reflecting the categorization of the databases in the NAR online database collection, that is, (1) nucleic acid sequence and structure and transcriptional regulation; (2) protein sequence and structure; (3) metabolic and signaling pathways, enzymes, and networks; (4) genomics of viruses, bacteria, protozoa, and fungi; (5) genomics of human and model organisms plus comparative genomics; (6) human genomic variation, diseases, and drugs; (7) plants; (8) other topics. It is worth noting that the above-mentioned data sources are mostly genotype-based, leveraging the fast development of the -omic technologies. Among the conspicuous number of available databases, some can be of particular interest for the drug design field, like those containing information regarding proteins, as either general sequence (e.g., SMART,23 UniProt24) or sequence of individual protein families (e.g., GPCRdb,25 Kinomer26), protein structure or protein–protein interactions (e.g., PDB,27 STRING28), metabolic and signaling pathways (e.g., Reactome29), human genes and diseases (e.g., DisGeNET30). A special mention is deserved here of two important resources that provide drug researchers with invaluable information on targets and mechanisms of action of small molecules: the ConnectivityMap31 and LINCS32 platforms that give access to gene transcriptional profiles in response to perturbation by drugs or other chemical compounds. In Table 2, the main features of these databases are summarized.

Table 2. Biological Databases.
database name description url ref
SMART Simple modular architecture research tool is a Web tool for identification of protein domains and exploration of domain architectures. http://smart.embl-heidelberg.de/ (23)
UniProtKB UniProtKB consists of two databases: SwissProt (manually annotated) and TrEMBL (automatically annotated and unreviewed). UniProtKB collects protein sequences providing rich annotations. https://www.uniprot.org/ (24)
GPCRdb GPCRdb stores reference data on GPCRs sequence, structures, mutations, and ligand interactions. GPCRdb also provides a suite of Web tools and interactive diagrams for illustrative purposes. https://gpcrdb.org/ (25)
Kinomer Protein kinases database contains annotated classification of 43 eukaryotic genomes. Kinomer is accessible through a Web interface, which enables the retrieval of sequences and the classification of arbitrary sequences. https://www.compbio.dundee.ac.uk/kinomer (26)
PDB Protein Data Bank is a repository of experimentally determined 3D structures of proteins, nucleic acids, and complexes with metal ions, drugs, and small molecules. https://www.rcsb.org/ (27)
STRING Search tool for retrieval of interacting genes/proteins (STRING) database of known and predicted protein–protein interactions, including physical and functional associations for a large number of organisms. STRING allows users to visualize interactions network and to make analysis. https://string-db.org/ (28)
Reactome Reactome is a relational pathway database in which signaling and metabolic molecules are organized into biological pathways and processes. https://reactome.org/ (29)
DisGeNET DisGeNET is a platform that integrates and standardizes data about disease associated genes and variants (628 685 gene-disease associations, GDAs, and 210 498 variant-disease associations, VDAs). https://www.disgenet.org/ (30)
ConnectivityMap CMap database contains gene-expression profiles from cultured human cells treated with perturbagens (bioactive small molecules). https://clue.io/cmap (31)
LINCS Library of integrated network-based cellular signatures catalogues perturbation-response signatures employing a various set of perturbations, model systems, and assay types. http://www.lincsproject.org/ (32)

2.1.3. Phenotypic Data

Moving to the field of human phenotypic data, we leave the territory of drug discovery to enter into the precision medicine arena. In this context, the pervasive digitalization of healthcare is providing quantitatively very important sources of phenotypic data, like primarily those contained in the electronic health records (EHRs) but also those generated by wearable devices or apps.33 Limiting to EHRs, the information embedded in these documents includes the description of the health/disease status of individuals, clinical test results, drug prescriptions, and eventual adverse effects.34 Of course, privacy issues limit the availability of this kind of data, and we cannot find publicly accessible databases yet. Nevertheless, several initiatives already exist to collect EHRs and related data and make them available for the biomedical research, as, for example, the “All of Us” initiative.35 As regards the contribution of this kind of information to the drug research, we observe that integration of phenotypic and genotypic data might be a necessary step toward a deeper understanding of the biological processes at the base of the onset and progression of diseases.36 Even though it is not yet clear how this integration will be translated to the discovery of new drugs, this approach is being applied for drug repurposing, as shown by the recent work by Khosravi et al.,37 who proposed a list of repurposed drug candidates for melanoma treatment after the analysis of genome- and phenome-wide association studies.

2.2. Building the Network

Given the wide availability of data on molecules, genes, proteins, cells, tissues, and diseases and the assumption that these entities are connected and representative of a more or less complex system, one needs to build and visualize the network (actually, the graph representing the network) in view of the subsequent analysis. The computational tools available for network visualization and analysis are countless and range in complexity depending on the dimensions of the data set and of the task to be executed. There are several popular and efficient software that can work on personal computer/workstation and allow one to perform all the basic operations on the network, from visualization to analysis of the basic parameters. Cytoscape38 is the most popular tool, but also Gephi,39 Pajek,40 and NetworkX41 are rather widespread in the biological community (see Table 3 for details on the main features of the softwares). A recent review by Pavlopoulos et al.42 analyzes and compares the performance of different software tools for the visualization of even large networks up to the order of magnitude of 106 nodes and edges. However, when the amount of data increases, the memory requirements to load large matrices become prohibitive even for powerful workstations, and higher performance computing is needed to analyze the network. A solution can be to distribute data and processes on high numbers of cores by means of frameworks, like Hadoop43 or Apache Spark44 (see Table 3). In this way, computations are partitioned across clusters of machines that work in parallel and carry out the jobs in reasonable time and with high efficiency.

Table 3. Network Building and Visualization Systems.

database name description url ref
Cytoscape Cytoscape is a platform for visualization, analysis, and integration of networks via basic functionalities or through apps; conceived mainly for biological research. https://cytoscape.org/ (38)
Gephi Gephi allows the visualization and exploration of all types of large graphs in real-time through a 3D render engine. https://gephi.org/ (39)
Pajek Pajek enables analysis and visualization of large networks having some thousands or millions of vertices. http://mrvar.fdv.uni-lj.si/pajek/ (40)
NetworkX NetworkX is a Phyton package designed for the creation and analysis of structure, dynamics, and functions of networks. https://networkx.github.io/ (41)
Apache Hadoop Open source framework for storing and processing large data sets across clusters of computers in a distributed environment through simple programming models. https://hadoop.apache.org/ (43)
Apache Spark A fast cluster computing system for large-scale data processing powering different libraries (SQL, MLlib, GraphX), and easy to use interactively from the Scala, Python, R, and SQL shells. https://spark.apache.org/ (44)

3. Networks to Study Systems of Pharmaceutical Interest

As a first step to illustrate the use of networks in drug research, we can consider studies aimed at visualizing and analyzing systems of biomedical/pharmacological interest. A short explanation of basic concepts of network theory and of the terminology used in this field is provided in the Supporting Information.

3.1. Networks for the Analysis of Molecules Data Sets

A relevant example of the use of network analysis in the small organic molecules context is that of chemical space networks (CSN), a framework proposed and developed by G. Maggiora and J. Bajorath.45,46 In their first review on the argument, these authors stated that “‘molecular networks’ are thought to provide an alternative way to represent and navigate chemical space” and that “chemical space exploration [...] is often motivated by the need to better understand structure–property relationships of small organic compounds”.45 In the initial applications of the CSN formalism, the aim was simply to obtain a new coordinate-free representation of the chemical space, based upon pairwise compound similarities, instead of the more commonly used coordinate-based representations referring to the descriptor space. In subsequent papers, Bajorath and co-workers showed how well network representations allowed highlighting of the properties of a chemical space viewed as a complex system to whom emergent properties like biological activity could be associated.46 Different similarity metrics were introduced and validated,47 and together with the analysis of the network topology parameters, they were shown to be a powerful tool to visualize and analyze the structure–activity relationships (SARs) of moderately sized compound sets.46,48 In this context, the analysis of the CSN through adequate metrics and algorithms can reveal the presence of communities (clusters) of compounds sharing latent characteristics not immediately evident from the common table format.

To illustrate a simple CSN application, in Figure 1 a network of 62 poly(ADP-ribose)polymerase (PARP) inhibitors is shown. The network accounts for relationships between the compounds, and the links among them were derived from pairwise similarity values calculated based on fingerprints. The inhibitors are represented by nodes (62) that are connected by edges (188) if their structure similarity exceeds a threshold (see the legend to Figure 1). The nodes are colored according to potency. This visualization of the chemical space based on similarity calculation facilitates the identification of the different structural families of PARP inhibitors (the main connected components of the network), and the color-coding allows one to grasp immediately the SAR of the set of compounds.

Figure 1.

Figure 1

Exemplary CSN of PARP inhibitors. PARPs 1, 2, and 3 family inhibitors with a measured EC50 were retrieved from CHEMBL.16 The pairwise chemical similarities between compounds were assessed by means of Tanimoto coefficient (Tc) values calculated for the ECFP4 fingerprints49 of the molecules generated by Canvas50 (Schrödinger, LLC, New York, NY, 2019). Pairs of inhibitors were connected only if their calculated Tc value exceeded the threshold value of 0.55. The chemical structures of the inhibitors are shown inside the nodes that are colored according to pEC50 values ranging from red (lowest potency) to green (highest potency) and sized based on node degree from small (low degree) to large (high degree). Edges are weighted by Tc values from thin (Tc = 0.55) to thick (Tc = 1) width. The network was generated by means of Cytoscape38 version 3.7.2.

The usefulness of the CSN framework was also recently shown in the field of natural product extraction and characterization, as reported in a paper by Nothias-Esposito et al.51 Here, the authors analyzed the extracts from two species of Euphorbia plants by means of tandem mass spectrometry and generated the network representing the isolated compounds using the mass spectra as nodes linked by similarity. In particular, to give rise to the network, the spectra were converted into vectors that were used to calculate a similarity score between each pair of spectra.52 The network then allowed visualization of structurally related molecules (based on their spectra, irrespective of their structure) that were subsequently identified by performing a search against a public reference spectral library.

The above examples deal with relatively small data sets, but the versatility of network analysis can be exploited even when the amount of data becomes much greater. This is the case of a work recently published by Miho et al., who performed a large-scale analysis on networks built from antibody repertoires.53 To appreciate the dimension of the problem, it is enough to note that to build the network of a repertoire of, for example, 106 clones (the nodes), one needs a similarity matrix of 1012 elements to define the edges. This is a relevant computational task that the authors tackled by leveraging the power of parallel distributed computing through the Apache Spark framework.44 The analysis of the network of complete antibody repertoires allowed overcoming of the limitations of using only portions of the network that are not a priori statistically representative of the properties (parameters) of the whole system and led the authors to derive some general principles on immune repertoire architecture.53

3.2. Protein Structure Networks

If we consider that a protein, like any other molecule, is an ensemble of interacting elements (the amino acid residues, in this case), it is immediately derived that it represents a complex system in which structure, dynamics, and eventually function can be viewed as emergent properties stemming from the relationships among the residues. In this context, protein structure networks (PSN) are widely studied, as the network approach is considered quite suitable to deal with the structure–function relationships, also in the light of the fast growth of analytical/biophysical technologies applicable to protein structure determination. PSNs are built by considering the amino acids (usually the Cα atoms) as nodes that are connected by a link if the distance between them falls within a cutoff value. The analysis of the parameters describing the properties of the PSNs lends itself to the study of the protein’s 3D architecture and its implications in issues like allosteric communication, folding, and model validation.54 In a recent review, Di Paola et al. discussed several applications of graph theory to the description of protein properties in terms of network parameters and advanced the hypothesis that the protein contact graph (representing the PSN) might in the future be considered as the structural formula of proteins.55 It is interesting to note that the PSN analysis can be carried out also in combination with other computational techniques to investigate the protein structural/conformational behavior. In particular, molecular dynamics simulations and eventually binding free energy calculations can be synergistically applied to studies of pharmaceutical interest. For example, Verkhivker showed the usefulness of this combined approach to interpret how allosteric effects act on the modulation of protein functions in the presence of inhibitors56 or cancer driver mutations.57 Recently, Amusengeri et al.58 applied a computational protocol including a residue network analysis to identify allosteric inhibitors of Plasmodium falciparum 70 kDa heat shock proteins as new antimalarial drug candidates.

3.3. Human Disease Network and Drug Discovery

Considering drug research from a system-wide, broad perspective, we cannot help but face the studies on the human disease network carried out by Barabási and his co-workers. In 2007, this author proposed the term “diseasome”59 to indicate the network of all genetic disorders systematically linked to all disease genes: it is an example of a bipartite network (see Supporting Information), where diseases are connected to the genes known to cause or effect them.60 This approach provides a wider view on diseases with respect to the usual classification, revealing possible connections between some of them, and eventually allowing for new possibilities of treatment. The same authors further elaborated on the diseasome concept by taking into consideration molecular interaction networks, i.e., the “interactomes”. The latter can be gene regulatory networks (GRNs), protein–protein interaction (PPI) networks, or metabolic networks and are indispensable further elements to consider to build a system view of the cellular mechanisms underlying the phenotype–genotype relationships in human diseases.61 It is evident here the effort to hold together different layers of a complex system, by leveraging the systems biology approach,62 in order to escape the simplifications of reductionism.

Focusing more specifically on the relevance of the above considerations in drug research, first of all, we must recall that, however, as far as drug action is concerned, the drug–target interaction remains a key event that maintains its centrality across paradigms, from the old “magic bullet” idea to the present polypharmacology, systems-based approach. Having said that, it appears clear how crucial can be the modeling of the drug–target interactomes. This goal has been pursued for a decade or more, from the early representations due to the work of Hopkins5,63 to the important contributions from the Barabási group.6466 In essence, from the data sets of experimental information (as, for example, DrugBank, Table 1), a bipartite drug–target network (DTN) is built from which projections on the drug or the target component are obtained. This provides both the drug network (DN), and the target network (TN), each made by nodes of the same type (drugs or targets, respectively) connected to each other if they share at least one target (the DN) or one drug (the TN). Analysis of the topology of DTNs, DNs, and TNs can reveal interesting and sometimes unexpected details on how and why drugs act on certain targets or target communities within the diseasome. It can also provide useful information for the design of drug combinations, the drug repurposing, or the interpretation of toxic effects. Applications of this kind of analysis are increasingly appearing in the literature and are reviewed in recent articles (e.g., see refs (67) and (68)).

As an example of a complex interactome, in Figure 2, a DTN generated from DrugBank data sets is shown. The network displays the interactions between 1636 approved small molecule drugs and 1991 human protein targets; the edges represent 7521 unique interactions. Drugs (circles) and targets (diamonds) are connected if an interaction between them is reported in the database. As it is evident from the picture, the network includes a large connected component constituted by 3368 nodes, 1510 of which are drugs. It may be noted that a relevant number of them cluster in a tightly interconnected community comprising mostly neurological (light pink), cardiovascular (brown), and respiratory system (aquamarine) drugs. The analysis of this kind of network provides a global picture of the molecular pharmacological space and might help to identify trends or possible areas of future development in drug research. An example thereof can be found in the paper by Yildirim et al.64 Clearly, these models cannot be definitive, and they evolve as new knowledge adds to the database. Starting from this consideration, in the next section, a further step into the use of networks in drug discovery will be illustrated.

Figure 2.

Figure 2

Drug–target network. The DTN was built from DrugBank18 version 5.1.5 retrieving the drug–target interactions between approved small molecule drugs and human protein targets. Drugs are represented as circle-shaped nodes, and protein targets are represented as diamond-shaped nodes. As shown in the inset, drugs are color-coded according to the first level anatomical therapeutic chemical (ATC) codes as reported in DrugBank. The nodes size accounts for the node degree from small (low degree) to large (high degree). Edges connect only drugs and targets nodes. The network was generated by means of Cytoscape38 version 3.7.2.

4. Network-Based Inferences

Going back to the idea that networks are simplified representations of complex systems, it is plausible to recognize that they are not definitive models, but they can evolve as new information becomes available. In other words, when dealing with networks, one has to do with the problem of missing information, a common situation in the study of biological systems, where the difficulty of obtaining experimental evidence of interactions makes the network inherently incomplete but where, on the other hand, new knowledge is continuously added. This issue is quite basic in network theory and pertains to the network reconstruction scope69 in the general context of the network statistical physics.11,70 In a tighter perspective, however, the possibility of inferring missing links in a network, which is the prediction of a new link between two yet unconnected nodes, is of more practical and immediate interest. Actually, a number of methods of link prediction are available that find wide application, e.g., in social or information networks, where predicting the possible association between individuals or documents, respectively, can be very useful.71

4.1. Link Prediction Methods

Focusing on drugs, one of the hottest issues in medicine today is the identification of disease-related genes, so much so that considerable experimental efforts are carried out to pursue this goal. Applying network modeling in this field has produced a relevant amount of knowledge so far, and many examples thereof regard the network-based identification of targets (see, for example, ref (72) in the framework of precision medicine) for the elucidation of mechanisms of action, for drug repurposing, or for the discovery of new drugs. In practice, one attempts to predict potential drug–target interactions (DTIs) using one of the numerous available prediction methods, often borrowed from such distant fields as social sciences, communication networking, economy and finance, and so on. The goal is to produce a list of potential DTIs and rank them based on some predefined metrics. The starting point is the construction of a heterogeneous network on which to run a link prediction algorithm. Generally, in these cases, the heterogeneous network integrates available information on drugs, targets, and drug–target interactions obtained from different databases. Given the availability of the data, the key steps in these methods are (1) the calculation of the drug–drug and target–target similarities and (2) the application of a drug–target association inference method. As regards the former, after the initial simple use of fingerprints and primary sequences to compare drug molecules and proteins, respectively, more complex and information rich similarity metrics73 have been devised in order to take into account also the information coming from known drug–target interactions,74 as well as protein and network topological information,75 eventually elaborated through ML.76 On the methods side, network-based prediction approaches for DTIs vary broadly, often depending on the user’s preference or expertise, but the most popular ones are derived from either recommendation77 or network propagation78 algorithms, both belonging to the class of the so-called similarity-based algorithms.71 The methods based on recommendation algorithms aim at predicting a node’s preferences for unconnected nodes based on previously calculated similarity scores (a technique also called collaborative filtering). This approach was applied by Cheng et al.,74 who developed the network-based inference (NBI) method for the prediction of novel DTIs based on a bipartite drug–target graph built from an adjacency matrix obtained from known drug–target interactions. Alaimo et al.79 enhanced this method by integrating into the model further domain-dependent biological knowledge, which is drug–drug and target–target similarity measures. On the other hand, under the framework of network propagation algorithms several methods are included that work by simulating the spread of information across the network starting from seed nodes. The most famous one is the Google page rank algorithm80 that uses the random walk through Web pages to calculate their importance. In DTI prediction, the random walk with restart (RWR) variant has been developed and successfully applied to drug–target heterogeneous networks.81,82 The output of these calculations is a ranked list of probabilities of drug–target association. Further methods based on network propagation have recently been proposed, among which those developed by Sharan and co-workers deserve mention, as they are particularly versatile and applicable also in the personalized medicine setting.83,84

4.2. Applications to Drug Repurposing

As an illustrative example of the impact a network-based approach can have on drug repurposing, here we briefly describe a study called Project Repethio (https://think-lab.github.io/p/rephetio/) recently published as a research article on eLife.85 In this work, the authors report the construction of a heterogeneous network to capture the connections among drugs and diseases (Hetionet version 1.0, https://neo4j.het.io/browser/) and its use to predict new drug/disease associations as probabilities of treatment for the repositioning of known drugs. Hetionet integrates data from public sources and is composed of ∼50 000 nodes of 11 types linked by ∼2.25 million of edges of 24 types. In Figure 3a, the conceptual scheme of the network is shown (a metagraph, i.e., a graph in which the nodes are sets of objects, and the edges connect the sets86), indicating types of nodes and links, and in Figure 3b, the whole network is visualized with the nodes grouped by type within circles, and the links are color-coded by type.

Figure 3.

Figure 3

Hetionet version 1.0. (a) Metagraph showing the types of nodes used to build the network and the types of links defined to connect the nodes. A detailed description of the meaning of each link type as well as the sources of information used to collect the nodes and to draw the edges is reported in ref (85). (b) Visualization of the whole heterogeneous network. Nodes of the same type are grouped within circles, and links are colored by type. This Figure 3 is reproduced from Figure 1 of Himmelstein D. S.; Lizee A.; Hessler C.; Brueggeman L.; Chen S. L.; Hadley D.; Green A.; Khankhanian P.; Baranzini S. E.; 2017; eLife (ref (85)), published under the Creative Commons Attribution 4.0 International Public License (CC BY 4.0; https://creativecommons.org/licenses/by/4.0/).

In order to extract disease treatment predictions from the drug–disease connections in the network, the authors implemented a ML procedure (logistic regression) trained on a previously compiled gold standard of 755 disease-modifying indications. These known treatments were the positive data, while ∼30 000 nontreatments were used as negatives. In short, the algorithm learns which types of compound-disease paths (metapaths) discriminate between treatments and nontreatments, and on this basis, it predicts new drug–disease associations. For 1538 compounds and 136 diseases, 209 168 associations were prioritized, and the authors provided illustrative validation through literature data and output analysis for two of them regarding nicotine dependence and epilepsy. In our opinion, the Repethio project gives a clear idea of how network-based data analysis can impact drug research, also considering the suitability of this kind of approaches to be interfaced with the powerful ML methods for features selection and prediction. Moreover, it is an example of the use of publicly available data integrated into an online platform that in turn is open to users who can access it and take advantage for their local purposes of a time- and resource-intensive assembly and integration work. We envisage that the more research data become available for the public domain, the more frequent initiatives of this kind develop, leveraging in full the combined potential of big data and network science.

Noteworthy, the variety of network-based DTI prediction methods is growing constantly in these years, and the integration with ML-based tools is frequent, as shown also by the case illustrated above. Interested readers can refer to the numerous recent reviews on the argument, e.g., like ref (87) (see also Discussion). Another direction of development is the consideration of a further source of information in the construction of the heterogeneous network, namely, diseases. This results in the building of tripartite networks, wherein drug–disease and target (gene)–disease associations are included, thus adding a further layer to the knowledge base on which predictions are calculated.8890 For instance, Zong et al.89 proposed a DTIs prediction method based on the use of a tripartite network called linked tripartite network (LTN), where drugs, targets, and diseases are nodes connected by drug–target, drug–disease, and target–disease associations. In practice, they merged a bipartite network built from the drug–target associations from DrugBank (Table 1) with the drug–disease and gene–disease associations from the diseasome published by Goh et al.60 to obtain the LTN containing 1452 diseases, 8201 drug–disease associations, and 1684 target–disease associations. The method showed high efficiency in the prediction of the associations between drugs and targets within the network, but it was not able to extend the predictivity to new drugs or targets not included in the model.

5. Network Dynamics

A step forward in the application of network science in drug research can be taken if we consider the possibility of modeling the time evolution of networks, that is, network dynamics. To appreciate the prospective importance of this field for future drug discovery, it is necessary to briefly introduce Boolean networks that were proposed in 1969 by S. Kauffman in the context of a general hypothesis aimed at accounting for the regulatory circuits controlling homeostasis and differentiation in cells.91,92 Interested readers might refer to the thoughtful review by B. Drossel.93

5.1. Boolean Networks

Boolean networks are directed networks, built in such a way where nodes are genes and links represent the functional connections between them; each gene can be “on” or “off”, and a set of rules or update functions are associated with each node to define the state of the gene at subsequent time steps. The dynamics of the system is calculated starting from input gene(s) by updating simultaneously at each discrete time step all gene states based on the predefined rules. Given the way in which it was built, a Boolean system is deterministic and has a finite number of initial network states (2N, where N is the number of genes, and 2 refers to the two states on and off). After a number of iterations (time steps), it will reach a stable situation that can be a fixed point or a self-looping circle: such a network state can be accessed following different network state sequences (trajectories), and it seems to attract the system to it, being therefore defined an attractor (Figure 4a). Several attractors may exist for a single Boolean network, and the ensemble of them together with the trajectories from the initial to the attractor states gives rise to the attractor landscape (Figure 4b). It may be noted that a Boolean network can simulate the dynamical state of a system in normal conditions, but it might also be perturbed by modifying the update rules, i.e., setting on or off some nodes. It has been shown that the attractor states correspond to the cellular phenotypes in response to external stimuli (see below), and consequently, exploring the attractor landscape leads to the definition of the phenotype landscape (Figure 4c), where a phenotype includes all the network states that lead the system to the same attractor(s).

Figure 4.

Figure 4

Boolean network dynamics. (a) Nodes are colored red or gray based on their “on” or “off” state, respectively. The stepwise evolution of the interactions between nodes determines some sequential steps that are calculated based on a set of rules. The stable state of the network at time step t = S represents an attractor. (b) Attractor landscape. Gray circles represent network states, colored circles represent attractor states. The landscape contains all the possible network states. The sets of states that converge toward an attractor form the basin of that attractor (colored areas). (c) The basins of attractors can be associated with cell phenotypes, and the gene states of the attractors determine the nature of the phenotype.

As a final observation on this kind of models, we point out a claim made by Kauffman in a recent paper, where he revisited his initial hypothesis: answering the question “What is a cell type?”, he wrote “We conclude that there is good evidence that cell types are high dimensional attractors in the dynamics of the genetic regulatory network.”94 This means that Boolean networks can be considered as simple cell models able to simulate the dynamic cell behavior in response to both physiological and external stimuli and to predict the cell fate in terms of phenotypic output. In other words, we might predict cellular responses to gene activation/inactivation (such as proliferation, apoptosis, cell cycle arrest, and so on) by calculating the attractors characterizing the attractor landscape (Figure 4b and Figure 4c). By the way, Huang et al. in a beautiful paper were the first to experimentally demonstrate that an attractor state corresponds to a unique cell phenotype that can be reached via different trajectories (gene expression patterns) in response to different perturbations.95

5.2. Network Dynamics and Drug Discovery

The possible contributions of this network dynamics modeling approach to the field of drug discovery are manifold, as illustrated in the review by Bloomingdale et al.96 First of all, simulating transcriptional regulation networks can be useful in view of the possibility of postulating new drug targets. For example, De Anda-Jauregui et al.97 built a simplified Boolean model of the estrogen receptor regulatory network and ran dynamics simulations in both unperturbed and perturbed mode. To simulate the perturbations of the network, the authors systematically overexpressed (on) and knocked out (off) all genes, both singularly and in combination. The results of the attractor landscape analysis pointed out some known gene expression regulators as the proteins chiefly involved in the altered proliferative state, thus demonstrating the suitability of the approach for the exploration of cancer-related systems in the search for new drug targets.

Incidentally, it must be noted that to obtain the attractor landscape, the state space of the system should in principle be fully explored by carrying out dynamics simulations starting from all the possible initial states (2N; see above). This is a computationally expensive task, and some methods have been developed to reduce the size of the network while preserving the properties of the system.96

As a further example of the applications of network dynamics, we cite here the work by Choi et al.,98 who studied the attractor landscape of the p53 regulatory network and used their analysis to identify druggable targets and putative drug combinations. The simulations allowed them to identify critical nodes in the p53 network able to drive normal and breast cancer MCF7 cells toward proliferation, cell cycle arrest, or cell death (the attractors). Then, on the basis of this information, the authors tested different putative therapeutic interventions (simulated as node or link deletions) aimed at pushing MCF7 cells toward apoptosis. They found and experimentally confirmed that the combined treatment with nutlin-3 (Mdm2-p53 PPI disruptor) and Wip1 inhibitor resulted in enhanced cell death, even in the absence of DNA damage. In a subsequent paper, the same group extended their approach to a panel of 83 human cancer cell lines, for which they mapped the genomic alterations of the p53 network. This allowed them to define 45 differently wired p53 networks that were then submitted to the attractor landscape analysis to identify cancer-specific therapeutic interventions.99

Despite the above-mentioned cases, network dynamics is still an underexplored field in the context of drug research, but its promises make it worthy of great attention, particularly if we consider its suitability to the simulation of complex GRNs, e.g., like those involved in carcinogenesis.100 In this regard, it is worth mentioning Enrico Capobianco’s observation that “cell state dynamics could be studied by discrete-time Markov models”,100 a statement that foreshadows a bridge toward some advanced computational approaches in drug design. Actually, in the molecular simulations setting, Markov state models (MSMs) are currently used to investigate the kinetic aspects of protein molecular dynamics,101,102 and their applications in GRN dynamics studies are also known.103 Indeed, the possibility to use a common mathematical formalism to describe the dynamics of such different systems as a protein and a GRN opens a formidable perspective in view of the multiscale modeling of biological systems for drug discovery.

6. Discussion

Networks and network analysis tools are rather widespread in biology/chemical biology, not so in chemistry/medicinal chemistry. In any case, if we consider the broad field of drug research, we see an increased use of this kind of in silico modeling approach. In our view, medicinal chemists should be aware of this and take every opportunity to enter this fascinating field, the same way as when the LFER/QSAR paradigm104 was introduced by Corwin Hansch.

In the design of a new drug, the scenario depicted by a system-based network model can be very useful and illuminating, for both practical and theoretical reasons. As regards the former, in perspective, the process of identification and selection of new drug candidates based on a network representation of the target biosystem (“network-driven drug discovery”105) might be viewed as a kind of “in silico integrated screening”, in some way simulating and expanding the experimental phenotypic screening (e.g., see Turbine, an artificial intelligence platform for studying cancer, https://turbine.ai). In fact, on the basis of the available knowledge, one can build the network of interactions governing the cell behavior (an in silico cell) and identify those interventions able to drive the cell toward the desired fate. The network might include several layers of description, from the molecular to the in vivo ones and even above. Considering the already mentioned growing production of molecular, -omic, clinical, etc. data, it seems reasonable to foresee the possibility of realizing the “vertical model integration” proposed by Xie et al.7 through the construction of multipartite networks. This “in silico pharmacology/systems biology continuum”106 might allow researchers to take into consideration simultaneously most of the effects determining a drug’s action process, thus increasing the reliability of predictions and lowering the costs of drug candidates selection.

On another side, from a theoretical point of view, taking a network approach to investigate human diseases means considering the intrinsic complexity of living organisms, which is nowadays accepted as a mandatory standpoint to confront pathologies.68 Coming to drugs and limiting ourselves to the pharmacological way of tackling the disease problem, the holistic view of the target system might help to devise new strategies of design. Peter Csermely gave an interesting example of this by proposing two strategies of network-based drug identification: the central hit strategy (CHS) and the network influence strategy (NIS).14 In the first one (not new, actually), the aim is to damage a cellular network by hitting a critical node (as in antiinfectious or anticancer therapies), while in the second one, drug(s) should hit influential nodes in order to rewire the diseased network toward its normal state. Of course, key points in the application of these strategies are the analysis of the network topology, and, in the case of NIS, also the network dynamics. Moreover, Csermely convincingly described how such a way of tackling drug-related issues, while taking into account the complexity of the system, might well incorporate current operative concepts, e.g., like PPI inhibitors, multitarget drugs, allosteric drugs, and hit/lead development. In this sense, a network-based view allows one to expand both the starting point and the landscape from which the drug discovery process is considered. Not being merely a technological improvement, it could eventually lead to devising alternative paradigms of pharmacological intervention.

A further theme worthy of discussion is how the network modeling approach can be integrated with the well-developed computational techniques currently employed in drug design/discovery, namely, molecular modeling and simulations, and ML. Indeed, ML and deep learning methods are already widely used in computational drug discovery,107,108 and they are perfectly suited to integrate in both the network construction techniques and the network-based prediction approaches. Examples thereof can be found, for example, in the works of Yamanishi et al.109 (application of a kernel regression method to build a pharmacological space by integrating a chemical space and a genomic space) or Mei et al.110 (use of a supervised bipartite model incorporating additional training from neighbors in order to predict DTIs for drugs and targets not included in the network). In a recent paper, Zhou et al.111 proposed a classification for the computational models used for DTI prediction comprising both network-based methods (i.e., the similarity-based algorithms outlined in section 4.1) that employ algorithms derived from network theory, and ML-based methods that belong to the realm of statistical learning.112 Taking this point of view, we can depict a drug discovery scenario where, given a complex system to be interpreted or on which predictions have to be made, one can rely on a rich toolbox of methods that are purely network-based (section 4.1) or ML-based112 or a merge of both.87

Regarding the inclusion in networks of the atomic-level 3D information obtained by molecular modeling and simulations, this issue has to be carefully considered in order to appreciate its potential in drug discovery. In fact, if we had to do with a drug–target network or a PPI network, why not consider the protein nodes as conformational ensembles? In this way, it might become straightforward to expand each node into the set of protein conformations. In such a case, this would lead to the obtainment of a 3D interactome that could take into account the possibility of a protein binding different partners with different conformations. In an analogous way, in a bipartite drug–target network, different conformations of the same protein could bind different drugs, and should the network describe this feature, it would increase substantially the informative content of the model and its ability to predict potential DTIs. If we imagine such a “3D network model” to be used for the selection of drug candidates, it might constitute a bridge between 3D agnostic network-based prediction and classical target-based molecular design, with a great synergistic potential in terms of efficiency. In a recent commentary,113 Mih and Palsson discussed the perspectives of a similar scenario in systems biology. They presented several studies where structural information on ligands and proteins were included in genome-scale metabolic network models, thus allowing a more detailed level of description of such systems.

Finally, limitations and open challenges of the network-based approach to drug discovery have to be highlighted. The first problem has already been pointed out in the section on data and databases and pertains to the quality of the data of whatever type they are and whatever source they come from. Moreover, biological data may be incomplete, biased, or sparse, and the languages used to build the databases may be different and/or incompatible. All this limits severely the possibility of even building a network. The second challenge has to do with the dimensions of the data and consequently of the network. Depending on number of nodes and node degrees, the number of links to calculate can increase enormously, and this again limits the possibility of building or analyzing the network, even though some tools allow one to deal with up to millions of nodes and edges (see the review by Pavlopoulos et al.42). However, as the analysis becomes more detailed or complex (e.g., network dynamics), the computational demand becomes prohibitive. A way to overcome the computational problems is to distribute the workload on the cloud, a choice that seems the best technological option presently available, while waiting for the effective accessibility of quantum computers.

7. Conclusion

In his “General Systems Theory” book, in 1969, Ludwig von Bertalanffy wrote “In one way or another, we are forced to deal with complexities, with ‘wholes’ or ‘systems’, in all fields of knowledge. This implies a basic re-orientation in scientific thinking.”114 This statement fits perfectly the idea at the basis of this paper about the way drug research should be approached. Nowadays, we are well aware that the research paradigm in the pharmaceutical field has changed in this sense, and what is presented here is an in silico framework coherent with the new way of thinking drug discovery. By no means does this imply that the classical computational approaches to drug design should be replaced by network theory but on the contrary that they might be more efficiently employed if integrated in a network context. With a careful awareness to the open challenges outlined above, network-based discovery methods will be key players in the next decades drug research.

Glossary

Abbreviations Used

QSAR

quantitative structure–activity relationship

ML

machine learning

NAR

Nucleic Acid Research (journal)

EHR

electronic health record

CSN

chemical space network

SAR

structure–activity relationship

PSN

protein structure network

GRN

gene regulatory network

PPI

protein–protein interaction

DTI

drug–target interaction

NBI

network-based inference

RWR

random walk with restart

MSM

Markov state model

LFER

linear free-energy relationship

CHS

central hit strategy

NIS

network influence strategy

Biographies

Maurizio Recanatini is a Professor of Medicinal Chemistry since year 2000, now at the Department of Pharmacy and Biotechnology of the University of Bologna (Italy). His research fields are the design/identification of biologically active molecules and the study of target biological systems through the application of computational tools ranging from QSAR (today, we’d rather say statistical learning) to molecular modeling and simulations. The main therapeutic areas of interest are neurodegeneration, cancer, and channelopathies.

Chiara Cabrelle received her M.S. in Pharmaceutical Chemistry and Technologies from University of Padova, Italy (2017). At present, she is a student in the Ph.D. program of Biotechnological, Biocomputational, Pharmaceutical and Pharmacological Sciences at the University of Bologna (Italy). Her current research interests focus on the application of network approaches in drug discovery in the field of computational medicinal chemistry.

Supporting Information Available

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jmedchem.9b01989.

  • Short presentation of the basic concepts of network theory (PDF)

The authors declare no competing financial interest.

Supplementary Material

jm9b01989_si_001.pdf (864.9KB, pdf)

References

  1. Dugger S. A.; Platt A.; Goldstein D. B. Drug Development in the Era of Precision Medicine. Nat. Rev. Drug Discovery 2018, 17, 183–196. 10.1038/nrd.2017.226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Zhu H. Big Data and Artificial Intelligence Modeling for Drug Discovery. Annu. Rev. Pharmacol. Toxicol. 2020, 60, 573–589. 10.1146/annurev-pharmtox-010919-023324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Van Der Greef J.; Mcburney R. N. Rescuing Drug Discovery: In Vivo Systems Pathology and Systems Pharmacology. Nat. Rev. Drug Discovery 2005, 4, 961–967. 10.1038/nrd1904. [DOI] [PubMed] [Google Scholar]
  4. Cavalli A.; Bolognesi M. L.; Minarini A.; Rosini M.; Tumiatti V.; Recanatini M.; Melchiorre C. Multi-Target-Directed Ligands to Combat Neurodegenerative Diseases. J. Med. Chem. 2008, 51, 347–372. 10.1021/jm7009364. [DOI] [PubMed] [Google Scholar]
  5. Hopkins A. L. Network Pharmacology: The next Paradigm in Drug Discovery. Nat. Chem. Biol. 2008, 4, 682–690. 10.1038/nchembio.118. [DOI] [PubMed] [Google Scholar]
  6. Hart T.; Xie L. Providing Data Science Support for Systems Pharmacology and Its Implications to Drug Discovery. Expert Opin. Drug Discovery 2016, 11, 241–256. 10.1517/17460441.2016.1135126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Xie L.; Draizen E. J.; Bourne P. E. Harnessing Big Data for Systems Pharmacology. Annu. Rev. Pharmacol. Toxicol. 2017, 57, 245–262. 10.1146/annurev-pharmtox-010716-104659. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Gentili P. L.Untangling Complex Systems. A Grand Challenge for Science; CRC Press: Boca Raton, FL, U.S., 2019. [Google Scholar]
  9. Loscalzo J.; Barabasi A. L. Systems Biology and the Future of Medicine. Wiley Interdiscip. Rev. Syst. Biol. Med. 2011, 3, 619–627. 10.1002/wsbm.144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Euler L.Solutio Problematis Ad Geometriam Situs Pertinentis. (1741). Euler Archive—All Works; No. 53, https://scholarlycommons.pacific.edu/euler-works/53. [Google Scholar]
  11. Newman M. E. J. The Structure and Function of Complex Networks. SIAM Rev. 2003, 45, 167–256. 10.1137/S003614450342480. [DOI] [Google Scholar]
  12. Estrada E. Graph and Network Theory in Physics. arXiv 2013, 1302.4378. [Google Scholar]
  13. Barabási A.-L.Network Science; Cambridge University Press: Cambridge, U.K., 2016. [Google Scholar]
  14. Csermely P.; Korcsmáros T.; Kiss H. J. M.; London G.; Nussinov R. Structure and Dynamics of Molecular Networks: A Novel Paradigm of Drug Discovery: A Comprehensive Review. Pharmacol. Ther. 2013, 138, 333–408. 10.1016/j.pharmthera.2013.01.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Yang J.; Wang D.; Jia C.; Wang M.; Hao G.; Yang G. Freely Accessible Chemical Database Resources of Compounds for In Silico Drug Discovery. Curr. Med. Chem. 2020, 26, 7581–7597. 10.2174/0929867325666180508100436. [DOI] [PubMed] [Google Scholar]
  16. Mendez D.; Gaulton A.; Bento A. P.; Chambers J.; De Veij M.; Félix E.; Magariños M. P.; Mosquera J. F.; Mutowo P.; Nowotka M.; Gordillo-Marañón M.; Hunter F.; Junco L.; Mugumbate G.; Rodriguez-Lopez M.; Atkinson F.; Bosc N.; Radoux C. J.; Segura-Cabrera A.; Hersey A.; Leach A. R. ChEMBL: Towards Direct Deposition of Bioassay Data. Nucleic Acids Res. 2019, 47 (D1), D930–D940. 10.1093/nar/gky1075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Kim S.; Chen J.; Cheng T.; Gindulyte A.; He J.; He S.; Li Q.; Shoemaker B. A.; Thiessen P. A.; Yu B.; Zaslavsky L.; Zhang J.; Bolton E. E. PubChem 2019 Update: Improved Access to Chemical Data. Nucleic Acids Res. 2019, 47 (D1), D1102–D1109. 10.1093/nar/gky1033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Wishart D. S.; Feunang Y. D.; Guo A. C.; Lo E. J.; Marcu A.; Grant J. R.; Sajed T.; Johnson D.; Li C.; Sayeeda Z.; Assempour N.; Iynkkaran I.; Liu Y.; Maciejewski A.; Gale N.; Wilson A.; Chin L.; Cummings R.; Le D.; Pon A.; Knox C.; Wilson M. DrugBank 5.0: A Major Update to the DrugBank Database for 2018. Nucleic Acids Res. 2018, 46 (D1), D1074–D1082. 10.1093/nar/gkx1037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Pence H. E.; Williams A. ChemSpider: An Online Chemical Information Resource. J. Chem. Educ. 2010, 87, 1123–1124. 10.1021/ed100697w. [DOI] [Google Scholar]
  20. Fourches D.; Muratov E.; Tropsha A. Trust, but Verify: On the Importance of Chemical Structure Curation in Cheminformatics and QSAR Modeling Research. J. Chem. Inf. Model. 2010, 50, 1189–1204. 10.1021/ci100176x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Fourches D.; Muratov E.; Tropsha A. Curation of Chemogenomics Data. Nat. Chem. Biol. 2015, 11, 535. 10.1038/nchembio.1881. [DOI] [PubMed] [Google Scholar]
  22. Rigden D. J.; Fernández X. M. The 26th Annual Nucleic Acids Research Database Issue and Molecular Biology Database Collection. Nucleic Acids Res. 2019, 47 (D1), D1–D7. 10.1093/nar/gky1267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Letunic I.; Bork P. 20 Years of the SMART Protein Domain Annotation Resource. Nucleic Acids Res. 2018, 46 (D1), D493–D496. 10.1093/nar/gkx922. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. UniProt: A Worldwide Hub of Protein Knowledge. Nucleic Acids Res. 2019, 47 (D1), D506–D515. 10.1093/nar/gky1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Pándy-Szekeres G.; Munk C.; Tsonkov T. M.; Mordalski S.; Harpsøe K.; Hauser A. S.; Bojarski A. J.; Gloriam D. E. GPCRdb in 2018: Adding GPCR Structure Models and Ligands. Nucleic Acids Res. 2018, 46 (D1), D440–D446. 10.1093/nar/gkx1109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Martin D. M. A.; Miranda-Saavedra D.; Barton G. J. Kinomer v. 1.0: A Database of Systematically Classified Eukaryotic Protein Kinases. Nucleic Acids Res. 2009, 37 (Suppl. 1), D244–D250. 10.1093/nar/gkn834. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Burley S. K.; Berman H. M.; Bhikadiya C.; Bi C.; Chen L.; Di Costanzo L.; Christie C.; Dalenberg K.; Duarte J. M.; Dutta S.; Feng Z.; Ghosh S.; Goodsell D. S.; Green R. K.; Guranović V.; Guzenko D.; Hudson B. P.; Kalro T.; Liang Y.; Lowe R.; Namkoong H.; Peisach E.; Periskova I.; Prlić A.; Randle C.; Rose A.; Rose P.; Sala R.; Sekharan M.; Shao C.; Tan L.; Tao Y. P.; Valasatava Y.; Voigt M.; Westbrook J.; Woo J.; Yang H.; Young J.; Zhuravleva M.; Zardecki C. RCSB Protein Data Bank: Biological Macromolecular Structures Enabling Research and Education in Fundamental Biology, Biomedicine, Biotechnology and Energy. Nucleic Acids Res. 2019, 47 (D1), D464–D474. 10.1093/nar/gky1004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Szklarczyk D.; Gable A. L.; Lyon D.; Junge A.; Wyder S.; Huerta-Cepas J.; Simonovic M.; Doncheva N. T.; Morris J. H.; Bork P.; Jensen L. J.; Mering C. V. STRING V11: Protein–Protein Association Networks with Increased Coverage, Supporting Functional Discovery in Genome-Wide Experimental Datasets. Nucleic Acids Res. 2019, 47 (D1), D607–D613. 10.1093/nar/gky1131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Jassal B.; Matthews L.; Viteri G.; Gong C.; Lorente P.; Fabregat A.; Sidiropoulos K.; Cook J.; Gillespie M.; Haw R.; Loney F.; May B.; Milacic M.; Rothfels K.; Sevilla C.; Shamovsky V.; Shorser S.; Varusai T.; Weiser J.; Wu G.; Stein L.; Hermjakob H.; D’Eustachio P. The Reactome Pathway Knowledgebase. Nucleic Acids Res. 2020, 48 (D1), D498–D503. 10.1093/nar/gkz1031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Piñero J.; Bravo À.; Queralt-Rosinach N.; Gutiérrez-Sacristán A.; Deu-Pons J.; Centeno E.; García-García J.; Sanz F.; Furlong L. I. DisGeNET: A Comprehensive Platform Integrating Information on Human Disease-Associated Genes and Variants. Nucleic Acids Res. 2017, 45 (D1), D833–D839. 10.1093/nar/gkw943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Subramanian A.; Narayan R.; Corsello S. M.; Peck D. D.; Natoli T. E.; Lu X.; Gould J.; Davis J. F.; Tubelli A. A.; Asiedu J. K.; Lahr D. L.; Hirschman J. E.; Liu Z.; Donahue M.; Julian B.; Khan M.; Wadden D.; Smith I. C.; Lam D.; Liberzon A.; Toder C.; Bagul M.; Orzechowski M.; Enache O. M.; Piccioni F.; Johnson S. A.; Lyons N. J.; Berger A. H.; Shamji A. F.; Brooks A. N.; Vrcic A.; Flynn C.; Rosains J.; Takeda D. Y.; Hu R.; Davison D.; Lamb J.; Ardlie K.; Hogstrom L.; Greenside P.; Gray N. S.; Clemons P. A.; Silver S.; Wu X.; Zhao W. N.; Read-Button W.; Wu X.; Haggarty S. J.; Ronco L. V.; Boehm J. S.; Schreiber S. L.; Doench J. G.; Bittker J. A.; Root D. E.; Wong B.; Golub T. R. A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles. Cell 2017, 171, 1437–1452. 10.1016/j.cell.2017.10.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Stathias V.; Turner J.; Koleti A.; Vidovic D.; Cooper D.; Fazel-Najafabadi M.; Pilarczyk M.; Terryn R.; Chung C.; Umeano A.; Clarke D. J. B.; Lachmann A.; Evangelista J. E.; Ma’ayan A.; Medvedovic M.; Schürer S. C. LINCS Data Portal 2.0: Next Generation Access Point for Perturbation-Response Signatures. Nucleic Acids Res. 2020, 48 (D1), D431–D439. 10.1093/nar/gkz1023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Dash S.; Shakyawar S. K.; Sharma M.; Kaushik S. Big Data in Healthcare: Management, Analysis and Future Prospects. J. Big Data 2019, 6, 54. 10.1186/s40537-019-0217-0. [DOI] [Google Scholar]
  34. Häyrinen K.; Saranto K.; Nykänen P. Definition, Structure, Content, Use and Impacts of Electronic Health Records: A Review of the Research Literature. Int. J. Med. Inform. 2008, 77, 291–304. 10.1016/j.ijmedinf.2007.09.001. [DOI] [PubMed] [Google Scholar]
  35. Denny J. C.; Devaney S.; Rutter J. L.; Boerwinkle E.; Brilliant M.; Butts P.; Castro M.; Chaffins J.; Chang K.; Channamsetty V.; Chinea C.; Cotler A.; Daviglus M.; Falcón A.; Funk G.; Gabriel S.; Gebo K.; Rico E. G.; Glazer D.; Goldstein D. B.; Grand J. M.; Greenland P.; Hall K.; Hochfel B. I. The “All of Us” Research Program. N. Engl. J. Med. 2019, 381, 668–676. 10.1056/NEJMsr1809937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Wu P. Y.; Cheng C. W.; Kaddi C. D.; Venugopalan J.; Hoffman R.; Wang M. D. -Omic and Electronic Health Record Big Data Analytics for Precision Medicine. IEEE Trans. Biomed. Eng. 2017, 64, 263–273. 10.1109/TBME.2016.2573285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Khosravi A.; Jayaram B.; Goliaei B.; Masoudi-Nejad A. Active Repurposing of Drug Candidates for Melanoma Based on GWAS, PheWAS and a Wide Range of Omics Data. Mol. Med. 2019, 25, 30. 10.1186/s10020-019-0098-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Shannon P.; Markiel A.; Ozier O.; Baliga N. S.; Wang J. T.; Ramage D.; Amin N.; Schwikowski B.; Ideker T. Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks. Genome Res. 2003, 13, 2498–2504. 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Bastian M.; Heymann S.; Jacomy M.. Gephi: An Open Source Software for Exploring and Manipulating Networks. Presented at the Third International AAAI Conference on Weblogs and Social Media, 2009.
  40. de Nooy W.; Mrvar A.; Batagelj V.. Exploratory Social Network Analysis with Pajek: Revised and Expanded Edition for Updated Software; Cambridge University Press: Cambridge, U.K., 2018. [Google Scholar]
  41. Hagberg A. A.; Schult D. A.; Swart P. J.. Exploring Network Structure, Dynamics, and Function Using NetworkX. In Proceedings of the 7th Python in Science Conference (SciPy 2008), Pasadena, CA, U.S.; 2008; pp 11–15. [Google Scholar]
  42. Pavlopoulos G. A.; Paez-Espino D.; Kyrpides N. C.; Iliopoulos I. Empirical Comparison of Visualization Tools for Larger-Scale Network Analysis. Adv. Bioinf. 2017, 2017, 1278932. 10.1155/2017/1278932. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Manikandan S. G.; Ravi S.. Big Data Analysis Using Apache Hadoop. Proceedings, 2014 International Conference on IT Convergence and Security (ICITCS), Beijing, China, 2014; IEEE, 2014; 10.1109/ICITCS.2014.7021746. [DOI] [Google Scholar]
  44. Zaharia M.; Chowdhury M.; Franklin M. J.; Shenker S.; Stoica I.. Spark: Cluster Computing with Working Sets. HotCloud ’10: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing; USENIX Association, 2010. [Google Scholar]
  45. Maggiora G. M.; Bajorath J. Chemical Space Networks: A Powerful New Paradigm for the Description of Chemical Space. J. Comput.-Aided Mol. Des. 2014, 28, 795–802. 10.1007/s10822-014-9760-0. [DOI] [PubMed] [Google Scholar]
  46. Vogt M.; Stumpfe D.; Maggiora G. M.; Bajorath J. Lessons Learned from the Design of Chemical Space Networks and Opportunities for New Applications. J. Comput.-Aided Mol. Des. 2016, 30, 191–208. 10.1007/s10822-016-9906-3. [DOI] [PubMed] [Google Scholar]
  47. Zhang B.; Vogt M.; Maggiora G. M.; Bajorath J. Comparison of Bioactive Chemical Space Networks Generated Using Substructure- and Fingerprint-Based Measures of Molecular Similarity. J. Comput.-Aided Mol. Des. 2015, 29, 595–608. 10.1007/s10822-015-9852-5. [DOI] [PubMed] [Google Scholar]
  48. Zwierzyna M.; Vogt M.; Maggiora G. M.; Bajorath J. Design and Characterization of Chemical Space Networks for Different Compound Data Sets. J. Comput.-Aided Mol. Des. 2015, 29, 113–125. 10.1007/s10822-014-9821-4. [DOI] [PubMed] [Google Scholar]
  49. Rogers D.; Hahn M. Extended-Connectivity Fingerprints. J. Chem. Inf. Model. 2010, 50, 742–754. 10.1021/ci100050t. [DOI] [PubMed] [Google Scholar]
  50. Duan J.; Dixon S. L.; Lowrie J. F.; Sherman W. Analysis and Comparison of 2D Fingerprints: Insights into Database Screening Performance Using Eight Fingerprint Methods. J. Mol. Graphics Modell. 2010, 29, 157–170. 10.1016/j.jmgm.2010.05.008. [DOI] [PubMed] [Google Scholar]
  51. Nothias-Esposito M.; Nothias L. F.; Da Silva R. R.; Retailleau P.; Zhang Z.; Leyssen P.; Roussi F.; Touboul D.; Paolini J.; Dorrestein P. C.; Litaudon M. Investigation of Premyrsinane and Myrsinane Esters in Euphorbia Cupanii and Euphobia Pithyusa with MS2LDA and Combinatorial Molecular Network Annotation Propagation. J. Nat. Prod. 2019, 82, 1459–1470. 10.1021/acs.jnatprod.8b00916. [DOI] [PubMed] [Google Scholar]
  52. Quinn R. A.; Nothias L. F.; Vining O.; Meehan M.; Esquenazi E.; Dorrestein P. C. Molecular Networking As a Drug Discovery, Drug Metabolism, and Precision Medicine Strategy. Trends Pharmacol. Sci. 2017, 38, 143–154. 10.1016/j.tips.2016.10.011. [DOI] [PubMed] [Google Scholar]
  53. Miho E.; Roškar R.; Greiff V.; Reddy S. T. Large-Scale Network Analysis Reveals the Sequence Space Architecture of Antibody Repertoires. Nat. Commun. 2019, 10, 1321. 10.1038/s41467-019-09278-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Bhattacharyya M.; Ghosh S.; Vishveshwara S. Protein Structure and Function: Looking through the Network of Side-Chain Interactions. Curr. Protein Pept. Sci. 2015, 17, 4–25. 10.2174/1389203716666150923105727. [DOI] [PubMed] [Google Scholar]
  55. Di Paola L.; De Ruvo M.; Paci P.; Santoni D.; Giuliani A. Protein Contact Networks: An Emerging Paradigm in Chemistry. Chem. Rev. 2013, 113, 1598–1613. 10.1021/cr3002356. [DOI] [PubMed] [Google Scholar]
  56. Verkhivker G. M. Molecular Dynamics Simulations and Modelling of the Residue Interaction Networks in the BRAF Kinase Complexes with Small Molecule Inhibitors: Probing the Allosteric Effects of Ligand-Induced Kinase Dimerization and Paradoxical Activation. Mol. BioSyst. 2016, 12, 3146–3165. 10.1039/C6MB00298F. [DOI] [PubMed] [Google Scholar]
  57. Verkhivker G. M. Biophysical Simulations and Structure-Based Modeling of Residue Interaction Networks in the Tumor Suppressor Proteins Reveal Functional Role of Cancer Mutation Hotspots in Molecular Communication. Biochim. Biophys. Acta, Gen. Subj. 2019, 1863, 210–225. 10.1016/j.bbagen.2018.10.009. [DOI] [PubMed] [Google Scholar]
  58. Amusengeri A.; Astl L.; Lobb K.; Verkhivker G. M.; Bishop Ö. T. Establishing Computational Approaches towards Identifying Malarial Allosteric Modulators: A Case Study of Plasmodium falciparum Hsp70s. Int. J. Mol. Sci. 2019, 20, 5574. 10.3390/ijms20225574. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Barabási A.-L. Network Medicine — From Obesity to the “Diseasome”. N. Engl. J. Med. 2007, 357, 404–407. 10.1056/NEJMe078114. [DOI] [PubMed] [Google Scholar]
  60. Goh K. II; Cusick M. E.; Valle D.; Childs B.; Vidal M.; Barabási A. L. The Human Disease Network. Proc. Natl. Acad. Sci. U. S. A. 2007, 104, 8685–8690. 10.1073/pnas.0701361104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Vidal M.; Cusick M. E.; Barabási A. L. Interactome Networks and Human Disease. Cell 2011, 144, 986–998. 10.1016/j.cell.2011.02.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Vidal M. A Unifying View of 21st Century Systems Biology. FEBS Lett. 2009, 583, 3891–3894. 10.1016/j.febslet.2009.11.024. [DOI] [PubMed] [Google Scholar]
  63. Paolini G. V.; Shapland R. H. B.; Van Hoorn W. P.; Mason J. S.; Hopkins A. L. Global Mapping of Pharmacological Space. Nat. Biotechnol. 2006, 24, 805–815. 10.1038/nbt1228. [DOI] [PubMed] [Google Scholar]
  64. Yildirim M. A.; Goh K. II; Cusick M. E.; Barabási A. L.; Vidal M. Drug-Target Network. Nat. Biotechnol. 2007, 25, 1119–1126. 10.1038/nbt1338. [DOI] [PubMed] [Google Scholar]
  65. Guney E.; Menche J.; Vidal M.; Barábasi A. L. Network-Based in Silico Drug Efficacy Screening. Nat. Commun. 2016, 7, 10331. 10.1038/ncomms10331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Cheng F.; Kovács I. A.; Barabási A. L. Network-Based Prediction of Drug Combinations. Nat. Commun. 2019, 10, 1197. 10.1038/s41467-019-09186-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Conte F.; Fiscon G.; Licursi V.; Bizzarri D.; D’Antò T.; Farina L.; Paci P. A Paradigm Shift in Medicine: A Comprehensive Review of Network-Based Approaches. Biochim. Biophys. Acta, Gene Regul. Mech. 2020, 1863, 194416. 10.1016/j.bbagrm.2019.194416. [DOI] [PubMed] [Google Scholar]
  68. Sonawane A. R.; Weiss S. T.; Glass K.; Sharma A. Network Medicine in the Age of Biomedical Big Data. Front. Genet. 2019, 10, 294. 10.3389/fgene.2019.00294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Squartini T.; Caldarelli G.; Cimini G.; Gabrielli A.; Garlaschelli D. Reconstruction Methods for Networks: The Case of Economic and Financial Systems. Phys. Rep. 2018, 757, 1–47. 10.1016/j.physrep.2018.06.008. [DOI] [Google Scholar]
  70. Albert R.; Barabási A. L. Statistical Mechanics of Complex Networks. Rev. Mod. Phys. 2002, 74, 47–97. 10.1103/RevModPhys.74.47. [DOI] [Google Scholar]
  71. Lü L.; Zhou T. Link Prediction in Complex Networks: A Survey. Phys. A 2011, 390, 1150–1170. 10.1016/j.physa.2010.11.027. [DOI] [Google Scholar]
  72. Ozturk K.; Dow M.; Carlin D. E.; Bejar R.; Carter H. The Emerging Potential for Network Analysis to Inform Precision Cancer Medicine. J. Mol. Biol. 2018, 430, 2875–2899. 10.1016/j.jmb.2018.06.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Vilar S.; Hripcsak G. The Role of Drug Profiles as Similarity Metrics: Applications to Repurposing, Adverse Effects Detection and Drug – Drug Interactions. Briefings Bioinf. 2017, 18, 670–681. 10.1093/bib/bbw048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Cheng F.; Liu C.; Jiang J.; Lu W.; Li W.; Liu G.; Zhou W.; Huang J.; Tang Y. Prediction of Drug-Target Interactions and Drug Repositioning via Network-Based Inference. PLoS Comput. Biol. 2012, 8, e1002503 10.1371/journal.pcbi.1002503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Li Z. C.; Huang M. H.; Zhong W. Q.; Liu Z. Q.; Xie Y.; Dai Z.; Zou X. Y. Identification of Drug-Target Interaction from Interactome Network with “guilt-by-Association” Principle and Topology Features. Bioinformatics 2016, 32, 1057–1064. 10.1093/bioinformatics/btv695. [DOI] [PubMed] [Google Scholar]
  76. Lee I.; Nam H. Identification of Drug-Target Interaction by a Random Walk with Restart Method on an Interactome Network. BMC Bioinf. 2018, 19, 208. 10.1186/s12859-018-2199-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Alaimo S.; Giugno R.; Pulvirenti A. Recommendation Techniques for Drug–Target Interaction Prediction and Drug Repositioning. Methods Mol. Biol. 2016, 1415, 441–462. 10.1007/978-1-4939-3572-7_23. [DOI] [PubMed] [Google Scholar]
  78. Cowen L.; Ideker T.; Raphael B. J.; Sharan R. Network Propagation: A Universal Amplifier of Genetic Associations. Nat. Rev. Genet. 2017, 18, 551–562. 10.1038/nrg.2017.38. [DOI] [PubMed] [Google Scholar]
  79. Alaimo S.; Pulvirenti A.; Giugno R.; Ferro A. Drug-Target Interaction Prediction through Domain-Tuned Network-Based Inference. Bioinformatics 2013, 29, 2004–2008. 10.1093/bioinformatics/btt307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Brin S.; Page L. The Anatomy of a Large-Scale Hypertextual Web Search Engine. Computer Networks and ISDN Systems 1998, 30, 107–117. 10.1016/S0169-7552(98)00110-X. [DOI] [Google Scholar]
  81. Chen X.; Liu M. X.; Yan G. Y. Drug-Target Interaction Prediction by Random Walk on the Heterogeneous Network. Mol. BioSyst. 2012, 8, 1970–1978. 10.1039/c2mb00002d. [DOI] [PubMed] [Google Scholar]
  82. Seal A.; Ahn Y.-Y.; Wild D. J. Optimizing Drug–Target Interaction Prediction Based on Random Walk on Heterogeneous Networks. J. Cheminf. 2015, 7, 40. 10.1186/s13321-015-0089-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Vanunu O.; Magger O.; Ruppin E.; Shlomi T.; Sharan R. Associating Genes and Protein Complexes with Disease via Network Propagation. PLoS Comput. Biol. 2010, 6, e1000641 10.1371/journal.pcbi.1000641. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Shnaps O.; Perry E.; Silverbush D.; Sharan R. Inference of Personalized Drug Targets via Network Propagation. Pacific Symp. Biocomput. 2016, 156–167. 10.1142/9789814749411_0015. [DOI] [PubMed] [Google Scholar]
  85. Himmelstein D. S.; Lizee A.; Hessler C.; Brueggeman L.; Chen S. L.; Hadley D.; Green A.; Khankhanian P.; Baranzini S. E. Systematic Integration of Biomedical Knowledge Prioritizes Drugs for Repurposing. eLife 2017, 6, e26726 10.7554/eLife.26726. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Gaur D.; Shastri A.; Biswas R.. Metagraph: A New Model of Data Structure. Proceedings, 2008 International Conference on Computer Science and Information Technology, ICCSIT 2008, Singapore, 2008; IEEE, 2008; pp 729–733. [Google Scholar]
  87. Zhang W.; Chien J.; Yong J.; Kuang R. Network-Based Machine Learning and Graph Theory Algorithms for Precision Oncology. npj Precis. Oncol. 2017, 1, 25. 10.1038/s41698-017-0029-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Chen H.; Zhang Z. Prediction of Drug-Disease Associations for Drug Repositioning through Drug-MiRNA-Disease Heterogeneous Network. IEEE Access 2018, 6, 45281–45287. 10.1109/ACCESS.2018.2860632. [DOI] [Google Scholar]
  89. Zong N.; Kim H.; Ngo V.; Harismendy O. Deep Mining Heterogeneous Networks of Biomedical Linked Data to Predict Novel Drug-Target Associations. Bioinformatics 2017, 33, 2337–2344. 10.1093/bioinformatics/btx160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Luo H.; Wang J.; Li M.; Luo J.; Ni P.; Zhao K.; Wu F.; Pan Y.. Computational Drug Repositioning with Random Walk on a Heterogeneous Network. IEEE/ACM Transactions on Computational Biology and Bioinformatics; IEEE, 2018; 10.1109/TCBB.2018.2832078. [DOI] [PubMed] [Google Scholar]
  91. Kauffman S. A. Metabolic Stability and Epigenesis in Randomly Constructed Genetic Nets. J. Theor. Biol. 1969, 22, 437–467. 10.1016/0022-5193(69)90015-0. [DOI] [PubMed] [Google Scholar]
  92. Kauffman S. Homeostasis and Differentiation in Random Genetic Control Networks. Nature 1969, 224, 177–178. 10.1038/224177a0. [DOI] [PubMed] [Google Scholar]
  93. Drossel B.Random Boolean Networks. In Reviews of Nonlinear Dynamics and Complexity; Wiley-VCH: Weinheim, Germany, 2008; pp 69–110. [Google Scholar]
  94. Bornholdt S.; Kauffman S. Ensembles, Dynamics, and Cell Types: Revisiting the Statistical Mechanics Perspective on Cellular Regulation. J. Theor. Biol. 2019, 467, 15–22. 10.1016/j.jtbi.2019.01.036. [DOI] [PubMed] [Google Scholar]
  95. Huang S.; Eichler G.; Bar-Yam Y.; Ingber D. E. Cell Fates as High-Dimensional Attractor States of a Complex Gene Regulatory Network. Phys. Rev. Lett. 2005, 94, 128701. 10.1103/PhysRevLett.94.128701. [DOI] [PubMed] [Google Scholar]
  96. Bloomingdale P.; Nguyen V. A.; Niu J.; Mager D. E. Boolean Network Modeling in Systems Pharmacology. J. Pharmacokinet. Pharmacodyn. 2018, 45, 159–180. 10.1007/s10928-017-9567-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. De Anda-Jáuregui G.; Espinal-Enríquez J.; Sandoval-Motta S.; Hernández-Lemus E. A Boolean Network Approach to Estrogen Transcriptional Regulation. Complexity 2019, 2019, 8740279. 10.1155/2019/8740279. [DOI] [Google Scholar]
  98. Choi M.; Shi J.; Jung S. H.; Chen X.; Cho K. H. Attractor Landscape Analysis Reveals Feedback Loops in the P53 Network That Control the Cellular Response to DNA Damage. Sci. Signaling 2012, 5, ra83. 10.1126/scisignal.2003363. [DOI] [PubMed] [Google Scholar]
  99. Choi M.; Shi J.; Zhu Y.; Yang R.; Cho K. H. Network Dynamics-Based Cancer Panel Stratification for Systemic Prediction of Anticancer Drug Response. Nat. Commun. 2017, 8, 1940. 10.1038/s41467-017-02160-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Capobianco E. Next Generation Networks: Featuring the Potential Role of Emerging Applications in Translational Oncology. J. Clin. Med. 2019, 8, 664. 10.3390/jcm8050664. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Husic B. E.; Pande V. S. Markov State Models: From an Art to a Science. J. Am. Chem. Soc. 2018, 140, 2386–2396. 10.1021/jacs.7b12191. [DOI] [PubMed] [Google Scholar]
  102. Bernetti M.; Masetti M.; Recanatini M.; Amaro R. E.; Cavalli A. An Integrated Markov State Model and Path Metadynamics Approach to Characterize Drug Binding Processes. J. Chem. Theory Comput. 2019, 15, 5689–5702. 10.1021/acs.jctc.9b00450. [DOI] [PubMed] [Google Scholar]
  103. Chu B. K.; Tse M. J.; Sato R. R.; Read E. L. Markov State Models of Gene Regulatory Networks. BMC Syst. Biol. 2017, 11, 14. 10.1186/s12918-017-0394-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Hansch C.; Fujita T. ρ-σ-π Analysis. A Method for the Correlation of Biological Activity and Chemical Structure. J. Am. Chem. Soc. 1964, 86, 1616–1626. 10.1021/ja01062a035. [DOI] [Google Scholar]
  105. Whitmore A. V.; Wray J. Networks for Better Drug Discovery: Making Optimal Use of Existing Chemical Space. Future Med. Chem. 2018, 10, 2497–2499. 10.4155/fmc-2018-0265. [DOI] [PubMed] [Google Scholar]
  106. Oprea T. I.; May E. E.; Leitão A.; Tropsha A. Computational Systems Chemical Biology. Methods Mol. Biol. 2011, 672, 459–488. 10.1007/978-1-60761-839-3_18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Vamathevan J.; Clark D.; Czodrowski P.; Dunham I.; Ferran E.; Lee G.; Li B.; Madabhushi A.; Shah P.; Spitzer M.; Zhao S. Applications of Machine Learning in Drug Discovery and Development. Nat. Rev. Drug Discovery 2019, 18, 463–477. 10.1038/s41573-019-0024-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Yang X.; Wang Y.; Byrne R.; Schneider G.; Yang S. Concepts of Artificial Intelligence for Computer-Assisted Drug Discovery. Chem. Rev. 2019, 119, 10520–10594. 10.1021/acs.chemrev.8b00728. [DOI] [PubMed] [Google Scholar]
  109. Yamanishi Y.; Araki M.; Gutteridge A.; Honda W.; Kanehisa M. Prediction of Drug-Target Interaction Networks from the Integration of Chemical and Genomic Spaces. Bioinformatics 2008, 24, i232–240. 10.1093/bioinformatics/btn162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Mei J. P.; Kwoh C. K.; Yang P.; Li X. L.; Zheng J. Drug-Target Interaction Prediction by Learning from Local Information and Neighbors. Bioinformatics 2013, 29, 238–245. 10.1093/bioinformatics/bts670. [DOI] [PubMed] [Google Scholar]
  111. Zhou L.; Li Z.; Yang J.; Tian G.; Liu F.; Wen H.; Peng L.; Chen M.; Xiang J.; Peng L. Revealing Drug-Target Interactions with Computational Models and Algorithms. Molecules 2019, 24, 1714. 10.3390/molecules24091714. [DOI] [PMC free article] [PubMed] [Google Scholar]
  112. Hastie T.; Tibshirani R.; Friedman J.. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed.; Springer, 2009. [Google Scholar]
  113. Mih N.; Palsson B. O. Expanding the Uses of Genome-scale Models with Protein Structures. Mol. Syst. Biol. 2019, 15, e8601 10.15252/msb.20188601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  114. von Bertalanffy L.General System Theory. Foundations, Development, Applications; George Braziller, Inc.: New York, 1969. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

jm9b01989_si_001.pdf (864.9KB, pdf)

Articles from Journal of Medicinal Chemistry are provided here courtesy of American Chemical Society

RESOURCES