Abstract
Background
A great deal of data has accumulated on signalling pathways. These large datasets are thought to contain much implicit information on their molecular structure, interaction and activity information, which provides a picture of intricate molecular networks believed to underlie biological functions. While tremendous advances have been made in trying to understand these systems, how information is transmitted within them is still poorly understood. This ever growing amount of data demands we adopt powerful computational techniques that will play a pivotal role in the conversion of mined data to knowledge, and in elucidating the topological and functional properties of protein - protein interactions.
Results
A computational framework is presented which allows for the description of embedded networks, and identification of common shared components thought to assist in the transmission of information within the systems studied. By employing the graph theories of network biology - such as degree distribution, clustering coefficient, vertex betweenness and shortest path measures - topological features of protein-protein interactions for published datasets of the p53, nuclear factor kappa B (NF-κB) and G1/S phase of the cell cycle systems were ascertained. Highly ranked nodes which in some cases were identified as connecting proteins most likely responsible for propagation of transduction signals across the networks were determined. The functional consequences of these nodes in the context of their network environment were also determined. These findings highlight the usefulness of the framework in identifying possible combination or links as targets for therapeutic responses; and put forward the idea of using retrieved knowledge on the shared components in constructing better organised and structured models of signalling networks.
Conclusion
It is hoped that through the data mined reconstructed signal transduction networks, well developed models of the published data can be built which in the end would guide the prediction of new targets based on the pathway's environment for further analysis. Source code is available upon request.
Background
"Any classification in a division of objects into groups is based on a set of rules - it is neither true nor false (unlike, for example, a theory) and should be judged largely on the usefulness of the results" [1].
For many years, model organisms have been studied extensively by scientists as they tried to better understand the functional implication of processes initiated during cellular signalling, and how organisms can use this to respond to perturbations outside of the cell [2]. With the advent of high throughput experimentation, the identification and characterization of molecular components involved in transduction events became possible in a systematic way. In addition to this, the discovered interactions between each of these components promoted the reconstruction of reactions leading to signaling pathways. Thus, elucidating the functional consequences of these interactions will be crucial in understanding the ways in which cells respond to extra cellular cues and how they communicate with one another.
Activities of biological cells are regulated by proteins carrying signals that modify the expression of different genes at any given time, and these extra-cellular signals drive cell proliferation and programmed cell death via complex signal transduction circuits comprising of receptors, kinases, phosphatases, transcription factors and many others. It is unsurprising that many components of these signal transduction circuits are oncogenes or tumour suppressors, emphasizing the importance of understanding signalling in normal tissues and targeting aberrant signalling in diseases [3]. Signalling networks which are chiefly based on interactions between proteins are the means by which a cell converts an external signal (e.g. stimulus) into an appropriate cellular response (e.g. cellular rhythms - periodic biological process observed in cell cycles or day-night cycles (circadian rhythms) of animals and plants) [4-6]. It is from the resulting basic cellular responses that complex behaviour in multi-cellular organisms emerges.
Signal transduction pathways have typically been drawn as separate linear entities, however it has become increasingly clear that signalling pathways are extensively interconnected and are embedded in networks with common protein components and cross talk with other networks [7-11]. In addition to this, signal transduction networks do not depend merely on the shifting of relevant protein concentrations from one steady state level to another, rather, the signals often have a significant temporal variation that carries much more information that is propagated in a complex manner through the networks [12-15].
Traditionally, study of the complex behaviour of networks require dynamic models that contain both the biochemical reactions as well as their rate constant counterparts [16-19]. This information is usually not accessible directly through experiments for systems less well studied. Fortunately for many biological systems partial prior knowledge about the connectivity patterns of the networks is becoming available and readily stored in databases [20-23], even though the detailed mechanisms still remain undiscovered. An important goal of this research therefore is to attain a reconstruction of the network of interactions that gives rise to signalling pathways in a biologically meaningful way, which in turn allows the mathematical analysis of the emerging properties of the network [24,25].
So far, a great deal of data has accumulated on signalling systems and these large datasets are thought to contain much information on the structure of their underlying networks. However, this information is hidden and requires advanced algorithms and methods, such as data mining and graph theories of network biology to make sense of it all [26-28]. Data mining deals with the discovery of hidden knowledge, unexpected patterns and new rules [29]; nevertheless, there are some limitations with this technique. A fundamental issue is that biological data repositories are normally presented in heterogeneous and unstructured forms [30-33]. Therefore, there is a great need to develop effective data mining methodologies to extract, process, integrate and discover useful knowledge from multiple data sources [34]. The retrieved knowledge can then be better organized and structured to develop models, which in the end, would guide the prediction of new targets based on the pathway's environment [24,26-28,35,36].
In this report, we present a systems analysis framework to examine how protein-protein interactions within these systems relate to multi-cellular functions, and how high throughput technologies allow the study of the different aspects of signalling networks for modelling. We assume that since mammalian cells are constantly remodelling their transcriptional activity profiles in response to a combination of inputs, the understanding of their coordinated responses have been lacking, and in essence requires a framework which examines the system or systems by extracting information on their topological and functional properties. An example of a system activated in response to a variety of signals is the NF-κB pathway [19,37-40] (a family of proteins which functions as DNA-binding proteins and transcription factors); the disruption of which in recent years have been shown to contribute towards the many human diseases presently known. We also know from literature [41-43] that the NF-κB network does not exist in isolation, since many of its mechanisms have been shown to integrate their activity with other cell signalling networks. Such as the p53 system [17,44-49] (another transcriptional activator that plays an important role in the regulation of apoptosis) and the E2F-1 [50-53] - a cell cycle transcriptional target that controls the expression of a number of genes needed for DNA synthesis and progression into S phase [46,49,54-59]. It is thought that the cooperation between p53, NF-κB and E2F-1 is most likely to reflect on their ability to function together to induce expression of target genes regulated by promoters containing p53, NF-κB and E2F-1 binding sites [53,60,61], since target genes translated to proteins in one way or another affect the individual system in a positive or negative way.
To capture the possible events involved in the pathways, only proteins involved in the oscillatory feedback loops of the systems were considered - which are ubiquitous feature of the biological examples given which can be adapted to yield distinct system level properties [16,17,40,62]. To generate the networks, the molecular components and their interactions were extracted from publicly available datasets [20-23]. In addition, associations of these networks with some cell cycle proteins, in particular, the G1/S phase cell cycle proteins [63,64] were also examined. Cell cycle proteins were considered since previously published literature showed some of its proteins to be activated by one pathway and to be relevant for the regulation of another [44,65-70]; and thus may be useful in showing a level of complexity not visible by looking at the NF-κB and p53 systems alone. We next identified key nodes of significant influence in the isolated systems investigated using some graph theories of network biology, namely, degree, vertex betweenness, and clustering coefficient measures. We used shortest paths calculation to find connecting nodes, most likely responsible for the propagation of transduction signals across the networks. And cross referencing them with reference databases, the interpretation of the functional properties of these key nodes, as well as, the highly ranked connecting nodes within the systems were realised. The idea is that through the data mined reconstructed signal transduction pathways which are comparable to the previously modelled networks of the real system, a phenomenological model of all the published data can be derived from which the key components of the system can be highlighted for further analysis. In fact, as we will show in this report, it is possible to reconstruct signalling networks in this way without additional constraint.
Methods
The development of high-throughput molecular assay technologies, as well as breakthroughs in information processing and storage technologies provide integrated views of biological and medical information. Databases enabling systematic data mining on bio-molecular interactions, pathways and molecular disease associations are becoming increasingly available, which it is hoped will facilitate the understanding of the dynamics of biological function in complex diseases. Summarised below are descriptions of the analytical methods used in this study - see Figure 1 for a schematic representation of the framework.
Definition of Reference Databases
Over the last few years many of the experimental data from gene expression studies have been made freely available for academic research in the form of reference databases [20-23] of which several exist. These different databases have their strengths and weaknesses and there is no universal method best for storing these data sets. A number of different approaches have been used to extract signalling data and integrate them for biologically valid conclusions to be drawn from the vast and comprehensive data sets available [71,72]. Table 1 lists a description of the individual databases used in this study, each of which was used to retrieve information related to the proteins considered. These databases contain information on proteins, protein interactions and biological processes.
Table 1.
Database | Description | URL | Statistics | Data extracted |
---|---|---|---|---|
Uniprot [20] |
comprehensive, high-quality and freely accessible resource of protein sequence and functional information. | http://www.uniprot.org | 220,325 entries | function,, post-translation modification, location, developmental stage, etc. |
I2d [21] |
on-line database of known and predicted mammalian and eukaryotic protein-protein interactions | http://ophid.utoronto.ca/ | 424,066 entries (92,561 for human) | protein interaction |
Reactome [22] |
curated resource of core pathways and reactions in human biology. | http://www.reactome.org | 928 pathways for human | Pathway |
PID [23] |
curated pathway and interactions | http://pid.nci.nih.gov/ | 133 pathways | Pathway |
Data extraction and data-mining
The concerted efforts of genetics, molecular biology, biochemistry and physiology have led to the accumulation of an enormous amount of data on molecular components of signalling networks reported in the literature or stored in databases [73]. The availability of these vast amounts of data provides an opportunity for investigating further the design principles underlying structure and dynamics of signalling networks [71,72,74]. However, these data are diverse and dispersed in different databases. For this reason, data mining is employed and takes the responsibility of mining this amount of data in the hope that it will return useful hypotheses supporting life sciences. Due to its capability of processing different kinds of data, data mining has the ability to integrate these spread-out data in a unified framework thus solving more efficiently the problems that may arise due to their differences [29,30,32].
We started by looking into four databases: Universal Protein Resource (Uniprot), Interologous Interaction Database (i2d), Reactome and Pathway Interaction Database (PID), which we have listed in Table 1. Since different databases have different names for each entry, the Uniprot name for identifying proteins was used as the standard and thus all protein names were converted accordingly to their Uniprot counterparts. In addition, in the Uniprot database, protein information is published for a wide-range of organisms and curated from different sources. A search for p53 in Uniprot returns 1,624 results, such as [Uniprot:P04637] (P53_HUMAN) for human, [Uniprot:P02340] (P53_MOUSE) for mouse, [Uniprot:P13481] (P53_CERAE) for green monkey. To assure the proteins extracted from Uniprot are the exact proteins from the organism of interest, a form of verification was implemented, where the identity of the mined data is confirmed through a form of literature search. This step avoids the confusion and ambiguity that often occurs when mining and integrating multiple data. Table 2 lists the search proteins considered in the study (highlighted proteins are proteins reported to be activated in one system and involved in the regulation of another).
Table 2.
Network | Uniprot accession | Uniprot entry name | Alternative name |
---|---|---|---|
p53 pathway | P04637 | P53_HUMAN | p53 |
Q00987 | MDM2_HUMAN | mdm2 | |
P38936 | CDN1A_HUMAN | p21 | |
Q8N726 | CD2A2_HUMAN | p14ARF | |
NF-κB pathway | O00221 | IKBE_HUMAN | NF-κB inhibitor epsilon |
O14920 | IKKB_HUMAN | IKK2 | |
O15111 | IKKA_HUMAN | IKK1 | |
P19838 | NFKB1_HUMAN | Nuclear factor NF-κB p105 subunit | |
P25963 | IKBA_HUMAN | IκB-alpha | |
Q00653 | NFKB2_HUMAN | Nuclear factor NF-κB p100 subunit | |
Q01201 | RELB_HUMAN | Transcription factor RelB | |
Q04206 | TF65_HUMAN | Transcription factor p65 (RelA) | |
Q04864 | REL_HUMAN | C-Rel protein | |
Q14164 | IKKE_HUMAN | Inhibitor of nuclear factor κB kinase subunit epsilon | |
Q15653 | IKBB_HUMAN | NF-kappa-B inhibitor beta | |
Q96HD1 | CREL1_HUMAN | Crel1 | |
Q6UXH1 | CREL2_HUMAN | Crel2 | |
Q9Y6K9 | NEMO_HUMAN | IKKγ | |
G1/S phase cell | P24385 | CCND1_HUMAN | Cyclin D1 |
cycle proteins | Q01094 | E2F1_HUMAN | E2F-1 |
P06400 | RB_HUMAN | Rb | |
P46527 | CDN1B_HUMAN | P27 |
The proteins have been listed according to their Uniprot accession names.
Using the i2d database, information on protein-protein interactions was extracted. Such information is potentially useful in identifying proteins and their families, the interplay with their interacting partners, the influence of certain proteins in a network and key regulatory relationships which are most influenced by extracellular signals. More comprehensive knowledge concerning the proteins of interest and their connector proteins, for example, biological process, cellular component, coding sequence diversity, developmental stage, disease, domain, ligand, molecular function and post-translation modification were also extracted. For elucidating the functional consequences of the interactions, the Reactome database - which gives pathway information by combining with graph information of the PID database - was the database of choice. Table 3 presents a list of pathways and/or processes the explored proteins were revealed to be involved in. The data mining implementation was done in Perl programming language http://www.perl.org/ and derived from BioPython library http://biopython.org/wiki/Main_Page.
Table 3.
Uniprot accession | Uniprot entry name | Pathway |
---|---|---|
P25963 | IKBB_HUMAN | [2 processes]: Signalling in Immune system; Signalling by NGF |
Q15653 | REL_HUMAN | Signalling in Immune system |
O15111 | IKKB_HUMAN | [2 processes]: Signalling in Immune system; Signalling by NGF |
O14920 | IKKE_HUMAN | [2 processes]: Signalling in Immune system; Signalling by NGF |
P19838 | IKBA_HUMAN | [2 processes]: Signalling in Immune system; Signalling by NGF |
Q00653 | IKBE_HUMAN | Signalling in Immune system |
Q04206 | IKBZ_HUMAN | [2 processes]: Signalling in Immune system; Signalling by NGF |
P04637 | P53_HUMAN | Cell Cycle Checkpoints |
P38936 | CDN1A_HUMAN | [3 processes]: Cell Cycle Checkpoints; Cell Cycle, Mitotic; DNA Replication |
Q00987 | MDM2_HUMAN | [2 processes]: Cell Cycle Checkpoints; Signalling by NGF |
P46527 | CDN1B_HUMAN | Signalling by NGF |
Reactome can either be directly browsed or queried by text search using, for instance using UniProt accession numbers, to identify events or pathways considered search proteins are involved in.
Network Biology
The actions of specific proteins in a network have been investigated in this report. A network can be described as a series of nodes/vertices that are connected to each other by links. Formally it was referred to as a graph and the links as edges [26,75-77]. The nodes in biological networks are the gene products/proteins and the links the interactions between two components [13,78]. A number of metrics have been used to characterise the networks of the systems studied:
• The first, the degree (or connectivity) of a node/vertex k, indicates how many links/edges the node has to the other nodes. Of particular importance is the degree distribution P(k), which measures the probability that a selected node has exactly k links. The degree distribution is used to distinguish between the different classes of network (which has not been reported in this account).
• The second, vertex betweenness (Bi) is a measure of the centrality and influence of nodes in the networks [79-82].
• The third, average clustering coefficient C(k), characterises the overall tendency of nodes to form clusters or groups; and C(k) the average clustering coefficient of all nodes with k links is an important measure of the network structure [15].
• And finally, the shortest path, which is found between two vertices (or nodes) such that the sum of the weights of its constituent edges is minimized [82,83].
A graph G(E, V) consists of a set of vertices (V) and a set of edges (E) between them. An edge eij connects vertex vi with vertex vj. Here, undirected graph is investigated since our studied protein interaction networks are undirected. An undirected graph has the property that eij and eji are considered identical. Therefore, the neighbourhood N for a vertex vi is defined as it's immediately connected neighbours in Eq. (1):
(1) |
where the degree ki of a vertex is defined as the number of vertices |Ni|, in its neighbourhood Ni.
The betweenness centrality of a vertex vi is defined as the number of shortest paths between pairs of other vertices that run through vi as Eq. (2):
(2) |
where i ≠ j ≠ k, gjk is the number of equally shortest paths between nodes vj and vk, and gjk(i) the number of the shortest paths where node vi is located [84].
The clustering coefficient Ci for a vertex vi is given by the proportion of links between the vertices within its neighbourhood divided by the number of links that could possibly exist between them [15]. Therefore, if a vertex vi has ki neighbours, ki(ki-1)/2 edges could exist among the vertices within the neighbourhood where the clustering coefficient for undirected graphs can be defined as Eq.(3):
(3) |
For the shortest path, given a real-value weight function f: E → R, and a start node vi of V, we find a path p of P (the set of paths) from vi to each vj of V if present (Eq. (4), so that
(4) |
If the protein-protein interaction networks here constitute an unweighted graph, the weight function f can be considered as a path length l (the number of edges in path p). In this case, the shortest path problem is to find a path p having the minimal path length. A Breadth-First Search algorithm [82,83] has been employed to find the shortest paths between two nodes (the starting node vi and destination node vj) (see Figure 2). The shortest paths may have different path lengths (l = 1, l = 2, l = 3, l = 4, etc.). In the example shown in Figure 2, there are different shortest paths from start node P1 to destination nodes (P5, P8, P10, P11) via different connector nodes (P6, P7, P9). If the path length is 1, this signifies a direct connection, where two nodes are directly connected (e.g., P1 and P11). For the shortest paths with l = 2, there are three nodes: a start node (P1), a connector node (P9), and a destination node (P10). Using this form of analysis the path lengths were used to obtain knowledge on the functional interactions between the proteins. For the purpose of this report we will only discuss findings for the shortest paths between two nodes of interest with path length l = 1 or l = 2; their connector nodes and their frequency ranking (fi) [see Additional file 1: Suppl. 1-5 for the full list of shortest paths with other path lengths]. A node is said to have a high frequency if there is an increase in the number of paths passing through it; thus a high frequency node may be the centre of the networks' cross talk. For a given set of 2-length shortest paths [26,85] {p1, p2, p3, ... pn} between two sets of nodes Vi and Vj, with v1, v2, v3 as connectors of those paths, if frequency f1 of v1 = 2 (if v1 is the connectors of two paths); f2 = 10 (if v2 is the connector of ten paths); f3 = 5 (if v3 is the connector of five paths); then the results of the ranking is v2 - v3 - v1. The highly ranked nodes obtained, is then suggested to be the most important nodes within the network.
Network biology computation was implemented in CoSBiLab-Graph http://www.cosbi.eu/index.php/research/prototypes/overview. CoSBiLab-Graph is a tool suitable for a variety of tasks on graphs like construction, visualisation, and modification. CoSBiLab-Graph can be used to calculate measures, run algorithms and layout graphs. The network visualisation is performed by the software NAViGaTOR ((Network Analysis, Visualization, & Graphing TORonto)) http://ophid.utoronto.ca/navigator/. NAViGaTOR is the add-in software package of i2d database, and thus supports the simulation of the protein interaction networks extracted from i2d in this report. Other network analysis tools used are Social Network Analysis Software http://www.analytictech.com/, and Centralities in Biological Networks http://centibin.ipk-gatersleben.de/.
Results and Discussion
Recognising that individual signalling pathways do not act in isolation, an integrated approach to investigate the dynamic relationships between components, their organisation and regulation in signalling systems was undertaken. We started by searching the i2d database (containing 92,561 human protein interactions) for the proteins of interest. This search retrieved a total of 1,881 protein-protein interactions for components of p53 and NF-κB networks (see Table 2). To increase the confidence in the extracted interactions information, we excluded 47 interactions shown to have been derived from other organisms (other than human) by homologous methods, so that the number of protein interactions obtained involving both the NF-κB and p53 networks consists of 1,834 interactions. Information on protein-protein interactions within the NF-κB and p53 pathways were also retrieved and analysed. Finally, the interlinking connections between the NF-κB and p53, and proteins involved in the G1/S phase of the cell cycle (in particular, RB_HUMAN, CCND1_HUMAN, CDN1B_HUMAN, CD2A2_HUMAN, E2F1_HUMAN and CDN1A_HUMAN) were also investigated (see Table 4 for statistical information retrieved for the networks).
Table 4.
Network | Number of nodes | Number of interactions | Number of articulation points |
---|---|---|---|
p53 | 436 | 506 | 7 |
NF-κB | 788 | 1352 | 15 |
Cell cycle - Cyclin D1, Rb, E2F-1, p27 | 527 | 299 | 4 |
NF-κB and p53 | 1105 | 1834 | 18 |
NF-κB, p53 and Rb, E2F-1 | 1208 | 2032 | 20 |
NF-κB, p53 and Cyclin D1, Rb, E2F-1, p27 | 1239 | 2127 | 22 |
Network of Interactions
Following data extraction, descriptive analysis of the data was performed. The degree, betweenness and cluster coefficient values for the network's components were calculated in order to ascertain the level of connectivity of the three systems. Figure 3 illustrates the molecular interactions obtained for the NF-κB, p53 and the G1/S phase cell cycle, respectively. Figure 3A and Table 4 show for the proteins in the p53 network, 506 interactions and 436 nodes. Seven of which are articulation points (four original search nodes (in red) and three other associated nodes obtained from the extraction process (in cyan)). Articulation nodes (or cut vertex) [86,87] are nodes that play an important role in a network, where the removal of the node may drastically alter the network topology leading to it's fragmentation. Conversely, for the NF-κB network (see Table 4 & Figure 3B) 788 nodes and 1,352 interactions were observed. The articulation points were fifteen in number, fourteen of which were the search proteins considered (in yellow) and an associated TIP60_HUMAN (in cyan) obtained during the extraction process. A subset of the highest connectivity or degree values are shown in Table 5 and 6 [see Additional file 1: Suppl. 6-10 for connectivity values obtained for nodes not included in the Tables]. We found that for the three networks examined, the calculated degree for the initial list of proteins, with the exception of the CREL2 protein in the NF-κB network (Table 2), were discovered to be much higher than the associated proteins found during the mining process; and therefore underscored the central role of the initial list within their individual networks (search proteins highlighted on Table 5 and 6; please note other nodes - TIP60_HUMAN in the NF-κB network (Figure 3B), and TCP4_HUMAN, PINX1_HUMAN and PM14_HUMAN in the p53 network (Figure 3A) - are associated articulation points).
Table 5.
Network p53 | Network NF-κB | ||||||
---|---|---|---|---|---|---|---|
Uniprot accession | Uniprot entry name | Degree | Clustering coefficient | Uniprot accession | Uniprot entry name | Degree | Clustering coefficient |
P04637 | P53_HUMAN | 300 | 9.10E-04 | Q14164 | IKKE_HUMAN | 324 | 1.90E-04 |
Q00987 | MDM2_HUMAN | 72 | 0.0133 | Q04206 | TF65_HUMAN | 186 | 0.01796 |
P38936 | CDN1A_HUMAN | 72 | 0.00509 | Q9Y6K9 | NEMO_HUMAN | 157 | 0.01764 |
Q8N726 | CD2A2_HUMAN | 43 | 0.00664 | Q00653 | NFKB2_HUMAN | 145 | 0.03218 |
P53999 | TCP4_HUMAN | 12 | 0 | P19838 | NFKB1_HUMAN | 118 | 0.04578 |
Q9Y3B4 | PM14_HUMAN | 10 | 0 | P25963 | IKBA_HUMAN | 85 | 0.05546 |
P49459 | UBE2A_HUMAN | 3 | 0.66667 | O14920 | IKKB_HUMAN | 75 | 0.07279 |
Q16665 | HIF1A_HUMAN | 3 | 0.66667 | Q15653 | IKBB_HUMAN | 73 | 0.06963 |
P06748 | NPM_HUMAN | 3 | 0.66667 | O15111 | IKKA_HUMAN | 71 | 0.08089 |
P25490 | TYY1_HUMAN | 3 | 0.66667 | Q01201 | RELB_HUMAN | 64 | 0.06399 |
P62988 | UBIQ_HUMAN | 3 | 0.66667 | O00221 | IKBE_HUMAN | 48 | 0.11702 |
P51959 | CCNG1_HUMAN | 3 | 0.66667 | Q04864 | REL_HUMAN | 38 | 0.14794 |
Q92793 | CBP_HUMAN | 3 | 0.66667 | Q96HD1 | CREL1_HUMAN | 12 | 0.68182 |
P62081 | RS7_HUMAN | 3 | 0.66667 | P07437 | TBB5_HUMAN | 12 | 0.68182 |
Q99816 | TS101_HUMAN | 3 | 0.66667 | P62158 | CALM_HUMAN | 12 | 0.68182 |
The table is arranged in descending order. Only the first fifteen proteins within the network with high degree values are listed.
Table 6.
Uniprot accession | Uniprot entry name | Degree | Clustering coefficient |
---|---|---|---|
P06400 | RB_HUMAN | 156 | 0.00232 |
P46527 | CDN1B_HUMAN | 50 | 0.00245 |
P24385 | CCND1_HUMAN | 48 | 0.01330 |
Q01094 | E2F1_HUMAN | 48 | 0.01418 |
Q13309 | SKP2_HUMAN | 3 | 0.33333 |
P78396 | CCNA1_HUMAN | 3 | 0.33333 |
P08047 | SP1_HUMAN | 3 | 0.66667 |
P24941 | CDK2_HUMAN | 3 | 0.66667 |
P38398 | BRCA1_HUMAN | 3 | 0.66667 |
P11802 | CDK4_HUMAN | 3 | 0.66667 |
P20248 | CCNA2_HUMAN | 2 | 0.00000 |
Q9NQX5 | NPDC1_HUMAN | 2 | 0.00000 |
P00519 | ABL1_HUMAN | 2 | 0.00000 |
P33993 | MCM7_HUMAN | 2 | 0.00000 |
P30281 | CCND3_HUMAN | 2 | 0.00000 |
The highest-degree node (or connectivity) uncovered for the NF-κB network was IKKE_HUMAN, a protein responsible for inhibiting the NF-κB inhibitory subunits with 324 interactions (see Table 5) [88]. A discovery that suggests IKKE_HUMAN to be the most studied protein of the NF-κB system; and maybe a possible molecular target for therapy in the NF-κB system. In addition to this, four other proteins were found to have interacting proteins numbering over 100. These were: TF65_HUMAN (RelA), NEMO_HUMAN (IKKγ), NFKB2_HUMAN (p52), and NFKB1_HUMAN (p50) [Note - this finding could also be a reflection of the fact that these proteins may be the most studied members of the NF-κB network]. For the cell cycle network, the highly connected nodes were four in number (see Figure 3C and Table 4). Compared to the NF-κB network (Figure 3B), the p53 (Figure 3A) and the cell cycle (Figure 3C) networks appeared to be sparse, with each node connected to a relatively small number of edges within the network, many of whom "know" each other. The sparse nature could be explained by the fact that only proteins involved in the oscillatory feedback loops of the systems of interest, and not the entire published members were considered in this study. The highest-degree node for the p53 network was the P53_HUMAN protein, and RB_HUMAN for the selected cell cycle proteins (both with degree connectivity value's, 300 and 156 respectively - Table 5 and 6); a result signifying their importance in their various networks. Similarly, the vertex betweenness measure [80] also confirms IKKE_HUMAN, P53_HUMAN and RB_HUMAN as prominent nodes in their networks (Table 7) [see Additional file 1: Suppl. 11-13 for results obtained from other centrality measures]. In addition, various highly interconnected subgroups were also uncovered, namely: P53_HUMAN with MDM2_HUMAN; RELB_HUMAN with NFKB2_HUMAN; and E2F1_HUMAN with RB_HUMAN [see Additional file 1: Suppl. 1-3]. These subgroups could also be described as network motifs [89-91], frequently recurring groups of interactions, usually highly conserved, which are thought to perform specific information processing roles in the networks; in some cases supporting their roles as oscillators [5,18,63,92].
Table 7.
Network p53 | Network NF-κB | Cell Cycle | ||||||
---|---|---|---|---|---|---|---|---|
Uniprot accession | Uniprot entry name | Bi | Uniprot accession | Uniprot entry name | Bi | Uniprot accession | Uniprot entry name | Bi |
P04637 | P53_HUMAN | 81612.87 | Q14164 | IKKE_HUMAN | 166771.25 | P06400 | RB_HUMAN | 26218.55 |
P38936 | CDN1A_HUMAN | 22279.58 | Q04206 | TF65_HUMAN | 70319.46 | P24385 | CCND1_HUMAN | 9196.14 |
Q00987 | MDM2_HUMAN | 18352.39 | Q9Y6K9 | NEMO_HUMAN | 60543.29 | Q01094 | E2F1_HUMAN | 8081.61 |
Q8N726 | CD2A2_HUMAN | 9223 | Q00653 | NFKB2_HUMAN | 48010.40 | P46527 | CDN1B_HUMAN | 7318.69 |
P53999 | TCP4_HUMAN | 6801.47 | P19838 | NFKB1_HUMAN | 40763.53 | P24941 | CDK2_HUMAN | 1481.39 |
Q96BK5 | PINX1_HUMAN | 4250 | P25963 | IKBA_HUMAN | 40680.12 | P11802 | CDK4_HUMAN | 1481.39 |
Q9Y3B4 | PM14_HUMAN | 3870 | Q15653 | IKBB_HUMAN | 21318.82 | Q13309 | SKP2_HUMAN | 1195.79 |
P68400 | CSK21_HUMAN | 1706.11 | Q01201 | RELB_HUMAN | 17557.99 | P78396 | CCNA1_HUMAN | 1195.79 |
P20226 | TBP_HUMAN | 1607.64 | O14920 | IKKB_HUMAN | 15356.68 | P38398 | BRCA1_HUMAN | 872.01 |
Q09472 | EP300_HUMAN | 1607.64 | O15111 | IKKA_HUMAN | 14387.00 | P08047 | SP1_HUMAN | 872.01 |
P41235 | HNF4A_HUMAN | 1192.02 | O00221 | IKBE_HUMAN | 12109.01 | Q9Y3I1 | FBX7_HUMAN | 614.12 |
P12004 | PCNA_HUMAN | 514.08 | Q92993 | KAT5_HUMAN | 8503 | Q00526 | CDK3_HUMAN | 614.12 |
P21675 | TAF1_HUMAN | 415.62 | Q96HD1 | CREL1_HUMAN | 7785 | P20248 | CCNA2_HUMAN | 614.12 |
P06748 | NPM_HUMAN | 363.33 | Q04864 | REL_HUMAN | 6563.01 | P30281 | CCND3_HUMAN | 614.12 |
P08238 | HS90B_HUMAN | 363.33 | P07437 | TBB5_HUMAN | 3982.81 | P30279 | CCND2_HUMAN | 614.12 |
Results obtained by vertex betweenness produced similar results to the degree of connectivity index reported in Tables 5 and 6.
Following the characterisation of the three networks with respect to their degree of connectivity, further calculations were made on their clustering coefficients. It was discovered that MDM2_HUMAN (mdm2) in the p53 network, REL_HUMAN (C-Rel) in the NF-κB network and E2F1_HUMAN (E2F-1) of the cell cycle were proteins found to have the highest clustering coefficient values; a finding reflecting on the nodes connectivity within their neighbourhood. That is to say, even though P53_HUMAN, RB_HUMAN and IKKE_HUMAN were found to be proteins with the most interaction within their individual networks; MDM2_HUMAN, REL_HUMAN (C-Rel) and E2F1_HUMAN were revealed to be proteins best at forming cliques in their networks.
Having discovered for each system, the highly connected nodes, as well as the nodes with the most number of neighbours, it was of interest to study how all the individual system studied relates to each other. In order to do this, we set out to calculate the shortest paths and the frequency of proteins linking the systems to one another; thereby identifying key connector proteins thought to assist in the transmission of information (or cross talk) across the three networks. It was hoped that through this form of analysis, characteristics of the connector proteins linking the systems will be uncovered.
Network of interactions between p53 and NF-κB pathways
Since it has been suggested that the topology of a network affects the spread of information carried by a signal and thus diseases [34], the network of interactions between the p53 and NF-κB systems were investigated. Figure 4 illustrates the complex network formed between the p53 and the NF-κB systems, and the connector proteins linking them (proteins in the p53 network are denoted in red, and those of the NF-κB are in yellow - Figure 4A). We found 365 paths connect proteins in the p53 network to proteins in the NF-κB network; among which, only two are direct connections and 295 require a connector protein. The two direct interactions were revealed to be between: P53_HUMAN and IKKA_HUMAN, and P53_HUMAN and IKBA_HUMAN proteins; illustrating potential connection route to consider when creating a unified model of the NF-κB and p53 system. Indirect links for the rest of the nodes were found to require protein mediators to act as connector proteins. The proteins acting as connectors between the two networks are shown in blue in Figure 4A, B and 4C. It is evident that the P53_HUMAN protein can itself act as a connecting protein between members of the NF-κB pathway and members of the p53 system (for example, CDN1A_HUMAN - P53_HUMAN - IKKA_HUMAN; and, MDM2_HUMAN - P53_HUMAN - IKKA_HUMAN).
After having determined the shortest paths linking the p53 and NF-κB systems, the identified connector proteins linking the two systems were grouped according to their frequency values, and cross referenced with reference databases, for the interpretation of their functional properties. Table 8 provides a list of the top ten connecting nodes with the most number of paths passing through it [see Additional file 1: Suppl. 14-18 for further information extracted for these proteins]. It shows the frequency values for the connector protein with shortest paths l = 2, and the biological processes associated with the connector protein. Members of the NF-κB and p53 network sharing the same connector protein are also listed in Table 8 (Note, only two examples for each connection with the same frequency values have been presented). Interestingly, Heat shock protein HSP 90-beta (HS90B_HUMAN) and Ubiquitin (UBIQ_HUMAN) were revealed to be important proteins with the highest frequency, f = 15, linking proteins in the p53 system to proteins in the NF-κB system. Two examples of each connection are: CD2A2_HUMAN - HS90B_HUMAN - TF65_HUMAN; and P53_HUMAN - HS90B_HUMAN - IKBB_HUMAN for HSP 90-beta; and MDM2_HUMAN - UBIQ_HUMAN - RELB_HUMAN; and CDN1A_HUMAN - UBIQ_HUMAN - NFKB2_HUMAN for Ubiquitin. The frequency results were useful in establishing the importance of shared proteins between systems.
Table 8.
Connector protein | Frequency | Biological process involved | Protein in p53 | Protein in NF-κB. |
---|---|---|---|---|
Heat shock protein HSP 90-beta (HS90B_HUMAN) | 15 | CD2A2_HUMAN | TF65_HUMAN | |
P53_HUMAN | IKBB_HUMAN | |||
Ubiquitin (UBIQ_HUMAN) | 15 | 13 processes]: APC-Cdc20 mediated degradation of Nek2A; APC/C:Cdh1-mediated degradation of Skp2; Apoptosis; Cdc20:Phospho-APC/C mediated degradation of Cyclin A; Cell Cycle Checkpoints; Cell Cycle, Mitotic; DNA Replication; HIV Infection; Regulation of activated PAK-2p34 by proteasome mediated degradation; Signalling by EGFR; Signalling by Wnt; Signalling in Immune system; Signalling by NGF | MDM2_HUMAN | RELB_HUMAN |
CDN1A_HUMAN | NFKB2_HUMAN | |||
Poly [ADP-ribose] polymerase 1 (PARP1_HUMAN) | 10 | CDN1A_HUMAN | NFKB1_HUMAN | |
P53_HUMAN | IKKB_HUMAN | |||
Stress-70 protein, mitochondrial (GRP75_HUMAN) | 9 | P53_HUMAN | REL_HUMAN | |
P53_HUMAN | NFKB1_HUMAN | |||
Heat shock cognate 71 kDa protein (HSP7C_HUMAN) | 9 | Membrane Trafficking | P53_HUMAN | IKBE_HUMAN |
P53_HUMAN | IKKE_HUMAN | |||
CREB-binding protein (CBP_HUMAN) | 8 | Gene Expression | MDM2_HUMAN | IKKA_HUMAN |
MDM2_HUMAN | IKKB_HUMAN | |||
Immunoglobulin heavy chain-binding protein (GRP78_HUMAN) | 8 | Hemostasis | P53_HUMAN | NFKB1_HUMAN |
P53_HUMAN | NFKB2_HUMAN | |||
Heat shock 70 kDa protein 1L (HS71L_HUMAN) | 8 | P53_HUMAN | REL_HUMAN | |
P53_HUMAN | RELB_HUMAN | |||
Heat shock protein HSP 90-alpha (HS90A_HUMAN) | 7 | P53_HUMAN | IKBB_HUMAN | |
P53_HUMAN | IKBE_HUMAN | |||
Nuclear receptor subfamily 3 group C member 1 (GCR_HUMAN) | 7 | Gene Expression | MDM2_HUMAN | NFKB1_HUMAN |
MDM2_HUMAN | NFKB2_HUMAN |
Network of interactions between p53, NF-κB and the G1/S phase of the Cell cycle
Since it has been suggested, that some cell cycle proteins are activated by one pathway and are relevant for the regulation of another [44,65-69], it was of interest to investigate the relationship between the NF-κB, p53 and the cell cycle systems. For this study, only events leading to the G1/S transition phase of the cell cycle, the point where NF-κB and p53 signal transduction events are active the most [93] were considered. We start by exploring the interactions between RB_HUMAN and E2F1_HUMAN cell cycle proteins, with members of the p53 and NF-κB networks. Figure 5 show the network obtained from this analysis. Proteins that link the proteins in the p53 and NF-κB networks to RB_HUMAN are denoted in green, whilst the proteins connecting the two networks to E2F1_HUMAN are in blue (Figure 5A). Common protein shared between the p53 and NF-κB networks have been represented in the form of green triangles (for links to RB_HUMAN) and blue triangles (for links with E2F1_HUMAN) (see Figure 5B). Closer evaluation of the interactions linking the p53 network to the cell cycle proteins (Table 9), identified 46 shortest paths for interactions with RB_HUMAN (44 of which are indirect links mediated by a single node and 2 direct links {CDN1A_HUMAN - RB_HUMAN [66,94]; MDM2_HUMAN - RB_HUMAN }); and 19 shortest paths for interactions with E2F1_HUMAN (17 of which are indirect links mediated by a single node, and 2 direct links {CD2A2_HUMAN - E2F1_HUMAN; P53_HUMAN - E2F1_HUMAN }). These results therefore suggest an active role of CDN1A_HUMAN (p21) and MDM2_HUMAN (mdm2) on the activity of the RB_HUMAN protein in the cell cycle. And thus implies possible connection routes to consider when constructing a unified model of the p53 and the G1/S phase of the cell cycle networks. Likewise, for the NF-κB network, 74 shortest paths were identified linking NF-κB proteins to RB_HUMAN (all of which were indirect links); and 36 shortest paths for interactions with E2F1_HUMAN (of which only a single direct link was observed, IKBA_HUMAN - E2F1_HUMAN).
Table 9.
p53 network and cell cycle proteins | ||||
---|---|---|---|---|
Rb | E2F-1 | P27 | Cyclin D1 | |
Direct link (l = 1) | 2 | 2 | 0 | 1 |
Path with l = 2 | 44 | 17 | 31 | 25 |
Total shortest paths of all length l (l = 1, l = 2, l = 3, l = 4, ...) | 46 | 19 | 31 | 26 |
NF-κB network and cell cycle proteins | ||||
Rb | E2F-1 | P27 | Cyclin D1 | |
Direct link (l = 1) | 0 | 1 | 0 | 0 |
Path with l = 2 | 72 | 35 | 33 | 46 |
Total shortest paths of all length l (l = 1, l = 2, l = 3, l = 4, ...) | 74 | 36 | 83 | 91 |
Path lengths with l = 1 is said to be a direct link. Total shortest path length is where l = 1, l = 2, l = 3 and l = 4
We repeated this analysis to include interactions between the rest of the G1/S cell cycle proteins (RB_HUMAN, CCND1_HUMAN, CDN1B_HUMAN, and E2F1_HUMAN) and the members of the p53 and NF-κB networks (see Figure 6 - only the connecting nodes linking CDN1B_HUMAN (p27, circle, yellow), CCND1_HUMAN (Cyclin D1, circle, magenta), RB_HUMAN (Rb, circle, green) and E2F1_HUMAN (E2F-1, circle, blue) to the p53 and NF-κB networks have been colour coded - Figure 6A and 6B). Shortest path lengths calculated for interactions between proteins in the p53 network and CDN1B_HUMAN, numbered 31 (all indirect links with path length = 2); and 26 for interactions with CCND1_HUMAN (25 indirect connection with path length = 2 and 1 direct connection {CDN1A_HUMAN - CCND1_HUMAN}). Similarly, for the NF-κB system, 83 shortest paths connecting CDN1B_HUMAN (33 of which have path length = 2), and 91 shortest paths connecting CCND1_HUMAN(46 of which are indirect links mediated by a single connector path length = 2) to members of the NF-κB network were determined (see Table 9 for shortest paths statistics). Frequency values and functional properties ascertained for nodes linking the p53 and cell cycle networks, as well for those linking the NF-κB with the cell cycle network have been reviewed in Table 10, 11, 12 and 13 [see Additional file 1: Suppl. 19-24 for a full list].
Table 10.
Connector protein | Frequency | Protein in NF-κB | Additional information of connector proteins |
---|---|---|---|
NFKB1_HUMAN | 9 | IKBA_HUMAN | Nuclear factor NF-kappa-B p105 subunit |
PARP1_HUMAN | 4 | NFKB2_HUMAN | Poly [ADP-ribose] polymerase 1 |
NCOA3_HUMAN | 3 | IKKA_HUMAN | Nuclear receptor coactivator 3 |
CUL1_HUMAN | 3 | IKBB_HUMAN | Cullin-1 |
CBP_HUMAN | 3 | TF65_HUMAN | CREB-binding protein (Involved in Gene Expression process) |
SP1_HUMAN | 2 | TF65_HUMAN REL_HUMAN |
Transcription factor Sp1 |
P53_HUMAN | 2 | IKKA_HUMAN | Cellular tumor antigen p53 |
TIP60_HUMAN | 1 | CREL1_HUMAN | Histone acetyltransferase HTATIP |
PHB_HUMAN | 1 | TF65_HUMAN | Prohibitin |
PA2G4_HUMAN | 1 | IKKE_HUMAN | Proliferation-associated protein 2G4 |
Table 11.
Connector protein | Frequency | Protein in NF-κB | Additional information of connector proteins |
---|---|---|---|
HSP7C_HUMAN | 10 | IKKB_HUMAN | Heat shock cognate 71 kDa protein (Involved in Membrane Trafficking process) |
HDAC2_HUMAN | 4 | NFKB1_HUMAN | Histone deacetylase 2 (Involved in 2 processes: Gene Expression; Signalling by NGF) |
ESR1_HUMAN | 3 | IKKB_HUMAN | Estrogen receptor |
HDAC1_HUMAN | 3 | IKKA_HUMAN | Histone deacetylase 1 (Involved in 2 processes: Gene Expression; Signalling by NGF) |
SMCA4_HUMAN | 3 | RELB_HUMAN | Probable global transcription activator SNF2L4 |
TBP_HUMAN | 3 | NFKB2_HUMAN | TATA-box-binding protein (Involved in 3 processes: Gene Expression; HIV Infection; Transcription) |
BRCA1_HUMAN | 2 | TF65_HUMAN | Breast cancer type 1 susceptibility protein (Involved in DNA Repair process) |
ANDR_HUMAN | 2 | TF65_HUMAN | Androgen receptor |
CEBPB_HUMAN | 2 | NFKB1_HUMAN | CCAAT/enhancer-binding protein beta |
CDK9_HUMAN | 2 | TF65_HUMAN | Cell division protein kinase 9 (Involved in 9 processes: Elongation arrest and recovery; Gene Expression; HIV Infection; HIV-1 elongation arrest and recovery; Pausing and recovery of HIV-1 elongation; Pausing and recovery of Tat-mediated HIV-1 elongation; Pausing and recovery of elongation; Tat-mediated HIV-1 elongation arrest and recovery; Transcription) |
Table 12.
Connector protein | Frequency | Protein in p53 | Additional information of connector proteins |
---|---|---|---|
P53_HUMAN | 2 | CDN1A_HUMAN | Cellular tumor antigen p53 |
CBP_HUMAN | 2 | MDM2_HUMAN | CREB-binding protein |
RB_HUMAN | 2 | CDN1A_HUMAN | Retinoblastoma-associated protein |
TIP60_HUMAN | 1 | MDM2_HUMAN | Histone acetyltransferase HTATIP |
SKP2_HUMAN | 1 | CDN1A_HUMAN | S-phase kinase-associated protein 2 |
PARP1_HUMAN | 1 | CDN1A_HUMAN | Poly [ADP-ribose] polymerase 1 |
ATM_HUMAN | 1 | MDM2_HUMAN | Serine-protein kinase ATM (Involved in 2 processes: Cell Cycle Checkpoints; DNA Repair) |
MDM4_HUMAN | 1 | MDM2_HUMAN | Protein Mdm4 |
CHK2_HUMAN | 1 | MDM2_HUMAN | Serine/threonine-protein kinase Chk2 (Involved in 3 processes: Cdc20:Phospho-APC/C mediated degradation of Cyclin A; Cell Cycle) |
CDK3_HUMAN | 1 | CDN1A_HUMAN | Cell division protein kinase 3 |
Table 13.
Connector protein | Frequency | Protein in p53 | Additional information of connector proteins |
---|---|---|---|
P53_HUMAN | 2 | CDN1A_HUMAN | Cellular tumor antigen p53 |
CBP_HUMAN | 2 | MDM2_HUMAN | CREB-binding protein |
RB_HUMAN | 2 | CDN1A_HUMAN | Retinoblastoma-associated protein |
TIP60_HUMAN | 1 | MDM2_HUMAN | Histone acetyltransferase HTATIP |
SKP2_HUMAN | 1 | CDN1A_HUMAN | S-phase kinase-associated protein 2 |
PARP1_HUMAN | 1 | CDN1A_HUMAN | Poly [ADP-ribose] polymerase 1 |
ATM_HUMAN | 1 | MDM2_HUMAN | Serine-protein kinase ATM (Involved in 2 processes: Cell Cycle Checkpoints; DNA Repair) |
MDM4_HUMAN | 1 | MDM2_HUMAN | Protein Mdm4 |
CHK2_HUMAN | 1 | MDM2_HUMAN | Serine/threonine-protein kinase Chk2 (Involved in 3 processes: Cdc20:Phospho-APC/C mediated degradation of Cyclin A; Cell Cycle) |
CDK3_HUMAN | 1 | CDN1A_HUMAN | Cell division protein kinase 3 |
Conclusion
A network is usually thought of as a coherent system that comprises of units interacting in some kind of orchestrated and regulated fashion - such that the emergent behaviour of the whole (i.e. the network) is recognisable and can be characterised. Once some of the behaviour is recognised, the system can be described at a level of detail appropriate to the system's behaviour whilst ignoring the details of the constituent parts. Since molecular networks are large and complex, with their components and their interactions quite heterogeneous characterising the relationship between structure and dynamics of the system makes it far from straightforward. Although research aiming at coping with these challenges has become very popular, it is important to bear in mind that the current efforts can only profit from a combined theoretical and experimental approach. This is where the approach presented in this paper becomes beneficial. The idea is that by combining both the data driven and knowledge driven strategies, direct and or combinatorial interaction parameters of many protein can be captured from the information gained, and can thus be used to construct, guide and or unify dynamical models of signal transduction pathways from which a realistic model of the systems behaviour can be determined. The resulting dynamical model can then provide the conceptual and explanatory linkage between the observed phenomena and the predicted.
This framework of computational modelling of molecular networks at various levels or organisation has the potential to allow cost effective experimentation and hypothesis exploration, computationally uncovering the behaviour of molecular species and combinatorial interactions that would be difficult and too expensive to carry out in a wet-lab setting. While, network topology analysis is thus useful for showing which proteins in the network depend on which other protein, it does not give us any further information on the regulatory effects of these dependencies. Despite these methodological limitations, our results offer a view, demonstrating the importance of elucidating the functional roles key or shared components play in the propagation of signals across transduction systems.
The main implication of the presented application is the recognition that changes in one signalling system, undoubtedly causes a ripple effect on the rest of the surrounding system - as shown by the extensive interconnection of the systems studied and their common shared components. It is hoped that the use of this form of analysis may also be beneficial in highlighting areas of research where very little is known for further future study.
Authors' contributions
AECI conceived the project and design. AECI and TPN prepared the data. TPN extracted processed data from public databases, implemented the algorithms and analysed the results. AECI and TPN wrote the paper. All authors read and approved the document.
Supplementary Material
Contributor Information
Adaoha EC Ihekwaba, Email: ihekwaba@cosbi.eu.
Phuong T Nguyen, Email: nguyen@cosbi.eu.
Corrado Priami, Email: priami@cosbi.eu.
Acknowledgements
This project has been partially funded by FIRB Project RBPR0523C3. The authors wish to thank colleagues at CoSBi: Ferenc Jordán, Sean Sedwards, Matteo Cavaliere and Ivan Mura for valuable discussions and editorial suggestions.
References
- Everitt BS. Cluster Analysis. London: Edward Arnold; 1993. [Google Scholar]
- Alberts B, Bray D, Lewis J, Raff M, Roberts K. Molecular biology of the cell. 4. Garland publishing; 2002. [Google Scholar]
- Nicolau M, Tibshirani R, Borresen-Dale AL, Jeffrey SS. Disease-specific genomic analysis: identifying the signature of pathologic biology. Bioinformatics. 2007;23(8):957–965. doi: 10.1093/bioinformatics/btm033. [DOI] [PubMed] [Google Scholar]
- Goldbeter A. Computational approaches to cellular rhythms. Nature. 2002;420(6912):238–245. doi: 10.1038/nature01259. [DOI] [PubMed] [Google Scholar]
- Nelson DE, Ihekwaba AE, Elliott M, Johnson JR, Gibney CA, Foreman BE, Nelson G, See V, Horton CA, Spiller DG. Oscillations in NF-kappaB signaling control the dynamics of gene expression. Science. 2004;306(5696):704–708. doi: 10.1126/science.1099962. [DOI] [PubMed] [Google Scholar]
- Goldbeter A. A model for circadian oscillations in the Drosophila period protein (PER) Proc R Soc Lond B Biol Sci. 1995;261(1362):319–324. doi: 10.1098/rspb.1995.0153. [DOI] [PubMed] [Google Scholar]
- Gagneur J, Casari G. From molecular networks to qualitative cell behavior. FEBS Lett. 2005;579(8):1867–1871. doi: 10.1016/j.febslet.2005.02.007. [DOI] [PubMed] [Google Scholar]
- Gagneur J, Krause R, Bouwmeester T, Casari G. Modular decomposition of protein-protein interaction networks. Genome Biol. 2004;5(8):R57. doi: 10.1186/gb-2004-5-8-r57. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bhalla US. Understanding complex signaling networks through models and metaphors. Prog Biophys Mol Biol. 2003;81(1):45–65. doi: 10.1016/S0079-6107(02)00046-9. [DOI] [PubMed] [Google Scholar]
- Kell DB. Metabolomics, machine learning and modelling: towards an understanding of the language of cells. Biochem Soc Trans. 2005;33(Pt 3):520–524. doi: 10.1042/BST0330520. [DOI] [PubMed] [Google Scholar]
- Yaffe MB. Signaling networks and mathematics. Sci Signal. 2008;1(43):eg7. doi: 10.1126/scisignal.143eg7. [DOI] [PubMed] [Google Scholar]
- Barabasi AL, Albert R. Emergence of scaling in random networks. Science. 1999;286(5439):509–512. doi: 10.1126/science.286.5439.509. [DOI] [PubMed] [Google Scholar]
- Barabasi AL, Oltvai ZN. Network biology: understanding the cell's functional organization. Nat Rev Genet. 2004;5(2):101–113. doi: 10.1038/nrg1272. [DOI] [PubMed] [Google Scholar]
- Strogatz SH. Exploring complex networks. Nature. 2001;410(6825):268–276. doi: 10.1038/35065725. [DOI] [PubMed] [Google Scholar]
- Watts DJ, Strogatz SH. Collective dynamics of 'small-world' networks. Nature. 1998;393(6684):440–442. doi: 10.1038/30918. [DOI] [PubMed] [Google Scholar]
- Hoffmann A, Levchenko A, Scott ML, Baltimore D. The IkappaB-NF-kappaB signaling module: temporal control and selective gene activation. Science. 2002;298(5596):1241–1245. doi: 10.1126/science.1071914. [DOI] [PubMed] [Google Scholar]
- Ma L, Wagner J, Rice JJ, Hu W, Levine AJ, Stolovitzky GA. A plausible model for the digital response of p53 to DNA damage. Proc Natl Acad Sci USA. 2005;102(40):14266–14271. doi: 10.1073/pnas.0501352102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lahav G, Rosenfeld N, Sigal A, Geva-Zatorsky N, Levine AJ, Elowitz MB, Alon U. Dynamics of the p53-Mdm2 feedback loop in individual cells. Nat Genet. 2004;36(2):147–150. doi: 10.1038/ng1293. [DOI] [PubMed] [Google Scholar]
- Ihekwaba AE, Wilkinson SJ, Waithe D, Broomhead DS, Li P, Grimley RL, Benson N. Bridging the gap between in silico and cell-based analysis of the nuclear factor-kappaB signaling pathway by in vitro studies of IKK2. Febs J. 2007;274(7):1678–1690. doi: 10.1111/j.1742-4658.2007.05713.x. [DOI] [PubMed] [Google Scholar]
- Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M. The Universal Protein Resource (UniProt) Nucleic Acids Research. 2005;33:D154–D159. doi: 10.1093/nar/gki070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown KR, Jurisica I. Unequal evolutionary conservation of human protein interactions in interologous networks. Genome Biology. 2007;8(5):R95. doi: 10.1186/gb-2007-8-5-r95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Joshi-Tope G, Gillespie M, Vastrik I, D'Eustachio P, Schmidt E, de Bono B, Jassal B, Gopinath GR, Wu GR, Matthews L. Reactome: a knowledgebase of biological pathways. Nucl Acids Res. 2005;33(suppl 1):428–432. doi: 10.1093/nar/gki072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schaefer CF, Anthony K, Krupa S, Buchoff J, Day M, Hannay T, Buetow KH. PID: the Pathway Interaction Database. Nucl Acids Res. 2009;37(suppl_1):D674–679. doi: 10.1093/nar/gkn653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Priami C, Ballarini P, Qualia P. Proc 7th International Conference on Computational Methods in Systems Biology: 2009. Bologna: Springer; 2009. BlenX4Bio: BlenX for biologists. [Google Scholar]
- Dematte L, Priami C, Romanel A. The Beta Workbench: a computational tool to study the dynamics of biological systems. Brief Bioinform. 2008;9(5):437–449. doi: 10.1093/bib/bbn023. [DOI] [PubMed] [Google Scholar]
- Schlitt T, Brazma A. Current approaches to gene regulatory network modelling. BMC Bioinformatics. 2007;8(Suppl 6):S9. doi: 10.1186/1471-2105-8-S6-S9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schlitt T, Palin K, Rung J, Dietmann S, Lappe M, Ukkonen E, Brazma A. From gene networks to gene function. Genome Res. 2003;13(12):2568–2576. doi: 10.1101/gr.1111403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Priami C. Algorithmic Systems Biology. An opportunity for computer science. Communications of the ACM. 2009;52(5):80–88. doi: 10.1145/1506409.1506427. [DOI] [Google Scholar]
- Han J, Kamber M. Data Mining: Concepts and Techniques (The Morgan Kaufmann Series in Data Management Systems) San Francisco: Morgan Kaufmann; 2000. [Google Scholar]
- Wang JTL, Zaki MJ, Toivonen HTT, Shasha DE. Data Mining in Bioinformatics. London: Springer; 2005. [Google Scholar]
- Bhaskar H, Hoyle DC, Singh S. Machine learning in bioinformatics: A brief survey and recommendations for practitioners. Intelligent Technologies in Medicine and Bioinformatics. 2006;36(10):1104–1125. doi: 10.1016/j.compbiomed.2005.09.002. [DOI] [PubMed] [Google Scholar]
- Larranaga P, Calvo B, Santana R, Bielza C, Galdiano J, Inza I, Lozano JA, Armananzas R, Santafe G, Perez A. Machine learning in bioinformatics. Brief Bioinform. 2006;7(1):86–112. doi: 10.1093/bib/bbk007. [DOI] [PubMed] [Google Scholar]
- Jang H, Lim J, Lim J-H, Park S-J, Lee K-C, Park S-H. Finding the evidence for protein-protein interactions from PubMed abstracts. Bioinformatics. 2006;22(14):e220–226. doi: 10.1093/bioinformatics/btl203. [DOI] [PubMed] [Google Scholar]
- Lee DS, Park J, Kay KA, Christakis NA, Oltvai ZN, Barabasi AL. The implications of human metabolic network topology for disease comorbidity. Proc Natl Acad Sci USA. 2008;105(29):9880–9885. doi: 10.1073/pnas.0802208105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brazhnik P, de la Fuente A, Mendes P. Gene networks: how to put the function in genomics. Trends Biotechnol. 2002;20(11):467–472. doi: 10.1016/S0167-7799(02)02053-X. [DOI] [PubMed] [Google Scholar]
- Lecca P, Palmisano A, Ihekwaba A, Priami C. Calibration of dynamic models of biological systems with KInfer. Eur Biophys J. 2009. [DOI] [PubMed]
- Baud V, Karin M. Is NF-kappaB a good target for cancer therapy? Hopes and pitfalls. Nat Rev Drug Discov. 2009;8(1):33–40. doi: 10.1038/nrd2781. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ghosh S, May MJ, Kopp EB. NF-kappa B and Rel proteins: evolutionarily conserved mediators of immune responses. Annu Rev Immunol. 1998;16:225–260. doi: 10.1146/annurev.immunol.16.1.225. [DOI] [PubMed] [Google Scholar]
- Ihekwaba AEC, Broomhead DS, Grimley R, Benson N, White MRH, Kell DB. Synergistic control of oscillations in the NF-kappaB signalling pathway. IEE Systems Biology. 2005;152(3):153–160. doi: 10.1049/ip-syb:20050050. [DOI] [PubMed] [Google Scholar]
- Ihekwaba AEC, Broomhead DS, Grimley RL, Benson N, Kell DB. Sensitivity analysis of parameters controlling oscillatory signalling in the NF-κB pathway: the roles of IKK and IκBα. Systems Biology. 2004;1(1):93–103. doi: 10.1049/sb:20045009. [DOI] [PubMed] [Google Scholar]
- Araki K, Kawauchi K, Tanaka N. IKK/NF-kappaB signaling pathway inhibits cell-cycle progression by a novel Rb-independent suppression system for E2F transcription factors. Oncogene. 2008;27(43):5696–5705. doi: 10.1038/onc.2008.184. [DOI] [PubMed] [Google Scholar]
- Kawauchi K, Araki K, Tobiume K, Tanaka N. Activated p53 induces NF-kappaB DNA binding but suppresses its transcriptional activation. Biochem Biophys Res Commun. 2008;372(1):137–141. doi: 10.1016/j.bbrc.2008.05.021. [DOI] [PubMed] [Google Scholar]
- Kawauchi K, Araki K, Tobiume K, Tanaka N. p53 regulates glucose metabolism through an IKK-NF-kappaB pathway and inhibits cell transformation. Nat Cell Biol. 2008;10(5):611–618. doi: 10.1038/ncb1724. [DOI] [PubMed] [Google Scholar]
- Webster GA, Perkins ND. Transcriptional cross talk between NF-kappaB and p53. Mol Cell Biol. 1999;19(5):3485–3495. doi: 10.1128/mcb.19.5.3485. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pigolotti S, Krishna S, Jensen MH. Oscillation patterns in negative feedback loops. Proc Natl Acad Sci USA. 2007;104(16):6533–6537. doi: 10.1073/pnas.0610759104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schumm K, Rocha S, Caamano J, Perkins ND. Regulation of p53 tumour suppressor target gene expression by the p52 NF-kappaB subunit. Embo J. 2006;25(20):4820–4832. doi: 10.1038/sj.emboj.7601343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tergaonkar V, Bottero V, Ikawa M, Li Q, Verma IM. IkappaB kinase-independent IkappaBalpha degradation pathway: functional NF-kappaB activity and implications for cancer therapy. Mol Cell Biol. 2003;23(22):8070–8083. doi: 10.1128/MCB.23.22.8070-8083.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tergaonkar V, Pando M, Vafa O, Wahl G, Verma I. p53 stabilization is decreased upon NFkappaB activation: a role for NFkappaB in acquisition of resistance to chemotherapy. Cancer Cell. 2002;1(5):493–503. doi: 10.1016/S1535-6108(02)00068-5. [DOI] [PubMed] [Google Scholar]
- Tergaonkar V, Perkins ND. p53 and NF-kappaB crosstalk: IKKalpha tips the balance. Mol Cell. 2007;26(2):158–159. doi: 10.1016/j.molcel.2007.04.006. [DOI] [PubMed] [Google Scholar]
- Trimarchi JM, Lees JA. Sibling rivalry in the E2F family. Nat Rev Mol Cell Biol. 2002;3(1):11–20. doi: 10.1038/nrm714. [DOI] [PubMed] [Google Scholar]
- Frolov MV, Dyson NJ. Molecular mechanisms of E2F-dependent activation and pRB-mediated repression. J Cell Sci. 2004;117(Pt 11):2173–2181. doi: 10.1242/jcs.01227. [DOI] [PubMed] [Google Scholar]
- Frolov MV, Huen DS, Stevaux O, Dimova D, Balczarek-Strang K, Elsdon M, Dyson NJ. Functional antagonism between E2F family members. Genes Dev. 2001;15(16):2146–2160. doi: 10.1101/gad.903901. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lim CA, Yao F, Wong JJ, George J, Xu H, Chiu KP, Sung WK, Lipovich L, Vega VB, Chen J. Genome-wide mapping of RELA(p65) binding identifies E2F1 as a transcriptional activator recruited by NF-kappaB upon TLR4 activation. Mol Cell. 2007;27(4):622–635. doi: 10.1016/j.molcel.2007.06.038. [DOI] [PubMed] [Google Scholar]
- Phillips AC, Ernst MK, Bates S, Rice NR, Vousden KH. E2F-1 potentiates cell death by blocking antiapoptotic signaling pathways. Mol Cell. 1999;4(5):771–781. doi: 10.1016/S1097-2765(00)80387-1. [DOI] [PubMed] [Google Scholar]
- Hitchens MR, Robbins PD. The role of the transcription factor DP in apoptosis. Apoptosis. 2003;8(5):461–468. doi: 10.1023/A:1025586207239. [DOI] [PubMed] [Google Scholar]
- Baguley BC, Marshall E. Do negative feedback oscillations drive variations in the length of the tumor cell division cycle? Oncol Res. 2005;15(6):291–294. doi: 10.3727/096504005776404544. [DOI] [PubMed] [Google Scholar]
- Barre B, Perkins ND. A cell cycle regulatory network controlling NF-kappaB subunit activity and function. Embo J. 2007;26(23):4841–4855. doi: 10.1038/sj.emboj.7601899. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- Perkins ND. Integrating cell-signalling pathways with NF-kappaB and IKK function. Nat Rev Mol Cell Biol. 2007;8(1):49–62. doi: 10.1038/nrm2083. [DOI] [PubMed] [Google Scholar]
- Campbell KJ, Perkins ND. Regulation of NF-kappaB function. Biochem Soc Symp. 2006. pp. 165–180. [DOI] [PubMed]
- Vousden KH. Outcomes of p53 activation--spoilt for choice. J Cell Sci. 2006;119(Pt 24):5015–5020. doi: 10.1242/jcs.03293. [DOI] [PubMed] [Google Scholar]
- Phillips AC, Vousden KH. E2F-1 induced apoptosis. Apoptosis. 2001;6(3):173–182. doi: 10.1023/A:1011332625740. [DOI] [PubMed] [Google Scholar]
- Lahav G. The strength of indecisiveness: oscillatory behavior for better cell fate determination. Sci STKE. 2004;2004(264):pe55. doi: 10.1126/stke.2642004pe55. [DOI] [PubMed] [Google Scholar]
- Calzone L, Gelay A, Zinovyev A, Radvanyi F, Barillot E. A comprehensive modular map of molecular interactions in RB/E2F pathway. Mol Syst Biol. 2008;4:173. doi: 10.1038/msb.2008.7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Novak B, Tyson JJ, Gyorffy B, Csikasz-Nagy A. Irreversible cell-cycle transitions are due to systems-level feedback. Nat Cell Biol. 2007;9(7):724–728. doi: 10.1038/ncb0707-724. [DOI] [PubMed] [Google Scholar]
- Dotto GP. p21(WAF1/Cip1): more than a break to the cell cycle? Biochim Biophys Acta. 2000;1471(1):M43–56. doi: 10.1016/s0304-419x(00)00019-6. [DOI] [PubMed] [Google Scholar]
- Sheahan S, Bellamy CO, Treanor L, Harrison DJ, Prost S. Additive effect of p53, p21 and Rb deletion in triple knockout primary hepatocytes. Oncogene. 2004;23(8):1489–1497. doi: 10.1038/sj.onc.1207280. [DOI] [PubMed] [Google Scholar]
- Kamijo T, Weber JD, Zambetti G, Zindy F, Roussel MF, Sherr CJ. Functional and physical interactions of the ARF tumor suppressor with p53 and Mdm2. Proc Natl Acad Sci USA. 1998;95(14):8292–8297. doi: 10.1073/pnas.95.14.8292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pomerantz J, Schreiber-Agus N, Liegeois NJ, Silverman A, Alland L, Chin L, Potes J, Chen K, Orlow I, Lee HW. The Ink4a tumor suppressor gene product, p19Arf, interacts with MDM2 and neutralizes MDM2's inhibition of p53. Cell. 1998;92(6):713–723. doi: 10.1016/S0092-8674(00)81400-2. [DOI] [PubMed] [Google Scholar]
- Stott FJ, Bates S, James MC, McConnell BB, Starborg M, Brookes S, Palmero I, Ryan K, Hara E, Vousden KH. The alternative product from the human CDKN2A locus, p14(ARF), participates in a regulatory feedback loop with p53 and MDM2. Embo J. 1998;17(17):5001–5014. doi: 10.1093/emboj/17.17.5001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Joyce D, Albanese C, Steer J, Fu M, Bouzahzah B, Pestell RG. NF-kappaB and cell-cycle regulation: the cyclin connection. Cytokine Growth Factor Rev. 2001;12(1):73–90. doi: 10.1016/S1359-6101(00)00018-6. [DOI] [PubMed] [Google Scholar]
- Steffen M, Petti A, Aach J, D'haeseleer P, Church G. Automated modelling of signal transduction networks. BMC Bioinformatics. 2002;3(34) doi: 10.1186/1471-2105-3-34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allen EE, Fetrow JS, Daniel LW, Thomas SJ, John DJ. Algebraic dependency models of protein signal transduction networks from time-series data. Journal of Theoretical Biology. 2006;238(2):317–330. doi: 10.1016/j.jtbi.2005.05.010. [DOI] [PubMed] [Google Scholar]
- Ng SK, Tan SH. Discovering protein-protein interactions. Journal of Bioinformatics and Computational Biology. 2003;1(4):711–741. doi: 10.1142/S0219720004000600. [DOI] [PubMed] [Google Scholar]
- Jennifer AM, Dahesh S, Haynes J, Andrews BJ, Davidson AR. Protein-protein interaction affinity plays a crucial role in controlling the Sho1p-mediated signal transduction pathway in Yeast. Mol Cell. 2004;14:813–823. doi: 10.1016/j.molcel.2004.05.024. [DOI] [PubMed] [Google Scholar]
- Jeong H, Tombor B, Albert R, Oltvai ZN, Barabasi AL. The large-scale organization of metabolic networks. Nature. 2000;407(6804):651–654. doi: 10.1038/35036627. [DOI] [PubMed] [Google Scholar]
- Kim BJ, Yoon CN, Han SK, Jeong H. Path finding strategies in scale-free networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2002;65(2 Pt 2):027103. doi: 10.1103/PhysRevE.65.027103. [DOI] [PubMed] [Google Scholar]
- Spirin V, Mirny LA. Protein complexes and functional modules in molecular networks. Proc Natl Acad Sci USA. 2003;100(21):12123–12128. doi: 10.1073/pnas.2032324100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin C, Cho Y, Hwang W, Pei P, Zhang A. In: Knowledge Discovery in Bioinformatics: Techniques, Methods and Application. Xiaohua H, Yi P, editor. Hoboken, NJ: Wiley InterScience; 2006. Clustering Methods In Protein-Protein Interaction Network; pp. 319–355. [Google Scholar]
- Yu H, Kim PM, Sprecher E, Trifonov V, Gerstein M. The importance of bottlenecks in protein networks: correlation with gene essentiality and expression dynamics. PLoS Comput Biol. 2007;3(4):e59. doi: 10.1371/journal.pcbi.0030059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang S, Jin G, Zhang XS, Chen L. Discovering functions and revealing mechanisms at molecular level from biological networks. Proteomics. 2007;7(16):2856–2869. doi: 10.1002/pmic.200700095. [DOI] [PubMed] [Google Scholar]
- Zotenko E, Mestre J, O'Leary DP, Przytycka TM. Why do hubs in the yeast protein interaction network tend to be essential: reexamining the connection between the network topology and essentiality. PLoS Comput Biol. 2008;4(8):e1000140. doi: 10.1371/journal.pcbi.1000140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cormen TH, Leiserson CE, Rivest RL, Stein C. Introduction to Algorithms. Second. Cambridge, MA: The MIT Press; 2001. [Google Scholar]
- Knuth DE. Art of Computer Programming, Fundamental Algorithms. 3. Vol. 1. Reading, MA: Addison-Wesley Professional; 1997. [Google Scholar]
- Freeman LC. A set of measures of centrality based on betweenness. Sociometry. 1977;40(1):35–37. doi: 10.2307/3033543. [DOI] [Google Scholar]
- Schlitt T, Brazma A. Modelling in molecular biology: describing transcription regulatory networks at different scales. Philos Trans R Soc Lond B Biol Sci. 2006;361(1467):483–494. doi: 10.1098/rstb.2005.1806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chartrand G. Introductory Graph Theory. New York: Dover; 1985. [Google Scholar]
- Harary F Graph Theory 1994Reading MA: Addison-Wesley; 8613499 [Google Scholar]
- Chariot A, Leonardi A, Muller J, Bonif M, Brown K, Siebenlist U. Association of the adaptor TANK with the I kappa B kinase (IKK) regulator NEMO connects IKK complexes with IKK epsilon and TBK1 kinases. J Biol Chem. 2002;277(40):37029–37036. doi: 10.1074/jbc.M205069200. [DOI] [PubMed] [Google Scholar]
- Alon U. Network motifs: theory and experimental approaches. Nat Rev Genet. 2007;8(6):450–461. doi: 10.1038/nrg2102. [DOI] [PubMed] [Google Scholar]
- Milo R, Shen-Orr S, Itzkovitz S, Kashtan N, Chklovskii D, Alon U. Network motifs: simple building blocks of complex networks. Science. 2002;298(5594):824–827. doi: 10.1126/science.298.5594.824. [DOI] [PubMed] [Google Scholar]
- Kashtan N, Itzkovitz S, Milo R, Alon U. Topological generalizations of network motifs. Phys Rev E Stat Nonlin Soft Matter Phys. 2004;70(3 Pt 1):031909. doi: 10.1103/PhysRevE.70.031909. [DOI] [PubMed] [Google Scholar]
- Geva-Zatorsky N, Rosenfeld N, Itzkovitz S, Milo R, Sigal A, Dekel E, Yarnitzky T, Liron Y, Polak P, Lahav G. Oscillations and variability in the p53 system. Mol Syst Biol. 2006;2:2006 0033. doi: 10.1038/msb4100068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kaltschmidt B, Kaltschmidt C, Hehner SP, Droge W, Schmitz ML. Repression of NF-kappaB impairs HeLa cell proliferation by functional interference with cell cycle checkpoint regulators. Oncogene. 1999;18(21):3213–3225. doi: 10.1038/sj.onc.1202657. [DOI] [PubMed] [Google Scholar]
- Garner E, Raj K. Protective mechanisms of p53-p21-pRb proteins against DNA damage-induced cell death. Cell Cycle. 2008;7(3):277–282. doi: 10.4161/cc.7.3.5328. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.