Abstract
Motivation
Signalling entropy measures the uncertainty or randomness in the signalling pathways of a biological system. It reflects the complexity and variability of protein interactions and can indicate how information is processed within cells. Higher signalling entropy often indicates a more dynamic and adaptive state, whereas lower entropy may imply a more stable and less responsive condition. Estimating signalling entropy has become a valuable method for studying and understanding the complexity of biological processes. This measure has the potential to shed valuable insights into various phenomena, including the mechanisms behind cell fate decisions, drug resistance, and disease progression. To examine the molecular changes within a system, signalling entropy is quantified through the integration of expression measurements and protein interaction networks. Experimental and computational issues, such as false positives and additional noise, can all compromise the accuracy of protein interaction networks. Correction methods can be used to mitigate spurious results, correct for experimental bias, and integrate data from multiple sources. However, to date, the effect of such approaches on entropy calculations, together with the impact of different underlying networks, has yet to be evaluated.
Results
Here, we investigate how the topology of distinct protein interaction networks can alter the entropy calculation. We examine the entropy derived from different protein interaction networks. Additionally, we systematically evaluate different correction strategies, outlining their benefits and drawbacks along with identifying the most effective approaches for specific types of data and biological scenarios. This protocol outlines how to optimize the reliability of entropy calculations and ultimately leads to a deeper comprehension of biological processes and disease mechanisms.
Keywords: signalling entropy, protein interaction networks, false-positive interactions, correction methods
Introduction
The concept of entropy, adopted from physics and information theory, quantifies the degree of unpredictability or randomness in a given system [1, 2]. In molecular signalling networks, entropy can be used to quantify the complexity rate and diversity of signals flowing through the network [3, 4] (Fig. 1). Promising applications of entropy-based measures of signalling dynamics include the prediction of drug responses in cancer cells and the identification of critical regulatory networks in disease models [5, 6]. The application of entropy quantification has proven to be likewise advantageous in the examination of cellular differentiation mechanisms. This process has been associated with the descent of a cell through Waddington’s epigenetic landscape consisting of several valleys, each of which indicates an alternative fate for the cell [7]. The higher up the valley a cell is positioned, the higher the number of ways it can specialize in. In this scenario, the potential of a cell to assume distinct states can be quantified with the global signalling entropy rate. As such, a matured cell will have a distinct signalling entropy compared to a multi-potent one [8].
Figure 1.

Overview of signalling entropy estimations. Starting from a biological process (e.g. cell differentiation, tumour progression), the corresponding biological samples are extracted for two or more conditions of interest. Expression data for each condition are inferred. To calculate signalling entropies, the expression patterns for each condition are integrated with protein interaction networks. These networks can be taken from different sources and filtered in a variety of ways. From this integration, signalling entropies are calculated and compared among different conditions.
Differences in network entropy have been seen across tumour stages, suggesting that it may be linked to tumour initiation and progression [9, 10]. Moreover, a novel measure, the sample-perturbed network entropy (SPNE) [11], has been recently introduced to identify critical states in tumour progression data. The warning signals provided by SPNE during tumour progression have the potential to enhance prompt and suitable clinical interventions by early detection. By correlating expression patterns with a thorough protein interaction signalling network, Banerji et al. [8] established a measure to quantify the overall signalling entropy of cellular signal transduction networks. Pathway Commons (PC) [12], STRING [13], and the Biological General Repository for Interaction Datasets (BioGRID) [14] are just some examples of the various databases currently available that provide information regarding molecular interactions (MINTs). Despite their supposed reliability, the protein–protein interactions (PPIs) collected in these databases are gathered using various sources and methods, including yeast two-hybrid and mass spectrometry experiments. The effectiveness of these methods has been demonstrated in aiding the discovery of new PPIs and in evaluating the interactions between existing proteins [15–18]. Other approaches for identifying such relations comprise text mining [19] and computational modelling; nevertheless, confirmation of these interactions typically requires additional experimental testing. Regardless of integrating multiple techniques to get a more thorough understanding of PPIs, both experimental and computational approaches for detecting molecular relations are prone to high false positives (FPs) and false negatives (FNs). The inherent noise in the data, along with the complexity of biological systems, contribute to these rates [20, 21]. In yeast two-hybrid screenings, on which most interaction databases are constructed, FP rates are estimated to reach up to 64%, whereas FN rates range from 43% to 71% [22]. Additionally, it has been shown that the scale-free topology of the interaction network contributes to the increased signalling entropy observed in cancer samples compared to the healthy samples, providing further evidence for the entropy’s dependence on the characteristics of the protein interaction network (PIN) [23]. From a more general perspective, the topological properties of regulatory networks are fairly good predictors of dynamics [24].
Here, we intend to determine a potential network architecture showing minimum interference with the global signalling entropy changes across distinct stages in biological data. First, we review extensively the existing PINs, entropy calculation strategies, and their employment in different published studies dealing with entropy calculations. Then, we move into two specific case studies. We initially examine the influence of PIN topology changes and demonstrate how the selected PIN impacts the final entropy measure on both in silico generated and real experimental data. We show that the PIN used to calculate the signalling entropy can alter the statistically significant differences among the entropies resulting from the gene expressions of distinct samples. Furthermore, by employing correction techniques for FP interactions in the PINs, we evaluate and outline how the changes, dependent on the selected PIN, can be stabilized and harmonized. We conclude our problem-solving protocol by suggesting an effective strategy when calculating network entropies to avoid biases in its estimation that could affect the final interpretation and the biological conclusions of the experiments.
Materials and methods
Background on signalling entropy calculation
As introduced above, quantifying signalling entropy rates (SRs) requires combining a protein interaction network with multiple expression measurements, thereby rendering its accuracy contingent upon the integrity of both constituents. Classically, signalling entropy has been introduced as a measure to distinguish between cellular states throughout the process of cellular differentiation [8, 30] and was subsequently also used to investigate the mechanisms involved in the initiation and progression of cancer [8, 10, 31, 32], as well as tumour heterogeneity and metastasis [9, 33]. The integration of a PIN with expression data further relies on the mathematical approach for the calculation of the signalling entropy. Below, we review adopted approaches in selecting PINs and entropy calculations.
PIN selection
As we are interested in capturing changes in signalling entropy resulting from variations in expression patterns (e.g. from multiple differentiation stages), it is imperative to minimize the impact of unreliable predicted interactions in the employed PIN, e.g. false-positive interactions. The choice of an appropriate experimental dataset is determined by the research question, but the choice of the PIN can also have a notable influence on the final outcomes [23]. The available PINs may vary in terms of the number of proteins involved, the number of interactions, the method used to detect protein interactions, and other related factors. An overview of the most commonly used PINs is given in Table 1. Here, we can observe that PIN databases can generally integrate interactions at different levels, in particular, protein and gene interactions. In addition, databases such as STRING also include predicted interactions via data mining and can integrate information from multiple databases to harmonise the existing information on protein interactions. This affects the final number of regulatory dependencies available in the PINs. Finally, different databases can integrate information from different organisms or limit their interactomes to single organisms, e.g. Human Protein Reference Database (HPRD) for human interactions. Thus, we could already observe a rather large heterogeneity in the construction and curation of interaction databases.
Table 1.
Overview of PIN databases
| Database | Protein interactions | Gene interactions | Predicted interactions | Integration of other databases | # Interactions | # Organisms |
|---|---|---|---|---|---|---|
| BioGrid [14] | ✓ | ✓ | X | X | 2 039 483 | 71 |
| STRING [13] | ✓ | ✓ | ✓ | ✓ | 67 592 464 | 14 094 |
| PC [12] | ✓ | ✓ | X | ✓ | 2 424 055 | Multiple |
| InnateDB [25] | ✓ | ✓ | ✓ | ✓ | 829 948 | 3 |
| HPRD [26] | ✓ | X | X | X | 41 327 | 1 |
| MINT [27] | ✓ | X | X | X | 133 087 | 668 |
| SIGNOR [28] | ✓ | X | X | ✓ | 35 246 | 3 |
| IntAct [29] | ✓ | X | X | X | 1 194 594 | 3527 |
| DIP [29] | ✓ | X | X | X | 81 923 | 834 |
A list of commonly used PIN databases is provided, including BioGrid, STRING, InnateDB, the HPRD, the MINT database, the SIGnaling Network Open Resource (SIGNOR), IntAct, and the Database of Interaction Proteins (DIP). For each database, we provide information on experimentally validated protein interactions (e.g. two-hybrid assay), genetic interactions (e.g. co-expression), and predicted interactions (e.g. text mining of scientific abstracts). We also report whether each database provides information from other interaction databases. Finally, we include the number (#) of interactions and organisms documented for each database. For PC, the exact number of organisms is not given. Cells coloured in violet indicate the datasets used in the use cases of the problem-solving protocol. The data displayed in the table were queried on 31 May 2023.
We have continued to revise the PINs used in studies calculating network entropies. An overview is provided in Table 2. It is noteworthy that despite the predominant use of PC in previous entropy studies, there is still no consensus on the choice of protein interaction networks. In particular, one study focused on data-driven co-expression networks rather than relying on an interaction database. In addition, different filtering techniques were used in the different studies, resulting in widely variable numbers of nodes and edges within the interaction network. In particular, some studies focus their filtering on Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations [34] or experimentally validated interactions. Others, instead, use the STRING database as a PIN and apply the internal confidence filter provided by the database, as well as filtering for human interactions only.
Table 2.
Overview of the PINs used in previously published work on signalling entropy estimation
| Publication | Year | PIN | # Proteins | # Interactions | Filtered | Filtering details |
|---|---|---|---|---|---|---|
| Teschendorff et al. [9] | 2010 | PC | 7112 | 31 678 | X | X |
| Banerji et al. [8] | 2013 | PC | 8434 | 303 600 | X | X |
| Teschendorff et al. [35] | 2014 | PC | 10 720 | 152 889 | ✓ | 4 included databases |
| Banerji et al. [33] | 2015 | PC | 8434 | 303 600 | X | X |
| Menichetti et al. [31] | 2015 | PC | 11 394 | 420 516 | ✓ | KEGG annotation |
| Cheng et al. [10] | 2016 | InnateDB | 13 579 | 113 473 | ✓ | Experimentally validated |
| Teschendorff et al. [30] | 2016 | HPRD | 8434 | 303 600 | X | X |
| Zhong et al. [11] | 2023 | STRING | – | – | X | |
| Gao et al. [36] | 2022 | Data-driven | – | – | – | – |
| Han et al. [37] | 2020 | STRING | 11 451 | 65 625 | ✓ | Human, confidence > 0.7 |
| Liu et al. [38] | 2020 | STRING | – | – | ✓ | Removal of isolated nodes, confidence > 0.8 |
For each published work, the reference and year of publication are given. Along with the employed PINs, details on the number (#) of proteins and covered interactions are also provided. Finally, we include the information on whether the selected PINs were reported to have been filtered by the original authors. If this is the case, we describe the filtering approach in detail. ‘–’ indicates that the corresponding information was unavailable within the original publication. The 4 encompassed databases (in row 3) correspond to HPRD, NCI-PID, IntAct, and MINT.
Strategies for the estimation of network entropies
As for the selection of the PINs, different studies have proposed a variety of strategies for calculating signalling entropies. An overview is provided in Table 3 and graphically depicted in Fig. 2. Likewise with selecting the experimental dataset, the methods for calculating the entropy of a system are dependent on the purpose of the study. One major characteristic distinguishing between the suggested methodologies regards the analysis of either single-sample entropies against a reference group or comparing groups of samples belonging to different classes. The single-sample approach is generally employed in detecting transition states, allowing the identification of involved biomarkers for individual cases. For instance, Zhong et al. [11] recommend examining the differences between the entropy computed on a progressive series of single samples and the entropy calculated on a set of reference samples to determine the key point of progression from a normal condition to a disease. On the other hand, one could evaluate the differences between the entropy rates obtained by two sets of samples representing two distinct states [8, 9, 23, 30, 33, 35, 39]. Another aspect to consider is whether it is more meaningful to analyse the local entropies [9, 10, 31, 35], defined for each node in the network, or the global entropy [8, 30, 33], describing the ambiguity of the whole system. While analysing the information flow around a single gene might be interesting in problems such as biomarker identification or understanding the dynamics of signalling molecules within a confined space; for investigating the overall complexity of a biological system and systemic responses to external stimuli, the utilisation of the global entropy might be beneficial. Regardless of the employed method, the overall objective is to utilise either node-entropy distribution analysis or global entropy differential analysis to investigate the differences between the entropy rates representing the involved groups.
Table 3.
Overview of signalling entropy measures employed in previously published work
| Publication | Local entropy | Global entropy | Entropy type |
|---|---|---|---|
| Teschendorff et al. [9] | ✓ | X | differential |
| Banerji et al. [8] | X | ✓ | X |
| Teschendorff et al. [35] | ✓ | ✓ | X |
| Banerji et al. [33] | X | ✓ | X |
| Menichetti et al. [31] | ✓ | X | single node |
| Cheng et al. [10] | ✓ | X | X |
| Teschendorff et al. [30] | X | ✓ | X |
| Zhong et al. [11] | ✓ | X | local SNE |
| Gao et al. [36] | ✓ | X | local SNE |
| Han et al. [37] | ✓ | X | local SNE |
| Liu et al. [38] | ✓ | X | local SNE |
For each published paper, we report whether the local or the global entropy has been employed in the final analysis. Further information on the strategy for computing either the local or the global entropy is given.
Figure 2.

Outline of the available methods for the signalling entropy calculation. Generally, the signalling entropy in a biological system is measured by combining an initial set of expression patterns with either an existing or a derived protein interaction network. Based on the biological question and the available data, one can choose to either evaluate differences in single-case sample entropies or compare local or global entropies derived from distinct states.
Calculation of the signalling entropy
Considering our review of PIN selections and entropy calculation strategies, we further defined our calculation strategy used for the case studies. First, considering the heterogeneous use of PINs, we decided to select PINs representing, at best, the variety of properties in interaction databases. Thus, we consider three PINs (PC, BioGRID, and STRING) together with their union and intersection, recalling the idea of combining information from multiple databases. Then, in order to compute the signalling entropy of a whole biological system, for Case Study 1, we considered the approach introduced by Banerji et al. [8] to measure the equilibrium global entropy and the non-equilibrium global entropy proposed by Teschendorff et al. [35]. Both measures are generated from the single node entropies, with the distinction that the first takes into account the non-normalized local entropies, whereas the second one uses the normalized ones. Briefly, the expression patterns of two neighbouring genes in the PIN,
and
, are utilized to calculate the likelihood of the interaction occurring. According to the mass-action principle [40], the interaction intensity in a sample
can be estimated by the product of the expression values
, normalized in order to have
:
![]() |
(1) |
where
denotes the set of neighbouring genes of gene
in the PIN. The probabilities of interactions are then stored in an
probability matrix
, with
representing the number of genes. The local entropy of the interaction distribution,
, for gene
, is defined to quantify the promiscuity level of its signalling within the sample:
![]() |
(2) |
As previously mentioned, to comprehensively capture the signalling flow of the entire network in a given sample, we analyse two global measures: the equilibrium entropy rate,
, and the non-equilibrium entropy rate,
. The first is defined with respect to the stationary distribution,
, of the stochastic matrix
[8], represented by
and satisfying
.
![]() |
(3) |
where
corresponds to the non-normalized local entropy and
represents the maximum reachable entropy rate and depends on the network’s characteristics. The second corresponds to the mean of the normalized local entropies, as described in [35].
![]() |
(4) |
here,
corresponds to the number of computed local entropies, i.e. nodes in the interaction network and each
is normalized to lie between 0 and 1 as:
![]() |
(5) |
with
indicating the degree of node
.
Unless the equilibrium entropy rate, corresponding to a weighted average of the non-normalized local entropies, and largely dependent on the global topology of the network, the average of the normalized local entropies keeps the discriminatory power high while being independent on the network structure.
PIN correction approaches
We showed above that some entropy calculation studies applied filters to circumvent the issue of including false-positive regulations within the integrated PINs (Table 2). However, multiple methodologies have been devised to calculate the reliability scores of the interactions presented within the PIN. To date, no investigation has been conducted regarding the impact of diverse correction methods available. Thus, we further provided and tested different correction strategies applicable to PINs. In the following, we provide a brief overview of these procedures, also shown in Fig. 3. For additional details on each individual method, refer to the Supplementary Material (Section 1.3).
Figure 3.
Correction methods for false-positive interactions are grouped by type, STRING scores (A), topology-based (B), and semantic-based (C), with a summary and mathematical details provided for each type.
Topological scoring
Topological methodologies employ network characteristics, either at a global or local scale, encompassing factors such as neighbourhood, connectivity patterns, clustering coefficients, centrality metrics, or node degrees. These approaches aim to determine reliable and meaningful interactions. In accordance with these analysis strategies, the assessment of similarity between two proteins is contingent upon their respective interacting partners. The underlying premise of these metrics is that the likelihood of an interaction between two proteins is heightened when their neighbouring proteins also interact. The three similarity coefficients that we employed in this study to analyse the network properties comprise Jaccard [41, 42], Czekanowski-Dice [43] and the inverse log score [44].
Semantic scoring
The core element of semantic scoring techniques resides in incorporating information, such as functional annotations or Gene Ontology (GO) terms, to assess the biological relevance and reliability of protein interactions. These methods aim at leveraging the comprehension of the functional properties of proteins to furnish a quantitative measure of confidence for protein interactions. In our current investigation, we apply four semantic similarity measures, Resnik [45], Lin [46], Jiang-Conrath [47], and Schlicker [48], that rely solely on GO annotations to assess the similarity between terms. Additionally, we employed Wang’s approach [49], which integrates topological features of the protein interaction network in order to enhance the precision of the interaction scores.
STRING scoring
As a last correction approach, we used the scoring system provided by the STRING database. The combined confidence scores reflect the strength of each interaction based on various sources and methods comprising experimental and computational predictions, annotations from different databases, text mining, co-expression as well as transferred knowledge from other organisms. Notably, this score has already been employed recently by some authors (Table 2).
The R-package igraph, proposed by Csardi and Nepusz [50], is employed to calculate the topological reliability score. The semantic reliability score is measured by means of the R-package GoSemSim [51] and for the STRING scoring computation we employ STRING v.11.5 [13].
Inclusion of reliability scores into the entropy calculation
Upon incorporating expression profiles with the protein interaction network as weight factors, we implement the previously outlined correction techniques, following each of the corresponding methods. Essentially, a reliability score ranging between 0 and 1, is assigned to each interaction of the network and, by employing a series of thresholds, 0.1, 0.2,..., 0.9, we selectively filterout interactions with a score below the specified threshold. We keep the filtered network and proceed to compute the signalling entropy.
Results and discussion
Case studies
Our problem-solving protocol concentrates on two specific scenarios to delineate the optimal approach for achieving a stable calculation of signalling entropy. In the first case study, we examine an in silico yeast dataset to investigate the consequences of artificial interventions in protein interaction network structures and assess their impact on entropy calculations. This case study also serves as a proof of principle for the relevance of applying the subsequent correction methods.
In the second case study, we systematically assess the effects on entropy measures resulting from alterations of the underlying biologically motivated PINs. Subsequently, we employ correction techniques to ascertain the most effective combination of PIN and correction methodology conducive to obtaining a stable entropy measure. Ultimately, we outline a procedural framework for estimating signalling entropy, as detailed in Section 6.
Case study 1 — yeast experiments
In this scenario, we evaluated the impact of PIN alterations on the entropy values, as depicted in Fig. 4. To do so, we calculated the entropies resulting from networks in which we randomly added, deleted, flipped, or rewired interactions starting from our initial PIN and compared those results to the ones from our original unperturbed system. For this purpose, two entropy calculation strategies were employed respectively, equilibrium and non-equilibrium entropies. The yeast expression data was generated with the microarray data simulator GeneNetWeaver [52]. Signalling entropies were computed using the expression data and a PIN taken from the DREAM4 challenge [53]. The details of the experimental setup for this case study are presented in the Supplementary Material (Section 1.1).
Figure 4.

Workflow of the first case study, where the impact of artificial perturbations on entropy rates was tested by applying four types of changes, adding, flipping, rewiring, and removing edges, to a reference PIN, with perturbation levels from 10% to 90%, and entropy changes visualised for both equilibrium and non-equilibrium states.
The results of the evaluation of the two employed measures, equilibrium and non-equilibrium entropy, are shown in Fig. 5A and B, respectively. For each experiment, we perturbed up to 90% of the initial interactome. For all four perturbation conditions, the calculated equilibrium entropies show outliers dropping in the entropy values. If, instead, the non-equilibrium entropies are considered, consistent trends can be observed for all perturbation settings. In this scenario, the inclusion, flipping and rewiring of edges lead to a gradual increase in entropy values, which saturates at about 80% of perturbed edges. The stagnating phenomenon might indicate that, after adding a certain percentage, the PIN attains a high level of connectivity causing any additional interactions to have minimal effect. On the other hand, the removal of edges leads to a general decrease in the entropy. Yet, the measured non-equilibrium entropies appear to be more stable without the above-mentioned entropy drops. We further examined the scale-free characteristic of each employed network. In Fig. 5E, depicting the changes of the scale parameter
, we can notice that with low disruption rate (around 10%–20%) the values of
are ranged between 2 and 3, whereas as the perturbation levels rise, the values diverge from this interval. Thus, in contrast with the distribution of the unperturbed network (Fig. 5F) which exhibits an approximate scale-free structure, we can see in Fig. 5G–J that the degree distributions of the altered systems do not adhere to a power law with a rate of 60% of modified interactions. The degree distributions for all types and levels of perturbation can be found in the Supplementary Figs S3–S6.
Figure 5.
The effects of the network perturbations on equilibrium (A) and non-equilibrium entropy (B) are shown across perturbation levels, with higher variance in equilibrium entropy and increasing non-equilibrium entropy for all but removal; changes in network size (C, D), scale parameter γ (E), and degree distribution (F–J) reveal structural alterations and the loss of scale-free properties at high perturbation levels.
The difference between the two calculation strategies is probably related to their intrinsic properties. The dips in the equilibrium entropy calculations could be related to the breakdown of the underlying largest connected component of the PIN on which the calculation is based, as shown in Fig. 5D. In particular, when removing interactions also, the number of nodes precipitates. Regardless of the perturbation type, we observe a consistent change in the entropy measures, which would affect the interpretation of the experimental results when comparing the signalling entropy within different conditions. Especially when edges are added, entropy calculations run the risk of saturating, therefore erasing differences between groups. This condition is relatable to the presence of false-positive edges within PIN databases. For instance, PINs constructed on two-hybrid screenings are reported to have a certain percentage (up to 64%) of false-positive edges [54] (in our scenario, added interactions). On these grounds, we tested the possibility of correcting entropy deviations by applying the STRING correction, to the network perturbed at 70% of added interactions. We could successfully notice a revert of the deviation effect on the entropy calculation at a threshold of 0.4. The result is shown in Fig. S1 of the Supplementary Material.
Consequently, we tested the differences of the entropy values between our initial in silico data (referred as wildtype) and a derived group (referred as dual-KO), wherein each sample we knocked out a couple of genes from the strongly connected component. Figure 6A depicts the discriminatory power of the entropy values between the two conditions measured on the unmodified network, whereas in Fig. 6B we show how the wildtype and the dual-KO are not distinguishable when calculating the entropies based on a network comprising 60% of added interactions.
Figure 6.

Network perturbations diminish discriminatory power: while wildtype and double knockout show significant entropy differences with the initial non-perturbed PIN, adding 60% new interactions neutralises this distinction.
Considering our experiment on the yeast in-silico data, we could show that rather limited alternations can potentially affect the resulting entropy calculations. Based on this, we proceed in evaluating the alterations of entropy calculations within real expression data and PINs derived by different interaction databases. Considering the higher stability of the non-equilibrium entropy calculations, being less dependent on the network’s structure, and also its computationally lighter calculation, we further performed our experiments only with this measure.
Case Study 2 — Physio-pathological experiments: differentiation & cancer
As a subsequent step, we proceed to the examination of real datasets and PINs. In this context, we assessed the efficacy of the entropy measures across three distinct datasets, concurrently evaluating and comparing the effect of various correction methods under different PINs (workflow depicted in Fig. 7).
Figure 7.

Workflow of Case Study 2, where PINs from PC, BioGRID, STRING, their union and intersection were combined with cancer and differentiation expression data to assess entropy variations between two classes, with results compared against those using semantic, topological, and STRING-based correction methods.
We analysed three datasets concerning: pancreatic ductal adenocarcinoma (PDAC, GSE15471), human embryonic stem cells (hESC, GSE30652), and hepatocellular carcinoma (HCC, GSE6764). The datasets were selected to cover both developmental and cancer-related examples. Details of the dataset compositions are reported in the Supplementary Material (Section 1.2). For the underlying PINs, we considered BioGrid, PC, and STRING, respectively, together with their union and intersection. Among the databases available via PC, we used HPRD, IntAct, MINT, and NCI-PID, according to Teschendorff et al. [35]. In this way, we included three types of databases: PC, which contains mainly experimental evidence from protein interactions only, BioGrid, which contains experimental evidence from both protein and genetic interactions; and STRING, which contains both experimental and text-mined predicted interactions (see also Table 1). This choice is motivated by the fact that we want to cover different possible strategies in choosing the underlying PIN, as reflected in the variety of PINs presented in published work on entropy measures (see Table 2). Thus, based on our result in the yeast use case, we consider the PIN choice as a crucial first step in the entropy calculations. To assess the stability of our entropy measures, we evaluated the persistence of significant differences when comparing two groups within the datasets (healthy versus cancer or stem cells versus differentiated). This strategy aims to assess the ultimate interpretability of the measured entropies, which aim to detect significant differences between cellular entities. In this way, we exclude potentially irrelevant fluctuations in the entropy measures.
First, we calculated the significance of our entropy values based only on the different underlying PINs without applying any correction method. The results are shown in Table 4. It can be seen that, especially for the HCC dataset, the significance of the entropy differences between healthy and cancerous tissues are strongly influenced by the underlying PIN. In fact, when using the BioGRID dataset, the entropy differences are no longer significant. Instead of the three datasets, the most significant differences are observed for the hESC dataset. In fact, entropy measures were initially used to distinguish differentiation landscapes. Additionally, looking at the plotted entropy differences, we could observe that STRING and the union of the databases yield the highest entropy values (supplementary Fig. S2). This is because the two PINs are indeed the largest in size. We were able to show that also, in real datasets, the choice of PIN can strongly influence the final interpretation of the entropy differences between cellular conditions. This further motivates the use of correction methods.
Table 4.
Significance analysis of the signalling entropy calculated on every combination of the three datasets with each initial PIN
| Dataset | PIN P-value | Compared Classes | |
|---|---|---|---|
| PDAC |
|
Normal versus PDAC | |
| HCC |
|
Normal versus HCC | |
| hESC |
|
hESC versus Diff |
In most of the cases, the differences are significant, excluding the HCC integrated with BioGRID.
We then evaluated the correction methods. The methods are divided, as mentioned above, into semantic, topological, and STRING correction methods. For each dataset, we applied each correction method to each PIN. In addition, we also calculated the significance level of differences within the same class by subsampling within the sample sets (e.g. healthy with healthy, cancer with cancer). The idea behind this evaluation is that a good correction method should preserve the significance levels between different classes but, at the same time, not generate false-positive significant differences within the same class.
To begin with the significant differences between the two classes, we can see that BioGRID, together with PC, are indeed the PINs that preserve fewer significant distinctions, also taking into account the correction methods (Fig. 8A). Instead, the STRING database, along with the union of the three databases, appears to be the one that best retains contrasts across groups. On the other hand, among the correction methods, the STRING filtering performs particularly well, especially in reversing the non-significant differences or increasing them, observed in the HCC dataset for the BioGRID PIN, shifting the result at almost every chosen threshold (Fig. 8A). Overall, we could observe that in general, considering all datasets and thresholds, the STRING filtering performs quite well in preserving the significant difference between classes. The worst correction method seems to be the Jaccard-topological, followed by the Wang-semantic. Considering the underlying initial PIN, instead, BioGRID is the worst initial PIN despite the use of correction methods, while STRING and the union of PINs perform better. Finally, the combination of STRING filtering and STRING/union as PINs gives the best results in terms of a lower number of errors (producing non-significant differences), as we can see in Fig. 8B.
Figure 8.

P-values for group differences, stem cells versus differentiated cells, or normal versus cancer are displayed in (A) across datasets and PINs with various correction thresholds (0.1–0.9); (B) shows the cumulative error counts (non-significant differences) for each filtering method; within-class comparisons (stem cells or healthy tissues) are detailed in (C); and the corresponding error frequency is indicated in (D).
We further evaluated the opposite scenario, considering the potential generation of false-positive significant values coming from PIN corrections. The experiment was performed by sampling within both classes separately. Nevertheless, we consider it more relevant not to induce false-positives within healthy/stem tissues. Instead, we cannot exclude the possibility of significant results when subsampling the cancer samples due to the potential effects of tumour heterogeneity and staging. For this reason, Fig. 8C depicts the
-value distributions within the healthy/stem classes. Here, we could observe that only the hESC dataset depicted a significant result within the stem population for the STRING database. Instead, all other PINs for unperturbed measures successfully retrieved non-significant differences within the same class. Interestingly, we could observe an extensive insurgence of false-positive significant results when applying either the topological or the semantic corrections. This phenomenon is particularly observed within the hESC dataset, the HCC one for the BioGRID PIN, and the PC, union, and intersection, especially for the topological corrections. Considering now the cumulative ability of the specific correction methods in preserving non-significant differences, we could again confirm that the STRING filtering is the one inducing the least switches from non-significant to significant differences. Considering instead the underlying PINs, again BioGRID and PC perform worst among the different correction methods. Consistently, STRING and union perform better for all corrections.
Protocol of entropy stabilisation
We could show that the choice of the underlying PIN does affect the interpretation of entropy differences among different biological conditions. Considering the results of our problem-solving protocol, we summarise in the following a procedural strategy to ensure stable entropy calculations within expression patterns and PINs.
Therefore, among the non-corrected underlying PINs, we suggest using STRING or the union of different PINs. However, considering the heterogeneity of the results, we also strongly advise employing a correction method. Among the ones tested, the scores provided by STRING showed by far the best properties on stabilising entropy calculations. Thus, we suggest the use of the STRING testing threshold above 0.6 at least. Ultimately, when looking to evaluate any experimental design involving another PIN, we encourage utilising a filtering method among STRING or semantic Lin or Jiang, which demonstrated nearly comparable results. This indicates that most reliable PPIs are those that facilitate the selection of biologically trustworthy interactions, beginning with the largest possible set of interactions and nodes. Additionally, our research confirms that biological reliability filters outperform other correction methods.
Conclusion
Using network entropy as a measure for the understanding of information complexity within different biologically motivated conditions is becoming increasingly common. This approach can also help identify pathways and targets of interest within the considered PINs. However, very limited, if not any, attention has been paid up to now to the stability of this measure when considering different underlying PINs. In this problem-solving protocol, we first extensively studied the effects of PIN perturbations on different entropy calculation strategies. Secondly, we moved to the analysis of real expression data in the context of physiology and cancer biology. By employing different PINs derived from widely used interaction datasets, we could indeed show that the choice of the PIN can strongly affect the final interpretation of entropy changes. Thus, we benchmarked various PIN correction methods aimed at stabilising global entropy changes among different biological classes. Our results indicate that the combination of the STRING PIN and correction is the most stable one for evaluating entropy changes. Based on these findings, we were able to propose a protocol for reliably accessing entropy changes within biological network data.
Key Points
Network signalling entropy is a widely used measure to evaluate alterations within interactomes during physiopathological processes.
The choice of underlying PIN can extensively affect the entropy calculation, and thus the final conclusions drawn from experiments.
Correction methods for false positives should be considered when approaching entropy calculations.
combining either the STRING database or the union of different PINs together with the STRING filtering provides reliable entropy measures.
Supplementary Material
Contributor Information
Ana Stolnicu, Institute of Medical Systems Biology, Ulm University, 89069 Ulm, Germany.
Nensi Ikonomi, Institute of Medical Systems Biology, Ulm University, 89069 Ulm, Germany.
Peter Eckhardt-Bellmann, Institute of Medical Systems Biology, Ulm University, 89069 Ulm, Germany.
Johann M Kraus, Institute of Medical Systems Biology, Ulm University, 89069 Ulm, Germany.
Hans A Kestler, Institute of Medical Systems Biology, Ulm University, 89069 Ulm, Germany.
Author contributions
H.A.K. conceived the project and supervized it. A.S., N.I., P.E.B., and J.M.K. performed theoretical analysis. A.S., J.M.K., and N.I. conceived the experiments, A.S., N.I., and J.M.K. analysed the results. A.S., P.E.B. implemented and revised the computational framework. A.S. and N.I. wrote the manuscript. All authors reviewed the manuscript.
Conflict of interest
The authors declare no conflict of interests.
Funding
The research leading to these results has received funding from the German Research Foundation (DFG, SFB 1506 project B6), the Federal Ministry of Education and Research (BMBF, TRANSCAN VI—PMTR-pNET, ID 01KT1901B ), as well as German Research Foundation (DFG, GRK 2254 HEIST and SFB 1074 project Z1) all to HAK.
Data availability
The data used in this work is all public.
Code availability
The code to reproduce the analyses is available upon Git repository (https://github.com/sysbio-bioinf/signallingEntropy)
Ethics approval
Not applicable
Consent to participate
Not applicable
Consent for publication
Not applicable
References
- 1. Shannon CE. A mathematical theory of communication. Bell Syst Tech J 1948;27:379–423. [Google Scholar]
- 2. Cover TM, Thomas JA. Elements of Information Theory. 2nd edn. Hoboken (NJ): John Wiley & Sons, 2006. [Google Scholar]
- 3. Maslov S, Sneppen K. Specificity and stability in topology of protein networks. Science 2002;296:910–3. [DOI] [PubMed] [Google Scholar]
- 4. Pržulj N, Corneil DG, Jurisica I. Modeling interactome: scale-free or geometric? Bioinformatics 2004;20:3508–15. [DOI] [PubMed] [Google Scholar]
- 5. Huang S, Kauffman SA. Complex gene regulatory networks - from structure to biological observables: Cell fate determination. In: Meyers RA, (ed.), Encyclopedia of Complexity and Systems Science, New York: Springer, 2009. p. 1180–1213. [Google Scholar]
- 6. Marbach D, Costello JC, Küffner R. et al. Wisdom of crowds for robust gene network inference. Nat Methods 2012;9:796–804. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Waddington CH. The Strategy of the Genes: A Discussion of Some Aspects of Theoretical Biology. London: George Allen & Unwin, 1957. [Google Scholar]
- 8. Banerji CRS, Miranda-Saavedra D, Severini S. et al. Cellular network entropy as the energy potential in Waddington’s differentiation landscape. Sci Rep 2013;3:3039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Teschendorff AE, Severini S. Increased entropy of signal transduction in the cancer metastasis phenotype. BMC Syst Biol 2010;4:104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Cheng F, Liu C, Shen B. et al. Investigating cellular network heterogeneity and modularity in cancer: a network entropy and unbalanced motif approach. BMC Syst Biol 2016;10:65. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Zhong J, Ding D, Liu J. et al. SPNE: sample-perturbed network entropy for revealing critical states of complex biological systems. Brief Bioinform 2023;24:bbad028. [DOI] [PubMed] [Google Scholar]
- 12. Cerami E, Gross BE, Demir E. et al. Pathway commons, a web resource for biological pathway data. Nucleic Acids Res 2011;39:D685–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Szklarczyk D, Gable AL, Nastou KC. et al. The STRING database in 2021: customizable protein–protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res 2021;49:D605–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Stark C, Breitkreutz B-J, Reguly T. et al. BioGRID: a general repository for interaction datasets. Nucleic Acids Res 2006;34:D535–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Xing S, Wallmeroth N, Berendzen KW. et al. Techniques for the analysis of protein-protein interactions in vivo. Plant Physiol 2016;171:727–58. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. von Mering C, Jensen LJ, Snel B. et al. STRING: known and predicted protein-protein associations, integrated and transferred across organisms. Nucleic Acids Res 2005;33:D433–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Huttlin EL, Ting L, Bruckner RJ. et al. The BioPlex network: a systematic exploration of the human interactome. Cell 2015;162:425–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Stynen B, Tournu H, Tavernier J. et al. Diversity in genetic in vivo methods for protein-protein interaction studies: from the yeast two-hybrid system to the mammalian split-luciferase system. Microbiol Mol Biol Rev 2012;76:331–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Franceschini A, Szklarczyk D, Frankild S. et al. STRING v9.1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res 2012;41:D808–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Mahdavi MA, Lin Y-H. False positive reduction in protein-protein interaction predictions using gene ontology annotations. BMC Bioinformatics 2007;8:262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Bork P, Jensen LJ, von Mering C. et al. Protein interaction networks from yeast to human. Curr Opin Struct Biol 2004;14:292–9. [DOI] [PubMed] [Google Scholar]
- 22. Edwards AM, Kus B, Jansen R. et al. Bridging structural biology and genomics: Assessing protein interaction data with known complexes. Trends Genet 2002;18:529–36. [DOI] [PubMed] [Google Scholar]
- 23. Teschendorff AE, Banerji CRS, Severini S. et al. Increased signaling entropy in cancer requires the scale-free property of protein interaction networks. Sci Rep 2015;5:9646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Weidner FM, Schwab JD, Werle SD. et al. Capturing dynamic relevance in Boolean networks using graph theoretical measures. Bioinformatics 2021;37:3530–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Lynn DJ, Winsor GL, Chan C. et al. InnateDB: facilitating systems-level analyses of the mammalian innate immune response. Mol Syst Biol 2008;4:218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Peri S, Navarro JD, Kristiansen TZ. et al. Human protein reference database as a discovery resource for proteomics. Nucleic Acids Res 2004;32:D497–501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Zanzoni A, Montecchi-Palazzi L, Quondam M. et al. MINT: a molecular INTeraction database. FEBS Lett 2002;513:135–40. [DOI] [PubMed] [Google Scholar]
- 28. Perfetto L, Briganti L, Calderone A. et al. SIGNOR: a database of causal relationships between biological entities. Nucleic Acids Res 2016;44:D548–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Hermjakob H, Montecchi-Palazzi L, Lewington C. et al. IntAct: an open source molecular interaction database. Nucleic Acids Res 2004;32:D452–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Teschendorff AE, Enver T. Single-cell entropy for accurate estimation of differentiation potency from a cell’s transcriptome. Nat Commun 2017;8:15599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Menichetti G, Bianconi G, Castellani G. et al. Multiscale characterization of ageing and cancer progression by a novel network entropy measure. Mol Biosyst 2015;11:1824–31. [DOI] [PubMed] [Google Scholar]
- 32. West J, Bianconi G, Severini S. et al. Differential network entropy reveals cancer system hallmarks. Sci Rep 2012;2:802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Banerji CRS, Severini S, Caldas C. et al. Intra-tumour signalling entropy determines clinical outcome in breast and lung cancer. PLoS Comput Biol 2015;11:e1004115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 2000;28:27–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Teschendorff AE, Sollich P, Kuehn R. Signalling entropy: a novel network-theoretical framework for systems analysis and interpretation of functional omic data. Methods 2014;67:282–93. [DOI] [PubMed] [Google Scholar]
- 36. Gao R, Yan J, Li P. et al. Detecting the critical states during disease development based on temporal network flow entropy. Brief Bioinform 2022;23:bbac164. [DOI] [PubMed] [Google Scholar]
- 37. Han C, Zhong J, Hu J. et al. Single-sample node entropy for molecular transition in pre-deterioration stage of cancer. Front Bioeng Biotechnol 2020;8:809. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Liu R, Chen P, Chen L. Single-sample landscape entropy reveals the imminent phase transition during disease progression. Bioinformatics 2020;36:1522–32. [DOI] [PubMed] [Google Scholar]
- 39. Teschendorff AE. A brief tutorial for estimating differentiation potency of single cells using scent and ccat. GitHub repository. 2014. Available from https://github.com/aet21/SCENT [DOI] [PubMed]
- 40. Érdi P, Tóth J. Mathematical Models of Chemical Reactions: Theory and Applications of Deterministic and Stochastic Models. Manchester: Manchester University Press, 1989. [Google Scholar]
- 41. Jaccard P. Lois de distribution florale dans la zone alpine. Bull Soc Vaudoise Sci Nat 1902;38:69–130. [Google Scholar]
- 42. Yamada T, Goto S, Kanehisa M. Extraction of phylogenetic network modules from prokayrote metabolic pathways. Genome Inform 2004;15:249–58. [PubMed] [Google Scholar]
- 43. Brun C, Chevenet F, Martin D. et al. Functional classification of proteins for the prediction of cellular function from a protein-protein interaction network. Genome Biol 2003;5:1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Adamic LA, Adar E. Friends and neighbors on the web. Social Networks 2003;25:211–30. [Google Scholar]
- 45. Resnik P. Using information content to evaluate semantic similarity in a taxonomy. In: Mellish CS (ed.), Proceedings of the 14th International Joint Conference on Artificial Intelligence, Vol. 1, pp. 448–53. San Francisco: Morgan Kaufmann Publishers Inc., 1995. [Google Scholar]
- 46. Lin D. An information-theoretic definition of similarity. In: Shavlik JW (ed.), Proceedings of the 15th International Conference on Machine Learning, pp. 296–304. San Francisco: Morgan Kaufmann Publishers Inc., 1998. [Google Scholar]
- 47. Jiang JJ, Conrath DW. Semantic similarity based on corpus statistics and lexical taxonomy. In: Chen K, Huang C, Sproat R (eds.), Proceedings of the 10th Research on Computational Linguistics International Conference, Taipei, Taiwan: ROCLING 1997, 1997, pp. 19–33. The Association or Computational Linguistics and Chinese Language Processing (ACLCLP). [Google Scholar]
- 48. Schlicker A, Domingues FS, Rahnenführer J. et al. A new measure for functional similarity of gene products based on gene ontology. BMC Bioinformatics 2006;7:1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Wang JZ, Du Z, Payattakool R. et al. A new method to measure the semantic similarity of go terms. Bioinformatics 2007;23:1274–81.> [DOI] [PubMed] [Google Scholar]
- 50. Csárdi G, Nepusz T. The igraph software package for complex network research. InterJ Complex Syst 2006;1695:1–9. [Google Scholar]
- 51. Yu G, Li F, Qin Y. et al. GOSemSim: An R package for measuring semantic similarity among GO terms and gene products. Bioinformatics 2010;26:976–8. [DOI] [PubMed] [Google Scholar]
- 52. Schaffter T, Marbach D, Floreano D. GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods. Bioinformatics 2011;27:2263–70. [DOI] [PubMed] [Google Scholar]
- 53. Greenfield A, Madar A, Ostrer H. et al. DREAM4: combining genetic and dynamic information to identify biological networks and dynamical models. PLoS One 2010;5:e13397. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Kuchaiev O, Rašajski M, Higham DJ. et al. Geometric de-noising of protein-protein interaction networks. PLoS Comput Biol 2009;5:e1000454. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data used in this work is all public.










