Skip to main content
PLOS One logoLink to PLOS One
. 2012 Jul 6;7(7):e38564. doi: 10.1371/journal.pone.0038564

Structural Discrimination of Networks by Using Distance, Degree and Eigenvalue-Based Measures

Matthias Dehmer 1,*, Martin Grabner 1, Boris Furtula 2
Editor: Jesus Gomez-Gardenes3
PMCID: PMC3391207  PMID: 22792157

Abstract

In chemistry and computational biology, structural graph descriptors have been proven essential for characterizing the structure of chemical and biological networks. It has also been demonstrated that they are useful to derive empirical models for structure-oriented drug design. However, from a more general (complex network-oriented) point of view, investigating mathematical properties of structural descriptors, such as their uniqueness and structural interpretation, is also important for an in-depth understanding of the underlying methods. In this paper, we emphasize the evaluation of the uniqueness of distance, degree and eigenvalue-based measures. Among these are measures that have been recently investigated extensively. We report numerical results using chemical and exhaustively generated graphs and also investigate correlations between the measures.

Introduction

Structural analysis of graphs has been an outstanding problem in graph theory for several decades [1][4]. A challenging problem in this theory is to investigate structural features of the graphs and their characterization. Another important task is to quantify the structural features of graphs, as well as their complexity [2], [3], [5], [6]. The former relates to developing measures such as the clustering coefficient or the average distance of a graph [7]. The latter relates to deriving complexity indices for graphs, which are often called structural descriptors/measures or topological indices [8][11].

In this paper, we deal with evaluating the uniqueness, discrimination power or degeneracy of special graph measures for investigating graphs holistically (in contrast to local graph measures) [12]. A descriptor is called degenerate if it possesses the same value for more than one graph. In view of the large body of literature on structural graph measures [2], [3], [5], [13], the degeneracy problem has been somewhat overlooked in graph theory. In fact, the uniqueness of structural descriptors has been investigated in mathematical chemistry and related disciplines for discriminating the structure of isomeric structures and other chemical networks [14][16]. A detailed survey on the uniqueness of topological indices by using isomers and hexagonal graphs has been given by Konstantinova [16]. For more related work, see also [17].

To date, no complete graph invariant, i.e., a measure that is fully unique on general graphs, has been found. Indeed, some measures turned out to be complete by using special sets of graphs [15], [17], [18]. In a more general context, i.e., by using graphs without structural constraints, any topological graph measure has a certain kind of degeneracy, which also depends on the mathematical method to define the measure, see [19], [20]. A highly discriminating graph measure is desirable for analyzing graphs; hence, measuring the degree of its degeneracy is important for understanding its properties, limits and quality.

The main contribution of this paper is to investigate to what extent known degree, distance and eigenvalue-based measures are degenerate. Among the measures we examine (see Table 1) are the recently developed geometric-arithmetic indices [21], [22], the atom-bond connectivity index [23] and the Estrada index [24], which is based on the eigenvalues of a special graph-theoretical matrix [25], here the adjacency and Laplacian matrix. It turns out that some of the measures based on distances and eigenvalues are highly unique in exhaustively generated graphs (e.g., see Table 2). Using these graphs is a greater challenge than only using isomeric structures, as exhaustively generated graphs do not possess any structural constraints. However, it is clear that other distance or eigenvalue-based measures exist that possess only low discrimination power [26], implying that the uniqueness of a measure crucially depends on its mathematical composition and the graph class under consideration.

Table 1. The topological indices used for determining the value distributions and correlation plots.

Index Name Symbol
Atom-bond connectivity index [23] Inline graphic
Augmented Zagreb index [40] Inline graphic
Variable Zagreb index [41] Inline graphic
Modified Zagreb index [42] Inline graphic
Narumi-Katayama index [43] Inline graphic
Distance degree centric index [8], [44] Inline graphic
Offdiagonal complexity [45] Inline graphic
Medium articulation [46] Inline graphic
Degree-degree association index [29] Inline graphic
First geometric-arithmetic index [21] Inline graphic
Second geometric–arithmetic index [22] Inline graphic
Third geometric–arithmetic index [31] Inline graphic
Efficiency complexity [26] Inline graphic
Graph energy [47] Inline graphic
Laplacian energy [48] Inline graphic
Estrada index [24] Inline graphic
Laplacian Estrada index [49] Inline graphic
Spectral radius [10] Inline graphic
Graph index complexity [26] Inline graphic
Balaban index [19] Inline graphic
Degree information index [8] Inline graphic
Topological information content [6] Inline graphic
Vertex complexity [50] Inline graphic

Table 2. Exhaustively generated sets of non-isomorphic and generated graphs.Inline graphic, Inline graphic and Inline graphic.

N 8 N 9 N 10
Index ndv S ndv S ndv S
Degree-based Measures
Inline graphic 8520 0,233606 241793 0,073874 11539714 0,015095
Inline graphic 8520 0,233606 241777 0,073935 11539377 0,015123
Inline graphic 8522 0,233426 242009 0,073047 11542066 0,014894
Inline graphic 10500 0,055501 258286 0,010702 11704386 0,001040
Inline graphic 10496 0,055860 258293 0,010675 11704428 0,001036
Inline graphic 10974 0,012863 260925 0,000594 11716377 0,000017
Information-theoretic Measures
Inline graphic 11116 0,000090 261079 0,000004 11716570 0,000000
Inline graphic 10731 0,034722 259967 0,004263 11713337 0,000276
Inline graphic 10879 0,021409 260576 0,001930 11715462 0,000095
Inline graphic 385 0,965368 6016 0,976957 609204 0,948005
Distance-based Measures
Inline graphic 1044 0,906090 40014 0,846737 3693236 0,684785
Inline graphic 663 0,940362 15228 0,941673 673972 0,942477
Inline graphic 11076 0,003688 261020 0,000230 11716455 0,000010
Eigenvalue-based Measures
Inline graphic 1628 0,853558 47577 0,817769 2413055 0,794048
Inline graphic 751 0,932446 26457 0,898663 1460054 0,875386
Inline graphic 5098 0,541423 59542 0,771940 2338347 0,800424
Inline graphic 1013 0,908878 23393 0,910399 718156 0,938706
Inline graphic 2003 0,819825 48120 0,815689 2137087 0,817601
Non-information-theoretic Measures
Inline graphic 10950 0,015022 260861 0,000839 11716146 0,000036
Inline graphic 1779 0,839975 44652 0,828972 2098604 0,820886

Methods and Results

Uniqueness of Topological Descriptors

In this section, we present numerical results when evaluating the uniqueness of certain topological descriptors. Note that a summary of the topological indices used in this paper can be found in Table 1. As mentioned, the discrimination power of these measures has not yet been evaluated extensively on a large scale. Therefore, the results might be useful for gaining deeper insights into these measures and for enabling implications when designing novel topological descriptors. As usual, we use the measure

graphic file with name pone.0038564.e024.jpg (1)

which was called the sensitivity by Konstantinova [15], for evaluating the uniqueness of an index Inline graphic. Clearly, Inline graphic depends on a graph class Inline graphic; ndv are the values that cannot be distinguished by Inline graphic, and Inline graphic is the size of the graph set. Now, we start interpreting the results by considering Table 2 and observe that we have arranged the used descriptors into four groups. We also emphasize that the values in Table 2 have been calculated by using the graph classes Inline graphic, Inline graphic. These are the classes of exhaustively generated non-isomorphic, unweighted and connected graphs with Inline graphic vertices each. The cardinalities Inline graphic are also depicted in Table 2.

For the degree-based indices, it is not surprising that these measures have only little discrimination power, as many graphs can be realized by identical degree sequences. This effect is even stronger if the cardinality of the underlying graph set increases, see Table 2. The highest discrimination power among the indices of this class has the Inline graphic index. This is in accordance with the well-known fact that the degeneracy of topological descriptors decreases in the following order: Inline graphicNKInline graphic, see [27]. Recall that first-generation indices are integer measures derived from integer local vertex invariants such as vertex degrees or distances sums [28]. Second-generation indices are real numbers derived from integer local vertex invariants [28]. Third-generation indices are real numbers derived from real local vertex invariants [28].

Most of the information-theoretic measures (e.g., Inline graphic, Inline graphic) we have evaluated in this study are based on grouping elements (e.g., vertices, degrees, etc.) in equivalence classes [6], [8] to determine probability values. We observe that the uniqueness of these measures is also low. In contrast, the degree-degree association index Inline graphic [29] is highly discriminating for all three graph classes [30]. Surely, a reason for this is the fact that this measure is non-partition-based, as probability values have been assigned to each vertex in the graph by using the special information functional Inline graphic, see [29]. Note that Inline graphic contains almost 12 million graphs. Calculating the discrimination power of the distance-based measures, such as the second or third geometric-arithmetic indices [22], [31], leads to a somewhat surprising result: the uniqueness for Inline graphic and Inline graphic is very high, but recall that they belong to the class of so-called second-generation indices [27]. Again, we see that the composition of the graph invariant (here, distances) to define the measure is crucial.

If we compare the sensitivity values (using Equation 1) of some second-generation indices, e.g., the geometric-arithmetic indices with some of the third-generation indices (information-theoretic and eigenvalue-based measures), we observe that the uniqueness of e.g., Inline graphic, Inline graphic is unexpectedly high. In particular, the high uniqueness of Inline graphic for graphs Inline graphic, Inline graphic, is probably caused by the fact that its calculation is based on distances between edges. As the number of edges lies in the interval Inline graphic, the range of the third geometric-arithmetic index is 0 to Inline graphic [32], and the probability that two graphs have different index values is certainly larger than in the case when the number of edges would be fixed. This hypothesis can be supported by comparing the values of the sensitivity index (using Equation 1) of the Inline graphic index shown in Tables 2 and 4. Thus, the sensitivity index resulting from Inline graphic shown in Table 2 is greater than 0.94 (Inline graphic), while, if the number of edges is fixed, see Table 4, the corresponding sensitivity index is less than 0.02 (Inline graphic). Using this idea again, it can be understood why the sensitivity index of Inline graphic (see Table 2) does not decrease with the number of vertices.

Table 4. Chemical trees with Inline graphic.Inline graphic, Inline graphic, Inline graphic.

Inline graphic Inline graphic Inline graphic
Index ndv Inline graphic ndv Inline graphic ndv Inline graphic
Degree-based Measures
Inline graphic 366257 0,000169 910662 0,000070 2278593 0,000029
Inline graphic 366303 0,000044 910710 0,000018 2278640 0,000008
Inline graphic 366303 0,000044 910710 0,000018 2278640 0,000008
Inline graphic 366318 0,000003 910722 0,000004 2278657 0,000000
Inline graphic 366318 0,000003 910722 0,000004 2278657 0,000000
Inline graphic 366318 0,000003 910725 0,000001 2278657 0,000000
Information-theoretic Measures
Inline graphic 366283 0,000098 910688 0,000042 2278608 0,000022
Inline graphic 366311 0,000022 910718 0,000009 2278652 0,000003
Inline graphic 366317 0,000005 910725 0,000001 2278657 0,000000
Inline graphic 196124 0,464609 544432 0,402200 39396 0,982711
Distance-based Measures
Inline graphic 362628 0,010076 904971 0,006319 2266566 0,005307
Inline graphic 362171 0,011323 904971 0,006319 2270582 0,003544
Inline graphic 319073 0,128975 813531 0,106723 2081010 0,086739
Eigenvalue-based Measures
Inline graphic 93204 0,745566 228831 0,748738 479746 0,789461
Inline graphic 87656 0,760711 224579 0,753407 525472 0,769394
Inline graphic 544 0,998515 880 0,999034 1275 0,999440
Inline graphic 292 0,999203 509 0,999441 842 0,999630
Inline graphic 130783 0,642981 318330 0,650466 675147 0,703708
Non-information-theoretic Measures
Inline graphic 366318 0,000003 910725 0,000001 2278657 0,000000
Inline graphic 69592 0,810024 160051 0,824260 316572 0,861071

Let us turn to the uniqueness of some eigenvalue-based measures such as the graph energy Inline graphic, the Estrada index Inline graphic and the Laplacian Estrada index Inline graphic. As expected, it is high because these measures belong to the class of third-generation indices (e.g., information-theoretic measures). We point out that the sensitivity index of the graph energy Inline graphic and Laplacian energy Inline graphic could be affected by rounding errors. The reason for this is based on the fact that the difference between the values of Inline graphic and Inline graphic for some graphs is less than Inline graphic [33]. However, since the number of such graphs is very small, see [33], this does not strongly affect the computation of the uniqueness of Inline graphic and Inline graphic measured by Inline graphic and ndv. In particular, the Estrada and Laplacian Estrada indices possess high uniqueness for all three graph classes Inline graphic. To give some arguments for this, recall their definitions, namely

graphic file with name pone.0038564.e091.jpg (2)
graphic file with name pone.0038564.e092.jpg (3)

where Inline graphic and Inline graphic are the eigenvalues of the adjacency and Laplacian matrices, respectively. Knowing that Inline graphic is irrational and transcendental, it can be presumed that any power and the sum thereof is also irrational and transcendental. Hence, the graphs with the same Estrada (Laplacian Estrada) index are isospectral.

In addition, the uniqueness of these measures is quite stable, and the same holds for Inline graphic. This means that there is only very little dependency between their uniqueness and the cardinality of the underlying graph set. Clearly, this result demonstrates that certain measures/functions based on the eigenvalues of graphs possess a high discrimination power. This contradicts the widely assumed hypothesis that graph spectra are not feasible to discriminate graphs properly because of the existence of isospectral graphs, see [34], [35]. Another positive example can be found in [36] where Dehmer et al. presented spectrum-based measures based on a probability distribution of structural values with low degeneracy.

In Table 3 and Table 4, we have also evaluated the discrimination power of the measures using isomers and chemical trees. In particular, we use the isomeric classes Inline graphic and Inline graphic containing all isomers with 11 and 12 vertices, see Table 3. The numerical results are quite similar to Table 2. However, when evaluating the indices by using the classes of chemical trees Inline graphic, Inline graphic and Inline graphic, we see that the discrimination power of Inline graphic deteriorates significantly. To better understand this, note that the information functional Inline graphic relies on determining the shortest paths for all Inline graphic and, then, degree-degree associations thereof resulting in Inline graphic, see [29]. Finally, when applying this measure to trees, the reason for the deterioration of its uniqueness could be understood by the occurrence of a large number of paths possessing similar length and, hence, resulting in very similar probability values and entropies. Interestingly, the eigenvalue-based measures Inline graphic and Inline graphic possess high uniqueness, and whose values are almost independent of the cardinality of the graph sets. Thus, these measures turned out to be quite feasible to discriminate chemical trees uniquely.

Table 3. Chemical isomers with Inline graphic.Inline graphic, Inline graphic.

Inline graphic Inline graphic
Index ndv Inline graphic ndv Inline graphic
Degree-based Measures
Inline graphic 160063 0,001441 738685 0,000329
Inline graphic 160089 0,001279 738714 0,000290
Inline graphic 160093 0,001254 738721 0,000280
Inline graphic 160290 0,000025 738924 0,000005
Inline graphic 160290 0,000025 738924 0,000005
Inline graphic 160293 0,000006 738927 0,000001
Information-theoretic Measures
Inline graphic 160292 0,000012 738925 0,000004
Inline graphic 160281 0,000081 738916 0,000016
Inline graphic 160291 0,000019 738926 0,000003
Inline graphic 1479 0,990773 18852 0,974487
Distance-based Measures
Inline graphic 23548 0,853095 118000 0,840309
Inline graphic 11046 0,931089 60597 0,917993
Inline graphic 160036 0,001610 738454 0,000641
Eigenvalue-based Measures
Inline graphic 24417 0,847674 110075 0,851034
Inline graphic 19590 0,877787 88842 0,879769
Inline graphic 22982 0,856626 104151 0,859051
Inline graphic 10062 0,937228 39634 0,946363
Inline graphic 28195 0,824104 117781 0,840606
Non-information-theoretic Measures
Inline graphic 160293 0,000006 738927 0,000001
Inline graphic 21432 0,866296 91321 0,876414

Value Distributions

In order to tackle the question of what kind of degeneracy the measures possess, we plot their characteristic value distributions. The Inline graphic-axis is the absolute frequency of the graphs, with a certain index value depicted on the Inline graphic-axis. For a graph class, we use the class of exhaustively generated non-isomorphic, connected and unweighted graphs denoted by Inline graphic. We start with Figures 1 and 2 and observe the vertical strips, indicating that a large number of graphs have quite similar index values discretely distributed on a certain interval. In addition, the hull of these value distributions looks like a Gaussian curve. This means that by using Inline graphic and Inline graphic, there exist many degenerate graphs possessing quite similar index values where the hull of the distributions forms a Gaussian curve.

Figure 1. Value distribution for GA1.

Figure 1

Figure 2. Value distribution for ABC.

Figure 2

As we can see from Figures 3 , 4 , 5 , 6 , the value distribution (and in fact the distribution of degenerate graphs) when considering the information-theoretic measures is significantly different. We start with Inline graphic, and see that the value distribution is quite scattered, i.e., there are no regions in which the graphs are closely clustered. In contrast, the values of Inline graphic are rather clustered. Similarly, this also holds for Inline graphic and observe that all three measures (Inline graphic, Inline graphic and Inline graphic are highly degenerate on Inline graphic. But, the degree-degree association index Inline graphic possesses a high discrimination power (see Figure 6). In particular, we see that there exist only a very few degenerate graphs whose index values exploit the entire domain.

Figure 3. Value distribution for IC.

Figure 3

Figure 4. Value distribution for OdC.

Figure 4

Figure 5. Value distribution for MA.

Figure 5

Figure 6. Value distribution for Inline graphic.

Figure 6

The results of plotting the value distributions for the eigenvalue-based measures graph energy Inline graphic and Estrada index Inline graphic are depicted in Figures 7 and 8. We see that they possess a high discrimination power and observe the horizontal strips. This means that a certain number of graphs (e.g., 2, 4, etc.) possess index values in a certain domain. When considering Figure 7, the horizontal strip for Inline graphic indicates the low degeneracy of this measure. This is similar for the Inline graphic shown in Figure 8.

Figure 7. Value distribution for E.

Figure 7

Figure 8. Value distribution for EE.

Figure 8

Correlations Between Indices

In order to investigate the correlation ability of the topological indices, we calculate the linear correlation between them and depict the results as correlation networks. More precisely, the linear correlation between the descriptor values of two data vectors has been computed according to the method of Pearson [37]. In the depicted plots of the correlation networks, the calculated Pearson Product-Moments have then been used as edge weights for labeling the edges connecting the vertices representing the compared descriptor pairs. The correlation networks are shown in Figures 9 , 10 , 11 , 12 , 13 , 14 .

Figure 9. Left: Correlation network Inline graphic inferred from Inline graphic.

Figure 9

Right: Correlation network Inline graphic inferred from Inline graphic .

Figure 10. Left: Correlation network Inline graphic inferred from Inline graphic.

Figure 10

Right: Correlation network Inline graphic inferred from Inline graphic .

Figure 11. Correlation network Inline graphic inferred from Inline graphic.

Figure 11

Figure 12. Correlation network Inline graphic inferred from Inline graphic.

Figure 12

Figure 13. Correlation network Inline graphic inferred from Inline graphic.

Figure 13

Figure 14. Correlation network Inline graphic inferred from Inline graphic.

Figure 14

We use the graph classes Inline graphic and Inline graphic, and choose different thresholds for the correlation coefficient, resulting in different networks.

Definition 1

Let Inline graphic be a set of topological indices defined on a graph class Inline graphic and let Inline graphic. The vertex and edge set of the correlation network Inline graphic inferred from Inline graphic is defined by

graphic file with name pone.0038564.e206.jpg (4)

where Inline graphic is the correlation coefficient.

Definition 2

Let Inline graphic be a set of topological indices defined on a graph class Inline graphic and let Inline graphic. The vertex and edge set of the correlation network Inline graphic inferred from Inline graphic is defined by

graphic file with name pone.0038564.e213.jpg (5)

where Inline graphic is the correlation coefficient.

We start interpreting the results by considering the left-hand side of Figure 9. The vertices of the graph Inline graphic represent indices that are highly correlated (here, Inline graphic) by using the graph class Inline graphic. In all correlation graphs, hub vertices, i.e., those with a high degree, are colored in gray. In particular, the grayer the color of a vertex is, the higher its degree.

In Inline graphic, the first geometric-arithmetic index (Inline graphic) and other measures are highly correlated with other indices that belong to different groups, e.g., degree-based and eigenvalue-based, etc. In addition, graph energy (Inline graphic) and Estrada index (Inline graphic) are highly correlated with other measures such as the Modified Zagreb index (degree-based). By using the graph class Inline graphic, we obtain the same type of correlation network denoted by Inline graphic. Observe that the connectedness of this network is similarly high in Inline graphic, however, there exist new hubs. For instance, the Balaban Inline graphic and the augmented Zagreb index (Inline graphic) index represent such vertices, i.e., they are highly correlated with other indices from different paradigms such as degree-based and eigenvalue-based measures. Interestingly, the uniqueness (measured by ndv and Inline graphic) of, e.g., Inline graphic and Inline graphic by using Inline graphic is higher than by taking Inline graphic into account. Nevertheless, these indices (and others) possess larger neighborhoods compared to Inline graphic. This means that they contain more highly correlated vertices adjacent to Inline graphic and Inline graphic than by using Inline graphic. One would have expected this in a reverse order as the isomers (Inline graphic) are structurally more similar among each other than the graphs contained in Inline graphic. It is likely that the reasons for this are different structural characteristics captured by the underlying graphs of Inline graphic and Inline graphic.

For studying indices that are only slightly correlated, firstly consider Inline graphic in Figure 10. We see that the degree-degree association index (Inline graphic) is a hub vertex, i.e., there is only a small correlation. That means Inline graphic (by using Inline graphic) captures structural information significantly different compared to almost all other measures (representing vertices) in this network. If we consider Inline graphic as a graph set, we observe that Inline graphic has more hubs than Inline graphic. For instance, Inline graphic and Inline graphic represent hubs and therefore possess only a small correlation with other measures from different paradigms. This also implies that the structural characteristics of the graphs Inline graphic are different to those Inline graphic. Also, the hubs in Inline graphic could serve as potential candidates to be tested for solving QSAR/QSPR problems [38] as they capture structural characteristics differently (compared to classical indices) and some (e.g., efficiency complexity and offdiagonal complexity) have not yet been used in mathematical chemistry and drug design. In addition, it would be interesting to examine their ability for classifying graphs optimally by using supervised learning techniques, e.g., see [39].

To finalize this section, we consider Figures 11, 12, 13, 14. We have also plotted the evolution of the correlation networks for Inline graphic, and have obtained the networks Inline graphic and Inline graphic for both Inline graphic and Inline graphic, respectively. From Figure 11, we see that by using Inline graphic, the measures Inline graphic and Inline graphic are highly uncorrelated (Inline graphic). In addition, the degree-degree association index Inline graphic and Inline graphic are highly uncorrelated by using Inline graphic (Inline graphic). If we now choose Inline graphic for Inline graphic and Inline graphic, the resulting networks (see Figures 13 and 14) also show highly uncorrelated indices. Starting with Inline graphic (see Figure 13), far more indices are highly uncorrelated (Inline graphic) compared with Figure 11. These indices belong to different paradigms (degree-based, information-theoretic, etc.). But when considering the graph class Inline graphic (see Figure 14), only the degree-degree association index Inline graphic is highly uncorrelated (Inline graphic) with many other indices. It is clear that the differences between these correlation networks are clearly induced by the structural differences (factors such as cyclicity and connectedness, which contribute to the complexity of the graphs) of the graph classes. Note that we obtained a similar result by comparing Inline graphic and Inline graphic (instead of Inline graphic and Inline graphic. Figure 14 expresses that by using trees, Inline graphic captures structural information significantly different than many other non-information-theoretic indices such as Inline graphic, Inline graphic, etc. We hypothesize that this result also holds for other tree classes as well. As mentioned above, the index Inline graphic could be used to characterize graphs for problems in structural chemistry or QSAR, with the aim that it solves a particular problem (e.g., QSAR/QSPR) better than existing indices which have already been used.

Summary and Conclusion

In this paper, we have explored to what extent degree and eigenvalue-based measures are degenerate. To tackle this problem, we used exhaustively generated undirected, connected and non-isomorphic graphs and chemical graphs. Interestingly, we found that some recently developed distance-based measures, e.g., Inline graphic, have a much better uniqueness than measures that are known to be highly unique for chemical graphs, e.g., the Balaban Inline graphic index. Note that the results for the Balaban Inline graphic index by using the classes Inline graphic, Inline graphic, have been reported in an earlier paper [30]. Equally, some of the eigenvalue-based measures such as Inline graphic and Inline graphic possess high discrimination power for all graph classes that we examined in this paper. This shows that such measures for discriminating graphs structurally can be feasible, despite the existence of isospectral graphs. A strong point of all measures (except the topological information content for large graphs, as it relies on determining their automorphism groups) used in this study is their polynomial time complexity. Hence, they could also be applied to large complex networks. First studies of examining the uniqueness of structural measures by using gene networks inferred from high-throughput data are under development. We will also examine the relationship between the uniqueness of a measure and the ability to classify graphs meaningfully.

Footnotes

Competing Interests: The authors have declared that no competing interests exist.

Funding: MD and MG thank the Austrian Science Funds and the Standortagentur Tirol for supporting this work (project P22029-N13). The authors are also grateful to ‘Zentraler Informatikdienst’ of the Technical University of Vienna for providing computing resources to perform large scale computations on the Phoenix Cluster. BF thanks the Serbian Ministry of Education and Science for partial support of this work (project 174033) and Ivan Gutman for valuable discussions and suggestions concerning this paper. The authors thank Kurt Varmuza for fruitful discussions. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Cvetković DM, Doob M, Sachs H. Spectra of Graphs. Theory and Application. Deutscher Verlag der Wissenschaften. Berlin, Germany. 1980.
  • 2.Dehmer M editor. Structural Analysis of Complex Networks. Birkhäuser Publishing. 2011.
  • 3.Harary F. Graph Theory. Addison Wesley Publishing Company. Reading, MA, USA. 1969.
  • 4.Kuratowski K. Sur le problème des courbes gauches en topologie. Fund Math Vol. 1930;15:271–283. [Google Scholar]
  • 5.Dehmer M, Emmert-Streib F editors. Analysis of Complex Networks: From Biology to Linguistics. Wiley VCH Publishing. 2009.
  • 6.Mowshowitz A. Entropy and the complexity of the graphs I: An index of the relative complexity of a graph. Bull Math Biophys. 1968;30:175–204. doi: 10.1007/BF02476948. [DOI] [PubMed] [Google Scholar]
  • 7.Newman MEJ, Barabási AL, Watts DJ. The Structure and Dynamics of Networks. Princeton Studies in Complexity. Princeton University Press. 2006.
  • 8.Bonchev D. Information Theoretic Indices for Characterization of Chemical Structures. Research Studies Press, Chichester. 1983.
  • 9.Dehmer M, Mowshowitz A. A history of graph entropy measures. Inform Sciences. 2011;1:57–78. [Google Scholar]
  • 10.Todeschini R, Consonni V, Mannhold R. Handbook of Molecular Descriptors. Wiley-VCH. Weinheim, Germany. 2002.
  • 11.Diudea MV, Gutman I, Jäntschi L. Molecular Topology. Nova Publishing. New York, NY, USA. 2001.
  • 12.Emmert-Streib F, Dehmer M. Networks for systems biology: Conceptual connection of data and function. IET Syst Biol. 2011;5:185–207. doi: 10.1049/iet-syb.2010.0025. [DOI] [PubMed] [Google Scholar]
  • 13.da F Costa L, Rodrigues F, Travieso G. Characterization of complex networks: A survey of measurements. Adv Phys. 2007;56:167–242. [Google Scholar]
  • 14.Bonchev D, Mekenyan O, Trinajstić N. Isomer discrimination by topological information approach. J Comp Chem. 1981;2:127–148. [Google Scholar]
  • 15.Konstantinova EV. The discrimination ability of some topological and information distance indices for graphs of unbranched hexagonal systems. J Chem Inf Comput Sci. 1996;36:54–57. [Google Scholar]
  • 16.Konstantinova EV. On some applications of information indices in chemical graph theory. In: Ahlswede R, Bäumer L, Cai N, Aydinian H, Blinovsky V, et al., editors, General Theory of Information Transfer and Combinatorics, Springer, Lecture Notes of Computer Science. 2006. pp. 831–852.
  • 17.Diudea MV, Ilić A, Varmuza K, Dehmer M. Network analysis using a novel highly discriminating topological index. Complexity. 2011;16:32–39. [Google Scholar]
  • 18.Xu CYHL. On highly discriminating molecular topological index. J Chem Inf Comput Sci. 1996;36:82–90. [Google Scholar]
  • 19.Balaban AT. Highly discriminating distance-based topological index. Chem Phys Lett. 1982;89:399–404. [Google Scholar]
  • 20.Balaban AT. Numerical modelling of chemical structures: Local graph invariants and topological indices. In: King RB, Rouvray DH, editors, Graph Theory and Topology, Elsevier. 159–176. Amsterdam, The Netherlands. 1987.
  • 21.Vukičević D, Furtula B. Topological index based on the ratios of geometrical and arithmetical means of end–vertex degrees of edges. J Math Chem. 2009;46:1369–1376. [Google Scholar]
  • 22.Fath-Tabar G, Furtula B, Gutman I. A new geometric–arithmetic index. J Math Chem. 2010;47:477–486. [Google Scholar]
  • 23.Estrada E, Torres L, Rodríguez L, Gutman I. An atom–bond connectivity index: Modelling the enthalpy of formation of alkanes. Indian J Chem. 1998;37A:849–855. [Google Scholar]
  • 24.Estrada E. Characterization of the folding degree of proteins. Bioinformatics. 2002;18:697–704. doi: 10.1093/bioinformatics/18.5.697. [DOI] [PubMed] [Google Scholar]
  • 25.Janežić D, Miležević A, Nikolić S, Trinajstić N. Graph-Theoretical Matrices in Chemistry. Mathematical Chemistry Monographs. University of Kragujevac and Faculty of Science Kragujevac. 2007.
  • 26.Kim J, Wilhelm T. What is a complex graph? Physica A. 2008;387:2637–2652. [Google Scholar]
  • 27.Balaban AT. Can topological indices transmit information on properties but not on structures? J Comput Auid Mol Des. 2005;19:651–660. doi: 10.1007/s10822-005-9010-6. [DOI] [PubMed] [Google Scholar]
  • 28.Balaban AT, Mills D, Kodali V, Basak SC. Complexity of chemical graphs in terms of size, branching and cyclicity. SAR QSAR Environ Res. 2006;17:429–450. doi: 10.1080/10629360600884421. [DOI] [PubMed] [Google Scholar]
  • 29.Dehmer M, Emmert-Streib F, Tsoy Y, Varmuza K. Quantifying structural complexity of graphs: Information measures in mathematical chemistry. In: Putz M, editor, Quantum Frontiers of Atoms and Molecules, Nova Publishing. 2011. pp. 479–498.
  • 30.Dehmer M, Grabner M, Varmuza K. Information indices with high discrimination power for arbitrary graphs. PLoS ONE. 2012;7:e31214. doi: 10.1371/journal.pone.0031214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Zhou B, Gutman I, Furtula B, Du Z. On two types of geometric-arithmetic index. Chem Phys Lett. 2009;482:153–155. [Google Scholar]
  • 32.Das KC, Gutman I, Furtula B. Survey on geometric–arithmetic indices of graphs. MATCH Commun Math Comput Chem. 2011;65:595–644. [Google Scholar]
  • 33.Miljković O, Furtula B, Radenković S, Gutman I. Equienergetic and almost–equienergetic trees. MATCH Commun Math Comput Chem. 2009;61:451–461. [Google Scholar]
  • 34.Ivanciuc O, Ivanciuc T, Diudea MV. Polynomials and spectra of molecular graphs. Roum Chem Quarterly Rev. 1999;7:41–67. [Google Scholar]
  • 35.Randić M, Vracko M, Novic M. Eigenvalues as molecular descriptors. In: Diudea MV, editor, QSPR/QSAR Studies by Molecular Descriptors, Nova Publishing. 93–120. Huntington, NY, USA. 2001.
  • 36.Dehmer M, Müller L, Graber A. New polynomial-based molecular descriptors with low degeneracy. PLoS ONE. 2010;5:e11393. doi: 10.1371/journal.pone.0011393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Backhaus K, Erichson B, Plinke W, Weiber R. Multivariate Analysemethoden. Springer. Berlin, Germany. 2003.
  • 38.Devillers J, Balaban AT. Topological Indices and Related Descriptors in QSAR and QSPR. Gordon and Breach Science Publishers. Amsterdam, The Netherlands. 1999.
  • 39.Müller LAJ, Kugler KG, Graber A, Dehmer M. A network-based approach to classify the three domains of life. Biology Direct. 2011;6:140–141. doi: 10.1186/1745-6150-6-53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Furtula B, Graovac A, Vukičević D. Augmented Zagreb index. J Math Chem. 2010;48:370–380. [Google Scholar]
  • 41.Miličević A, Nikolić S. On variable Zagreb indices. Croat Chem Acta. 2004;77:97–101. [Google Scholar]
  • 42.Nikolić S, Kovačević G, Miličević A, Trinajstić N. The Zagreb indices 30 years after. Croat Chem Acta. 2003;76:113–124. [Google Scholar]
  • 43.Narumi H, Katayama M. Simple topological index. a newly devised index characterizing the topological nature of structural isomers of saturated hydrocarbons. Mem Fac Engin Hokkaido Univ. 1984;16:209–214. [Google Scholar]
  • 44.Bonchev D. The concept for the center of a chemical structure and its applications. J Mol Struct: Theochem. 1989;185:155–168. [Google Scholar]
  • 45.Claussen JC. Offdiagonal complexity: A computationally quick network complexity measure - Application to protein networks and cell division. In: Deutsch A, de la Parra RB, de Boer R, Diekmann O, Jagers P, et al., editors, Mathematical Modeling of Biological Systems, Volume II, Boston: Birkhäuser. 2007. pp. 303–311.
  • 46.Wilhelm T, Hollunder J. Information theoretic description of networks. Physica A. 2007;388:385–396. [Google Scholar]
  • 47.Gutman I, Li X, Zhang J. Graph energy. In: Dehmer M, Emmert-Streib F, editors, Analysis of Complex Networks: From Biology to Linguistics, Wiley-VCH. 2009. pp. 145–174.
  • 48.Gutman I, Zhou B. Laplacian energy of a graph. Linear Algebra Appl. 2006;414:29–37. [Google Scholar]
  • 49.Fath-Tabar GH, Ashrafi AR, Gutman I. Note on Estrada and L-Estrada indices of graphs, volume CXXXIX. 1–16 pp. 2009.
  • 50.Raychaudhury C, Ray SK, Ghosh JJ, Roy AB, Basak SC. Discrimination of isomeric structures using information theoretic topological indices. J Comput Chem. 1984;5:581–588. [Google Scholar]

Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES