Skip to main content
PLOS One logoLink to PLOS One
. 2019 Nov 14;14(11):e0223745. doi: 10.1371/journal.pone.0223745

Measuring the complexity of directed graphs: A polynomial-based approach

Matthias Dehmer 1,2,3,*, Zengqiang Chen 2, Frank Emmert-Streib 4,5, Shailesh Tripathi 1, Abbe Mowshowitz 6, Alexei Levitchi 1, Lihua Feng 7, Yongtang Shi 8,*, Jin Tao 9,10
Editor: Irene Sendiña-Nadal11
PMCID: PMC6855486  PMID: 31725742

Abstract

In this paper, we define novel graph measures for directed networks. The measures are based on graph polynomials utilizing the out- and in-degrees of directed graphs. Based on these polynomial, we define another polynomial and use their positive zeros as graph measures. The measures have meaningful properties that we investigate based on analytical and numerical results. As the computational complexity to compute the measures is polynomial, our approach is efficient and can be applied to large networks. We emphasize that our approach clearly complements the literature in this field as, to the best of our knowledge, existing complexity measures for directed graphs have never been applied on a large scale.

1 Introduction

Graph complexity measures have been studied extensively [14]. Although a large number of complexity measures have been defined, few deal specifically with directed graphs. However, many real-world networks such as transportation networks [5] and biological networks [6] are directed graphs whose edges express critical interactions, flows and so forth. Examples of complexity measures for undirected graphs include treewidth [2], cycle rank [2] and numerous so-called topological indices, see [4, 7]. Some of the classical graph complexity indices like the distance-based Wiener index [8] or the graph entropy measure based on vertex orbits due to Mowshowitz [9] can be computed for directed graphs as well. For example, Knor et al. [10] studied the Wiener Index on directed graphs. Other classical and distance-based measures like the Szeged index [11] could also be applied to directed graphs. But to the best of our knowledge, there is no body of literature that focuses on comparing structural graph measures for undirected and directed graphs.

Measures for analyzing directed graphs [12] include DAG-width [3], directed treewidth [13] and girth [1]. Treewidth and directed treewidth are both based on a game-theory applied to special graph decompositions. It might be difficult to apply these measures to large real-world networks. Also, the girth of a directed graph has been defined as the minimum length of a directed cycle [1]. If the graph is acyclic, the girth is infinite [1]. Another technique is due to Bertz et al. [14]; they investigate the complexity of digraphs by identifying all possible subgraphs with a certain number of vertices representing patterns such as trees, paths, rings etc. Degree sequences are then used to quantify the complexity or diversity of the digraphs [14]. Hunter and Kreutzer [15] investigate the meaning of several methods for determining the complexity of directed graphs and point out differences between undirected and directed graphs. Berwanger and Grädel [16] use special tree-decompositions and define the graph measure entanglement and its relationship to treewidth. Estrada and Hatano [17] define the measures reciprocity and returnability based on eigenvalues of special graph-theoretical matrices. Heterogeneity measures, interpreted as irregularity based on differences between in-degrees and out-degrees, have been developed by Ye et al. [18]. We emphasize that in this paper, we put the emphasis on examining complexity measures for analyzing complex networks. Another important branch of Quantitative Graph Theory [19] relates to measure the similarity between networks. See [20] for an up-to-date review to survey this area.

In this paper, we propose an approach that departs from the contributions sketched above. Based on the occurrences of out- and in-degrees of directed graphs, we define certain graph polynomials. We show that every directed graph can be characterized by an out- and in-degree polynomial. In order to obtain positive zeros, we define modified graph polynomials and show they must possess a unique, positive zero in the interval (0, 1), depending on certain parameters. So, we analyze properties of these polynomials and prove interrelations between their zeros. Based on these zeros, we define graph complexity measures and investigate issues such as the correlation between the measures and the homogeneity of the zeros which are associated with a graph.

2 Methods

2.1 New complexity measures for directed graphs

In this section, we introduce some preliminaries. The directed graphs [21] considered here are without loops and multiple edges.

Definition 2.1 Let G = (V, E), EV × V, |V| < ∞ be a directed graph.

N+(v){v˜V\{v}|(v,v˜)E}isthesetofout-neighborsofv, (1)
N(v){u˜V\{v}|(u˜,v)E}isthesetofin-neighborsofv, (2)
δout(v)|N+(v)|,
δin(v)|N(v)|. (3)

Definition 2.2 Let G = (V, E), EV × V, |V| < ∞ be a directed graph.

N0+{vV|δout(v)=0},(4)N0+{vV|δout(v)=1},(5)(6)Nδoutmax+{vV|δout(v)=δoutmax}.(7)

Definition 2.3 Let G = (V, E), EV × V, |V| < ∞ be a directed graph.

N0{vV|δin(v)=0},(8)N1{vV|δin(v)=1},(9)(10)Nδinmax{vV|δin(v)=δinmax}.(11)

Now, we define two special graph polynomials with real coefficients. Note that the coefficients capture structural information of the given graph.

Definition 2.4 We define the coefficients of the graph polynomial PG,out(x) by

akout|Nδoutmax+|,(12)ak1out|Nδoutmax1+|,(13)(14)a1out|N1+|,(15)a0out|N0+|.(16)

Finally, PG,out(x)akoutxδoutmax++a1outx+a0outx0.

Similarly,

Definition 2.5 we define the coefficients of the graph polynomial PG,in(x) by

akin|Nδinmax+|,(17)ak1in|Nδinmax1+|,(18)(19)a1in|N1+|,(20)a0in|N0+|.(21)

Finally, PG,in(x)akinxδinmax++a1inx+a0inx0.

Obviously PG,out(x) and PG,in(x) have no positive zeros since their sequences of coefficients have no sign changes. This follows from Descartes’ Rule of Signs, see [22]. In the following, we establish the conditions under which two associated polynomials have a unique, positive zero ∈ (0, 1).

Theorem 2.1 Let G = (V, E) be a directed graph. Now we define the polynomials

P*G,out(x)αoutPG,out(x), (22)
P*G,in(x)αinPG,in(x). (23)

There exist parameters αout and αin, such that the polynomials P*G,out(x) and P*G,in(x) have a unique, positive zero δout(G) and δin(G), respectively. In fact, the values of δout(G) and δin(G) depend on αout and αin, respectively.

Proof: In order to simplify the notation, we write the polynomial as

PG(x)=akxk++a1x+a0,aiZ. (24)

Also, we need to define

PG(x)=α-PG(x)=α-(akxk++a1x+a0). (25)

Observe that limxPG(x)=-. Assuming PG(0)>0 yields the inequality

α-a0>0orα>a0. (26)

Since the sequence of coefficients of PG(x) has only one sign change, Descartes’ Rule of Signs tells us that PG(x) has a unique zero δ. Now δ ∈ (0, 1) if

PG(1)=α-(ak++a1+a0)<0, (27)

and, finally

α<ak++a1+a0. (28)

Thus the inequalities (26) and (28) provide a range for δ ∈ (0, 1). It is evident that δ depends on the choice of α needed to satisfy the Inequalities (26) and (28). Finally, the theorem holds for the two different polynomials represented by the Eqs (22) and (23) along with the corresponding parameters αout and αin.

In the following, we elaborate briefly on the problem of choosing αout and αin. Again, consider the Inequalities (26) and (28), whose parameters can be real numbers or positive integers. If we choose real numbers, we get an infinite number of polynomials P*G,out(x) and P*G,in(x) whose roots lie in the interval (0, 1). If αout and αin are taken to be positive integers, the set of possible polynomials is finite. To determine the effect of the parameters on the roots, we appeal to the continuity theorem for complex and real polynomials, see [22]. This theorem states that the zeros of a polynomial are continuous functions of the coefficients of the polynomial, which mean a small change in the coefficients will cause only a small change in value of the zeros [22]. It seems to be unclear who was the first who proved the continuity theorem. Yet, it appears that a proof was already given by Weber in 1895, see [23]. Several other proofs of this statement have also been given independently, see [22].

Suppose, G is a directed graph and we wish to apply Theorem (2.1). To do this we have to determine the sets {αout[1],,αout[p]} and {αin[1],,αin[q]}, if we choose positive integers. The following ordering can be obtained by permuting the indices

αout[1]<αout[2]<αout[p], (29)

and

αin[1]<αin[2]<αin[q]. (30)

The sets of roots {δout1(G),,δoutp(G)} and {δin1(G),,δinq(G)} in the interval (0, 1) can also be obtained. Applying the continuity theorem may lead to a simplification. For instance, we always choose the minimum value namely αout[1]Z and αin[1]Z satisfying the Inequalities (26) and (28). Consequently, we could reduce the problem to the zeros {δout1(G)} and {δin1(G)}. In Section (3.5), we investigate numerically the effect of parameters αout and αin on the zeros of P*G,out(x) and P*G,in(x) numerically.

2.2 Graph complexity measures

In this section, we define some complexity measures on directed graphs based on the findings of Section (2.1). Recall the two polynomials, defined earlier, based on out- and in-degrees (see the Definitions (2.4), (2.5), and the modified polynomials with unique positive zeros in the interval (0, 1). Here, we argue that these zeros can serve as measures of the structural complexity of a directed graph. These measures are similar to those defined as a function of the eigenvalues of certain graph polynomials, see, e.g., [24, 25]. The eigenvalue based measures, represented by Eqs (33)(36), have been defined with an eye to reducing their degeneracy, see also [2426]. Degeneracy implies that a measure is unable to distinguish between non- isomorphic graphs, [4, 26], and is thus an undesirable property. Taking account of this we define the following measures.

Definition 2.6

I1(G)=δout(G), (31)
I2(G)=δin(G), (32)
I3(G)=12δout(G)+12δin(G) (33)
I4(G)=12δout(G)+12δin(G), (34)
I5(G)=|ln(δout(G))|+|ln(δin)(G)|, (35)
I7(G)=|E||V|2-|V| (36)

δout(G) and δin(G) are unique, positive zeros of the respective out- and in-degree polynomials. I7 is the well-known edge density [27].

2.3 Examples

In the previous section, we briefly discussed how to find the parameters αout and αin by using Theorem (2.1). Now, we calculate the polynomials P*G,out(x) and P*G,in(x) as well as their roots in some special cases. Consider the graphs shown in Fig (1). For G1 we determine

N0+={v7,v8,v9},N1+={},N2+={},N3+={v6},N4+={v4,v5},N5+={},N6+={v3},N7+={v1,v2},N0={v1,v2},N1={},N2={v3},N3={v4},N4={v5},N5={v6,v7},N6={v8,v9}.

and

a7out=2,a6out=1,a5out=0,a4out=2,a3out=1,a2out=0,a1out=0,a0out=3,
a6in=2,a5in=2,a4in=1,a3in=1,a2in=1,a1in=0,a0in=2.

Fig 1. Two example graphs from G1.

Fig 1

From definitions (2.4), (2.5), we obtain

PG1,out(x)=2x7+x6+2x4+x3+3,
PG1,in(x)=2x6+2x5+x4+x3+x2+2.

To determine the range of αout, we use the Inequalities (26) and (28) and infer

αout>3andαout<2+2+1+1+1+2=9. (37)

According to Theorem (2.1), P*G,out(x) and P*G,in(x) have a unique positive zero in the interval (0, 1) if 3 < αout < 9. If we choose positive integers, we obtain the set {4,5,6,7,8} as valid candidates. In Section (2.1) we explained that due to the continuity of the zeros, it makes sense to choose the minimum value of this set in order to calculate the zero. Thus,

P*G,out(x)=αout-PG,out(x)=4-(2x7+x6+2x4+x3+3)=0, (38)

gives δout(G1) ≐ 0.683953. Following the same procedure, we get

αin>2andαin<9. (39)

This leads to

P*G1,in(x)=αin-PG1,in(x)=3-(2x6+2x5+x4+x3+x2+2)=0. (40)

Solving Eq (40) finally gives δin ≐ 0.608309. Similarly, for G2 in Fig (1),

PG2,out(x)=8x+1, (41)
PG2,in(x)=8x+1. (42)

We see that PG2,out(x)=PG2,in(x). Also, we infer

αout=αin>1andαout=αin<9. (43)

So, αout = αin ∈ {2, 3, 4, 5, 6, 7, 8}. Finally,

P*G2,out(x)=αoutPG2,out(x)=2(8x+1)=0, (44)
P*G2,in(x)=αinPG2,in(x)=2(8x+1)=0. (45)

Hence, δout = δin ≐ 0.125. These findings are summarized in Table (1).

Table 1. Characteristics of the example graphs G1 and G2.

Op. G1 = (V, E), |V| = 9, |E| = 31 Op. G2 = (V, E), |V| = 9, |E| = 8
PG,out(x) = 2x7 + x6 + 2x4 + x3 + 3 PG,out(x) = 8x + x
PG,out*(x) = 1 − (2x7 + x6 + 2x4 + x3) PG,out*(x) = 1 − 8x
αout {4, 5, 6, 7, 8} αout {2, 3, 4, 5, 6, 7, 8}
PG,in(x) = 2x6 + 2x5 + x4 + x3 + x2 + 2 PG,in*(x) = 8x + 1
PG,in*(x) = 1 − (2x6 + 2x5 + x4 + x3 + x2) PG,in*(x) = 1 − 8x
αin {3, 4, 5, 6, 7, 8} αin {2, 3, 4, 5, 6, 7, 8}
I1 = 0.683953 I1 = 0.125
I2 = 0.608309 I2 = 0.125
I3 = 0.646131 I3 = 0.125
I4 = 0.803478 I4 = 0.353553
I5 = 0.876937 I5 = 4.158883
I7 = 0.430555 I7 = 0.111111

Op.—operator

3 Numerical results

3.1 Software and computation

For the work of this paper, we used R [28] to generate the numerical results. To generate the classes of directed graphs, we used igraph and its functions [29], see Section (3.2). Also, we performed tests using igraph to ensure the graphs are pairwise non-isomorphic as well as connected. Moreover, the packages graph and QuACN were used to determine the in-degree and out-degree distribution of the generated graphs [30, 31].

3.2 Definition of graph classes

In this section, we define the graph classes we used for performing the numerical analysis. Note that we always performed 1000 repetitions when generating the graphs as we deal with random graphs.

Definition 3.1 The class G1 contains 20.000 directed, connected randomly chosen graphs G = (V, E) with 9 vertices.

Note that we have relied on the Erdős—Rényi model [32] to generate these digraphs where min(|E|) = 8 and max(|E|) = 36. The number of edges and their direction were randomly selected.

Definition 3.2 G21 contains 500 directed, connected, hierarchical graphs G = (V, E) which are randomly generated. The vertex and the edge sets are given by [33]

V={v01,,v0|V0|,v11,,v1|V1|,v21,,v2|V2|,v31,,v3|V3|},EE1E2. (46)

These graphs have four levels. Level 0 is the root level. |Vi|, 0 ≤ i ≤ 3 is the number of vertices on level i. E1 represents the set of edges which jump exactly one level. E2 is the set of jump edges which over-jump at least one level. Here, all edges move upwards, which means from level 3 to the root level. Thus, 5 ≤ |V| ≤ 30.

Note that these graphs are usually called directed universal graphs and have been introduced in [33]. An example of such a graph is depicted in Fig (2).

Fig 2. An example graph GG21 where |V| = 23, |E| = 35.

Fig 2

Definition 3.3 G22 contains 500 directed connected hierarchical graphs G = (V, E) based on Definition (3.2), where |V| = 20 and 8 ≤ |E| ≤ 30.

The reason we choose hierarchical (random) graphs for our analysis is that they appear in many real world applications, see [33]. Hierarchical graphs appear in many disciplines such as biology, management and manufacturing, see [34, 35]. Noteworthy are BOM-structures (Bill of Material) [34, 35]. These graphs have been widely used to analyze production systems and for representing optimizational tasks.

3.3 Correlation analysis

In this section, we discuss correlations between the graph measures applied to the classes of graphs defined above. The discussion is limited to the results shown in Figs (3) and (4). Other correlations have been found but are not presented explicitly here. We begin with results shown in Fig (3) for graph class G1. Observe that there are many degenerate cases, i.e., non-isomorphic graphs having the same measure values. Interestingly, the two zeros of the polynomials P*G,out(x) and P*G,in(x) represented by the measures I1 and I2 are rather weakly correlated. See Fig (3a). Thus, these indices capture structural information differently on random graphs with 9 vertices. A plausible explanation for the values of the Spearman correlation being higher for the random graphs of G1 (shown by Fig (3d), (3g) and (3h) is that I3, I4, I5 also depend on I1. For some combinations of I1 vs. Ij, 2 ≤ j ≤ 5, the correlation is also weak and, hence, the associated measures give quite different values. For instance, this is the case for I1 and I2 on G21. A possible reason for the weak correlation and for the wide spread shown on the scatter plots is that the underlying hierarchical graphs have a more distinct structure compared to the completely random graphs G1. This also implies that the degeneracy is much lower compared with G1, which also holds for the scatter plots shown in Fig (3) for G22. All the values of the Spearman correlation are given in Table (2).

Fig 3. Scatter plots for I1 vs. Ij, 2 ≤ j ≤ 5.

Fig 3

Fig 4. Scatter plots for I7 vs. Ij, 2 ≤ j ≤ 5.

Fig 4

Table 2. Spearman correlation of the graph measures for G1, G21 and G22.

G1 G21 G22
I1 I2 I3 I4 I5 I1 I2 I3 I4 I5 I1 I2 I3 I4 I5
I2 0.3709 I2 0.0046 I2 -0.0146
I3 0.8221 0.8194 I3 0.8259 0.5106 I3 0.8403 0.4818
I4 0.8203 0.8199 0.9989 I4 0.7735 0.5956 0.9898 I4 0.7948 0.5626 0.9911
I5 -0.8171 -0.8188 -0.9959 -0.9990 I5 -0.7013 -0.6794 -0.9559 -0.9873 I5 -0.7316 -0.6422 -0.9626 -0.9894
I7 0.6112 0.6218 0.7439 0.7541 -0.7624 I7 0.0140 -0.0666 -0.0135 -0.0215 0.0286 I7 0.1825 0.4427 0.3741 0.4168 -0.4570

Similar results are obtained for all other combinations of Ii vs. Ij, 2 ≤ j ≤ 5, 2 ≤ i ≤ 6, which is the reason they are not shown explicitly here. Finally, consider Fig (4). These scatter plots as well as the values in Table (2), show that the edge density I7 has no structural relationship with all other measures. The left column of Fig (4), also shows that I7 is highly degenerated. This is not surprising given its definition. As indicated earlier, the measures applied to graph classes containing hierarchical graphs have fewer degeneracies.

3.4 Extremal graphs/relations

Now we take a closer look at graphs that attain maximum or minimum values under the graph measures of Definition (2.6). Fig (5) shows two graphs for which max(I1) and min(I1) obtains. Take graph G3 as an example. To calculate PG3,out*(x), we apply Theorem (2.1) and solve the equation

αout-(x8+8)=0. (47)

Fig 5. Two graphs G1 maximizing and minimizing I1 and I2.

Fig 5

According to Theorem (2.1), we know that Eq (47) has a unique, positive zero in the interval (0, 1) which obviously depends on the parameter αout. Solving Eq (47) gives x=αout-88. So, to maximize graph measure I1 we have to determine maxαout{αout-88}. Note that Theorem (2.1) also gives 8 < αout < 9. In Table (3), we set αout = 8.5 in order to compute the values of the graph complexity measures. Clearly, this is the maximum value of I1 for the given parameter. The case min(I1) for G4 can be shown analogously.

Table 3. Polynomials, parameters and graph measures for calculating I1 and I2 for G3, G4G1.

Op. G3 = (V, E), |V| = 9, |E| = 8 Op. G4 = (V, E), |V| = 9, |E| = 8
PG,out(x) = 1x8 + 8 PG,out(x) = 8x + 1
PG,out*(x) = 0.5 − x8 PG,out*(x) = 1 − 8x
αout {8.5} αout {2, 3, 4, 5, 6, 7, 8}
PG,in(x) = 8x + 1 PG,in(x) = x7 + x + 7
PG,in*(x) = 1 − 8x PG,in*(x) = 1 − (x7 + x)
αin {2, 3, 4, 5, 6, 7, 8} αin {8}
I1 = 0.917004 I1 = 0.125
I2 = 0.125 I2 = 0.796544
I3 = 0.521002 I3 = 0.460772
I4 = 0.655578 I4 = 0.623023
I5 = 2.166084 I5 = 2.306914
I7 = 0.111111 I7 = 0.111111

Op.—operator

Now, we are in position to begin our analysis of extremal conditions and relations between digraphs.

Theorem 3.1 Let G = (VG, EG) and H = (VH, EH), be two digraphs. Define

PG(x)α(a1Gx+a0G),aiG, (48)
PH(x)α(anHxn+an1Hxn1++a1Hx+a0H),aiH, (49)

and assume that there exist graphs G and H with the given polynomials. Also, assume that the conditions of Theorem (2.1), namely,

a0G<α<a1G+a0G, (50)
a0H<α<anH+an1H++a1H+a0H, (51)

are satisfied, and there exists α satisfying the Inequalities (50) and (51). The equation

α-a0Ga1G=δα(G)>δα(H), (52)

holds if

α<anH(a1G)n(αa0G)n+an1H(a1G)n1(αa0G)n1++a1Ha1G(αa0G)+a0H. (53)

Proof: Consider the polynomials represented by the Eqs (48) and (49). Since we assume the Inequalities (50) and (50) are valid, Theorem (2.1) assures us that δα(G) and δα(H) lie in the interval (0, 1). The notation δα makes explicit the dependence of the zeros on the parameter α. Clearly, PG(x)=α-(a1Gx+a0G)=0 implies δα(G)=α-a0Ga1G. Assuming PH(δα(G)) < 0, we conclude that Inequality (52) must be satisfied. But

PH(δα(G))=α[anH(a1G)n(αa0G)n+an1H(a1G)n1(αa0G)n1++a1Ha1G(αa0G)+a0H]<0, (54)

implies the Inequality (53).

Corollary 3.1 Referring to Theorem (3.1), we assume that for the two polynomials represented by the Eqs (48) and (49) the Inequalities (50) and (50) are satisfied. If a1H=a1G and a0G=a0H, then Inequality (52) always holds.

Proof: Setting a1H=a1G and a0G=a0H in Inequality (53), we obtain

anH(a1G)n(αa0G)n+an1H(a1G)n1(αa0G)n1++a2H(a1G)2(αa0G)2>0. (55)

As α>a0G, Inequality (55) is always satisfied.

Finally, we state the following theorem.

Theorem 3.2 Let G = (VG, EG) and H = (VH, EH) be two digraphs, and define

PG(x)α(anGxn+an1Gxn1++a1Gx+a0G),aiG), (56)
PH(x)α(anHxn+an1Hxn1++a1Hx+a0H),aiH. (57)

assuming that there are graphs G and H with the given polynomials. Now,

a0G<α<anG++a1G+a0G, (58)
a0H<α<anH++a1H+a0H. (59)

Choose α such that it satisfies the Inequalities (58) and (59). If

anG<aiHfor1in, (60)

then

δα(G)>δα(H). (61)

Both δα(G) and δα(H) lie in (0, 1).

Proof: From the Inequality-System (60), we derive

PG(x)=α(anGxn+an1Gxn1++a1Gx+a0G)>α(anHxn+an1Hxn1++a1Hx+a0H)=PH(x). (62)

But PG(x) > PH(x) implies Inequality (61).

Consider the two graphs G4 (see Fig (5)) and G (see Fig (2)). We use these examples to demonstrate Theorem (3.1). In this demonstration, we consider only the out-degree polynomial and obtain

P*G4,out(x)=α(8x+1), (63)
P*G,out(x)=α(x8+x7+5x2+10x+6). (64)

Note that G4 and G2 (see Fig (1) and Table (1)) have the same polynomial PG4,out(x) and PG2,out(x) but the two graphs are non-isomorphic. We call such polynomials degenerate. The Inequalities (26) and (28) give for G4, 1 < α < 9 and for G, 6 < α < 23. So, if we choose, e.g., α = 7, these two inequalities are satisfied. So, we need to check whether the Inequality (53)

7<188(71)8+187(71)7+585(71)5+108(71)+6, (65)

holds in this case. Since 7 < 14.920, Theorem (3.1) says

δα=7(G4)>δα=7(G). (66)

From Eqs (63) and (64) with α = 7, we finally obtain

δα=7(G4)=34andδα=7(G)0.099995. (67)

The two graphs shown by Fig (6A) provide another illustration of Theorem (3.2). Again, we deal only with the out-degree polynomials. From Fig (6), we determine

P*G5,out(x)=α(x3+x2+2x+4), (68)
P*G6,out(x)=α(2x3+2x2+3x+6). (69)

Fig 6. The two example graphs G5 and G6.

Fig 6

The two graphs have a different number of vertices and edges but their underlying out-degree polynomials have the same degree. Moreover, the Inequality-System (60) is satisfied. For Inequalities 4 < α < 8 for G5 and 6 < α < 13, we can choose α = 7. Calculating the zeros of the polynomials represented by the Eqs (68) and (69) gives

δα=7(G5)0.843734>δα=7(G6)0.271069. (70)

3.5 Homogeneity of The Zeros—Influence of αout and αin

In this section, we briefly investigate the influence of αout and αin. To measure the divergences between the resulting zeros, we define a homogeneity measure.

For this discussion we restrict attention to out-degrees; analogous results hold for in-degrees. Let G be a directed graph with associated polynomials PG,out(x) and P*G,out(x). Suppose the Inequalities (26) and (28), yield koutG possible values αoutG,1,,αoutG,koutG. Thus, δoutG,1,δoutG,2,,δoutG,koutG are possible roots of P*G,out(x) = 0.

To define the homogeneity of the set

SoutG={δoutG,1,δoutG,2,,δoutG,koutG}. (71)

we use a real valued distance measure, namely,

d(x,y)=|x-y|,x,yR. (72)

The homogeneity of SoutG is defined by

h(SoutG)=1koutG(koutG-1)i=1koutGj=1koutGd(δoutG,i,δoutG,j). (73)

The value for in-degrees is defined similarly. A high h-score indicates that the set SG is inhomogeneous while a small value of h gives a high homogeneity rank of SG. The definition is illustrated in Fig (7).

Fig 7. Distributions of the h(SoutG) and h(SinG) values for all three graph classes.

Fig 7

The homogeneity values are plotted against the number of polynomials in Fig (7). First, observe that the distributions of the homogeneity values for out-degrees and in-degrees look very similar. Also, we observe that the differences between the zeros is quite small, which implies that homogeneity is high. This can be seen from the value-range in Fig (7). This result is not surprising as we explained in Section (2.1), i.e., the zeros of a polynomial are continuous functions of the coefficients of the polynomial. In fact, if we vary αout or αin, we see that the coefficients of the resulting polynomials are quite similar, when only the constant terms αout and αin are changed, see the Eqs (22) and (23). Therefore, the small differences between the roots (and the high homogeneity values) reflect the continuity theorem for complex and real polynomials, see [22].

3.6 Computational complexity

In this section, we briefly sketch some ideas to determine the computational complexity to compute δ. Note that calculating the vertex degrees requires polynomial time, i.e., O(n2) in case n is the order of an input graph. Assigning the out- and in-degrees to the monomials xi can be achieved in constant time and adding up those terms requires linear time complexity, i.e., O(k); k is the degree of the polynomial. Altogether, we see that we are able to construct an efficient algorithm to compute δ.

4 Summary and conclusion

In this paper, we have introduced new complexity measures for real networks. One reason for developing an alternative to degree-based measures such as the Zagreb indices [4] or entropies based on vertex degrees [36] is their high degeneracy, meaning that many pairwise non-isomorphic graphs have the same measured value. Similar to [37], we developed a polynomial-based approach for measuring the complexity of directed graphs. To the best of our knowledge, there are very few measures for directed graphs, e.g., treewith and girth, and these are true complexity measures for encoding structural information of a directed graph. Following [37], we developed graph polynomials based on the out- and in-degrees of directed graphs and constructed modified polynomials which posses a unique, positive zero in the interval (0, 1), depending on up to two parameters αout and αout. These zeros δout and δin can be interpreted as complexity measures. Interestingly, we start with a graph invariant and construct polynomials which are associated with the graph. However, the graph measures defined here are algebraic quantities representing the zeros of polynomials.

Analytical results showing relationships between the graph measures have been demonstrated; we have obtained numerical results that show correlations between the graph measures, and have investigated the homogeneity of the zeros (graph measures). We compared our graph measures with the well-known edge density and found that our measures capture structural information differently. Also some of our measures seem to have useful properties, e.g., they possess a high discrimination power for graphs with a distinct graph topology. Our approach to analyzing the complexity of directed graphs is promising in that low computational complexity (i.e., vertex degrees of a directed graph can be determined in polynomial time) allows for applying the polynomial based measures to large networks.

As part of our ongoing research, we plan to continue investigating extremal properties of the measures. Also, we should like to perform a correlation analysis with other measures on a large scale, if we can find ones that can be computed in polynomial time. Existing measures based on game theory are computationally complex, see, e.g., [13].

Data Availability

All relevant data are within the manuscript. The generation process of the data is described in the paper as we deal with random graphs it make no sense to upload the data.

Funding Statement

Matthias Dehmer thanks the Austrian Science Funds for supporting this work (project P 30031). Lihua Feng was supported by the National Natural Science Foundation of China (Nos. 11671402, 11871479), the Hunan Provincial Natural Science Foundation (2016JJ2138, 2018JJ2479) and the Mathematics and Interdisciplinary Sciences Project of Central South University. Jin Tao was supported by Academy of Finland (no. 315660).

References

  • 1. Bang-Jensen J, Gutin G. Digraphs: Theory, Algorithms and Applications. 1st ed Springer-Verlag; 2007. [Google Scholar]
  • 2. Gruber H. Digraph complexity measures and applications in formal language theory. Discrete Mathematics & Theoretical Computer Science. 2012;14:189–204. [Google Scholar]
  • 3. Obdržálek J. DAG-width—connectivity measure for directed graphs. Symposium on Discrete Algorithms, ACM-SIAM. 2006; p. 814–821. [Google Scholar]
  • 4. Todeschini R, Consonni V. Handbook of Molecular Descriptors. Wiley-VCH; 2002. [Google Scholar]
  • 5. Rodrigue JP, Comtois C, Slack B. The Geography of Transport Systems. Taylor & Francis; 2013. [Google Scholar]
  • 6. Junker BH, Schreiber F. Analysis of Biological Networks Wiley Series in Bioinformatics. Wiley-Interscience; 2008. [Google Scholar]
  • 7. Emmert-Streib F, Dehmer M. Networks for Systems Biology: Conceptual Connection of Data and Function. IET Systems Biology. 2011;5:185–207. 10.1049/iet-syb.2010.0025 [DOI] [PubMed] [Google Scholar]
  • 8. Wiener H. Structural Determination of Paraffin Boiling Points. Journal of the American Chemical Society. 1947;69(17):17–20. 10.1021/ja01193a005 [DOI] [PubMed] [Google Scholar]
  • 9. Mowshowitz A. Entropy and the complexity of graphs: I. An index of the relative complexity of a graph. Bulletin of Mathematical Biophysics. 1968;30:175–204. 10.1007/bf02476948 [DOI] [PubMed] [Google Scholar]
  • 10. Knor M, Skrekovski R, Tepeh A. Some remarks on Wiener index of oriented graphs. Applied Mathematics and Computation. 2016;273:631–636. 10.1016/j.amc.2015.10.033 [DOI] [Google Scholar]
  • 11. Klavžar S, Rajapakse A, Gutman I. The Szeged and the Wiener index of graphs. Applied Mathematics Letters. 1996;9(5):45–49. 10.1016/0893-9659(96)00071-7 [DOI] [Google Scholar]
  • 12. Harary F. Structural models An introduction to the theory of directed graphs. Wiley; 1965. [Google Scholar]
  • 13. Johnson T, Robertson N, Seymour PD, Thomas R. Directed tree-width. Journal of Combinatorial Theory, Series B. 2001;82:138–154. 10.1006/jctb.2000.2031 [DOI] [Google Scholar]
  • 14. Bertz SH, Pereira GZ, Zamfirescu CMD. Complexity and Diversity of Digraphs In: Minai AA, Braha D, Bar-Yam Y, editors. Unifying Themes in Complex Systems. Springer; 2011. p. 31–40. [Google Scholar]
  • 15. Hunter P, Kreutzer S. Digraph measures: Kelly decompositions, games, and orderings. Theoretical Computer Science. 2008;399:206–219. 10.1016/j.tcs.2008.02.038 [DOI] [Google Scholar]
  • 16. Berwanger D, Grädel E. Entanglement—A Measure for the Complexity of Directed Graphs with Applications to Logic and Games In: Proceedings of LPAR 2004, Montevideo, Uruguay: vol. 3452 of LNCS. Springer-Verlag; 2005. p. 209–223. [Google Scholar]
  • 17. Estrada E, Hatano N. Returnability in complex directed networks (digraphs). Discrete Mathematics & Theoretical Computer Science. 2009;430:1886–1896. [Google Scholar]
  • 18. Ye C, Wilson RC, Comin CH, da F Costa L, Hancock ER. Entropy and Heterogeneity Measures for Directed Graphs In: Hancock E, Pelillo M, editors. Similarity-Based Pattern Recognition. SIMBAD 2013. Lecture Notes in Computer Science. vol. 7953 Springer; 2013. p. 219–234. [Google Scholar]
  • 19. Dehmer M, Emmert-Streib F. Quantitative Graph Theory Theory and Applications. CRC Press; 2014. [Google Scholar]
  • 20. Emmert-Streib F, Dehmer M, Shi Y. Fifty years of graph matching, network alignment and network comparison. Information Sciences. 2016;346-347:180–197. 10.1016/j.ins.2016.01.074 [DOI] [Google Scholar]
  • 21. Harary F. Graph Theory. Addison Wesley Publishing Company; 1969. [Google Scholar]
  • 22. Marden M. Geometry of Polynomials. 2nd ed American Mathematical Society; 1966. [Google Scholar]
  • 23. Weber H. Lehrbuch der Algebra, Vol. 1; 1895. [Google Scholar]
  • 24. Dehmer M, Shi Y, Mowshowitz A. Discrimination power of graph measures based on complex zeros of the partial Hosoya polynomial. Applied Mathematics and Computation. 2015;250:352–355. 10.1016/j.amc.2014.10.048 [DOI] [Google Scholar]
  • 25. Dehmer M, Moosbrugger M, Shi Y. Encoding structural information uniquely with polynomial-based descriptors by employing the Randić matrix. Applied Mathematics and Computation. 2015;268:164–168. 10.1016/j.amc.2015.04.115 [DOI] [Google Scholar]
  • 26. Konstantinova EV. The Discrimination Ability of Some Topological and Information Distance Indices for Graphs of Unbranched Hexagonal Systems. J Chem Inf Comput Sci. 1996;36:54–57. 10.1021/ci9502461 [DOI] [Google Scholar]
  • 27. da F Costa L, Rodrigues F, Travieso G. Characterization of complex networks: A survey of measurements. Advances in Physics. 2007;56:167–242. 10.1080/00018730601170527 [DOI] [Google Scholar]
  • 28. R Core Team. R: A Language and Environment for Statistical Computing; 2018. Available from: https://www.R-project.org/. [Google Scholar]
  • 29. Csardi G, Nepusz T. The igraph software package for complex network research. InterJournal. 2006;Complex Systems:1695. [Google Scholar]
  • 30.Gentleman R, Whalen E, Huber W, Falcon S. graph: A package to handle graph data structures; 2017.
  • 31. Mueller LAJ, Kugler KG, Dander A, Graber A, Dehmer M. QuACN: An R package for analyzing complex biological networks quantitatively. Bioinformatics. 2011;27:140–141. 10.1093/bioinformatics/btq606 [DOI] [PubMed] [Google Scholar]
  • 32. Erdős P, Rényi A. On random graphs. Publicationes Mathematicae. 1959;6:290–297. [Google Scholar]
  • 33. Dehmer M, Emmert-Streib F. Structural similarity of directed universal hierarchical graphs: A low computational complexity approach. Applied Mathematics and Computation. 2007;194:7–20. 10.1016/j.amc.2007.04.006 [DOI] [Google Scholar]
  • 34. Hopp WJ, Spearman ML, Physics F. Foundations of Manufacturing Management New York: Irwin; 2011. [Google Scholar]
  • 35. Ptak C, Smith C. Orlicky’s Material Requirements Planning 3/E McGraw-Hill’s AccessEngineering. Mcgraw-hill; 2011. Available from: https://books.google.at/books?id=CMKNhCdROt4C. [Google Scholar]
  • 36. Bonchev D. Information Theoretic Indices for Characterization of Chemical Structures. Research Studies Press, Chichester; 1983. [Google Scholar]
  • 37.Dehmer M, Chen Z, Emmert-Streib F, Mowshowitz A, Varmuza K, Jodlbauer H, et al. The Orbit-polynomial: A novel Measure of Symmetry in Graphs. submitted for publication. 2019;.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All relevant data are within the manuscript. The generation process of the data is described in the paper as we deal with random graphs it make no sense to upload the data.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES