Abstract
This paper uses the reliability polynomial, introduced by Moore and Shannon in 1956, to analyze the effect of network structure on diffusive dynamics such as the spread of infectious disease. We exhibit a representation for the reliability polynomial in terms of what we call structural motifs that is well suited for reasoning about the effect of a network’s structural properties on diffusion across the network. We illustrate by deriving several general results relating graph structure to dynamical phenomena.
I. INTRODUCTION
Characterizing networks in a way that is directly relevant to diffusion phenomena on the network is important, but difficult. We argue that the Network Reliability Polynomial introduced by Shannon and Moore [1] is a characterization that folds together static measures like degree, modularity, and measures of centrality into precisely the combinations that are most relevant to the dynamics. [2] Conversely, knowledge of reliability can be used to infer structure, in the sense of network tomography. [3] Furthermore, reliability is a useful concept for reasoning more generally about the consequences of structural changes. Colbourn [4] gives a comprehensive introduction to network reliability, while Youssef [5] provides a brief derivation of the form of the reliability polynomial.
A network’s reliability is the probability that it continues to function after sustaining damage to its component edges and/or vertices. Reliability depends on a parameterized damage model D(x⃗), which specifies the probability of damaging a particular set of components, and a parameterized property 𝒫(α⃗), which specifies what it means for the network to function. Canonical problems in diffusion over fixed networks can be cast in terms of reliability. For example, a damage model D(x) under which edges (respectively, vertices) fail independently with probability 1 − x represents bond (respectively, site) percolation. The choice of 𝒫 then selects the dynamical phenomenon of interest. For example, the property “contains a connected component including at least a fraction α of the vertices” is appropriate for studying the existence of a giant component. For ease of notation, we often express 𝒫 in terms of a corresponding reliability rule r𝒫, a binary function indicating whether property 𝒫 holds for a given graph. We say that the rule accepts a graph g if and only if r(g) = 1, that is g has the property 𝒫.
Here we introduce four different reliability rules. The first three are the most commonly used rules, followed by the last one that could be of use to study percolation problems. To make it easier to understand, we refer to real world problems like designing reliable communication networks and epidemiology which benefit from each of these rules:
Two-Terminal: a graph is accepted if it contains at least one directed path from a distinguished vertex S (the source) to another distinguished vertex T (the terminus). Reliability under this rule is the probability that the specified source can send a message to the specified terminus in a damaged network. [6]
K-Terminal: a graph is accepted if every vertex is in a connected component that includes at least one of K specified vertices. For example, consider a set of K nodes as service centers. Then reliability under this rule is the probability that every operational site in a damaged network is connected to at least one service center. [7]
All-Terminal: a graph is accepted if it consists of a single connected component. The reliability under this rule, also known as system reliability is the probability that every pair of nodes in a damaged network can communicate with each other, or alternatively, that any vertex in a damaged network can broadcast to all the other vertices. [8]
-
EAR-α: To understand this rule we discuss the application of bond percolation for the study of the spread of infectious diseases on networks. Such a bond percolation disease model was explained by [9]
The probability that an edge does not fail represents the transmissibility, i.e., the conditional probability of transmitting infection from one person to another, conditioned on the source being infectious and the destination being susceptible. One of the most important properties of disease dynamics is the attack rate, defined as the fraction of the population infected in an outbreak. Most models of infectious disease exhibit a sharp transition in the attack rate at a critical value of transmissibility. Indeed, this is a percolation phase transition.
Using reliability we can find the relationship between critical transmissibility and all structural information of a network contained in its edge list. In this case the reliability is the probability that the expected attack rate for an outbreak seeded in a single person chosen uniformly at random from the population is at least α. Hence, we call this rule EAR-α.
These rules are all coherent. That is, any graph formed by adding an edge to an accepted graph is also accepted.
In a companion to this paper [5], we have shown how the concept of network reliability together with an efficient, scalable estimation scheme can shed light on complicated dynamical trade-offs between local structural properties such as assortativity-by-degree and the number of triangles.
Here we introduce a different representation of the reliability polynomial that highlights the role certain network structures play in dynamical phenomena.
We show how coefficients of the reliability polynomial can be interpreted in terms of topological motifs in the network and their overlaps. Conversely, we illustrate how knowledge of these motifs and their overlaps can be used to infer important constraints on the dynamics of diffusion processes on the network.
The representation in [5] is well-suited for computational analysis of networks with up to 108 edges, but is analytically tractable only for small networks; the representation presented in the current work is analytically tractable, but computationally feasible only for small networks because of its combinatorial complexity. Thus the results of this paper exactly complement those of the previous paper.
II. RELIABILITY POLYNOMIALS
We use the common notation of G(V,E) for a graph with V vertices and E edges. The graph may be directed or undirected, and it is possible to have multiple edges between two vertices.
The vertices and edges may be labeled. The general case of directed edges and labeled vertices and edges is powerful enough to represent extremely complex networks such as interdependent infrastructure networks. Here, without loss of generality, we restrict ourselves to homogeneous networks represented as undirected, unlabeled graphs.
A. Definition and a common representation
The reliability R(G, 𝒫(α⃗), D(x⃗)) of a network G with respect to the property 𝒫 under damage model D is the probability that a subgraph of G chosen with probability given by D has property 𝒫, the binary rule r𝒫 examines whether subgraph has property 𝒫 or not, if it has then:
otherwise
i.e., reliability can be interpreted as the expected value of the reliability rule operator over different subgraphs of G. We will explicitly include the dependence on the network G and the property 𝒫 in notation such as R(G, 𝒫(α⃗), D(x⃗)) only when we wish to distinguish the reliability of two different graphs or two different properties. Moreover, we will not include the damage model itself, but only the values of its parameters x⃗. Finally, for a homogeneous network in which all edges (or all vertices) fail with the same probability, x⃗ is a scalar, x. Thus we can write the reliability simply as R(x).
| (1) |
For the independent-edge damage model, in which the probability of selecting a subgraph g ⊆ G depends only on the number of its edges, |g| = k, and is xk(1 − x)E − k, we have:
| (2) |
We can re-write 2in terms of sum over subgraphs of different sizes, introduced by Alon et al.[10] as motifs:
| (3) |
Rk is the number of subgraphs of G with exactly k edges that are accepted by the rule. For computational convenience, we often prefer to work with normalized coefficients
| (4) |
Pk is the fraction of subgraphs of kG with exactly k edges that are accepted by the rule. Pk ≤ Pk+1 for a coherent rule. Pk can be estimated efficiently via monte Carlo simulation. [5]
Substituting Rk coefficients in Equation 2 with Pks from 4, we can see the resemblance to binomial distribution, since Pk ≤ 1 it is clear that
| (5) |
Therefore, R(x) : [0, 1] → [0, 1] is a continuous polynomial with only a finite, but possibly large, number of coefficients Rk, k ∈ {0, …, E}. That is, the reliability can be thought of as a vector in an E + 1-dimensional vector space, and the Rk’s as the components of the vector in the basis xk(1 − x)E−k. There are, of course, many other bases we could choose for this space. An orthogonal basis, such as the first E + 1 Legendre polynomials, might have useful estimation properties. Here we use another non-orthogonal basis – the functions xk ı.e. the Taylor series expansion – because of its simplicity and its attractive interpretation. There is a unique mapping from coefficients in one basis to those in the other, which can be derived by expanding the factor (1 − x)E−k in Equation ??:
| (6) |
where
| (7) |
The Nl coefficients are signed integers. In section IIC we will explain how we can interpret these coefficients.
B. Structural motifs
We can express the reliability polynomial in terms of overlaps among certain distinguished subgraphs. These subgraphs are the 𝒫-minimal subgraphs of G. A graph g is 𝒫-minimal if and only if:
g has property 𝒫; and
there is no proper subgraph g′ ⊂ g that has property 𝒫.
Obviously, whether a graph is 𝒫-minimal or not depends on the property 𝒫, or equivalently here, the reliability rule. For example, 𝒫-minimal graphs under a TwoTerminal rule are paths from S to T with no extraneous edges, i.e., no loops or dead ends; under the All-Terminal rule, they are spanning trees. In general, the rule selects a distinctive topological pattern (e.g. path, spanning tree) that may occur many times in a given graph, i.e., if we consider all subgraphs of a distinctive pattern to be the motifs introduced by Alon et al.[10] then reliability rule selects a subset of these motifs that has the property 𝒫. We refer to subsets generated by a particular rule as structural motifs because, as we will demonstrate, they are the structural elements of the network that completely determine the occurrence of dynamical phenomena of interest, as specified by 𝒫. One advantage of using this representation is that the contribution of the structural motifs to the reliability is known exactly for all 𝒫 and all D, as is shown below.
To describe this, we apply two terminal reliability rule with source and terminus nodes as its parameters on a small toy network, depicted in Figure 1. For this rule structural motifs are simple paths (including but not limited to shortest paths) connecting these two nodes. The structural motifs and their overlap can be seen in Figure 1.
FIG. 1.
Example calculation of two-terminal reliability in the toy network in the left panel. The second panel from the left shows the four motifs; the third shows all unions of two motifs. There are unions of 3 motifs, all of which give the entire graph as shown on the top of the right panel; there is one union of all motifs, which is also the entire graph. The number next to each union of motifs gives its size.
In another example, we consider the two-terminal reliability rule on a two dimensional grid with 4 nodes in each dimension. For a given source node we look at two different terminus nodes to illustrate the dependence of structural motifs on parameters of the reliability rule. Several structural motifs of different sizes for each set of selected sources and termini are shown in Figure 2.
FIG. 2.
Structural motif for the two-terminal reliability rule are shown for same source node S and two different target nodes T1 and T2. We can see that structural motifs of the same network varies with the parameters of property 𝒫. In the left grid three structural motifs of sizes 6 are shown: note that black dashed path and gray line have all but two edges in common. For the grid on the right, black normal, black dashed and gray paths represent motifs of sizes 3, 5, 9 respectively. The last two have two edges in common.
C. Contribution of structural motifs to R(x)
The sizes of each motif and their unions completely determine a network’s reliability. We demonstrate this using an Inclusion-Exclusion argument motivated by a series of straightforward examples. A detailed proof and more examples are provided in Appendices A and B.
1. Example 1: A single structural motif
Suppose the network G contains only one structural motif and that it is a set of k0 edges. For example, for Two-Terminal reliability, suppose that there is exactly one path between S and T, and that it has length k0. Then the motif will occur exactly once among all sub-graphs of size k0. For k > k0, we must “use up” k0 edges to build the structural motif. This leaves E − k0 other edges from which, because the rule is assumed to be coherent, any set of k−k0 produces an acceptable subgraph of size k. Hence for this case
| (8) |
2. Example 4: N disjoint structural motifs
Suppose the graph has exactly N structural motifs, that all have k0 edges, and that the N edge sets are disjoint. Arguing as above, with the convention that , gives:
| (9) |
3. Example 5: Two overlapping structural motifs
Suppose the graph has exactly two structural motifs, that both have k0 edges, and that the number of edges in the union of the two is k0+Δ. Arguing as in Example 1, we get a similar result, with 2k0 replaced by k0 + Δ:
| (10) |
4. The general case
Suppose the graph has exactly N structural motifs. As above, its reliability polynomial will be determined by the size of each structural motif and the overlaps among them. Define as the number of combinations of l structural motifs whose union contains exactly k edges. Also, define
| (11) |
Then arguing as above gives
| (12) |
In Appendix C, we present constraints on Nk.
Using Equation 11 and 12 we can determine the reliability coefficients for the two-terminal reliability rule, by analyzing the four motifs as shown in Figure 1. Results of this computation is shown in Table I.
TABLE I.
By inspection, we have the values for given in the table on the right, and the values of Nk and Rk as given by Equations 11 and 12.
| l | k |
|
Nk | Rk | |
|---|---|---|---|---|---|
| 1 | 3 | 3 | 3 | 3 | |
| 4 | 0 | 12 | |||
| 1 | 5 | 1 | |||
| 2 | 5 | 2 | −1 | 17 | |
| 2 | 6 | 3 | −3 | 7 | |
| 2 | 7 | 1 | |||
| 3 | 7 | 4 | |||
| 4 | 7 | 1 | 2 | 1 | |
For two dimensional grid, we study only the left case of two cases in Figure 2. For two terminal reliability from S to T there exist 184 structural motifs of sizes 6 to 14, thus it is not trivial to draw all structural motifs and their overlap. We provide computation for Nk and Rk for k ≤ 10 in Table II.
TABLE II.
Number of structural motifs and their overlap for the two terminal reliability rule on the left grid in Figure 2.
| k |
|
|
|
Nk | Rk | |||
|---|---|---|---|---|---|---|---|---|
| 5 | 0 | 0 | 0 | 0 | 0 | |||
| 6 | 20 | 0 | 0 | 20 | 20 | |||
| 7 | 0 | 0 | 0 | 0 | 360 | |||
| 8 | 36 | −30 | 0 | 6 | 3066 | |||
| 9 | 0 | −84 | 0 | −84 | 16332 | |||
| 10 | 48 | −146 | 144 | 10 | 60670 |
D. Reliability in terms of structural motifs
Given the rather complicated relationship between Rk and Nk in Equation 12, it is somewhat surprising that R(x) can be expressed very simply in terms of Nk. Consider the contribution of a single structural motif of size k0 to R(x). Using Equation 12, . This set of coefficients determines R(x)
| (13) |
Since the effect of each structural motif, and each motif overlap, is additive on Rk, we can reduce the general case to sums like the above, so we immediately find:
| (14) |
Thus the Nk defined in Equation 11 are indeed the same coefficients as those introduced in Equation 7.
E. Alternative damage models
The reasoning above is all done in the context of the usual edge damage model introduced by Moore and Shannon. This damage model is appropriate for studying bond percolation. An entirely analogous set of arguments applies to a vertex damage model, in which a set of k vertices is chosen uniformly at random, producing a unique subgraph containing all the edges whose endpoints are both in the selected set of vertices. This damage model is appropriate for studying site percolation. Coefficients analogous to Pk and Nk can be derived (substituting the number of vertices V for the number of edges E wherever it appears) and structural motifs can be defined in terms of vertex removal instead of edge removal. The physical interpretation of Nk in terms of these structural motifs is the same. It is likely that there are many other damage models with these properties. Here, we consider only the edge damage model, because it serves to illustrate the role of structural motifs and its analysis is simpler.
III. ESTIMATES AND BOUNDS ON RELIABILITY FOR SPECIAL CASES
In this section, we illustrate how the study of structural motifs and their overlaps helps understand network reliability under several different rules.
A. Exact expressions
If the network contains m structural motifs that are disjoint, and they all have the same size k0, the only non-zero coefficients are , yielding
| (15) |
If the network contains m structural motifs, every pair overlaps in all but one edge, and they all have the same size k0, the only non-zero coefficients are , yielding
| (16) |
We can use the identities given in Eq. C4 and C5 to evaluate Nk for the following case: there are f structural motifs; they all have the same size k0; and all unions of structural motifs have one of only one or two other sizes. Although this case is somewhat artificial, note that the first two conditions are satisfied for any network under the AR-α rule. It seems likely that the last restriction can be relaxed if additional combinatorial identities are brought to bear on the problem. First suppose that the only nonzero coefficients Nk are for k0 and k1. Then we must have the following:
| (17) |
These simultaneous equations admit a solution only for m = 2, for which Nk1 = 1 − m and hence
| (18) |
Note that k1 is not determined by this argument; however, it is easy to see that k0 + 1 ≤ k1 ≤ mk0. Now consider the above case, but with three nonzero coefficients instead of two. We have:
| (19) |
If we look for solutions with Nk2 ≥ 1, we must have m ≥ 3. Then the solution is
| (20) |
This gives
| (21) |
B. Perturbative estimates of reliability
Since R(x) is defined for x in the interval [0, 1], it is tempting to think that the lowest-order term in x that appears in the reliability polynomial, i.e., Nkminxkmin, is a good estimate of its value. Note that , since any union of 2 or more structural motifs must contain more than kmin edges. Moreover Nkmin = Rkmin. Unfortunately, because the coefficients Nk may grow combinatorially and may be either positive or negative, the leading order coefficient may not be sufficient to determine behavior of the reliability polynomial far from 0. For example, if there are very few different structural motifs with kmin edges (specifically, if Nkmin ≪ N1+kminx), the contribution of Nkmin may be overwhelmed by larger structural motifs. Nevertheless, evaluating the lowest-order term provides insight into the relationship between graph structure and reliability.
- [wide]All-Terminal reliability: Recall that the structural motifs for the All-Terminal rule are spanning trees. Each such tree has exactly V − 1 edges. NV − 1 is thus the number of spanning trees, so the lowest-order term in the reliability polynomial is NV − 1xV − 1. The (Kirchhoff) Matrix Tree Theorem [11] gives NV − 1 in terms of a cofactor of the graph Laplacian matrix. AR-α reliability: The structural motifs for the AR-α reliability rule are trees that contain at least αV vertices. Letting t be the number of such trees, the leading order term in R(x) is txαV − 1. Higher-order terms depend on how the trees overlap. We can use this to establish a tight lower bound on R(x) for one particular choice of α. The lower bound is generated by graphs that minimize the coefficient of the next higher order term xαV. This in turn requires that as many as possible of the motifs overlap in all but one edge. For example, beginning with a single tree, we can change one edge to any other edge that is not already in the tree and does not create a loop in the tree. There are at most E − (αV − 1) ways to do this, depending on the graph. Thus there is a graph with t trees, each of which contains α vertices, each of which differs from any other by exactly 2 edges, if and only if t ≤ E + 2 − αV. In this case, R(x) = xαV − 2(1 − x)t. As far as we know, this particular tree structure occurs only for αV = E − 1. The graph in which it occurs has a central vertex of degree t connected to t linear chains of length E/t (thus t must divide E evenly). The trees contain every edge except the last edge on one of the chains. EAR-α reliability: Satisfying the EAR-α rule demands that the sum of squared component sizes equals or exceeds αV2. What are the structural motifs for this rule? Consider a partition Π of V, i.e., a set of positive integers πi whose sum is V. The number of elements in Π varies from one partition to another. Then πi could represent the number of vertices in the ith connected component. Furthermore, if each component is a tree, the number of edges in the ith component is just πi − 1, hence the number of edges in the entire subgraph is ∑i(πi − 1) = V − C, where C = |Π| is the number of components. There are many ways to assign vertices to components, even for a single Π. Each will generate a different structural motif, as long as the reliability condition is satisfied. The smallest number of edges results from a subgraph with the largest number of components. The result is that kmin, the size of the smallest structural motif, is the size of a subgraph with all isolated vertices except for one large tree with υ vertices. kmin can be determined by the constraint
or(22)
Thus , and Nkmin is the number of different trees that can be made with kmin edges.(23)
IV. STRUCTURAL MOTIFS TO FIND EDGE IMPORTANCE
In section IIB we demonstrated how structural motifs for 2 dimensional grid depend on parameters of the two terminal reliability rule i.e. S and T. Here we explain how this fact can effect edge importance based on reliability rules. We computed reliability for two-terminal reliability rule for two case on two dimensional grids in Figure 2. It is clear that grid is more reliable for the reliability rule parameters in the rightı.e. it is more probable to have a path from S to T2 than to T1. This can be seen in Figure 3. Next we remove two out of three edges on the shortest path connecting S and T2 and we compute the reliability for both cases again. We see that reliability of the grid decreases more for the left case as expected. This result suggests to employ structural motifs for finding most important edges in a way that reflects the choices of parameters for the reliability rule.
FIG. 3.
Reliability polynomial for two terminal reliability black curve represents reliability for S − T1, gray curve for S − T2 and dashed curves represent the reliability polynomial after removing two edges.
The reliability polynomials of different graphs may intersect multiple times. [12] This means that, for different values of the damage model’s parameters x⃗, the relative reliability of the two graphs switches signs. If both graphs are subgraphs of the network of interest with the same number of elements removed, then clearly the Birnbaum importance ranking of the elements is different for different parameter values. This is perhaps surprising, but it is an important feature of this approach compared to, for example, a graph statistic that is independent of x⃗.
For example, consider the graph in Figure 4 under S − T reliability for the indicated S and T. First, we write down the reliability for the graph by inspection from its structural motifs. There are three motifs, A ≡ S12T, B ≡ S354T, and C ≡ S364T. The first has size 3; the second and third, size 4. The second and third overlap in two edges, but are disjoint from the first. There are no edges that do not appear in any structural motif. Taken together, this gives:
| (24) |
(Note that R(x) satisfies the constraints ∑ Nk = 1 and ∑ |Nk| = 2m − 1, where m = 3 is the number of structural motifs.) By symmetry, we expect the three edges S1, 12, and 2T to be equally important, and also the pair S3 and 4T, and finally the four edges 35, 36, 54, and 64. Which edge is most important? A moment's thought shows that any edge from the last four is less important than any other edge. The real choice is between S1, which is part of a single structural motif of size 3, and S3, which is part of two structural motifs of size 4. We consider the reliability R1(x) of the graph after removing edge S1, leaving motifs B and C, and the reliability R2(x) after removing edge S3, leaving only motif A. Again by inspection, these are:
| (25) |
| (26) |
By definition, the importance of the edges is IS1(x) ≡ R(x) − R1(x) and IS2(x) ≡ R(x) − R2(x). Hence, the rank of the edges switches if the polynomial IS1(x) − IS2(x) = R2(x) − R1(x) changes sign. In fact, this polynomial has a zero in the interval [0, 1]. That is,
| (27) |
FIG. 4.
Toy graph illustrating the possible x-dependence of an edge’s importance.
V. APPLYING RELIABILITY CONCEPTS TO OTHER NETWORK ANALYSIS PROBLEMS
The representation of the reliability polynomial in terms of structural motifs provides a convenient organizing principle for thinking about general network analysis problems. As one example, consider the tradeoffs between two systems: one with only a few completely redundant reliable subsytems and another with more, but only partially redundant, ones. To study this we consider two extreme cases of overlap. One contains r1 structural motifs of size k1, any two of which differ by only two edges. They are thus built using a total of 2r1 + k1 − 2 edges. The reliability of this combination can be written as:
| (28) |
Using the same number of edges we can construct motifs of size k2 that are completely disjoint. The reliability of this combination of motifs is:
| (29) |
Knowing the reliability for these two cases, we are able to compare the reliability of networks with different configurations of structural motifs of different sizes. As an example we compared the reliability of a network composed of 20 motifs with 18 edges that are different from one another only in two edges with a network of 4 completely disjoint motifs of size 6. Figure 5 shows the reliability curves for these two networks and their difference as a function of x. The analysis shows that the network of disjoint motifs is more reliable for smaller values of x while the opposite is true for larger x values.
FIG. 5.
Comparing the reliability of a network with many overlapping structural motifs with that of a network with a few disjoint motifs.
This approach could also be used to estimate the number of spanning trees in a graph. A spanning tree is a subgraph of the network that includes all vertices [13–15]; the number of spanning trees can be estimated by evaluating the All-Terminal reliability. Another problem that can be addressed using this method is to identify chordless loops of various sizes in a network. A chordless loop is a sequence of vertices with more than three vertices if for all i = 1, ⋯, k there is exactly one link from vertex υi to υi+1 and there is no other link between any two of the vertices in this sequence [16]. Recent studies on ecological networks have discovered the existence of many chordless cycles in these networks [17], therefore enumeration of all chordless cycles can make a significant impact on understanding the structure of these networks. An appropriately-designed reliability rule can be used to count the number of chordless cycles of different sizes.
VI. CONCLUSION AND FUTURE WORK
In this paper we focused on the representation of the reliability polynomial in terms of structural motifs. We have shown that network reliability is simply related to the number of edges in unions of structural motifs Nk (14). Whereas the coefficients Pk of xk(1 − x)E−k are easy to estimate numerically but hard to work with analytically, the coefficients Nk of xk are hard to estimate numerically but easy to work with analytically. To demonstrate this, we have derived closed-form expressions for Nk for several types of graphs. The resulting expressions were confirmed by numerical estimation. We anticipate that this approach can lead us to a measure of edge centrality that relates the importance of an edge to the frequency of its appearance in different structural motifs [18]. While we can use numerical simulation to study specific large, realistic networks – including epidemiology on social networks [19–21] – we can use the notion of structural motifs to understand the differences between networks that are discovered in simulation. We expect this approach to be particularly useful in studying the stability and robustness of interconnected networks [22–27].
ACKNOWLEDGMENTS
This research was partially supported by NSF NetSE Grant CNS-1011769, DTRA R&D Grant HDTRA1-0901-0017, and DTRA CNIMS Grant HDTRA1-07-C-0113. The work described in this paper was funded by the National Institute of General Medical Sciences of the National Institutes of Health under NIH MIDAS Grant 2U01GM070694-09. We would like to acknowledge many useful comments from our external collaborators and members of the Network Dynamics and Simulation Science Laboratory (NDSSL) particularly M. Marathe and A. Vullikanti. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health or DTRA.
Appendix A: Inclusion-Exclusion proof of reliability polynomial
We begin by rewriting the expansion
| (A1) |
as the following,
| (A2) |
here xA is short for x|A| where |A| refers to size of the subset A ⊆ E that characterizes the edge-induced subgraph A. (1 − x)E−A, then, has a similar interpretation. We note that (A1) is equivalent to (A2) simply because Rk is counting the number of edge-induced subgraphs A ⊆ E of size k that satisfy the reliability rule r; for these subgraphs r(A) = 1 and the equivalence follows –for those that do not, r(A) = 0.
If the reliability rule is coherent, there is a unique family of minimal subgraphs A ⊆ E such that for every subgraph A′ that is a proper subgraph of either one of them, r(A′) = 0. This means that every subgraph that is accepted by the reliability rule r, contains at least one of these minimal subgraphs (entirely). Therefore, we can see that every reliability rule is in 1-1 correspondence with a certain family of (edge-induced) subgraphs A of E. As a result, we can now define the reliability rule in terms of the very family of minimal subgraphs that were obtained from this rule,
| (A3) |
here 𝒜 represents this family.
We now show that R(x) stands for the probability that at least one of these minimal subgraphs is operating in the sense that all of its corresponding edges function. Put differently, we show that
| (A4) |
where Ai’s are different members of the above family of minimal subgraphs indexed by i; the second line is understood as a shorthand notation for the first.
Let 〈A1 ∨ A2〉 stand for the family of all subgraphs that contain all of the edges in at least one of the subgraphs A1 or A2. Also let denote the family of all subgraphs that lack at least one edge from each of the subgraphs A1 and A2. Next, we define 〈A1 ∧ A2〉 as the family of all subgraphs that contain all of the edges in both A1 and A2. And finally, let represent the family of all subgraphs that contain all of the edges in subgraph A1 and lack at least one edge from A2. We also utilize unambiguous generalizations of these notations at a later inductive step.
Now suppose that the reliability rule r is such that only one minimal subgraph, namely A1, is accepted and, as a result, R(x) becomes
| (A5) |
| (A6) |
where equation (A5) can be easily obtained by induction on the edges in subgraph E − A1. We can take (A6) as the base case for an inductive proof of the equivalence of expressions (A2) and (A4) and proceed to the induction step. However, to break in the notation we introduced above, we also prove the case of two minimal subgraphs, and then proceed to the inductive step. Therefore, suppose that we have two minimal subgraphs A1 and A2 and we want to prove the following,
| (A7) |
The right hand side of this euqality is precisely the expansion of R(x) for a reliability rule that is in 1-1 correspondence with the family of two minimal subgraphs A1 and A2; therefore our claim is proved for this case if the above equality holds. We start from the left hand side and note that
| (A8) |
Now, we make the following observations,
| (A9) |
| (A10) |
From these expansions, it immediately follows for (A8) that
| (A11) |
This proves the base case of a family with only two minimal subgraphs A1 and A2. The inductive step is quite similar,
| (A12) |
| (A13) |
| (A14) |
where r is the reliability rule that corresponds to the family of minimal subgraphs {A1, ⋯, An}. In (A13), represents the family of all subgraphs that contain all of the edges in at least one of the subgraphs A1 through An−1 and lack at least one edge from the subgraph An. Next, stands for the family of all subgraphs that contain all of the edges in subgraph An and lack at least one edge from each and every subgraph A1 through An−1. Lastly, 〈∨i<n Ai ∧ An〉 refers to the family of all subgraphs that contain all of the edges in both An and at least on of the subgraphs A1 through An−1. Equation (A14), then, follows from the fact that the combination of these three possibilities is precisely what 〈∨i<n Ai〉 stands for.
So far, we have established that
| (A15) |
and now we proceed to prove a final equivalent expansion, namely
| (A16) |
| (A17) |
where denotes the number of combinations of l minimal subgraphs whose union contains exactly k edges. To show that the above holds, we appeal to equation (A15) which can be now, by account of inclusion-exclusion principle, expanded as
| (A18) |
this clearly coincides with the expansion (A16).
Appendix B: Additional examples of contribution of structural motifs to R(x)
a. Example 2: Two disjoint structural motifs
Suppose the network G contains exactly two structural motifs, that both have k0 edges, and that no edge is in both. Arguing as in Example 1, for k < 2k0, Rk is simply twice what it is for the case of a single structural motif. But when k = 2k0, the subgraph that consists of the union of the two structural motifs will have been counted twice instead of once. Similarly, for k > 2k0, the number of graphs overcounted is given by assigning 2k0 of the edges and choosing the remaining k − 2k0 in the subgraph from among the remaining E − 2k0 in the graph G. Hence:
| (B1) |
b. Example 3: Three disjoint structural motifs
Suppose the network contains exactly three structural motifs, that all three have k0 edges, and that the three edge sets are disjoint. Again, when k0 ≤ k < 2k0, each motif generates different reliable subgraphs, and for 2k0 = k, three of these subgraphs are counted twice. But in this case, when k reaches 3k0, the subgraph consisting of all three motifs is first included three times (once for each motif), then excluded three times (once for each pair of motifs) with the net result that it must be included again:
| (B2) |
Appendix C: Constraints on coefficients
Several constraints apply to Nk. A union of l motifs can have size k only if all possible unions of l − 1 of the same motifs have size less than k. This leads to a set of constraints of the form
| (C1) |
In addition, a union of l motifs can have size k only if all possible unions of l − 1 and l − 2 of the same motifs have size less than k. For instance, has the following upper bound
| (C2) |
Overall, since all unions of l structural motifs must be included in G, we have
| (C3) |
where f is the total number of structural motifs. Finally, the facts that and imply that
| (C4) |
and
| (C5) |
Contributor Information
Yasamin Khorramzadeh, Network Dynamics and Simulation Science Laboratory, Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, Virginia 24061, USA and Department of Physics, Virginia Tech, Blacksburg, Virginia 24061, USA.
Mina Youssef, Network Dynamics and Simulation Science Laboratory, Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, Virginia 24061, USA.
Stephen Eubank, Network Dynamics and Simulation Science Laboratory, Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, Virginia 24061, USA, Department of Physics, Virginia Tech, Blacksburg, Virginia 24061, USA and Department of Population Health Sciences, Virginia Tech, Blacksburg, Virginia 24061, USA.
Shahir Mowlaei, Department of Physics, Virginia Tech, Blacksburg, Virginia 24061, USA and Network Dynamics and Simulation Science Laboratory, Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, Virginia 24061, USA.
References
- 1.Moore E, Shannon C. Journal of the Franklin Institute. 1956;262:191. [Google Scholar]
- 2.Newman M, Barabsi A, Watts D. Princeton University Press; 2006. [Google Scholar]
- 3.Ganesh A, Massouli L, Towsley D. Proceedings of the IEEE INFOCOM. Miami, FL: 2005. p. 05. [Google Scholar]
- 4.Colbourn CJ. The Combinatorics of Network Reliability. Oxford University Press; 1987. [Google Scholar]
- 5.Youssef M, Khorramzadeh Y, Eubank S. Physical Review E. 2013;88 doi: 10.1103/PhysRevE.88.052810. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Jane C-C, Shenb W-H, Laihc Y-W. European Journal of Operational Research. 2009;195:427. [Google Scholar]
- 7.Elmallah ES. Networks. 1992;22:369. [Google Scholar]
- 8.Rong-Hong J. Computers and Operations Research. 1993;20:25. [Google Scholar]
- 9.LM S, CP W, IM S, Simon C KJ. Mathematical Biosciences. 2002:180. doi: 10.1016/s0025-5564(02)00131-1. [DOI] [PubMed] [Google Scholar]
- 10.Alon U. Nature Reviews Genetics. 2007;8:450–461. doi: 10.1038/nrg2102. [DOI] [PubMed] [Google Scholar]
- 11.Kirchhoff G. Ann. Phys. Chem. 1847;72:497. [Google Scholar]
- 12.Brown J, Koc Y, Kooij R. Proceedings of the 3rd International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT), 2011. Budapest, Hungary: 2011. Oct 5–7, p. 2011. [Google Scholar]
- 13.Chaiken S. SIAM J. ALG. DISC. METH. 1982;3 [Google Scholar]
- 14.Cvetkovi DM, Doob M, Sachs H. Spectra of Graphs: Theory and Applications. Wiley: 1998. [Google Scholar]
- 15.Buekenhout F, Parker M. Disc. Math. 1998;186:69. [Google Scholar]
- 16.Sokhn N, Baltensperger R, Bersier L-F, Hennebert J, Ultes-Nitschey U. Identification of Chordless Cycles in Ecological Networks. Springer; 2013. pp. 316–324. [Google Scholar]
- 17.Huxham M, Beaney S, Raddaelli D. 1996;76:284. [Google Scholar]
- 18.Eubank S, Youssef M, Khorramzadeh Y. Proceedings of the second workshop on complex networks and their applications, SITIS 2013. Tokyo, Japan: 2013. [Google Scholar]
- 19.Eubank S, Guclu H, Kumar VSA, Marathe M, Srinivasan A, Toroczkai Z, Wang N. Nature. 2004;429(6988):180. doi: 10.1038/nature02541. [DOI] [PubMed] [Google Scholar]
- 20.Parikh N, Youssef M, Swarup S, Eubank S. Scientific Reports. 2013;3 doi: 10.1038/srep03152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Valle SYD, Hyman JM, Hethcote HW, Eubank SG. Social Network. 2007;29:539–554. [Google Scholar]
- 22.Gao J, Buldyrev SV, Stanley HE, Havlin S. Nat. Phys. 2012;8:40–48. [Google Scholar]
- 23.Buldyrev SV, Parshani R, Paul G, Stanley HE, Havlin S. Nature. 2009;464:1025–1028. doi: 10.1038/nature08932. [DOI] [PubMed] [Google Scholar]
- 24.Trajanovski S, Martn-Hernndez J, Winterbach W, Van Mieghem P. Journal of Complex Networks. 2013;1:44. [Google Scholar]
- 25.Van Mieghem P. Computer Communications. 2012;35:1494. [Google Scholar]
- 26.Youssef M, Kooij R, Scoglio C. Journal of Computational Science. 2011;2:286. [Google Scholar]
- 27.Ellens W, Kooij R. 2013 arXiv:1311.5064 [cs.DM] [Google Scholar]





