Abstract
Current network models assume one type of links to define the relations between the network entities. However, many real networks can only be correctly described using two different types of relations. Connectivity links that enable the nodes to function cooperatively as a network and dependency links that bind the failure of one network element to the failure of other network elements. Here we present an analytical framework for studying the robustness of networks that include both connectivity and dependency links. We show that a synergy exists between the failure of connectivity and dependency links that leads to an iterative process of cascading failures that has a devastating effect on the network stability. We present exact analytical results for the dramatic change in the network behavior when introducing dependency links. For a high density of dependency links, the network disintegrates in a form of a first-order phase transition, whereas for a low density of dependency links, the network disintegrates in a second-order transition. Moreover, opposed to networks containing only connectivity links where a broader degree distribution results in a more robust network, when both types of links are present a broad degree distribution leads to higher vulnerability.
Keywords: percolation, critical phenomena, complex networks, scale free networks, Erdős–Rényi networks
Many friendships between individuals in a social network, numerous business connections in a financial network, or multiple cables between Internet routers are all examples of networks with a high density of connectivity links (1–11). Such networks are regarded as very stable to attacks because, even after a failure of many nodes, the network still remains connected. In contrast, dependencies between the network nodes endanger the network stability because the failure of several nodes may lead to the immediate failure of many others. As an example, consider a financial network: Each company has trading and sales connections with other companies (connectivity links). These connections enable the companies to interact with each other and function together as a global financial market. But there are also dependencies relations between companies; several companies that belong to the same owner depend on one another. If one company fails, the owner might not be able to finance the other companies that will fail too. Such dependencies jeopardize the network stability and are the possible cause of many major financial crises. Another example is an online social network (Facebook or Twitter): Each individual communicates with his friends (connectivity links), thus forming a social network through which information and rumors can spread. However, many individuals will only participate in a social network if other individuals with common interests also participate (dependency links) in that social network.
The effect of failing nodes on the network stability has been studied separately for networks containing only connectivity links (12–17) and for networks containing only dependency links (18–22). The fundamental difference between connectivity and dependency links is that for dependency links when a node fails, his direct neighbors also fail (with some probability), but for connectivity links a node fails only when it (or the cluster it is in) becomes completely disconnected from the network. Percolation theory is a major tool for studying the stability of networks connected only by connectivity links. In a percolation process on a network of size N, a fraction 1 - p of the network nodes are removed. If the remaining fraction of nodes, p, is larger than a critical value (p > pc), a spanning cluster connecting order N nodes exists; if, however, p < pc, the network collapses into small clusters. At p = pc the network undergoes a second-order phase transition (12–17).
Previous studies of networks containing dependencies can be divided into two categories: (i) Failures due to overloads in networks containing a flow of a physical quantity. For example, disturbances in power transmission systems or congestion instabilities in transportation networks and Internet traffic (18–21). These models show that when one node is overloaded and the traffic cannot be routed through it, choosing alternative paths will cause other nodes to also become overloaded. This process may develop into a series of cascading failures that can disable the entire network. (ii) Models that are based on local dependencies, such as decision making of interacting agents (22). In these models the state of a node depends on the state of its neighbors and therefore a failing node will cause its neighbors to also fail and so on.
Results
Here we present an analytical framework for studying the robustness of networks that include both connectivity and dependency links. When nodes fail in a network containing both types of links, two different processes occur. (i) Connectivity links are disconnected, causing other nodes and clusters to disconnect from the network (percolation process). (ii) Failing nodes cause other nodes that depend on them to also fail even though they are still connected to the network via connectivity links (dependency process). We show that the synergy between the percolation process and the dependency process leads to a cascade of failures that can fragment the entire network (Fig. 1). We find that the density of dependency links, q, plays a key role in determining the robustness of such networks. For networks containing connectivity links and a high density of dependency links, an initial failure of even a small fraction of the network nodes disintegrates the network in a form of a first-order phase transition. If, however, the fraction of dependency links is reduced below a certain threshold, qc, the network disintegrates in a form of a second-order phase transition.
The cascading process leading to a first-order transition exists for a wide range of topologies including lattices, Erdős–Rényi (ER), and scale free (SF) networks, indicating it is a general property of many networks (Fig. 2A). Comparing networks with both connectivity and dependency links but with different topologies, reveals a relation between topology and the robustness to random failure: Networks with a broader degree distribution of connectivity links are more vulnerable to random failure in the presence of dependency links. This relation is opposed to the known result for networks containing only connectivity links, where networks with a broader degree distribution are significantly more robust to random failures. Fig. 2 A and B show that, when comparing ER and SF networks with the same average degree, SF networks with a high density of dependent nodes are more vulnerable to random failures then ER networks.
Formalism
Next we present an analytical approach for studying the robustness to random failure of networks containing the two types of links. Without loss of generality, we define a model in which only pairs of nodes depend on one another, forming dependency groups of size two. When the dependency group contains more than two nodes, the cascade effect is even more extreme and the transition from the regular second-order percolation transition to a first-order transition occurs even for more stable networks (SI Text). Therefore, the properties we present for the case of dependency groups of size two are also valid in the general case of larger dependency groups (see SI). The model is defined as follows: A network containing N nodes is randomly connected by connectivity links with a degree distribution P(k) and an average degree 〈k〉. In addition, pairs of nodes are connected by dependency links as follows: (i) A node can only have one dependency link. (ii) If node i depends on node j, then node j depends on node i. For this model, we denote by q the fraction of nodes that have dependencies.
We start by presenting the formalism describing the iterative process of cascading failures for the simple case of q = 1 (see SI). Each iteration (step) includes failures that are the result of the percolation process and failures that are the result of the dependency process. The goal of the formalism is to describe the accumulated process up to step n as an equivalent single random removal, rn, from the original network. The remaining fraction of nodes after such a removal is βn = 1 - rn. The new network, after the removal of a fraction rn of the nodes, has a giant component consisting of a fraction g(βn) of the remaining nodes which is a fraction αn+1 = βng(βn) from the original network. The iterative process is initiated by the removal of a fraction r0 = 1 - p of the network nodes. The remaining part of the network is β0 = p. This initial removal will cause additional nodes to disconnect from the giant cluster due to the percolation process. The fraction of nodes that remain functional after the percolation process is α1 = β0g(β0). Each node from the nonfunctional part (1 - α1) will cause the node that depends on it to also fail (dependency process). The probability that a node depending on a nonfunctional node has survived until now is α1. Therefore the fraction of new nodes that will fail due to dependencies is δ1 = (1 - α1)α1. The accumulated failure, including the initial failure of 1 - β0 and δ1, is equivalent to a random removal of r1 = (1 - β0) + (1 - α1)β0 from the original network (see SI Text). The remaining fraction of nodes after the new removal is therefore . The remaining functional part of the giant component is now α2 = β1g(β1). To calculate the fraction δ2 of nodes that are disconnected due to dependencies at the second stage, recall that at the previous stage a fraction δ1 failed from α1. The remaining part of α1 was therefore . Thus , which is equivalent to a random removal of from the original network. The remaining fraction of nodes is . Following this approach, we can construct the sequence, βn, of the remaining fraction of nodes in the network after each iteration.
Following a similar approach for the general case of 0 ≤ q ≤ 1 (see SI Text) yields the sequence βn = qp2g(βn-1) + p(1 - q). Given, βn, the fraction of nodes in the giant cluster is αn+1 = βng(βn) = p{1 - q[1 - pg(βn)]}g(βn). Fig. 3A compares theory and simulations of αn, for the case of an ER network.
To determine the state of the system at the end of the cascade process, we analyze βn at the limit of n → ∞. This limit must satisfy the equation βn = βn+1 because, at the end of the process, the cluster is not further fragmented. Denoting βn = βn-1 = x we arrive at the equation
[1] |
This equation can be solved graphically as the intersection of a straight line y = x and a curve y = p2qg(x) + p(1 - q). When p is small enough, the curve increases very slowly and does not intersect with the straight line (except at the origin which corresponds to the trivial solution). The critical case for which the nontrivial solution emerges, corresponds to the case when the line touches the curve at a single point x and in this point we have the condition , which together with Eq. 1 gives the solution for the critical fraction of failing nodes that will fragment the network and the critical size of the giant component.
Analytical Solution
An exact analytical solution can be obtained using the apparatus of generating functions. As in refs. 23–25, we introduce the generating function of the degree distribution . Analogously, we also introduce the generating function of the underlying branching process, . A random removal of a fraction 1 - p of nodes will change the degree distribution of the remaining nodes, so the generating function of the new distribution is equal to the generating function of the original distribution with the argument ξ replaced by 1 - p(1 - ξ) (23). The fraction of nodes that belong to the giant component after the removal of 1 - p nodes is g(p) = 1 - G0[1 - p(1 - f)], where f = f(p) satisfies a transcendental equation f = G1[1 - p(1 - f)] (25).
In the case of an ER network with a Poisson degree distribution (12–14), the problem can be solved explicitly because G1(ξ) = G0(ξ) = exp(〈k〉(ξ - 1)). Accordingly, g(x) = 1 - f and f = exp[〈k〉x(f - 1)] where x is defined in Eq. 1. The fraction of nodes in the giant component at the end of the cascade process is then given by α∞ = β∞g(β∞) = p{1 - q[1 - p(1 - f)]}(1 - f). The equation f = f(q,p,k) has a trivial solution at f = 1. The nontrivial solutions of f can be presented by the crossing points of the two curves in a system of equations that are given with respect to x and f:
[2] |
For the trivial solution at f = 1, the size of the giant component is zero (α∞ = 0). For the solutions that are the crossing points of the two curves, f < 1, i.e., α∞ > 0. Thus, the case where the curves tangentially intersect corresponds to a first-order phase transition point (p = pI) where α∞ abruptly jumps from a finite size above pI to zero below pI (26). The condition for the first-order transition is that the derivatives of the equations of system [2] with respect to f are equal. Together with system [2] this yields
[3] |
However, for a solution of system [2] where f → 1 (α∞ = 0) there is no jump in the size of the giant cluster and thus the transition is a second-order transition (p = pII). Solving system [2] for f → 1 yields
[4] |
The analysis of Eqs. 3 and 4 shows that the first-order transition at p = pI occurs for networks with a high density of dependency links (q > qc), whereas the second-order transition at p = pII occurs for networks with a low density of dependency links (q < qc). This behavior is confirmed by Fig. 4A which compares theory and simulations for pII(q) and pI(q). The critical value of qc (and pc) for which the phase transition changes from first order to second order is obtained when the conditions for both the first- and second-order transitions are satisfied simultaneously. Applying both conditions, we obtain
[5] |
Simulations
Next, we support our analytical results by simulations. Finding the transition point via simulations is always a difficult task that requires high precision. In the case of the first-order transition, we are able to calculate the transition point with good precision by identifying the special behavior characterizing the number of iterations (NOI) in the cascading process. At the first-order transition point, the NOI scales as N1/4 (see SI Text), which is also demonstrated by the long plateau in Fig. 3A. This number sharply drops as the distance from the transition point is increased, because away from the transition point, pI, the NOI scales as log N/(p - pI) (see SI Text). Thus, plotting the NOI as a function of p provides a useful and precise method for identifying the transition point pI at the first-order region. For the second-order region, a similar behavior exists for the size of the second largest cluster which also reaches its maximum at the transition point (17). Fig. 3B presents simulation results of the NOI. The transition point, pI, can easily be identified by the sharp peak characterizing the transition point. Fig. 3B, Inset, presents a similar behavior for the size of the second largest cluster near the second-order transition point, pII. Fig. 4A compares simulation results and theory for the transition points pI(q) at the first-order region (solid line) and pII(q) at the second-order region (dashed line). The transition points were obtained using the NOI and the second cluster size techniques, respectively. The theoretical results for different values of q and 〈k〉 were calculated by solving system [2] together with Eqs. 3 or 4, respectively. Fig. 2B compares the values of the transition points pI(q) and pII(q), respectively, between SF and ER networks with the same average degree. For networks with a small fraction of dependencies (second-order transition region), SF networks are more robust to random failure (lower pII). For networks with a high fraction of dependencies (first-order transition region), SF networks become more vulnerable (higher pI). Fig. 4B compares simulation and theory for α∞, the fraction of nodes in the giant cluster at the transition point. Above qc, α∞ is finite characterizing a first-order transition, whereas below qc, α∞ is zero as expected for a second-order transition.
Discussion
Here we argue that, in order to properly model real networks, two different types of links are needed: connectivity links and dependency links. We present an analytical formalism for a general network model including both connectivity and dependency links. According to our model, networks with high density of dependency links are extremely vulnerable to random failure and when a critical fraction of nodes fail the network disintegrates in a form of a first-order phase transition. Networks with a low density of dependency links are significantly more robust and disintegrate in a form of a second-order phase transition. In the limit of zero fraction of dependency links, our general solution yields the known results for networks with only connectivity links. Our framework also provides an analytical solution for the critical density of dependency links for which the phase transition changes from a first-order to a second-order percolation transition. We develop a powerful simulation method to accurately estimate the transition point, based on the unique behavior of the NOI (number of iterations in the iterative process of cascading failures) that diverges at the first-order transition point. Using this method, we are able to provide very accurate simulation results supporting our analytical results.
Supplementary Material
Acknowledgments.
We thank the European Epidemic Forecast Infrastructure Framework Project, the Israel Science Foundation, the Office of Naval Research, and the Defence Threat Reduction Agency for financial support. S.V.B. thanks the Office of the Academic Affairs of Yeshiva University for funding the Yeshiva University high-performance computer cluster and acknowledges the partial support of this research through the Dr. Bernard W. Gamson Computational Science Center at Yeshiva College.
Footnotes
The authors declare no conflict of interest.
*This Direct Submission article had a prearranged editor.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1008404108/-/DCSupplemental.
References
- 1.Watts D-J, Strogatz S-H. Collective dynamics of ‘small-world’ networks. Nature (London) 1998;393:440–442. doi: 10.1038/30918. [DOI] [PubMed] [Google Scholar]
- 2.Barabási A-L, Albert R. Emergence of scaling in random networks. Science. 1999;286:509–512. doi: 10.1126/science.286.5439.509. [DOI] [PubMed] [Google Scholar]
- 3.Albert R, Barabási A-L. Statistical mechanics of complex networks. Rev Mod Phys. 2002;74:47–97. [Google Scholar]
- 4.Pastor-Satorras R, Vespignani A. Evolution and Structure of the Internet: A Statistical Physics Approach. Cambridge, UK: Cambridge Univ Press; 2006. pp. 69–208. [Google Scholar]
- 5.Dorogovtsev S-N, Mendes J-F-F. Evolution of Networks: from Biological Nets to the Internet and WWW. New York: Oxford Univ Press; 2003. pp. 1–219. [Google Scholar]
- 6.Barrat A, Barthelemy M, Vespignani A. Dynamical Processes on Complex Networks. Cambridge, UK: Cambridge Univ Press; 2009. pp. 24–125. [Google Scholar]
- 7.Rinaldi SM, Peerenboom JP, Kelley TK. Identifying, understanding and analyzing critical infrastructure interdependencies. IEEE Contr Syst Mag. 2001;21:11–25. [Google Scholar]
- 8.Vespignani A. Predicting the behavior of techno-social systems. Science. 2009:425–428. doi: 10.1126/science.1171990. [DOI] [PubMed] [Google Scholar]
- 9.Newman M-E-J, Barabási A-L, Watts D-J. The Structure and Dynamics of Networks. Princeton: Princeton Univ Press; 2006. [Google Scholar]
- 10.Newman M-E-J. The structure and function of complex networks. Phys Rev E. 2001;64:026118. [Google Scholar]
- 11.Cohen R, Havlin S. Cambridge, UK: Cambridge Univ Press; Complex Networks: Structure, Robustness and Function; pp. 97–142. [Google Scholar]
- 12.Erdős P, Rényi A. On random graphs I. Publ Math. 1959;6:290–297. [Google Scholar]
- 13.Erdős P, Rényi A. On the evolution of random graphs. Inst Hung Acad Sci. 1960;5:17–61. [Google Scholar]
- 14.Bollobás B. Random Graphs. Orlando, FL: Academic; 1985. pp. 1–125. [Google Scholar]
- 15.Cohen R, Erez K, Ben-Avraham D, Havlin S. Resilience of the Internet to random breakdown. Phys Rev Lett. 2000;85:4626–4628. doi: 10.1103/PhysRevLett.85.4626. [DOI] [PubMed] [Google Scholar]
- 16.Callaway D-S, Newman M-E-J, Strogatz S-H, Watts D-J. Network robustness and fragility: Percolation on random graphs. Phys Rev Lett. 2000;85:5468–5471. doi: 10.1103/PhysRevLett.85.5468. [DOI] [PubMed] [Google Scholar]
- 17.Bunde A, Havlin S. Fractals and Disordered Systems. 2nd Ed. New York: Springer; 1996. [Google Scholar]
- 18.Motter A-E, Lai Y-C. Cascade-based attacks on complex networks. Phys Rev E. 2002;66:065102. doi: 10.1103/PhysRevE.66.065102. [DOI] [PubMed] [Google Scholar]
- 19.Motter A-E. Cascade control and defense in complex networks. Phys Rev Lett. 2004;93:098701. doi: 10.1103/PhysRevLett.93.098701. [DOI] [PubMed] [Google Scholar]
- 20.Sachtjen M-L, Carreras B-A, Lynch V-E. Disturbances in a power transmission system. Phys Rev E. 2000;61:4877. doi: 10.1103/physreve.61.4877. [DOI] [PubMed] [Google Scholar]
- 21.Moreno Y, Pastor-Satorras R, Vázquez A, Vespignani A. Critical load and congestion instabilities in scale-free networks. Europhys Lett. 2003;62:292. [Google Scholar]
- 22.Watts D-J. A simple model of global cascades on random networks. Proc Natl Acad Sci USA. 2002;99:5766. doi: 10.1073/pnas.082090499. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Newman M-E-J. Spread of epidemic disease on networks. Phys Rev E. 2002;66:016128. doi: 10.1103/PhysRevE.66.016128. [DOI] [PubMed] [Google Scholar]
- 24.Buldyrev S-V, Parshani R, Paul G, Stanley H-E, Havlin S. Catastrophic cascade of failures in interdependent networks. Nature. 2010;464:1025–1028. doi: 10.1038/nature08932. [DOI] [PubMed] [Google Scholar]
- 25.Shao J, Buldyrev S-V, Braunstein L-A, Havlin S, Stanley H-E. Structure of shells in complex networks. Phys Rev E. 2009;80:036105. doi: 10.1103/PhysRevE.80.036105. [DOI] [PubMed] [Google Scholar]
- 26.Achlioptas D, D’Souza R-M, Spencer J. Explosive percolation in random networks. Science. 2009;323:1453–1455. doi: 10.1126/science.1167782. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.