Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2010 Apr 12;107(17):7640–7645. doi: 10.1073/pnas.0912983107

Multifractal network generator

Gergely Palla a, László Lovász b, Tamás Vicsek a,c,1
PMCID: PMC2867894  PMID: 20385847

Abstract

We introduce a new approach to constructing networks with realistic features. Our method, in spite of its conceptual simplicity (it has only two parameters) is capable of generating a wide variety of network types with prescribed statistical properties, e.g., with degree or clustering coefficient distributions of various, very different forms. In turn, these graphs can be used to test hypotheses or as models of actual data. The method is based on a mapping between suitably chosen singular measures defined on the unit square and sparse infinite networks. Such a mapping has the great potential of allowing for graph theoretical results for a variety of network topologies. The main idea of our approach is to go to the infinite limit of the singular measure and the size of the corresponding graph simultaneously. A very unique feature of this construction is that with the increasing system size the generated graphs become topologically more structured. We present analytic expressions derived from the parameters of the—to be iterated—initial generating measure for such major characteristics of graphs as their degree, clustering coefficient, and assortativity coefficient distributions. The optimal parameters of the generating measure are determined from a simple simulated annealing process. Thus, the present work provides a tool for researchers from a variety of fields (such as biology, computer science, biology, or complex systems) enabling them to create a versatile model of their network data.

Keywords: complex networks, sparse graphs, singular measures


As our methods of studying the features of our environment are becoming more and more sophisticated, we also learn to appreciate the complexity of the world surrounding us. The corresponding systems (including natural, social, and technological phenomena) are made of many units, each having an important role from the suitable functioning of the whole. An increasingly popular way of grabbing the intricate structure behind such complex systems is a network or graph representation in which the nodes correspond to the units and the edges to the connections between the units of the original system (13). It has turned out that networks corresponding to realistic systems can be highly nontrivial, characterized by a low average distance combined with a high average clustering coefficient (4), anomalous degree distributions (5, 6), and an intricate modular structure (79). A better understanding of these graphs is expected and, in many cases has been shown, to be efficient in designing and controlling complex systems ranging from power lines to disease networks (10).

As increasingly complex graphs are considered, a need for a better representation of the graphs themselves has arisen as well. Sophisticated visualization techniques emerged (11), and a series of parameters have been introduced over the years (13). Very recently one of us (L.L.) proved that, in the infinite network size limit, a dense graph’s adjacency matrix can be well represented by a continuous function W(x,y) on the unit square (12, 13). A similar approach was introduced by Bollobás et al. (14, 15) and used to obtain convergence and phase transition results for inhomogeneous random (including sparse) graphs. This two-variable symmetric function (which can have a very simple form for a variety of interesting graphs and was supposed to be either continuous or almost everywhere continuous) predicts the probability whether two nodes are connected or not. (The nontrivial relations between the limiting objects of graph sequences and 2D functions are discussed in more detail in SI Text). In this paper we develop the above ideas further in order to obtain simple and analytically treatable models of random graphs with a level of structuredness growing together with their size. Thus, we make an important conceptual step forward by acknowledging a rather natural expectation: The internal organization of larger networks is more complex than those of the smallest ones (e.g., the social contacts in large universities are much more structured than in an elementary school, which is in part because of the underlying hierarchical organization of almost every large networks we know of).

In a sense, using a function to represent a network is very much like using a model to describe a network. Models in the context of networks have been playing a crucial role because they are ideal from the point of view of singling out the simplest aspects of complex structures and, thus, are extremely useful in understanding the underlying principles. Models are also very useful from the point of testing hypotheses about measured data. Indeed, many important and successful models have been proposed over the past 10 years to interpret the various aspects of real world networks. However, a considerable limitation of these models is that they typically explain a particular aspect of the network (clustering, a given degree distribution, etc.), and for each new—to be explained—feature a new model had to be constructed.

In the recent years, generating graphs with desired properties has attracted great interest. A few remarkable methods have been proposed, including various hidden variable models (16, 17). The basic idea of this general framework is to characterize each node by a hidden variable h drawn from a given probability distribution ρ(h) and link the pair of nodes I and J with a probability given by a symmetric function r(hI,hJ). By appropriate choice of ρ(h) and r(h,h), one can generate random networks with an a priori specified degree distribution and degree correlation structure. Furthermore, nonequilibrium growing networks can be mapped to networks with hidden variables depending on the age of the nodes. The hidden variable methods are also related to the systematic study of the entropy of randomized network ensembles with fixed degree distribution, degree correlations, or community structure by Bianconi (18) (e.g., the ensemble of networks with a given degree sequence corresponds to a hidden variable model where the hidden variables are given by Lagrangian multipliers of the node connectivities).

Another systematic approach for analyzing network topologies was introduced by Mahadevan et al., by using the dK series of probability distributions (19). These distributions specify all degree correlations within d-sized subgraphs of a given graph, with 0K reproducing the average degree, 1K the degree distribution, 2K the joint degree distribution, etc. Several methods for generating random graphs having a predefined finite dK series were also given in (19) (with typically d ≤ 3). Most important of these techniques is based on rewiring of the links, because this turned out to be the only efficient tool in practice.

The concept of characterizing a network via the frequencies of given subgraphs (forming a series with increasing size) is at the heart of the exponential random graph model as well (2022). In this approach a possible subgraph g (e.g., a pair of connected nodes, a “two-star” of a pair of links sharing a node, a triangle, etc.) is assigned a parameter ηg related to the frequency of the subgraph, and the probability of a given network configuration is assumed to be proportional to Inline graphic, where ng denotes the number of subgraphs occurring in the network. The η parameters for a studied network are usually estimated by using maximal likelihood techniques. For the particular case of the two-star model a nonperturbative analytical solution was derived by Park and Newman (23), showing an interesting phase transition between high and low density phases.

The dK-series method and the exponential random graph model can be viewed as bottom-up approaches: In the first-order approximation of the studied network we concentrate on the frequency of the most simple object (an edge); when this is reproduced correctly we move on to a slightly more complex subgraph, and so on. The series of subgraphs from small/simple to large/complex are ordered into a sort of hierarchy. However, in a realistic scenario we stop in the above process at a relatively early stage, because on one hand most important properties of the networks are usually reproduced already, on the other hand including “higher order” subgraphs becomes computationally very expensive.

Hierarchy, self-similarity, and fractality are very important concepts when describing complex systems in nature and society and turned out to be relevant in network theory as well (2426). Very recently, two important network models have been introduced that are intrinsically hierarchical yet show general features. Avetisov et al. proposed in ref. 27 the construction of random graphs having an adjacency matrix equivalent to a p-adic randomized locally constant Parisi matrix, one of the key objects in the theory of spin glasses (28) This symmetric matrix has a hierarchic structure, and its elements are Bernoulli distributed random variables (taking the value of 1 with probability qγ and the value 0 with probability 1 - qγ, where γ counts the hierarchy levels). An interesting feature of this construction is that any subgraph belonging to a specific hierarchy level γ is equivalent to an Erdös–Rényi random graph (29); nevertheless, the overall degree distribution can be scale-free.

The Kronecker-graph approach introduced by Leskovec et al. is centered around hierarchic adjacency matrices as well; however, in this case the self-similar structure is achieved by Kronecker multiplication as follows (30). Starting from a small adjacency matrix A1 (where Inline graphic if nodes i and j are linked; otherwise, Inline graphic), at every iteration we replace each current matrix element by A1 multiplied by the matrix element itself, hence enlarging the matrix by a factor given by the size of A1. In the stochastic version of this model the elements of A1 are replaced by real numbers between 0 and 1, and at the final stage of the multiplication process we draw a link for each pair of nodes with a probability given by the corresponding element in the obtained stochastic adjacency matrix. According to the results, the Kronecker graphs obtained in this approach can mimic several properties of real networks (heavy tails in the degree distribution, and in the eigenvalue spectra, small diameter, densification power law) simultaneously. Furthermore, in ref. 31 Leskovec and Faloutsos presented a scalable method for fitting real networks with Kronecker graphs.

We note that link probability matrices similar to the previous examples can be also used for community detection as pointed out by Nepusz et al. in refs. 32 and 33. In their approach [inspired by Szemerédy’s regularity lemma (34)] the diagonal elements of the matrix give the link density inside the corresponding communities, whereas the off-diagonal elements correspond to the link probabilities between the groups.

In summary, a plausible classification of the emerging graph generating procedures/approaches involves the following types: generating graphs as (i) stochastic growth processes (e.g., ref. 1), (ii) as a process of connecting or rewiring nodes according to prescribed probabilities (4, 16, 17, 3538), (iii) accepting varying configurations with a prescribed probability (1922), (iv) by deterministically or stochastically obtaining its adjacency matrix from simpler initial matrix (27, 30, 31), and (v) from a function W(x,y) on the unit square providing a value for the probabilities of node pair connections (1214).

Rewiring and the related construction techniques do not provide a clue how a complex network emerges from a simple rule. On the other hand, generating a graph from a fixed function/measure does not result in networks with increasing structuredness. Our approach can be considered as a combination of (iv) and (v) (thus, combining their advantages), assuming that in the infinitely large network limit the right representation is a singular measure (nowhere continuous function).

Thus, here we discuss a method to construct random graphs inheriting features from real networks. The main idea of our approach is to replace W(x,y) by a fractal (singular) measure (also called multifractal) and go to the limit of infinitely fine resolution of the measure and the infinitely large size of the generated graph simultaneously. Consequently, the structuredness of the obtained network is increasing with the size. Another advantage of this approach is that the statistical features characterizing the network topology, e.g., the degree distribution, clustering coefficient, degree correlations, etc., can be simply calculated analytically. For generating networks with a given prescribed statistical feature (e.g., a given degree distribution), the optimal parameters of the generating measure defining the multifractal can be determined from a simple simulated annealing process.

Model

The network generation has three main stages in our approach: We start by defining a generating measure on the unit square, next we transform the generating measure through a couple of iterations into a link probability measure, and finally, we draw links between the nodes by using the link probability measure. The generating measure is defined as follows. We identically divide both the x and the y axis of the unit square to m (not necessarily equal) intervals, splitting it to m2 rectangles, and assign a probability pij to each rectangle (i,j∈[1,m] denote the row and column indices). The probabilities must be normalized Inline graphic and symmetric pij = pji. Next, the link probability measure is obtained by recursively multiplying each rectangle with the generating measure k times (which is equivalent to taking the kth tensorial product of the generating measure). The above procedure is in complete analogy with the standard process of generating a multifractal, resulting in m2k rectangles, each associated with a linking probability pij(k) equivalent to a product of k factors from the original generating pij given as

graphic file with name pnas.0912983107eq26.jpg [1]

In our convention k = 1 stands for the generating measure; thus, a link probability measure at k = 1 is equivalent to the generating measure itself. The indices of the factors in [1] are given by

graphic file with name pnas.0912983107eq27.jpg [2]

where ⌊a/b⌋ denotes the quotient (integer part) of a/b, the term Inline graphic stands for subsequent calculation of the remainder after the division by mk-r, and an analogous formula can be written for the indices jq as well. (For q = 1, Eq. 2 simplifies to iq = ⌊(i - 1)/mk-1⌋+1; in SI Text we show the explicit form of the above expressions in a specific example.)

Finally, we distribute N points independently, uniformly at random on the [0,1] interval, and link each pair with a probability given by the pij(k) at the given coordinates. The above process of network generation is illustrated in Fig. 1, whereas in Fig. 2. we show a small network obtained with this method. We note that the above method can be viewed as a special case of the hidden variable models, where the coordinates of the nodes correspond to the hidden variables and the link probability measure pij(k) to the r(h,h) function. However, when k is increased, the complexity of this r(h,h) function is increasing exponentially.

Fig. 1.

Fig. 1.

Schematic illustration of the multifractal graph generator. (A) The construction of the link probability measure. We start from a symmetric generating measure on the unit square defined by a set of probabilities pij = pji associated to m × m rectangles (shown on the left). In the example shown here m = 2, the length of the intervals defining the rectangles is given by l1 and l2, respectively, and the magnitude of the of the probabilities is indicated by both the height and the color of the corresponding boxes. The generating measure is iterated by recursively multiplying each box with the generating measure itself as shown in the center and on the right, yielding mk × mk boxes at iteration k. The variance of the height of the boxes (corresponding to the probabilities associated to the rectangles) becomes larger at each step, producing a surface that is getting rougher and rougher; meanwhile, the symmetry and the self-similar nature of the multifractal are preserved. (B) Drawing linking probabilities from the obtained measure. We assign random coordinates in the unit interval to the nodes in the graph and link each node pair I,J with a probability given by the probability measure at the corresponding coordinates.

Fig. 2.

Fig. 2.

A small network generated with the multifractal network generator. (A) The generating measure (on the left) and the link probability measure (on the right) after k = 3 iterations. (Note that k = 1 corresponds to the generating measure in our convention.) (B) A network with 500 nodes generated from the link probability measure. The colors of the nodes were chosen as follows. Each row in the final linking probability measure was assigned a different color, and the nodes were colored according to their position in the link probability measure. (Thus, nodes falling into the same row have the same color.)

Our construction could be made more general by replacing the “standard” multifractal with the kth tensorial product of a symmetric 2D function 0 ≤ W(x,y) ≤ 1 defined on the unit square. Although the resulting Wk(x1,…,xk,y1,…,yk) = W(x1,y1)…W(xk,yk) function is [0,1]2k → [0,1] instead of [0,1]2 → [0,1], with the help of a measure preserving bijection between [0,1] and [0,1]k it could be used to generate random graphs in the same manner as with our multifractal.

The diversity of the linking probabilities pij(k) (and, correspondingly, the structuredness of the generated graph) is increasing with the number of iterations, just like in the case of a standard multifractal. In order to keep the generated networks sparse, we must ensure that the average degree 〈d〉 of the nodes does not change between subsequent iterations, which can be achieved by an appropriate choice of the number of nodes as a function of k, using the following relation:

graphic file with name pnas.0912983107eq28.jpg [3]

where aij(k) denotes the area of the box i,j at iteration k. In the special case of equal-sized boxes aij(k) = m-2k, and because of the normalization of the linking probabilities the above expression simplifies to 〈d〉 = Nm-2k. Thus, to keep the average degree constant when increasing the number of iterations for a given generating measure, the number of nodes has to be increased exponentially with k.

Statistical Methods

One of the main advantages of our model is that the statistical properties characterizing the network topology can be calculated analytically. An important observation concerning our model is that nodes having coordinates falling into the same row (column) of the link probability measure are statistically identical, which means that, e.g., the expected degree or clustering coefficient of the nodes in a given row is the same. Consequently, the distributions related to the topology are composed of subdistributions associated with the individual rows.

Let us concentrate on the degree distribution first, which can be expressed as

graphic file with name pnas.0912983107eq29.jpg [4]

where Inline graphic denotes the subdistribution of the nodes in row i and li(k) corresponds to the width of the row (giving the ratio of nodes in row i compared to the number of total nodes). These Inline graphic can be calculated by using the generating function formalism as shown in the Appendix, resulting in

graphic file with name pnas.0912983107eq30.jpg [5]

where Inline graphic denotes the average degree of nodes in row i. Eqs. 45 are analogous to the results for the degree distribution in a general hidden variable model derived in ref. 17. Even though the degree distribution of nodes in a given row follows a Poisson distribution according to [5], the overall degree distribution of the generated graph can show nontrivial features, as will be demonstrated later.

Similarly to the degree distribution, the clustering coefficient and the average nearest neighbors degree can be calculated analytically as well in a rather simple way (as given in SI Text). According to Fig. 3 BD, the analytical results for the quantities above are in very good agreement with the empirical distributions (obtained by generating a number of sample graphs for the chosen parameters). The use of analytic formulas instead of empirical distributions can significantly speed up the optimization of the generating measure with respect to some prescribed target property.

Fig. 3.

Fig. 3.

Comparison between the analytical and empirical result for a randomly chosen generating measure. (A) The generating measure (Left) and the link probability measure (Right) after k = 4 iterations. The number of rows in the generating measure was set to m = 3 with equal box lengths li = 1/3; the corresponding initial linking probabilities pij were chosen randomly. (B) The degree distribution obtained by averaging over 100 samples with N = 5,000 nodes each (symbols), plotted together with the analytical result obtained from Eq. 5 (continuous line), showing very good agreement. The error bars (showing the standard error of the mean) are smaller than the symbols. (C) The average clustering coefficient 〈C〉 of nodes falling into the same row of the final link probability measure plotted in function of the row index. Similarly to the previous panel, the symbols correspond to the empirical result (the error bars show the standard error of the mean), whereas the continuous line was obtained from analytical calculations. (D) The nearest neighbors average degree in function of the node degree obtained both empirically by averaging over the samples (symbols, with the error bars corresponding to the standard error of the mean) and analytically (continuous line).

Results

Depending on the choice of the generating measure and the box boundaries, our method is capable of producing graphs with diverse properties. However, to generate a random graph with prescribed features in our approach, we need to optimize the generating measure with respect to the given requirements. Let us suppose that the number of nodes in the graph to be generated is given. In this case we have two parameters: the number of boxes in the generating measure (given by m2) and the number of iterations k. The actual pij and box boundaries are “self-adjusting,” as we shall describe in the following.

Let us denote the property to which we are optimizing the generating measure by Inline graphic. A conceptually simple example is when our goal is to obtain a network with a given degree distribution; in this case Inline graphic is equivalent to p(d). In principle, Inline graphic depends on pij, li, k, and N (and in an implicit way on m, through the box sizes and linking probabilities). However, as m, k, and N are kept constant, we discard them from the notation and write the “value” of the property corresponding to a given choice of pij and li as Inline graphic. [Note that in most cases Inline graphic is actually a high dimensional object, e.g., a degree distribution, and not a real number.] The target value of the property to which we would like the system to converge is denoted by Inline graphic.

In order to be able to make the studied property of the generated network converge to the goal Inline graphic, we have to define a way to judge the quality of the actual Inline graphic. In other words, we have to define a sort of distance or similarity between Inline graphic and Inline graphic. This distance/similarity measure can be used as an energy function during a so-called simulated annealing procedure, and we shall denote it by Inline graphic. The actual form of this function depends on the actual choice of the property; e.g., in the case of optimizing the degree distribution, a plausible choice is the sum of the relative differences between the degree distributions:

graphic file with name pnas.0912983107eq31.jpg [6]

where d runs over the degrees, ρ(k)(d) is the value of the actual degree distribution at degree d, and ρ(d) is the value of the target degree distribution at the same degree. In the simulating annealing we also define a temperature T, which is decreased slowly during the process. The process itself consist of many Monte Carlo steps, and in one step we try to change one of the linking probabilities or one of the box boundaries by a small amount, following the Metropolis algorithm (39). If the energy E2 after the change is smaller than the energy E1 before, the change is accepted. In the opposite case, the change is accepted by a probability given by P = exp[-(E2 - E1)/T].

The above procedure can be generalized in principle to optimizing with respect to multiple properties simultaneously as well. Another option is to use maximum likelihood techniques in a similar fashion to the KronFit algorithm shown in ref. 31, making it possible to optimize with respect to a given network as a whole. However, such an approach would be more complicated both at the level of formulation and at the level of implementation. Thus, for simplicity here we consider the optimization of the different properties separately, using the simple scenario for the optimization described above. In Fig. 4A we show the results for optimizing the generating measure with respect to various target degree distributions. Although the three chosen targets are rather different (a scale-free distribution, a log-normal one, and a bimodal distribution), our method succeeded in finding a setting of pij and li producing a degree distribution sufficiently close to the target. Similarly, in Fig. 4B the results from optimizing with respect to three clustering coefficient distributions are displayed, showing again a reasonable agreement between the targets and the results.

Fig. 4.

Fig. 4.

Optimizing the generating measure with respect to different target properties. During the optimization process the number of nodes N, the number of rows in the generating measure m, and the number of iterations k are kept constant; only the probabilities pij and the length of the intervals li defining the generating measure are adjusted. The typical value of the constant parameters in our experiments were N = 10,000–20,000, m = 3–4, and k = 3–5. (A) Optimizing with respect to different degree distributions. The target distributions are shown with circles, whereas the corresponding results at the end of the optimization procedure are marked by squares. The black symbols come from an experiment where the target was a power-law degree distribution (the Inset shows this on log scale), and the gray symbols correspond to a setting with a log-normal target, whereas the white symbols show the results of an experiment with a bimodal target distribution. (B) Optimizing with respect to different clustering coefficient distributions. Similarly to the previous panel, the circles correspond to the target distribution, whereas the squares to the result of the optimization, and the different colors mark three different settings of the target distribution.

Discussion

Our approach raises a number of fundamental graph theoretical and practical questions. Should we expect that large real graphs converge to some limiting network in a strict sense of the convergence? Or, alternatively, their structure cannot be mapped onto a fixed function, and only an ever-changing (with the size of the network) measure (in the infinite network size limit becoming singular) can be used to reflect the underlying structural complexity? This picture would be in contrast with the consequences of the renowned Szemerédi lemma (34) valid for arbitrary dense graphs.

Although it can be shown analytically (see SI Text) that in the infinitely large network size limit our construction converges to a relatively simple graph, the convergence to this structure is extremely slow. According to our numerical studies, there is a very extensive region between the small and infinite regimes in which a well-defined, increasingly complex structure emerges as our method is applied. Details about aspects of the slowness of convergence involving an extremely slow growth of the relative number of isolated nodes and the appearance of oscillations are given in SI Text.

In summary, our results demonstrate that it is possible to use simple models to construct large graphs with arbitrary distributions of their essential characteristics, such as degree distribution, clustering coefficient distribution, or assortativity. In turn, these graphs can be used to test hypotheses or as models of actual data. The combination of the tensorial product of a simple generating measure and simulated annealing technique leads to small (in practice, 3 × 3 to 5 × 5) matrices representing the most relevant statistical features of observed networks. A very unique feature of this construction is that with the increasing systems size the generated graphs become topologically more structured. In addition, the multifractal measure we propose is likely to result in networks displaying aspects of self-similarity in the spirit of the related findings by Song et al. (25).

Appendix

The degree distribution of the nodes falling in row i of the link probability measure, Inline graphic, can be calculated as follows. In our construction we draw links for a node in row i pointing to nodes in row j altogether nj(k) times with a probability pij(k), where nj(k) is the number of nodes in row j, given by nj(k) = Nlj(k). The distribution of the number of links from a node in row i to nodes in row j can be approximated by a Poisson distribution when nj is sufficiently large as

graphic file with name pnas.0912983107eq32.jpg [7]

where 〈dij〉 denotes the average number of links from a node in row i to nodes in row j given by 〈dij〉 = nj(k)pij(k). The degree of a node in row i is given by the sum over the links towards the other rows as Inline graphic. Therefore, the generating function of Inline graphic is the product of the generating functions of the Inline graphic distributions:

graphic file with name pnas.0912983107eq33.jpg [8]

where Inline graphic is defined as

graphic file with name pnas.0912983107eq34.jpg [9]

(A summary of the most important properties of the generating functions is given in SI Text). By substituting [9] into [8] we arrive at

graphic file with name pnas.0912983107eq35.jpg [10]

where we used the fact that, because of the independence of the links, the expected degree of a node in row i can be written as Inline graphic. The degree distribution of the nodes falling into row i can be obtained by transforming back the generating function in [10], resulting in

graphic file with name pnas.0912983107eq36.jpg [11]

Because 〈dij〉 = nj(k)pij(k), the expected degree of a node in row i, denoted by 〈di〉 in the above expression, can be given as

graphic file with name pnas.0912983107eq37.jpg [12]

The distributions associated with the clustering coefficients and the degree correlations can be derived analogously as described in SI Text.

Supplementary Material

Supporting Information

Acknowledgments.

We thank Gábor Elek for fruitful discussions. This work was partially supported by the Hungarian National Science Fund (OTKA K68669, K75334, and T049674), the National Research and Technological Office (NKTH, Textrend), and the János Bolyai Research Scholarship of the Hungarian Academy of Sciences.

Footnotes

The authors declare no conflict of interest.

*This Direct Submission article had a prearranged editor.

This article contains supporting information online at www.pnas.org/cgi/content/full/0912983107/DCSupplemental.

References

  • 1.Albert R, Barabási A-L. Statistical mechanics of complex networks. Rev Mod Phys. 2002;74:47–97. [Google Scholar]
  • 2.Mendes JFF, Dorogovtsev SN. Evolution of Networks: From Biological Nets to the Internet and WWW. Oxford: Oxford Univ Press; 2003. [Google Scholar]
  • 3.Newman MEJ, Barabási A-L, Watts DJ. The Structure and Dynamics of Networks. Princeton: Princeton Univ Press; 2006. [Google Scholar]
  • 4.Watts DJ, Strogatz SH. Collective dynamics of “small-world” networks. Nature. 1998;393:440–442. doi: 10.1038/30918. [DOI] [PubMed] [Google Scholar]
  • 5.Faloutsos M, Faloutsos P, Faloutsos C. On power-law relationships of the Internet topology. Comput Commun Rev. 1999;29:251–262. [Google Scholar]
  • 6.Barabási A-L, Albert R. Emergence of scaling in random networks. Science. 1999;286:509–512. doi: 10.1126/science.286.5439.509. [DOI] [PubMed] [Google Scholar]
  • 7.Girvan M, Newman MEJ. Community structure in social and biological networks. Proc Natl Acad Sci USA. 2002;99:7821–7826. doi: 10.1073/pnas.122653799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Palla G, Derényi I, Farkas I, Vicsek T. Uncovering the overlapping community structure of complex networks in nature and society. Nature. 2005;435:814–818. doi: 10.1038/nature03607. [DOI] [PubMed] [Google Scholar]
  • 9.Lancichinetti A, Fortunato S, Kertész J. Detecting the overlapping and hierarchical community structure of complex networks. New J Phys. 2009;11:033015. [Google Scholar]
  • 10.Goh K-I, et al. The human disease network. Proc Natl Acad Sci USA. 2007;104:8685–8690. doi: 10.1073/pnas.0701361104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Dogrusoz U, Duncan CA, Gutwenger C, Sander G. Graph drawing contest report. Lect Notes Comp Sci. 2009;5417:453–458. [Google Scholar]
  • 12.Lovász L, Szegedy B. Limits of dense graph sequences. J Comb Theory B. 2006;96:933–957. [Google Scholar]
  • 13.Borgs C, Chayes J, Lovász L, Sós VT, Vesztergombi K. Convergent sequences of dense graphs I: Subgraph frequencies, metric properties and testing. Adv Math. 2008;219:1801–1851. [Google Scholar]
  • 14.Bollobás B, Janson S, Riordan O. The phase transition in inhomogeneous random graphs. Random Struct Algor. 2007;31:3–122. [Google Scholar]
  • 15.Bollobás B, Riordan O. Random graphs and branching process. In: Bollobás B, Kozma R, Kozma R, Miklós D, editors. Handbook of Large-Scale Random Networks. Berlin: Springer; 2002. pp. 15–115. [Google Scholar]
  • 16.Caldarelli G, Capocci A, De Los Rios P, Muñoz MA. Scale-free networks from varying vertex intrinsic fitness. Phys Rev Lett. 2002;89:258702. doi: 10.1103/PhysRevLett.89.258702. [DOI] [PubMed] [Google Scholar]
  • 17.Boguñá M, Pastor-Satorras R. Class of correlated random networks with hidden variables. Phys Rev E. 2003;68:036112. doi: 10.1103/PhysRevE.68.036112. [DOI] [PubMed] [Google Scholar]
  • 18.Bianconi G. The entropy of randomized network ensembles. Europhys Lett. 2008;81:28005. [Google Scholar]
  • 19.Mahadevan P, Krioukov D, Fall K, Vahdat A. Systematic topology analysis and generation using degree correlations. Comp Comm Rev. 2006;36:135–146. [Google Scholar]
  • 20.Frank O, Strauss D. Markov graphs. J Am Stat Assoc. 1986;81:832–842. [Google Scholar]
  • 21.Wasserman S, Pattison PE. Logit models and logistic regressions for social networks: I. An introduction to Markov graphs and p*. Psychometrika. 1996;61:401–425. [Google Scholar]
  • 22.Robins G, Snijders T, Wang P, Handcock M, Pattison P. Recent developments in exponential random graph (p*) models for social networks. Soc Networks. 2007;29:192–215. [Google Scholar]
  • 23.Park J, Newman MEJ. Solution of the two-star model of a network. Phys Rev E. 2004;70:066146. doi: 10.1103/PhysRevE.70.066146. [DOI] [PubMed] [Google Scholar]
  • 24.Barabśsi AL, Ravasz E, Vicsek T. Deterministic scale-free networks. Physica A. 2001;299:559–564. [Google Scholar]
  • 25.Song CM, Havlin S, Makse HA. Self-similarity of complex networks. Nature. 2005;433:392–395. doi: 10.1038/nature03248. [DOI] [PubMed] [Google Scholar]
  • 26.Clauset A, Moore C, Newman MEJ. Hierarchical structure and the prediction of missing links in networks. Nature. 2008;453:98–101. doi: 10.1038/nature06830. [DOI] [PubMed] [Google Scholar]
  • 27.Avetisov VA, Chertovich AV, Nechaev SK, Vasilyev OA. On scale-free and polyscale behaviors of random hierarchical network. J Stat Mech. 2009;2009:P07008. [Google Scholar]
  • 28.Mezard G, Parisi G, Virasoro M. Spin Glass Theory and Beyond. Singapore: World Scientific; 1987. [Google Scholar]
  • 29.Erdös P, Rényi A. On the evolution of random graphs. Publ Math Inst Hung Acad Sci. 1960;5:17–61. [Google Scholar]
  • 30.Leskovec J, Chakrabarti D, Kleinberg J, Faloutsos C. Realistic, mathematically tractable graph generation and evolution, using Kronecker multiplication. Lect Notes Comput Sci. 2005;3721:133–145. [Google Scholar]
  • 31.Leskovec J, Faloutsos C. Scalable modeling of real graphs using Kronecker multiplication; Proceedings of the 24th International Conference on Machine Learning; New York: Association for Computing Machinery; 2007. pp. 497–504. [Google Scholar]
  • 32.Nepusz T, Négyessy L, Tusnády G, Bazsó F. Reconstructing cortical networks: Case of directed graphs with high level of reciprocity. In: Bollobás B, Kozma R, Miklós D, editors. Handbook of Large-Scale Random Networks. Berlin: Springer; 2008. pp. 325–368. [Google Scholar]
  • 33.Nepusz T, Bazsó F. Likelihood-based clustering of directed graphs; IEEE Proceedings of the Third International Symposium on Computational Intelligence and Intelligent Informatics; Piscataway, NJ: Institute of Electrical and Electronic Engineers; 2007. pp. 189–194. [Google Scholar]
  • 34.Szemerédy E. Problémes Combinatoires et Théories des Graphes. Paris: Centre National de la Recherche Scientifique; 1978. Regular partitions of graphs; pp. 399–401. [Google Scholar]
  • 35.Molloy M, Reed B. A critical point for random graphs with a given degree sequence. Random Struct Algor. 1995;6:161–179. [Google Scholar]
  • 36.Molloy M, Reed B. The size of the giant component of a random graph with a given degree sequence. Combinatorica. 1998;7:295–305. [Google Scholar]
  • 37.Chung F, Lu L. The average distances in random graphs with given expected degrees. Proc Natl Acad Sci USA. 2002;99:15879–15882. doi: 10.1073/pnas.252631999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Serrano MÁ, Boguñá M. Tuning clustering in random networks with arbitrary degree distributions. Phys Rev E. 2005;72:036133. doi: 10.1103/PhysRevE.72.036133. [DOI] [PubMed] [Google Scholar]
  • 39.Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E. Equations of state calculations by fast computing machines. J Chem Phys. 1953;21:1087–1092. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES