Uncovering and Testing the Fuzzy Clusters Based on Lumped Markov Chain in Complex Network

Fan Jing; Xie Jianbin; Wang Jinlong; Qu Jinshuai

doi:10.1371/journal.pone.0082964

. 2013 Dec 31;8(12):e82964. doi: 10.1371/journal.pone.0082964

Uncovering and Testing the Fuzzy Clusters Based on Lumped Markov Chain in Complex Network

Fan Jing ^1,^*, Xie Jianbin ², Wang Jinlong ¹, Qu Jinshuai ¹

Editor: Peter Csermely³

PMCID: PMC3877001 PMID: 24391729

Abstract

Identifying clusters, namely groups of nodes with comparatively strong internal connectivity, is a fundamental task for deeply understanding the structure and function of a network. By means of a lumped Markov chain model of a random walker, we propose two novel ways of inferring the lumped markov transition matrix. Furthermore, some useful results are proposed based on the analysis of the properties of the lumped Markov process. To find the best partition of complex networks, a novel framework including two algorithms for network partition based on the optimal lumped Markovian dynamics is derived to solve this problem. The algorithms are constructed to minimize the objective function under this framework. It is demonstrated by the simulation experiments that our algorithms can efficiently determine the probabilities with which a node belongs to different clusters during the learning process and naturally supports the fuzzy partition. Moreover, they are successfully applied to real-world network, including the social interactions between members of a karate club.

Introduction

The theory of network science has significantly improved our understanding of complex systems. In the last fifteen years, an explosive growth of interests and activities on the structure and dynamics of complex networks is appearing. One of the most favorable but challenging tasks in network science is cluster analysis, which is aimed at revealing possible partitions of a network into subsets of nodes (clusters, or communities) [1]. Markov chains are frequently used as analytic models in the quantitative evaluations of stochastic systems. Examples of their use may be found in diverse areas such as computer, biological, physical and social sciences as well as in business, economics and engineering [2]–[4]. The fundamental property characterizing the model, referred to as the Markov property, is that given the present, past and future transitions of the system are independent of each other [5], [6]. The information that is often sought from this model is either the transient or stationary probability of the system being in a given state. When the number of states is small, it is relatively easy to obtain the transient and stationary solutions allowing the prediction of the system behavior. However, as models become more complex the process of obtaining these solutions becomes much more difficult. There is also a wide class of situations, where the modeler does not need information about each state of the system but about classes of states only. This leads to the consideration of a new process, to be called the aggregated or lumped, whose states are the state classes of the original Markov chain. The new stochastic process need not to be Markovian. In order to be able to utilize all the power of the Markov chain theory, it is important to be able to claim that for a given initial distribution the aggregated process has the Markov property.

In a previous paper [7], a k-means approach is proposed to partition the networks based on optimal prediction theory proposed by Chorin and coworkers [8], [9]. The basic idea is to associate the network with the random walker Markovian dynamics [10], then introduce a metric on the space of Markov chains (stochastic matrices), and optimally reduce the chain under this metric. The final minimization problem is solved by an analogy to the traditional k-means algorithm [11], [12] in clustering analysis. This approach is motivated by the diffusion maps [13] and MNCut algorithms in imaging science [5].

To give a coarse definition about the study of complex networks from the viewpoints of applied mathematics, it is about the research of dynamical systems on graphs. The graph structure may be fixed, or time-varying; the dynamical system may be deterministic, or stochastic. Since these networks are typically very complex, it is of great interest to see whether they can be reduced to much simpler systems [5]–[7], [13]–[20]. And in a broader aspect, it is also closely related to the model reduction theory of differential equations [21], [22]. The concept of lumpability on hard partitions is a useful tool to analysis the dynamic of the network based on the a coarse grain view. Lumped Markov chain model of the random walker (i.e., a reduced-order Markov chain in which the clusters of the original network become nodes) which is easily derived from the original (highorder) Markov chain model. This notion of proximity of ‘nodes’ in the lumped markov chain reflects the intrinsic geometry of the meta-node set in terms of connectivity of the cluster in a diffusion process.

In the current paper, we first analyze the property of lumped markov chain and propose two novel ways of inferring the lumped markov transition matrix. Some useful results are proposed based on the analysis of the properties of the lumped markov process. Furthermore, we construct two algorithms — the steepest descent method with projection (SDP) and the reduced conjugate gradient method with projection (CGP) — from minimizing the objective function under the generalized framework in this paper. According to two choices of projection operators P1, P2, we obtain the formulations — SDP1, SDP2, CGP1, CGP2 — which have been applied to two artificial networks, including the ad hoc network, as well as real-world network, the karate club network. The proposed algorithms are easy to be implemented with reasonable computational effort and the final results do make sense in the considered models. It is demonstrated by these experiments that the algorithms can always perform successfully during the learning process and lead to a good clustering result.

Materials and Methods

We will start with a brief review of markov random walks on complex network [8], [9]. Let Inline graphic be a network with N nodes and M edges, where S is the nodes set, is the weight matrix and is the weight for the edge connecting the nodes x and y. A simple example of the weight matrix is given by the adjacency matrix: or 1, depending whether x and y are connected. We can relate this network to a discrete-time Markov chain with stochastic matrix p with entries Inline graphic given by

(1)

where Inline graphic is the degree of the node x [10]–[13], [23]. This Markov chain has stationary distribution

(2)

and it satisfies the detailed balance condition

(3)

We take a partition of S as Inline graphic with if . Our aim is to aggregate the nodes in each cluster in order to give the exact expression of lumped markov chain. To do so, first, we regard each set in the state space as corresponding to the nodes of a -nodes network , where , and the weight on the edge that connects and Inline graphic is defined as

(4)

where the sum involves all the transition probabilities between Inline graphic . From the detailed balance condition (4), it can be verified that . By setting , one can define the lumped Markov chain on graph with stationary distribution and lumped transition probabilities

(5)

It can be easily shown that Inline graphic is a stochastic matrix on the state space and satisfies a detailed balance condition with respect to , i.e.

(6)

This construction of a lumped markov chain is shown in Figure 1.

For a given partition on a network , we define a lumped network by aggregating all nodes belonging to a subset into a meta-node. The new weights are computed via weight averaging the transition probabilities between nodes and –, and the lumped Markov chain with transition probabilities can also be obtained.

To give a more clear probability expression, we denote such probability function as Inline graphic to represent the probability which the node x belongs to the k-th cluster with. Naturally we need the assumption that

(7)

for all Inline graphic .

Thus, the lumped markov transition matrix can be rewritten based on Inline graphic

graphic file with name pone.0082964.e045.jpg

(8)

Another approach of deriving the lumped markov transition matrix is based on lifted transition matrix. Any lumped stochastic matrix Inline graphic can be naturally lifted to the space of stochastic matrices on the original state space S via

(9)

where

(10)

The basic idea in [24], [25] is to introduce a metric, also called the Hilbert-Schmidt norm, in the space of stochastic matrices. The optimal partition and the corresponding reduced Markov chain Inline graphic is found by minimizing

(11)

where

graphic file with name pone.0082964.e051.jpg

(12)

subject to the constraint Eq.(7).

To minimize the objective function J in Eq.(12), we define

graphic file with name pone.0082964.e052.jpg

(13)

which has the similar form as Inline graphic in hard clustering. Then is indeed a stochastic matrix because for all k. Furthermore we have that satisfies the detailed balance condition with respect to , that is

(14)

With the above background, we have the following basic results.

The expressions of Inline graphic and according to Eq.(12) are given by

(15a)

graphic file with name pone.0082964.e062.jpg

(15b)

where Inline graphic is a matrix and is a matrix with entries

(17)

The diagonal matrices Inline graphic , are and respectively, with entries

(17a)

(17b)

where Inline graphic and are both Kronecker delta symbols.

The proof of Lemma 1 can be found in Appendix_S1.pdf. Eq.(15a) and Eq.(15b) give the partial derivatives of the objective function in Eq.(12), which are the critical points of constructing gradient methods.

Based upon this formulation, we can find the optimal Inline graphic for any fixed partition . From the optimality condition

(18)

we can obtain Eq.(8). It indicates that when the partition Inline graphic is known, the minimizer of Eq.(11) is unique, which can be given by Eq.(8), and the corresponding is the stochastic matrix in the class Eq.(9) which provides the best rank- approximation of the original one under the metric Eq.(11).

SDP

In this section, based on the lumped markov transition matrix Inline graphic and corresponding results derived before, we devote to find ways to uncover partitions. An obvious choice to solve the constrained optimization Eq.(11) is the steepest descent method [12]. However, the components of and may become negative and non-normalized during the descent procedure for our problems, which makes the probabilistic interpretation useless. To ensure the nonnegativity and normalization conditions for Inline graphic and , we add a projection step after each renewal. This leads to the following Steepest Descent method with Projections(SDP).

Set up the initial state as the indicator matrix for each node in the network with the -means algorithm in [7], .
Perform the following iteration until :
(19a)

(19b)
Here is some type of projection operator which maps a real vector into a probability vector, is the learning rate and is a prescribed tolerance.
The final gives the fuzzy partition for each node.

Two choices of the projection operator Inline graphic are used in our computations, but the final results seem to be insensitive to them. Now suppose , and when , we make projection as any of the following two choices

P1 Direct projection to the boundary.

When Inline graphic , we set ; otherwise we set .

P2 Iterative projections.

At first make projection to the hyperplane Inline graphic , then check each component of the projected . If , we take and make projection again to a dimension reduced hyperplane . Repeat the projection procedure to a lower and lower dimensional hyperplane until no component is negative.

The application of projection operators for SD can simply solve the original constrained optimization Eq.(11), while to solve it exactly may involve much more complexity. Note that with this SDP method, we can not guarantee that Inline graphic and are the exact minimizer of the original problem Eq.(11) with non-negative and normalization constraints, but we take them as an approximate minimizer. The numerical results show that this strategy works fine for many examples.

The learning rate Inline graphic was usually chosen that started from a reasonable initial value and then reduced to zero with the iteration number in such a way that , and

(20)

The typical example of such case is Inline graphic , where is a positive constant [26]. Another choice is to fix the learning rate as a positive constant [24], [25], which we utilize here, since the initial partition is good enough that the objective function Eq.(12) descends much more slowly when the learning rate becomes smaller, while larger values of Inline graphic cause blow up.

The number of the iteration steps is difficult to be estimated, which may depend on the structure of the network itself, the choice of the initial values, etc. It usually converges fast for well-clustered networks and may converge slowly for diffusive networks.

Now let us estimate the computational cost in each iteration. In the iteration step for Inline graphic , all of the matrices are of order and full according to Eq.(15a). It is easy to find that the computation of costs , and the computation of costs . Note that the stochastic matrix is sparse with entries, so the computation for costs , where represents the number of edges, which is usually assumed Inline graphic in realistic networks. So finally, we obtain the cost in the step for is . The cost for is also according to Eq.(15b), since is involved in the equations.

CGP

Another choice is to minimize the objective function using a simplified formulation of traditional conjugate gradient method, which is frequently used in machine learning [27]. It can be also regard as the above steepest descent method with a non-zero momentum term, which leads to the following reduced Conjugate Gradient method with Projections(CGP).

Set up the initial state as the indicator matrix for each node in the network with the -means algorithm in [13], .
Perform the following iteration until :
(21a)

(21b)
Here is some type of projection operator which maps a real vector into a probability vector, is the learning rates and is a prescribed tolerance.
The final gives the fuzzy partition for each node.

We again note that this is just a reduced form of conjugate gradient method, and it is demonstrated by simulation experiments that such method performs more efficiently than SD, just like the superiority traditional conjugate gradient has. The learning rate Inline graphic and are still chosen as constants by experience due to the same reason mentioned above. The computational cost in each iteration of CGP is the same as SDP, which is also for both and . Associating the two projections described above with SDP and CGP respectively, we refer to the derived algorithms: SDP1, SDP2, CGP1, CGP2, as the fuzzy partitioning algorithms for networks.

Results

In this section, simulation experiments on artificial network, the ad hoc network with 128 nodes, is carried out to demonstrate the performance of the proposed algorithms, via comparing the clustering results with some priori quantities. Moreover, the algorithms is applied to real-world network, the social interactions between members of a karate club.

Ad hoc network with 128 nodes

The first example is the ad hoc network with 128 nodes. The ad hoc network is a benchmark problem used in many papers [7], [16], [17], [22]. It has a known partition and is constructed as follows. Suppose we choose Inline graphic nodes, split them into 4 clusters with 32 nodes each. Assume that pairs of nodes belonging to the same clusters are linked with probability , and pairs belonging to different clusters with probability . These values are chosen so that the average node degree is fixed at . In other words, Inline graphic and are related as

(22)

We will denote Inline graphic .

To test on a less diffusive network, we take Inline graphic and generate the network according to Eq.(1). This network has a fuzzy clustering structure that some nodes should have immediate weights belonging to different clusters. We set the parameters by , and the learning rates in this model computation. is obtained after initialization [12]. The numerical results are shown in Table 1. Here we compare Inline graphic with an interesting quantity, the degree fraction , which is defined as

(23)

where Inline graphic is the number of nodes that are connected with and lie in cluster . Thus we have . With this definition, means the fraction of the edges connected with the node in the -th cluster. Note that this is not the same as the clustering probability, even though it is an interesting quantity to be compared with. We expect that the degree fraction Eq.(2) is close to our result Inline graphic for network though generally this assertion needs to be justified or disconfirmed theoretically. To verify this fact, we define the mean and maximal -error of

(24a)

(24b)

for error comparing. Table 1 shows that the deviation between these two is about Inline graphic . Obviously CG algorithm improves the convergence rate of SD. The projection P1 has the smallest , while the projection P2 reaches a better minimum which indicates a more accurate result. In Figure 2 we plot the probability distribution function (pdf) of and . We observe that the shape of the pdf for Inline graphic or are almost the same. Note that all the 's have a lower peak centered at about 0.85, which corresponds to the nodes in this cluster, and a higher peak centered at about 0.05, which corresponds to the other nodes outside of this cluster. The case for is similar but with the lower peak centered at about 0.7 and the higher peak centered at about 0.1. We note here that the center Inline graphic corresponds to the choice of the parameters . If we classify the nodes according to the majority rule, i.e. if for a given node then we set , we obtain the 4-cluster partition exactly for this model. This also verifies the accuracy of our algorithms, but fuzzy algorithms give more detailed information for each node.

Table 1. .

	Iterations
SDP1&CGP1	110&55	5.8794	0.1191	0.2389
SDP2&CGP2	119&54	5.8757	0.1207	0.2374

Open in a new tab

The iterations, the value of the objective function Inline graphic and the mean and maximum -error of defined in Eq.(3) for algorithms.

Karate club network

This network was constructed by Wayne Zachary after he observed social interactions between members of a karate club at an American university [28]. Soon after, a dispute arose between the clubs administrator and main teacher and the club split into two smaller clubs. It has been used in several papers to test the algorithms for finding clusters in networks [7], [14]–[18].

There are 34 nodes in karate club network, where each node represents one member in the club. In Zachary's original partition, each node belongs to only one sub-club after splitting. We label it as red or yellow color in the figures to show its attribute in the graph representation. From the viewpoint of the fuzzy clustering, the attribute of each node is no longer an indicator function but rather a discrete probability distribution. In our following notations, the association probability Inline graphic and means the probability of each node belonging to red or yellow colored cluster respectively.

We set the parameters by Inline graphic . Here is obtained after initialization [7]. The numerical results are shown in Table 2. It can be demonstrate again that CGP is more efficient, and P2 can reach a smaller value of . Figure 3 shows the convergence history during 5–30 iterations for the methods. It can be obviously seen that CGP, which decreased the objective function faster than SDP, performs more efficiently. The final association probabilities are presented in Table 3, where Inline graphic and are the probability of belonging to the red or yellow colored group shown in Figure 4 respectively. Comparing or between P1 and P2, we find that almost all the errors are less than , but the association probability is quite different from the 0–1 distributions obtained in the Inline graphic -means algorithm. Now let us compare the association probability and obtained by our methods with the original partition result obtained by Zachary. In [28], Zachary gave the partition and . If we classify the nodes according to the majority rule, i.e., if then we set , otherwise we set Inline graphic , we obtain the same partition as Zachary's (see Figure 4(a)). We note that this hard partition deduced by fuzzy partition is more reasonable than the result of k-means(see Figure 4(b)), since node 10 is classified correctly this time.

Table 2. of the karate club.

	Iterations
SDP1&CGP1	153&55	4.0394
SDP2&CGP2	173&53	4.0389

Open in a new tab

The iterations and minimized objective function values Inline graphic for the algorithms of the karate club network.

Table 3. The association probability.

	Nodes	1	2	3	4	5	6	7	8	9	10	11	12
SDP1&CGP1		0.039	0.079	0.44	0	0	0	0	0.004	0.675	0.764	0	0
		0.96	0.92	0.56	1	1	1	1	0.996	0.3254	0.236	1	1
SDP2&CGP2		0.052	0.098	0.452	0.018	0	0	0	0.03	0.678	0.766	0	0
		0.948	0.902	0.548	0.982	1	1	1	0.97	0.3222	0.234	1	1

Open in a new tab

The association probability that each node belongs to different clusters. Inline graphic and means the probability belonging to red or yellow colored cluster in Figure 4 respectively.

(a)Hard partition with thresholding operation according to the node's maximal weight. (b)Hard partition by k-means in [13]; (c)Fuzzy partition by CGP2.

However, we have more detailed information in fact. From Table 3, we find Inline graphic for nodes , which lie at the boundary of the yellow colored group; and for nodes , which mostly lie at the boundary of the red colored group. The others belong to the red and yellow colored groups with nonzero probability, especially the nodes have more diffusive probability and they play the role of transition nodes between the red and yellow colored groups. We can visualize the data Inline graphic more transparently with the color vector Eq.(23) for each node. We can conclude from Figure 4(c) that how much probability each of the 34 members stands by both parts with. Members and have an obvious attitude on following their leader, i.e. the club administrator or the main teacher. Others such as Inline graphic hold neutralism that they can support either leader according their weights.

Discussion

We address the expression of lumped markov transition matrix for networks with two novel method in this paper. This can also be considered as a generalization of markov random walk dynamic in statistics for the networks. We successfully constructed the steepest descent method with projection (SDP) and the reduced conjugate gradient method with projection (CGP). They are derived to search for the local minimum of the objective function in Eq.(12) under the fuzzy clustering framework, which is extended from a deterministic framework for network partition based on the optimal prediction of a random walker Markovian dynamics [7]. The simulation experiments have shown that the algorithms can efficiently determine the fuzzy partition matrix. Partitioning the network with thresholding operation according to the node's maximal weight can give a more reasonable clustering result than the previous k-means algorithm [7]. We use two datasets(Ad hoc network with 128 nodes and Karate club network) to validate algorithms and achieve good results Numerical results show that our algorithms with two different projections produce similar results, while the CGP2 algorithm has better efficiency and accuracy. Moreover, the algorithms succeed in real-world learning tasks.

Supporting Information

Text S1

The ad hoc network is a benchmark problem used in many papers [8] , [9] , [12] , [16] . It has a known partition and is constructed as follows. Suppose we choose N = 128 nodes, split them into 4 clusters with 32 nodes each.

(TXT)

Click here for additional data file.^{(14.6KB, txt)}

Text S2

The file karate.txt contains the network of friendships between the 34 members of a karate club at a US university, as described by Wayne Zachary in 1977. If you use these data in your work, please cite W. W. Zachary, An information flow model for conflict and fission in small groups, Journal of Anthropological Research 33, 452–473 (1977).

(TXT)

Click here for additional data file.^{(4KB, txt)}

Appendix S1

The Proof of Lemma 1.

(PDF)

Click here for additional data file.^{(34KB, pdf)}

Funding Statement

This work was supported by the National Natural Science Foundation of China (grant No. 60963026, No. 61163061), and the Application Basic Research Fund of Yunnan Province (grant No. 2011FZ174). Tthe National Natural Science Foundation of China: http://www.nsfc.gov.cn/. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Lancichinetti A, Kivelä M, Saramäki J, Fortunato S (2010) Characterizing the community structure of complex networks. PloS one 5: e11976. [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Albert R, Barabási AL (2002) Statistical mechanics of complex networks. Reviews of modern physics 74: 47. [Google Scholar]
3.Newman M, Barabási AL, Watts DJ (2006) The structure and dynamics of networks. Princeton University Press. [Google Scholar]
4. Börner K, Sanyal S, Vespignani A (2007) Network science. Annual review of information science and technology 41: 537–607. [Google Scholar]
5. Shi J, Malik J (2000) Normalized cuts and image segmentation. Pattern Analysis and Machine Intelligence, IEEE Transactions on 22: 888–905. [Google Scholar]
6.Meila M, Shi J (2001) A random walks view of spectral segmentation.
7. Weinan E, Li T, Vanden-Eijnden E (2008) Optimal partition and effective dynamics of complex networks. Proceedings of the National Academy of Sciences 105: 7907–7912. [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Chorin AJ, Kast AP, Kupferman R (1999) Unresolved computation and optimal predictions. Communications on pure and applied mathematics 52: 1231–1254. [Google Scholar]
9. Chorin AJ (2003) Conditional expectations and renormalization. Multiscale Modeling & Simulation 1: 105–118. [Google Scholar]
10. Lovász L (1993) Random walks on graphs: A survey. Combinatorics, Paul erdos is eighty 2: 1–46. [Google Scholar]
11.Devijver PA, Kittler J (1982) Pattern recognition: A statistical approach. Prentice/Hall International Englewood Cliffs, NJ.
12. Hastie T, Tibshirani R, Friedman J, Franklin J (2005) The elements of statistical learning: data mining, inference and prediction. The Mathematical Intelligencer 27: 83–85. [Google Scholar]
13. Lafon S, Lee AB (2006) Diffusion maps and coarse-graining: A unified framework for dimensionality reduction, graph partitioning, and data set parameterization. Pattern Analysis and Machine Intelligence, IEEE Transactions on 28: 1393–1403. [DOI] [PubMed] [Google Scholar]
14. Girvan M, Newman ME (2002) Community structure in social and biological networks. Proceedings of the National Academy of Sciences 99: 7821–7826. [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Newman ME (2004) Fast algorithm for detecting community structure in networks. Physical review E 69: 066133. [DOI] [PubMed] [Google Scholar]
16. Newman ME, Girvan M (2004) Finding and evaluating community structure in networks. Physical review E 69: 026113. [DOI] [PubMed] [Google Scholar]
17. Newman ME (2004) Detecting community structure in networks. The European Physical Journal B-Condensed Matter and Complex Systems 38: 321–330. [Google Scholar]
18. Newman ME (2006) Modularity and community structure in networks. Proceedings of the National Academy of Sciences 103: 8577–8582. [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Li T, Liu J, Weinan E (2009) Probabilistic framework for network partition. Physical Review E 80: 026106. [DOI] [PubMed] [Google Scholar]
20. Hofman JM, Wiggins CH (2008) Bayesian approach to network modularity. Physical review letters 100: 258701. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Schilders WH, Van Der Vorst HA, Rommes J (2008) Model order reduction: theory, research aspects and applications, volume 13. Springer. [Google Scholar]
22. Danon L, Diaz-Guilera A, Duch J, Arenas A (2005) Comparing community structure identification. Journal of Statistical Mechanics: Theory and Experiment 2005: P09008. [Google Scholar]
23. Chung FR (1997) Spectral graph theory, volume 92 of cbms regional conference series in mathematics. American Mathematical Society [Google Scholar]
24. Ma J, Wang T, Xu L (2004) A gradient byy harmony learning rule on gaussian mixture with automated model selection. Neurocomputing 56: 481–487. [Google Scholar]
25. Ma J, Gao B, Wang Y, Cheng Q (2005) Conjugate and natural gradient rules for byy harmony learning on gaussian mixture with automated model selection. International Journal of Pattern Recognition and Artificial Intelligence 19: 701–713. [Google Scholar]
26. Ma J, Wang L (2006) Byy harmony learning on finite mixture: adaptive gradient implementation and a floating rpcl mechanism. Neural Processing Letters 24: 19–40. [Google Scholar]
27. Qian N (1999) On the momentum term in gradient descent learning algorithms. Neural networks 12: 145–151. [DOI] [PubMed] [Google Scholar]
28. Zachary WW (1977) An information flow model for conict and fission in small groups. Journal of anthropological research 452–473. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Text S1

(TXT)

Click here for additional data file.^{(14.6KB, txt)}

Text S2

(TXT)

Click here for additional data file.^{(4KB, txt)}

Appendix S1

The Proof of Lemma 1.

(PDF)

Click here for additional data file.^{(34KB, pdf)}

[pone.0082964-Lancichinetti1] 1. Lancichinetti A, Kivelä M, Saramäki J, Fortunato S (2010) Characterizing the community structure of complex networks. PloS one 5: e11976. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0082964-Albert1] 2. Albert R, Barabási AL (2002) Statistical mechanics of complex networks. Reviews of modern physics 74: 47. [Google Scholar]

[pone.0082964-Newman1] 3.Newman M, Barabási AL, Watts DJ (2006) The structure and dynamics of networks. Princeton University Press. [Google Scholar]

[pone.0082964-Brner1] 4. Börner K, Sanyal S, Vespignani A (2007) Network science. Annual review of information science and technology 41: 537–607. [Google Scholar]

[pone.0082964-Shi1] 5. Shi J, Malik J (2000) Normalized cuts and image segmentation. Pattern Analysis and Machine Intelligence, IEEE Transactions on 22: 888–905. [Google Scholar]

[pone.0082964-Meila1] 6.Meila M, Shi J (2001) A random walks view of spectral segmentation.

[pone.0082964-Weinan1] 7. Weinan E, Li T, Vanden-Eijnden E (2008) Optimal partition and effective dynamics of complex networks. Proceedings of the National Academy of Sciences 105: 7907–7912. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0082964-Chorin1] 8. Chorin AJ, Kast AP, Kupferman R (1999) Unresolved computation and optimal predictions. Communications on pure and applied mathematics 52: 1231–1254. [Google Scholar]

[pone.0082964-Chorin2] 9. Chorin AJ (2003) Conditional expectations and renormalization. Multiscale Modeling & Simulation 1: 105–118. [Google Scholar]

[pone.0082964-Lovsz1] 10. Lovász L (1993) Random walks on graphs: A survey. Combinatorics, Paul erdos is eighty 2: 1–46. [Google Scholar]

[pone.0082964-Devijver1] 11.Devijver PA, Kittler J (1982) Pattern recognition: A statistical approach. Prentice/Hall International Englewood Cliffs, NJ.

[pone.0082964-Hastie1] 12. Hastie T, Tibshirani R, Friedman J, Franklin J (2005) The elements of statistical learning: data mining, inference and prediction. The Mathematical Intelligencer 27: 83–85. [Google Scholar]

[pone.0082964-Lafon1] 13. Lafon S, Lee AB (2006) Diffusion maps and coarse-graining: A unified framework for dimensionality reduction, graph partitioning, and data set parameterization. Pattern Analysis and Machine Intelligence, IEEE Transactions on 28: 1393–1403. [DOI] [PubMed] [Google Scholar]

[pone.0082964-Girvan1] 14. Girvan M, Newman ME (2002) Community structure in social and biological networks. Proceedings of the National Academy of Sciences 99: 7821–7826. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0082964-Newman2] 15. Newman ME (2004) Fast algorithm for detecting community structure in networks. Physical review E 69: 066133. [DOI] [PubMed] [Google Scholar]

[pone.0082964-Newman3] 16. Newman ME, Girvan M (2004) Finding and evaluating community structure in networks. Physical review E 69: 026113. [DOI] [PubMed] [Google Scholar]

[pone.0082964-Newman4] 17. Newman ME (2004) Detecting community structure in networks. The European Physical Journal B-Condensed Matter and Complex Systems 38: 321–330. [Google Scholar]

[pone.0082964-Newman5] 18. Newman ME (2006) Modularity and community structure in networks. Proceedings of the National Academy of Sciences 103: 8577–8582. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0082964-Li1] 19. Li T, Liu J, Weinan E (2009) Probabilistic framework for network partition. Physical Review E 80: 026106. [DOI] [PubMed] [Google Scholar]

[pone.0082964-Hofman1] 20. Hofman JM, Wiggins CH (2008) Bayesian approach to network modularity. Physical review letters 100: 258701. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0082964-Schilders1] 21.Schilders WH, Van Der Vorst HA, Rommes J (2008) Model order reduction: theory, research aspects and applications, volume 13. Springer. [Google Scholar]

[pone.0082964-Danon1] 22. Danon L, Diaz-Guilera A, Duch J, Arenas A (2005) Comparing community structure identification. Journal of Statistical Mechanics: Theory and Experiment 2005: P09008. [Google Scholar]

[pone.0082964-Chung1] 23. Chung FR (1997) Spectral graph theory, volume 92 of cbms regional conference series in mathematics. American Mathematical Society [Google Scholar]

[pone.0082964-Ma1] 24. Ma J, Wang T, Xu L (2004) A gradient byy harmony learning rule on gaussian mixture with automated model selection. Neurocomputing 56: 481–487. [Google Scholar]

[pone.0082964-Ma2] 25. Ma J, Gao B, Wang Y, Cheng Q (2005) Conjugate and natural gradient rules for byy harmony learning on gaussian mixture with automated model selection. International Journal of Pattern Recognition and Artificial Intelligence 19: 701–713. [Google Scholar]

[pone.0082964-Ma3] 26. Ma J, Wang L (2006) Byy harmony learning on finite mixture: adaptive gradient implementation and a floating rpcl mechanism. Neural Processing Letters 24: 19–40. [Google Scholar]

[pone.0082964-Qian1] 27. Qian N (1999) On the momentum term in gradient descent learning algorithms. Neural networks 12: 145–151. [DOI] [PubMed] [Google Scholar]

[pone.0082964-Zachary1] 28. Zachary WW (1977) An information flow model for conict and fission in small groups. Journal of anthropological research 452–473. [Google Scholar]

PERMALINK

Uncovering and Testing the Fuzzy Clusters Based on Lumped Markov Chain in Complex Network

Fan Jing

Xie Jianbin

Wang Jinlong

Qu Jinshuai

Roles

Abstract

Introduction

Materials and Methods

Figure 1. Lumped markov process of a network.

SDP

CGP