Hypergraph partitioning using tensor eigenvalue decomposition

Deepak Maurya; Balaraman Ravindran

doi:10.1371/journal.pone.0288457

. 2023 Jul 21;18(7):e0288457. doi: 10.1371/journal.pone.0288457

Hypergraph partitioning using tensor eigenvalue decomposition

Deepak Maurya ^1,^*, Balaraman Ravindran ¹

Editor: Ilya Safro²

PMCID: PMC10361499 PMID: 37478054

Abstract

Hypergraphs have gained increasing attention in the machine learning community lately due to their superiority over graphs in capturing super-dyadic interactions among entities. In this work, we propose a novel approach for the partitioning of k-uniform hypergraphs. Most of the existing methods work by reducing the hypergraph to a graph followed by applying standard graph partitioning algorithms. The reduction step restricts the algorithms to capturing only some weighted pairwise interactions and hence loses essential information about the original hypergraph. We overcome this issue by utilizing tensor-based representation of hypergraphs, which enables us to capture actual super-dyadic interactions. We extend the notion of minimum ratio-cut and normalized-cut from graphs to hypergraphs and show that the relaxed optimization problem can be solved using eigenvalue decomposition of the Laplacian tensor. This novel formulation also enables us to remove a hyperedge completely by using the “hyperedge score” metric proposed by us, unlike the existing reduction approaches. We propose a hypergraph partitioning algorithm inspired from spectral graph theory and also derive a tighter upper bound on the minimum positive eigenvalue of even-order hypergraph Laplacian tensor in terms of its conductance, which is utilized in the partitioning algorithm to approximate the normalized cut. The efficacy of the proposed method is demonstrated numerically on synthetic hypergraphs generated by stochastic block model. We also show improvement for the min-cut solution on 2-uniform hypergraphs (graphs) over the standard spectral partitioning algorithm.

1 Introduction

In machine learning, interacting systems are often modeled as graphs. In graph modeling, an interacting object is represented as a node, and an edge captures the interaction between a pair of objects. A conventional approach is to quantify the extent of interaction by associating a positive weight to the corresponding edge. This graph formulation is further utilized for various standard machine-learning applications in different domains, such as biology [1], VLSI [2], computer vision [3], transport [4], clustering [5], and semi-supervised learning [6]. Learning on graphs has been an active area of research, ranging from spectral graph theory [7] to recently proposed graph neural networks [8]. A graph representation is limited to capturing only pairwise interaction, whereas many real-world systems may involve interactions that may be more complex than the simple pairwise formulation [9]. For instance, a collaboration network may involve agents interacting at a group level (also called super-dyadic interactions), which can not be captured by modeling the system as a graph.

Recently, hypergraphs have been used to represent and analyze such complex super-dyadic relationships. Hypergraphs are generalizations of graphs where an edge could potentially connect multiple nodes. These edges are commonly referred to as hyperedges. A k-uniform hypergraph refers to the case when all hyperedges are constrained to contain exactly k nodes.

Graph partitioning is an interesting problem that involves partitioning the set of nodes in a graph into multiple subsets such that nodes in one subset are more “similar” to each other as compared to nodes in any other subset. Graph partitioning is utilized in various fields such as biology [1], VLSI [2], and computer vision [3]. One of the widely accepted approaches for graph partitioning is minimizing the ratio-cut or normalized-cut [5] objective function using spectra of the graph [7]. Similarly, hypergraph partitioning has been used in a variety of applications in several domains, such as circuit designing [10], image segmentation [11], object segmentation in videos [12], citation networks [13], and semi-supervised learning [14]. In this work, we define the ratio-cut and normalized-cut on hypergraphs and propose a spectral partitioning algorithm.

Existing hypergraph modeling frameworks can be classified into two paradigms, based on whether they reduce the hypergraph to a graph explicitly [15] or implicitly [13, 16]. These reduction based approaches are quite popular in the machine learning community due to the scalability to large datasets [17–19], and provable performance guarantees of graph-based algorithms [20]. Thus most of the existing approaches make use of hypergraph reduction to utilize standard graph-based algorithms, which defeats the motivation behind using hypergraphs. As graphs are limited to capture only dyadic interactions, the reduction-based approaches fail to model the desired super-dyadic relationships.

Ihler et al. [21] show that the reduction-based approaches can not model a hypergraph cut, i.e., the complete removal of a hyperedge from a given hypergraph. After reducing a hypergraph to a graph, partitioning is performed on the graph. During that process, any partitioning algorithm removes some edges from the graph, which is not guaranteed to have any correspondence to the hyperedges in the original hypergraph. Also, note that two or more non-isomorphic hypergraphs may reduce to the same graph. An example for such a case is presented in S1 File. In order to bridge this existing gap, we propose a hypergraph partitioning algorithm in this work, which removes the hyperedges directly without using reduction to the graph. We use the tensor representation of hypergraphs and further the tensor eigenvalue decomposition for hypergraph partitioning. Note that tensor eigenvalue decomposition is NP-hard for general tensors and cannot be approximated unless P = NP [22].

Tensors have gained increasing attention for modeling hypergraphs, primarily in the mathematics community. For instance, Hu et al. [23] extended the fundamental and well-known theorem in spectral graph theory relating cardinality of zero eigenvalue of the Laplacian of a graph to the number of connected components to the uniform hypergraphs. Specifically, they proved that the algebraic multiplicity of zero eigenvalue of a symmetric Laplacian tensor is equal to the sum of the number of even-bipartite connected components and the number of connected components excluding the number of singletons in the given hypergraph. Such insights can not be revealed from the clique reduction methods and its variants [15]. In the machine learning community, tensor representation of hypergraphs has not gained much attention, except for a few works [24, 25]. In this work, we utilize the tensor representation of hypergraphs for detecting densely connected components by extending the notion of ratio-cut and normalized from graphs to hypergraphs [26]. We propose the novel “hyperedge score” that captures the structural variation of multiple nodes in a hyperedge [27, 28]. The key contributions and outline of this work are presented in the following subsection.

1.1 Our contributions

We make the following contributions in this work:

We propose the ratio-cut and normalized cut for k-uniform hypergraphs. Further, we prove that the solution to the minimization of relaxed ratio-cut or normalized cut problem can be obtained from the eigenvector corresponding to the minimum positive eigenvalue of the unnormalized and normalized Laplacian tensor respectively.
We propose a novel metric termed as “hyperedge score”, which is defined over each existing hyperedge and is a function of the eigenvector corresponding to minimum positive eigenvalue. This hyperedge score metric is used by our partitioning algorithm to remove the hyperedge directly without performing any reduction on hypergraphs [15].
We also derive a tighter upper bound on the minimum positive eigenvalue of the normalized Laplacian tensor in terms of hypergraph conductance for even order hypergraphs.
We demonstrate the efficacy of the proposed algorithm on synthetic hypergraphs (k = 2 and k = 4) generated by stochastic block model (SBM).
We compare the performance on synthetic graphs (2-uniform hypergraphs) generated by SBM. We also report n/8 times improvement of ratio-cut over the conventional spectral partitioning for cockroach graph, where n is the number of nodes.

1.2 Outline

The preliminaries of hypergraph notation and tensor representation are covered in Section 2. The proposed hypergraph partitioning algorithm is presented in Section 3. The functioning and efficacy of the proposed algorithm is demonstrated in Section 4 by experiments on synthetic and real hypergraphs. The main manuscript ends with concluding remarks in Section 5. The numerical details of the illustrative examples are presented in the S1 File after references.

2 Preliminaries

In this section, we briefly discuss the prevalent approach of representing hypergraphs and their partitioning. A hypergraph G is defined as a pair of G = (V, E), where V = {v₁, v₂, …, v_n} is the set of entities called vertices or nodes and E = {e₁, e₂, …, e_m} is a set of non-empty subsets of V referred to as hyperedges.

The strength of interaction among nodes in the same hyperedge is quantified by the positive weight represented by $w_{e} = {w_{e_{1}}, w_{e_{2}}, \dots, w_{e_{m}}}$ .

The vertex-edge incidence matrix is denoted by H and has the dimension |V| × |E|. The entry h(i, j) is defined to be 1 if v_i ∈ e_j and 0 otherwise.

The degree of node v_i is defined by $d_{v_{i}} = \sum_{e_{j} \in E} w_{e_{j}} h (i, j)$ . We can also define two diagonal matrices, W, D, with the dimension of m × m, n × n, containing the hyperedge weights and node degrees respectively. Note that there is no loss of information in this form of representation of hypergraphs until this point. This implies that a unique hypergraph can be constructed for a given incidence matrix.

2.1 Reducing a hypergraph to a graph

Now, we discuss the widely-accepted approach for hypergraph reduction in the machine learning community. The fundamental idea is to reduce a hypergraph to graph and subsequently apply standard graph-based algorithms. In this subsection, we briefly discuss the merits and demerits of these approaches and articulate the reasons for choosing the tensor based representation of hypergraphs.

Definition 1. The clique expansion for hypergraph G(V, E) builds a graph G_x(V, E_x ⊆ V²) by replacing each hyperedge with the corresponding clique, E_x = {(v_i, v_j): v_i, v_j ∈ e_l, e_l ∈ E} [15]. The edge weight w_x(u, v) is given by $w_{x} (u, v) = \sum_{u, v \in e_{l}, e_{l} \in E} w (e_{l})$ .

The same could be stated in matrix form as

\begin{matrix} A = H W H^{T} - D \end{matrix}

(1)

where A represents the adjacency matrix for reduced hypergraph. Another traditional hypergraph reduction approach is star expansion [29]. Most of the other reduction approaches are build on these. Please see [15] and the references therein, for more details.

This reduction step is very convenient as we can now employ any graph algorithms that scale well and come with theoretical guarantees. A natural question arises on the need for these different reduction based approaches. We believe that each of these reduction approaches preserves a few but not all hypergraph properties in the reduction step. The preserved hypergraph property may be useful for the end task of learning on hypergraphs. For example, clustering results can be improved on hypergraphs by preserving node degrees during reduction [30].

More often, the reduction step loses vital information about hypergraphs as two different hypergraphs can reduce to the same graph. This can be seen directly from Eq (1) as two distinct hypergraphs having different H and W can reduce to the same adjacency matrix A. An illustrative example of the same is presented in S1 File.

2.2 Tensor representation of hypergraphs

In this subsection, we briefly review the tensor-based representation of hypergraphs [31, 32]. A natural representation of hypergraphs is a k-order n-dimensional tensor $A$ , which consists of n^k entries and is defined by:

\begin{matrix} a_{i_{1} i_{2} \dots i_{k}} = {\begin{matrix} \frac{w_{e_{j}}}{(k - 1)!} & if {i_{1}, i_{2}, \dots, i_{k}} \in E, 1 \leq i_{1}, \dots, i_{k} \leq n \\ 0 & otherwise \end{matrix} \end{matrix}

(2)

It should be noted that $A$ is a “super-symmetric” tensor, i.e, $a_{i_{1} i_{2} \dots i_{k}} = a_{σ (i_{1} i_{2} \dots i_{k})}$ , where σ(i₁, i₂, …i_k) denotes any permutation of the elements in the set {i₁, i₂, …, i_k}. The order or mode of the tensor refers to the hyperedge cardinality, which is k for $A$ . The degree of all the vertices can be represented by k-order n-dimensional diagonal tensor $D$ . The Laplacian tensor $L$ is defined as follows:

\begin{matrix} L = D - A \end{matrix}

(3)

An example demonstrating the tensor representation of a 4-uniform hypergraph is presented in S1 File. The normalized Laplacian tensor, denoted by $L$ can also be defined in a similar manner:

\begin{matrix} ℓ_{i_{1} i_{2} \dots i_{k}} = {\begin{matrix} - \frac{w_{e_{j}}}{(k - 1)!} \prod_{i_{j} = 1}^{k} \frac{1}{\sqrt[k]{d_{i_{j}}}} & if {i_{1}, i_{2}, \dots, i_{k}} \in E \\ 1 & if i_{1} = i_{2} \dots = i_{k} = i, i = {1, 2, \dots, n} \\ 0 & otherwise \end{matrix} \end{matrix}

(4)

For the sake of completeness, we define the tensor eigenvalue decomposition as:

\begin{matrix} L x^{k - 1} = λ x, such that x^{T} x = 1 \end{matrix}

(5)

where $(λ, x) \in (R, R^{n} \ {0}^{n})$ is called the Z-eigenpair and $L x^{k - 1} \in R^{n}$ , whose i^th component is:

\begin{matrix} {[L x^{k - 1}]}_{i} = \sum_{i_{k} = 1}^{n} \dots \sum_{i_{3} = 1}^{n} \sum_{i_{2} = 1}^{n} l_{i i_{2} i_{3} \dots i_{k}} x_{i_{2}} x_{i_{3}} \dots x_{i_{k}} \end{matrix}

(6)

The expression for the tensor Laplacian of a hypergraph ( $L x^{k}$ ) can be computed using the above and $L x^{k} = {(L x^{k - 1})}^{T} x$ . This is a k^th order polynomial in n variables which can be simplified as stated in the following theorem.

Theorem 2. The expression for tensor Laplacian of a hypergraph can be simplified using

\begin{matrix} L x^{k} & = \sum_{i_{1}, i_{2}, \dots, i_{k} = 1}^{n} l_{i_{1} i_{2} \dots i_{k}} x_{i_{1}} x_{i_{2}} \dots x_{i_{k}} \\ = \sum_{e_{j} \in E} w_{e_{j}} (\sum_{i_{t} \in e_{j}} x_{i_{t}}^{k} - k \prod_{i_{t} \in e_{j}} x_{i_{t}}) = \sum_{e_{j} \in E} w_{e_{j}} k (\underset{i_{t} \in e_{j}}{A M (x_{i_{t}}^{k})} - \underset{i_{t} \in e_{j}}{G M (| x_{i_{t}} |^{k})} {(- 1)}^{n_{s, j}}) \end{matrix}

(7)

where $n_{s, j} = | {i_{t} : x_{i_{t}} < 0, i_{t} \in e_{j}} |$ , AM and GM stand for arithmetic and geometric means respectively.

Proof.

\begin{matrix} L x^{k} & = \sum_{i_{k} = 1}^{n} \dots \sum_{i_{2} = 1}^{n} \sum_{i_{1} = 1}^{n} (d_{i_{1} i_{2} \dots i_{k}} - a_{i_{1} i_{2} \dots i_{k}}) x_{i_{1}} x_{i_{2}} \dots x_{i_{k}} \\ = \sum_{i = 1}^{n} d (v_{i}) x_{i}^{k} - \sum_{i_{k} = 1}^{n} \dots \sum_{i_{2} = 1}^{n} \sum_{i_{1} = 1}^{n} a_{i_{1} i_{2} \dots i_{k}} x_{i_{1}} x_{i_{2}} \dots x_{i_{k}} \\ = \sum_{i = 1}^{n} \sum_{i_{k} = 1}^{n} \dots \sum_{i_{3} = 1}^{n} \sum_{i_{2} = 1}^{n} a_{i i_{2} i_{3} \dots i_{k}} x_{i}^{k} - \sum_{i_{k} = 1}^{n} \dots \sum_{i_{2} = 1}^{n} \sum_{i_{1} = 1}^{n} a_{i_{1} i_{2} \dots i_{k}} x_{i_{1}} x_{i_{2}} \dots x_{i_{k}} \\ = \sum_{i = 1}^{n} (\sum_{(i_{2}, i_{3}, \dots, i_{k}) \in e_{j}} \frac{w_{e_{j}}}{(k - 1)!} x_{i}^{k}) - (\sum_{(i_{1}, i_{2}, \dots, i_{k}) \in e_{j}} \frac{w_{e_{j}}}{(k - 1)!} x_{i_{1}} x_{i_{2}} \dots x_{i_{k}}) \end{matrix}

(8)

As there are (k − 1)! and k! permutations of the first and second term respectively:

\begin{matrix} L x^{k} & = \sum_{e_{j} \in E} w_{e_{j}} (\sum_{i_{t} \in e_{j}} \frac{(k - 1)!}{(k - 1)!} x_{i_{t}}^{k} - \frac{k!}{(k - 1)!} x_{i_{1}} x_{i_{2}} \dots x_{i_{k}}) \\ = \sum_{e_{j} \in E} w_{e_{j}} (\sum_{i_{t} \in e_{j}} x_{i_{t}}^{k} - k \prod_{i_{t} \in e_{j}} x_{i_{t}}) \\ = \sum_{e_{j} \in E} w_{e_{j}} (k \frac{\sum_{i_{t} \in e_{j}} x_{i_{t}}^{k}}{k} - k {(\prod_{i_{t} \in e_{j}} {| x_{i_{t}} |}^{k})}^{\frac{1}{k}} {(- 1)}^{n_{s, j}}) \\ = \sum_{e_{j} \in E} w_{e_{j}} k (\underset{i_{t} \in e_{j}}{A M (x_{i_{t}}^{k})} - \underset{i_{t} \in e_{j}}{G M (| x_{i_{t}} |^{k})} {(- 1)}^{n_{s, j}}) \end{matrix}

(9)

The above polynomial expression can be viewed as generalization of the graph, as for any edge {a, b}, the objective function ${(x_{a} - x_{b})}^{2} = x_{a}^{2} + x_{b}^{2} - 2 x_{a} x_{b}$ .

Theorem 3. The expression for the normalized tensor Laplacian of a hypergraph

\begin{matrix} L x^{k} & = \sum_{e_{j} \in E} w_{e_{j}} (\sum_{i_{t} \in e_{j}} \frac{x_{i_{t}}^{k}}{d_{i_{t}}} - k \prod_{i_{t} \in e_{j}} \frac{x_{i_{t}}}{\sqrt[k]{d_{i_{t}}}}) \\ = \sum_{e_{j} \in E} w_{e_{j}} k (\underset{i_{t} \in e_{j}}{A M (\frac{x_{i_{t}}^{k}}{d_{i_{t}}})} - \underset{i_{t} \in e_{j}}{G M ({\frac{| x_{i_{t}} |}{d_{i_{t}}}}^{k})} {(- 1)}^{n_{s, j}}) \end{matrix}

where $n_{s, j} = | {i_{t} : x_{i_{t}} < 0, i_{t} \in e_{j}} |$ .

Proof. Similar to Theorem 2

This theorem for hypergraphs will be used in the later sections for proving other theorems. With basics covered in this section, we focus on the main problem of hypergraph partitioning in the next section.

3 Partitioning of hypergraphs

We start this section with brief review of spectral graph theory for partitioning of graphs [26] and further propose these ideas for hypergraphs.

3.1 Partitioning of graphs

Let the p parts of a partition of vertex set V be denoted by sets C₁, C₂, …, C_p such that

\begin{matrix} C_{i} \neq \emptyset, C_{i} \subset V, \cup_{i = 1}^{p} C_{i} = V, C_{i} \cap C_{j} = \emptyset, \forall i, j \in [p], and i \neq j \end{matrix}

(10)

The two most commonly used objective function of graph partitioning are Ratio cut [33] and Normalized cut [34]:

\begin{matrix} Ratio Cut (C_{1}, C_{2}, \dots, C_{p}) = & \sum_{i = 1}^{p} \frac{cut (C_{i}, \bar{C_{i}})}{2 | C_{i} |}, where cut (C_{i}, \bar{C_{i}}) = \sum_{r \in C_{i}, s \in \bar{C_{i}}} w_{r s} \end{matrix}

(11)

\begin{matrix} Normalized Cut (C_{1}, C_{2}, \dots, C_{p}) & = \sum_{i = 1}^{p} \frac{cut (C_{i}, \bar{C_{i}})}{2 vol (C_{i})}, where vol (C_{i}) = \sum_{r \in C_{i}} d_{r} \end{matrix}

(12)

where w_rs denotes the weight of the edge between nodes r and s, and d_r denotes the degree of r^th node. It is well known that the solution to the relaxed version of minimizing the ratio cut and normalized cut can be obtained from the Fiedler vector of unnormalized and normalized Laplacians, respectively.

The approximation made in the relaxation step is theoretically analyzed [35–37].

3.2 Ratio-cut and normalized-cut for hypergraphs

We start the discussion with a formal description of the problem. Let C₁, C₂, …, C_p be the p parts of a partition as defined in Eq (10). For a given hypergraph G(V, E, W_e), we intend to remove a subset of hyperedges ∂E ⊆ E, such that G \ ∂E produces a partition with at least p disjoint parts [38, 39]. The hyperedge boundary ∂E can be defined as:

\begin{matrix} \partial E = {e_{j} \in E : e_{j} \cap C_{i} \neq ⌀, e_{j} \cap \bar{C_{i}} \neq ⌀} \end{matrix}

(13)

for some i ∈ [p]. It basically denotes the set of hyperedges which “cross” the parts of the partition. The next step is to define the objective function to be minimized for obtaining optimal partitions. The measures described in Eqs (11) and (12) is proposed for graphs and hence not well-suited for hypergraphs as discussed in Example 1 shortly. We propose the following generalization of ratio-cut and normalized-cut for hypergraphs.

Definition 4. The cut cost for the i^th part C_i is denoted by w_h(C_i) and the total cut cost denoted by w_h,t(V) for all the p parts of a partition is defined as:

\begin{matrix} w_{h} (C_{i}) = \sum_{e_{j} \in \partial E} | C_{i} \cap e_{j} | w_{e_{j}}, w_{h, t} (V) = \frac{1}{k} \sum_{i = 1}^{p} w_{h} (C_{i}) \end{matrix}

(14)

The cut cost for a partition and total cut cost defined in Eq (14) reduces to numerator term in Eqs (11) and (12) for k = 2 because the term |C_i ∩ e_j| reduces to unity ∀e_j ∈ ∂E in graphs. We further demonstrate the merits of this cut cost by the following example.

Example 1. Consider the 3-uniform hypergraph shown in Fig 1. Consider the partitions obtained after removing hyperedges e₂ and e₃. Let C₁ = {v₁, v₂, v₃}, C₂ = {v₄}, C₃ = {v₅}. The partition cost is given by

\begin{matrix} w_{h} (C_{1}) & = 2 w_{e_{2}} + w_{e_{3}}, w_{h} (C_{2}) = w_{e_{2}} + w_{e_{3}}, w_{h} (C_{3}) = w_{e_{3}} \\ w_{h, t} (V) & = w_{e_{2}} + w_{e_{3}} \end{matrix}

It should be noted that $w_{e_{1}}$ is not reflected in the above cut costs because hyperedge e₁ is not cut. It could be easily verified that this cut cost is not equivalent to clique reduction approach. The cut costs derived for the reduced hypergraph are as follows:

\begin{matrix} w_{g} (C_{1}) & = 2 (w_{e_{2}} + w_{e_{3}}), w_{g} (C_{2}) = 2 (w_{e_{2}} + w_{e_{3}}), w_{g} (C_{3}) = 2 w_{e_{3}} \\ w_{h, t} (V) & = 2 w_{e_{2}} + 3 w_{e_{3}} \end{matrix}

Note that the cut costs derived from both approaches are different. On further inspection, we infer w_g(C_i) = 2w_h(C_i) for i = {2, 3}, which means the cut cost for partitions C₂ and C₃ in the reduced hypergraph are just a scaled version of costs involved in original hypergraph. The same relation does not hold for partition C₁ due to the presence of the term |C_i ∩ e_j| in Eq (14). Please refer to S1 File for the computation of these cut costs.

From this illustrative example, it can be inferred that the proposed cut cost for hypergraphs defined in Eq (14) carries more information about the cut as compared to reduced hypergraphs. The term |C_i ∩ e_j| in Eq (14) will lead to a greater penalty for removing hyperedges with more elements from C_i. A hyperedge with higher |C_i ∩ e_j| is likely to have more association with partition C_i, so the corresponding cut should be penalized more.

Minimizing the total cut cost defined in Eq 14 directly may lead to “unbalanced” partitions with minimum cost. To bypass such trivial and undesirable partitions, we propose the normalization.

Definition 5. The Ratio-cut and Normalized-cut for p partitions are defined as:

\begin{matrix} R a t i o - C u t (C_{1}, C_{2}, \dots, C_{p}) & = \sum_{i = 1}^{p} \frac{w_{h} (C_{i})}{k | C_{i} |^{k / 2}} \end{matrix}

(15)

\begin{matrix} N - C u t (C_{1}, C_{2}, \dots, C_{p}) & = \sum_{i = 1}^{p} \frac{w_{h} (C_{i})}{k {(vol (C_{i}))}^{k / 2}} \end{matrix}

(16)

where w_h(C_i) is defined in Eq (14). The above term for ratio-cut and normalized-cut simplifies to Eqs (11) and (12) respectively for k = 2. Compared to the similar objective function proposed in literature [13, 19], our objective function normalizes the exponential factor in the denominator. This helps us bypass the partitions with singletons or fewer nodes compared to normalization less than the exponential factor (like linear). Another perspective can be seen from the motivation behind the introduction of |C_i| or |vol(C_i)| term in the denominator of ratio-cut and normalized cut for graphs (k = 2). An exponential factor of this normalization factor probably helps us to produce more balanced partitions, which is very much required for hypergraphs. For example, consider a hypergraph with one hyperedge e₁ = {v₁, v₂, v₃} with 3 nodes v₁, v₂, and v₃. Cutting one hyperedge will produce three singletons which we consider as three partitions. A similar definition of normalized associativity can be seen in literature [20, 40].

3.3 Hypergraph partitioning algorithm

We wish to find the partition C₁, …, C_p which minimizes the ratio-cut or normalized-cut. It should also be noted that p is fixed. For further discussion, we focus on the minimization of ratio-cut, and the same approach can be extended for normalized-cut, as shown later. The optimal partitions can be obtained by solving:

\begin{matrix} (C_{1}, C_{2}, \dots, C_{p}) = \underset{(C_{1}, C_{2}, \dots, C_{p})}{argmin} \frac{1}{k} \sum_{i = 1}^{p} \frac{w_{h} (C_{i})}{| C_{i} |^{k / 2}} \end{matrix}

(17)

Unfortunately, the above problem is NP-hard [26, 41–43]. Inspired from spectral graph theory, we propose to solve a relaxed version of the optimization problem mentioned above.

Theorem 6. The minimization of ratio-cut in Eq (15) can be equivalently expressed as

\begin{matrix} min (\sum_{i = 1}^{p} L f_{i}^{k}) = min (\sum_{i = 1}^{p} \sum_{e_{j} \in \partial E} | C_{i} \cap e_{j} | \frac{w_{e_{j}}}{| C_{i} |^{k / 2}}), f_{i, j} = {\begin{matrix} \frac{1}{\sqrt{| C_{j} |}} & v_{i} \in C_{j} \\ 0 & o t h e r w i s e \end{matrix} \end{matrix}

(18)

where we define p indicator vectors f_j and its i^th element, denoted by f_i,j indicates if the vertex v_i belongs to j^th part of partition, denoted by C_j. The solution to the above problem after relaxing $f_{i} \in R^{n}$ rather than an indicator vector can be derived from the eigenvector corresponding to the minimum positive eigenvalue stated in Eq (5).

Proof. Given a partition of p disjoint sets {C₁, C₂, …, C_p}, define the p indicator variables f_j = (f_1,j, f_2,j, …, f_n,j)^⊺ defined as

\begin{matrix} f_{i, j} = {\begin{matrix} \frac{1}{\sqrt{| C_{j} |}} & v_{i} \in C_{j} \\ 0 & otherwise \end{matrix} \end{matrix}

(19)

where i ∈ [n] and j ∈ [p].

For any part C_i, we compute $L f_{i}^{k}$

\begin{matrix} L f_{i}^{k} = \sum_{i_{k} = 1}^{n} \dots \sum_{i_{2} = 1}^{n} \sum_{i_{1} = 1}^{n} l_{i_{1} i_{2} \dots i_{k}} f_{i_{1}, i} f_{i_{2}, i} \dots f_{i_{k}, i} \end{matrix}

(20)

We use Theorem 2 to compute the above term

\begin{matrix} L f_{i}^{k} & = \sum_{e_{j} \in E} w_{e_{j}} (\sum_{i_{t} \in e_{j}} f_{i_{t}, i}^{k} - k \prod_{i_{t} \in e_{j}} f_{i_{t}, i}) \end{matrix}

(21)

There can be three cases for each hyperedge:

e_j ⊆ C_i: All the nodes in a hyperedge e_j are assigned as $\frac{1}{| C_{i} |^{1 / 2}}$ . Both the terms ( $\sum_{i_{t} \in e_{j}} f_{i_{t}, i}^{k}$ and $k \prod_{i_{t} \in e_{j}} f_{i_{t}, i}$ ) will be $k \frac{1}{| C_{i} |^{k / 2}}$ and the overall term ( $L f_{i}^{k}$ ) reduces to 0.
$e_{j} \subseteq {\bar{C}}_{i}$ : All the nodes in hyperedge e_j are assigned 0. Both the terms will be zero and overall term will be zero.
e_j ∈ ∂E: Some of the nodes are assigned $\frac{1}{\sqrt{| C_{i} |}}$ . The second term ( $k \prod_{i_{t} \in e_{j}} f_{i_{t}, i}$ ) will be zero and the first term ( $\sum_{i_{t} \in e_{j}} f_{i_{t}, i}^{k}$ ) will reduce to $| C_{i} \cap e_{j} | \frac{w_{e_{j}}}{| C_{i} |^{k / 2}}$ .

So the overall term reduce to

\begin{matrix} L f_{i}^{k} = \sum_{e_{j} \in \partial E} w_{e_{j}} | C_{i} \cap e_{j} | \frac{w_{e_{j}}}{| C_{i} |^{k / 2}} \end{matrix}

(22)

Summing over the parts, we arrive at

\begin{matrix} \sum_{i = 1}^{p} L f_{i}^{k} = \sum_{i = 1}^{p} \sum_{e_{j} \in \partial E} w_{e_{j}} | C_{i} \cap e_{j} | \frac{w_{e_{j}}}{| C_{i} |^{k / 2}} \end{matrix}

(23)

The RHS term in Eq (23) is same as the defined ratio cut for hypergraphs (Eq (15)). It should be noted that $f_{i}^{T} f_{i} = 1$ . As the objective function and constraint are the same under relaxation, the solution to the relaxed optimization problem can be derived from tensor eigenvalue decomposition.

We continue the discussion on partitioning with the following example. Note that a certain ratio-cut approximation is involved while utilizing Theorem 6 for proposing the hypergraph partitioning algorithm using tensor EVD. This approximation is theoretically analyzed in Theorem 8 and Theorem 9.

Example 2. Consider the 3-uniform hypergraph shown in Fig 2. The colored number indicates the hyperedge weight. It is clear that the optimal partitions are A₁ = {2, 3, 4, 5, 6, 7} and ${\bar{A}}_{1}$ . The Fiedler eigenvector for this hypergraph is

\begin{matrix} f^{⋆} = [0.33 0.16 0.17 0.13 - 0.05 0.05 0.12 0.39 0.38 0.43 0.39 0.38] \end{matrix}

A standard approach in spectral graph theory is to use the sign of the elements in the Fiedler vector for partitioning [26]. For example, C₁ = {i|f^⋆(i) < 0, i ∈ [n]}.Hence, the partitions are C₁ = {5} and C₂ = V\C₁, which is clearly not optimal.

From the above example, it is clear that the traditional approach of partitioning does not yield desired partitions for hypergraphs. This is primarily because the eigenvectors of the Laplacian tensor of a hypergraph can not be interpreted in the same way as the eigenvectors of the Laplacian matrix of a graph.

To understand the implication of minimum ratio-cut associated with minimum positive λ^⋆, we analyze the computation of Laplacian objective function using the Fiedler vector:

\begin{matrix} l_{e_{j}} (f^{⋆}) = w_{e_{j}} (\sum_{i_{t} \in e_{j}} f_{i_{t}}^{k} - k \prod_{i_{t} \in e_{j}} f_{i_{t}}), λ^{⋆} = \sum_{e_{j} \in E} l_{e_{j}} (f^{⋆}) \end{matrix}

(24)

where $l_{e_{j}} (f^{⋆})$ denotes the “score” for hyperedge e_j computed for the eigenvector f^⋆. With a slight abuse of terminology, we argue that a higher value of this score indicates the corresponding hyperedges are “close” to separator boundary ∂E. The measure of closeness between two nodes is quantified by the minimum number of hyperedges to be traversed for reaching one node to another.

This can be validated easily by careful inspection of hyperedge score $l_{e_{j}} (f)$ , when the vector f is treated as the cluster indicator variable shown in Eq (18). The hyperedge score will be non-zero only for the hyperedges on the separating boundary for such ideal choice of f. The same can be also interpreted as the score being zero ∀e_j ∈ {E \ ∂E}. We carry forward the same intuition and prefer to cut the hyperedges with a “higher” score.

The score may not be exactly zero for any hyperedge if the Fiedler vector is used for the score computation as it is obtained for the relaxed minimization of the ratio-cut (Theorem 6). Applying this approach on Example 2, we report a maximum score of 0.017 for the hyperedge {1, 2, 3} and hence cut it to obtain the optimal partitions. It should be noted that we obtain the optimal partitions directly without computing the ratio-cut value n − 1 times and taking minimum value like the existing sweep cut-based approaches [33]. The proposed algorithm is summarized in Algorithm 1.

Algorithm 1: Hypergraph Partitioning Algorithm

Result: Partitions

Construct the tensor Laplacian and derive the Fiedler eigenpair (λ^⋆, f^⋆).

Calculate the hyperedge score, $l_{e_{j}} (f^{⋆})$ by using Eq (24).

while number of parts < p do

Remove hyperedges with maximum cost (hyepredge score).

end

The intuition behind using the hyperedge score for deriving ∂E is motivated from spectral graph theory. It is interesting to note that this novel use of hyperedge scores helps to compute a better ratio-cut for the cockroach graph presented in Section 4.

A similar analysis can be performed for the minimization of the normalized cut of hypergraphs.

Corollary 7. The solution to the relaxed optimization problem of minimizing normalized cut mentioned in Proposition 5 can be derived using the eigenvector corresponding to the minimum positive eigenvalue of the normalized Laplacian tensor defined in Eq (4).

The proof of this corollary is very similar to the proof of Theorem 6. In this case, we choose the indicator variable as

\begin{matrix} f_{i, j} = {\begin{matrix} \frac{1}{\sqrt{vol (C_{j})}} & v_{i} \in C_{j} \\ 0 & otherwise \end{matrix} \end{matrix}

(25)

The next step is to compute the $L x^{k}$ , where the normalized Laplacian tensor $L$ is defined in Eq 4. The rest of the proof is very similar to the proof of Theorem 6.

We perform the theoretical analysis of the proposed algorithm and derive an interesting bound on the approximation made in normalized cuts.

Theorem 8. The upper bound on the minimum positive eigenvalue of an even order k-uniform hypergraph is

\begin{matrix} λ_{1} \leq k ϕ (G), ϕ (G) = min_{C \subseteq V} \frac{\sum_{e_{j} \in \partial E} w_{e_{j}}}{min {v o l (C), vol (\bar{C})}}, vol (C) = \sum_{i_{j} \in C} d_{i_{j}} \end{matrix}

(26)

where λ₁ is the smallest eigenvalue satisfying Eq (5) for normalized tensor Laplacian $L$ and ϕ(G) refers to the conductance of hypergraph.

Proof. Let x be a n × 1 vector with $x_{i_{t}} \in {\frac{\sqrt[k]{d_{i_{t}}}}{ω}, \frac{- \sqrt[k]{d_{i_{t}}}}{ω}}$ , where ω is defined as

\begin{matrix} ω = {(\sum_{i_{t} = 1}^{n} d_{i_{t}}^{\frac{2}{k}})}^{\frac{1}{2}} \end{matrix}

It can be easily verified that x^T x = 1. Substitute x in the expression for normalized hypergraph Laplacian defined in Eq (4). Please note that the signs of $x_{i_{t}}$ correspond to an arbitrary cut $(C, \bar{C})$ .

\begin{matrix} λ_{1} \leq L x^{k} & = \sum_{e_{j} \in E} w_{e_{j}} (\sum_{i_{t} \in e_{j}} \frac{x_{i_{t}}^{k}}{d_{i_{t}}} - k \prod_{i_{t} \in e_{j}} \frac{x_{i_{t}}}{\sqrt[k]{d_{i_{t}}}}) \\ = \sum_{e_{j} \in E} w_{e_{j}} (\frac{k - n_{s, j}}{ω^{k}} + \frac{n_{s, j} {(- 1)}^{k}}{ω^{k}} - k {(- 1)}^{n_{s, j}} \frac{1}{ω^{k}}) \end{matrix}

(27)

where $n_{s, j} = | {i_{t} : x_{i_{t}} < 0, i_{t} \in e_{j}} |$ . For even order hypergraphs, the above can be reduced to

\begin{matrix} λ_{1} & \leq \sum_{e_{j} \in E} w_{e_{j}} (\frac{k}{ω^{k}} - k {(- 1)}^{n_{s, j}} \frac{1}{ω^{k}}) \\ \leq \sum_{e_{j} \in \partial E} w_{e_{j}} (\frac{2 k}{ω^{k}}) = \sum_{e_{j} \in \partial S} w_{e_{j}} (\frac{2 k}{{(\sum_{i = 1}^{n} d_{i}^{\frac{2}{k}})}^{\frac{k}{2}}}) \\ \leq \sum_{e_{j} \in \partial E} w_{e_{j}} (\frac{2 k}{{(\sum_{i = 1}^{n} d_{i})}^{\frac{2}{k} \times \frac{k}{2}}}) \\ \leq \sum_{e_{j} \in \partial E} w_{e_{j}} \frac{2 k}{2 min (vol (C), vol (\bar{C}))} \\ \leq k ϕ (G) for any C \subseteq V \end{matrix}

(28)

This theorem helps to analyze the order of approximation involved in relaxing the N-min cut problem by deriving the solution through tensor EVD. The tightness of the bound indicates the goodness of the approximation. Several other attempts have been made to derive such approximation bounds for hypergraphs. For example, Chen et al. [27] utilize a different Laplacian tensor and the following hyperedge score to derive similar bound on λ₁ of a different tensor:

\begin{matrix} l_{e_{j}} (x) = \sum_{i_{k} \in e_{j}} {(x_{i_{k}} - \bar{x})}^{k}, \bar{x} = \frac{1}{k} \sum_{i_{k} \in e_{j}} x_{i_{k}}, λ_{1} \leq 2^{k / 2} ϕ (G) \end{matrix}

(29)

This is a weaker bound of exponential nature whereas we have proposed tighter bound of linear nature in Theorem 8.

Theorem 9. The upper bound on the minimum positive eigenvalue of the unnormalized Laplacian tensor of an even order k-uniform hypergraph is:

\begin{matrix} λ_{1} \leq k ϕ_{r} (G), ϕ_{r} (G) = min_{C \subseteq V} \frac{\sum_{e_{j} \in \partial E} w_{e_{j}}}{min {| C |, | \bar{C} |}}, \end{matrix}

30)

Proof. Let x be n × 1 vector with $x_{i_{j}} \in {\frac{1}{ω}, \frac{- 1}{ω}}$ , where ω is defined as

\begin{matrix} ω = {(| V |)}^{\frac{1}{2}} \end{matrix}

Please note that x^T x = 1. Proceeding in a similar manner to the proof of Theorem 8:

\begin{matrix} λ_{1} \leq L x^{m} & = \sum_{e_{j} \in E} w_{e_{j}} (\sum_{i_{t} \in e_{j}} x_{i_{t}}^{k} - k \prod_{i_{t} \in e_{j}} x_{i_{t}}) \\ = \sum_{e_{j} \in E} w_{e_{j}} (\frac{k - n_{s, j}}{ω^{k}} + \frac{n_{s, j} {(- 1)}^{k}}{ω^{k}} - k {(- 1)}^{n_{s, j}} \frac{1}{ω^{k}}) \\ \leq \sum_{e_{j} \in \partial E} w_{e_{j}} (\frac{2 k}{ω^{k}}) = \sum_{e_{j} \in \partial S} w_{e_{j}} (\frac{2 k}{{| V |}^{k / 2}}) \\ \leq \sum_{e_{j} \in \partial E} w_{e_{j}} (\frac{2 k}{| V |}) \\ \leq \sum_{e_{j} \in \partial E} w_{e_{j}} (\frac{2 k}{2 min | C |, | \bar{C} |}) \\ \leq k ϕ (G) for any C \subseteq V \end{matrix}

where $n_{s, j} = | {i_{t} : x_{i_{t}} < 0, i_{t} \in e_{j}} |$ and ϕ_r(G) is defined in Eq 30.

It should be noted ϕ_r(G) defined in Eq 30 is a slightly modified version of the conductance, ϕ(G) defined in Eq 26. Also, note that ϕ_r(G) = d × ϕ(G) for d-regular hypergraph because vol(C) = d × |C| for this particular case. A d-regular hypergraph is a hypergraph where each node is constrained to have degree of exactly d.

3.4 Computation of tensor eigenvectors

The computation of eigenvectors of real super-symmetric tensors is quite challenging and not straightforward as in the case of real symmetric matrices. It is actually NP-hard for general tensors and cannot be approximated unless P = NP [22]. This is primarily due to the non-orthogonality of tensor eigenvectors. There are several other challenging aspects, for example, real symmetric tensors can have complex eigenpairs, unlike the case of matrices. Also, a real symmetric matrix of size n×n can have a maximum of n eigenvalues, whereas a tensor can have much larger number of eigenpairs [31, 44]. Most of the existing works on computation of eigenpairs have been for tensors with special structure [45] or the extreme eigenvalues such as maximum or minimum eigenvalue [46, 47]. As discussed in Section 3.3, only the Fiedler vector is required for partitioning a given hypergraph. As the Fiedler vector is not one of the extreme eigenvectors, the above methods are not helpful for our case.

Recently proposed algorithm to compute all the eigenvalues of a tensor utilizes homotopy methods [48]. They pose the problem as finding the roots of a vector of high order polynomials generated from $P (y) = L x^{k - 1} - λ x = 0$ , where $y = [x λ] \in R^{n + 1}$ . As it is tough to compute the zeros of P(y) directly, the core idea of linear homotopy methods is to construct a vector function H(y, t) = (1 − t)Q(y) + tP(y), where t ∈ [0, 1] and Q(y) is a suitable vector polynomial whose roots can be computed easily. The next step is to slowly iterate from the solution of H(y, t = 0) = Q(y) = 0 to H(y, t = 1) = P(y) = 0. Despite the novel formulation, this approach is forced to compute all the complex eigenpairs even if we are interested in real eigenpairs only.

Before proceeding to the main discussion on the computation of the Fiedler vector, it should be noted that one of the eigenvectors for minimum eigenvalue can be found analytically by exploiting the particular structure of Laplacian tensor [32]. In fact, the minimum eigenvalue of Laplacian tensor is known to be 0, and the corresponding eigenvector is $x = \frac{1}{\sqrt{n}} [1 1 \dots 1]$ . There can be other eigenvectors for the zero eigenvalue whose graphical implication is discussed in the literature [23]. For our problem, we may use the approach by Cui et al. [49], which computes all the real eigenvalues sequentially from maximum to minimum by using Jacobian semidefinite relaxations in polynomial optimization to avoid computing all eigenvalues.

They formulate the following problem to compute λ_i+1 assuming λ_i is known:

\begin{matrix} max & f (x) = L x^{k} \\ such that & f (x) \leq λ_{k} - δ & h_{r} (x) = 0, (r = 1, \dots, 2 n - 2) \end{matrix}

(31)

where 0 < δ < λ_i − λ_i+1 and h_r(x) is defined as:

\begin{matrix} h_{r} (x) = \sum_{i + j = r + 2} \frac{\partial f (x)}{\partial x_{i}} \frac{\partial g (x)}{\partial x_{j}} - \frac{\partial f (x)}{\partial x_{j}} \frac{\partial g (x)}{\partial x_{i}} \end{matrix}

(32)

where $g (x) = x_{1}^{2} + x_{2}^{2} + \dots + x_{n}^{2} - 1$ is a normalization constraint. They further utilize Lasserre’s hierarchy of semidefinite relaxations [50] to solve the above problem.

The computation of the objective function f(x) and the constraints h_r(x) is expensive and takes O(n^k) for general tensors. Using Theorem 2, the objective function can be computed in linear time O(m) for Laplacian tensors. The constraint can also be simplified using:

\begin{matrix} \frac{\partial f (x)}{\partial x_{i}} = \sum_{e_{p} \in E_{i}} k w_{e_{p}} (x_{i}^{k - 1} - k \prod_{t \in {e_{p} \ i}} x_{t}) \end{matrix}

(33)

where E_i = {e_q|i ∩ e_q ≠ ∅, e_q ∈ E}. This approach is very helpful as all the eigenvalues need not be computed for the Fiedler eigenvalue. Hence, these closed form expression for the case of Laplacian tensor can be utilized to reduce the number of function evaluations in optimization methods as compared to general tensors.

3.5 Related works

As stated earlier, most of the existing methods utilize hypergraph reductions either implicitly [13, 15] or explicitly or coarsening [10] or scalable heuristic methods [19]. For example, Ghoshdastidar et al. [51] utilize the tensor-based representation of hypergraphs but construct a matrix by concatenating the slices of the tensor. Further, they apply the standard spectral partitioning algorithm on the covariance of that matrix. These variants of hypergraph reduction differ in the method of expanding a hyperedge and produce graphs with different edge weights. The Laplacian objective function (Eq (7)) of any graph is second-order polynomial, which captures weighted interaction among two nodes. A second-order polynomial is insufficient for capturing super-dyadic interaction among multiple nodes (≥3) of a hyperedge. Also, note that multiple hypergraphs may reduce to the same graph.

Hein et al. [52] discuss the incapability of reduction methods in preserving the hyperedge cuts for general hypergraphs. We utilize the Laplacian tensor (Eq (7)) to penalize these multiple cuts differently. Few other recent works try to capture these multiple ways of splitting nodes. For example, Li et al. [53] proposes non-uniform clique expansion and provides quadratic approximation under submodularity constraints of the inhomogeneous cost function. Li et al. [28] extends the notion of p-Laplacian from graphs to hypergraphs by introducing the following hyperedge score:

\begin{matrix} l_{e_{j}} (x) = max_{i_{k}, i_{k}^{^{'}} \in e_{j}} {| x_{i_{k}} - x_{i_{k}^{^{'}}} |}^{p} \end{matrix}

(34)

Ideally, any definition of hyperedge score should capture the non-uniformity among the nodes in a hyperedge, but the above equation fails to capture the variation perfectly. For example, consider two hyperedges with cardinality 4 and node labels assigned as {0, 1, 1, 2} and {0, 1, 2, 2}. Eq (34) computes the maximum difference and hence will not differentiate among these two hyperedges but the AM-GM difference (Eq (24)) will capture the variation among all the nodes of the hyperedge. Various other similar formulation of the hyperedge score function are considered in literature [54–56].

4 Experiments

We compare the proposed algorithm and sign-based Fiedler vector partitioning on cockroach graphs. Further, the proposed algorithm is examined on synthetic graphs and hypergraphs generated by Erdős Rényi Model [57] and Stochastic Block Model (SBM) [58]. The numerical details of Fiedler vector and hyperedge scores are presented in S1 File.

4.1 Proposed algorithm vs sign-based partitioning on cockroach graph

Consider the cockroach graph with 4t nodes, as shown in Fig 3. Von Luxburg [26] shows that the conventional sign-based Fiedler vector partitioning does not produce the optimal ratio-cut for cockroach graph. In this example, we show that the proposed algorithm performs better. For comparing the proposed method and sign-based Fiedler vector partitioning, we compute the ratio-cut value by these algorithms.

The analysis shown in this example is valid for general t but we have presented the numerical details for t = 3. The Fiedler vector for this graph (t = 3) is given by:

\begin{matrix} f = {[\begin{matrix} - 0.49 & - 0.41 & - 0.26 & - 0.07 & - 0.02 & - 0.01 & 0.49 & 0.41 & 0.26 & 0.07 & 0.02 & 0.01 \end{matrix}]}^{T} \end{matrix}

(35)

So the partitions defined based on the sign of elements in v are A₁ = {v₁, v₂, v₃, v₄, v₅, v₆} and $A_{2} = \bar{A_{1}}$ . So, the ratioCut $(A_{1}, \bar{A_{1}}) = \frac{3}{6} + \frac{3}{6} = 1$ .

The next step is to apply the proposed algorithm. The edge score computed from the proposed algorithm are presented in Table 1. The edges {v₃, v₄} and {v₉, v₁₀} are removed as they are of maximum edge score of 0.0371. So the partitions are $B_{1} = {v_{1}, v_{2}, v_{3}, v_{7}, v_{8}, v_{9}}, B_{2} = \bar{B_{1}}$ . Therefore, ratioCut $(B_{1}, \bar{B_{1}}) = \frac{2}{6} + \frac{2}{6} = 0.66$ . It can be clearly observed that the partitions obtained from the proposed algorithm have a lower ratioCut value compared to the existing method.

Table 1. Edge-score for graph in Example 4.1.

Edge	Score
{v₃, v₄}	0.0371
{v₉, v₁₀}	0.0371
{v₄, v₁₀}	0.0228
{v₂, v₃}	0.0228
{v₈, v₉}	0.0222
{v₁, v₂}	0.0222
{v₇, v₈}	0.0066
{v₄, v₅}	0.0066
{v₁₀, v₁₁}	0.0029
{v₅, v₁₁}	0.0029
{v₆, v₁₂}	0.0002
{v₁₁, v₁₂}	0.0002
{v₅, v₆}	0.0002

Open in a new tab

In general, the traditional spectral partitioning makes the red cut shown in the graph and the partition is A₁ = {v₁, …, v_2t} and the ratio-cut $(A_{1}, \bar{A_{1}}) = \frac{t}{2 t} + \frac{t}{2 t} = 1$ . We utilize the edge scores as suggested in the proposed algorithm and report that the edges {v_t, v_t+1} and {v_3t, v_3t+1} have maximum scores. On cutting these edges, the obtained partition is B₁ = {v₁, v₂, …, v_t, v_2t+1, …, v_3t} and hence the ratio-cut $(B_{1}, \bar{B_{1}}) = \frac{2}{2 t} + \frac{2}{2 t} = \frac{2}{t}$ . Therefore, the solution obtained by proposed algorithm is t/2 times better than the traditional approach. We have verified it numerically for t = {3, 4, …, 50}.

4.2 Proposed algorithm vs sign-based partitioning on synthetic graphs & hypergraphs

In this example, we consider different types of synthetic graphs and compare the ratio-cut values computed by the existing and proposed methods. We define the following metric, termed as percentage improvement (PI) to showcase the proposed algorithm’s performance:

\begin{matrix} PI = \frac{(R_{f} - R_{p})}{R_{f}} \times 100 \end{matrix}

(36)

where R_f, R_p denotes the ratio-cut value by sign based Fiedler partitioning and proposed algorithm, respectively. A positive value of PI indicates the proposed algorithm has produced a better ratio-cut value and the magnitude of the value represents the extent of the improvement.

4.2.1 Proposed algorithm vs sign-based partitioning on graphs generated by ER model

We begin with the study on random graphs generated from the Erdős Rényi Model [57] denoted by G(n, p), where n is the number of nodes and p is the probability of an edge between any two nodes. We compare the ratio-cut values 2 partitions values on 100 different graphs for n = 100 and for each value of p = {0.2, 0.4, 0.6}. Fig 4 shows the result as a histogram for different values of p = {0.2, 0.4, 0.6}. It can be seen that the proposed algorithm performs better than the sign based Fiedler partitioning in all cases.

Fig 4 — It shows that the proposed algorithm performs better for all the generated graphs.

4.2.2 Proposed algorithm vs sign-based partitioning on graphs generated by SBM

We perform a similar analysis for another graph generation model, referred to as the stochastic block model (SBM). This model provides us the freedom to control the number of parts, the number of nodes in each part (denoted by n₁, n₂), the probability of an edge within a part (denoted by p), and across the partition (q). Note that p = q yields ER model with n = n₁ + n₂ as discussed previously.

We consider the graphs for multiple combinations of probabilities p, q and 2 partitions with n₁ = n₂ = 50. It should be noted that we consider the SBM with assortative community structure, which implies p > q. We generate 100 random graphs for each of these settings and compare the ratio-cut values. A histogram plot summarizing the results is presented in Fig 5.

It is evident from Fig 5 that the proposed algorithm produces a lower ratio-cut value for most of the graphs generated by SBM.

4.2.3 Proposed algorithm vs sign-based partitioning on hypergraphs generated by SBM

We perform a similar analysis on synthetic hypergraphs generated by SBM [51]. We generate 100 random 4-uniform hypergraphs with 2 partitions, 60 nodes, and relatively small values of intra-cluster probability (p) and inter-cluster probability (q) as compared to the case of graphs. This is primarily because the number of possible hyperedges for a 4–uniform hypergraph is $(\binom{n}{4})$ , which is much larger than $(\binom{n}{2})$ as compared to the case of graphs.

The proposed algorithm is compared to the conventional sign-based partitioning using the Fiedler vector computed from the Laplacian tensor of the hypergraph. It should be noted that computation of tensor eigenvalues in NP-hard and cannot be approximated unless P = NP [22]. A histogram plot summarizing the results is shown in Fig 6.

Fig 6 — It shows that the proposed algorithm performs significantly better as compared to sign-based partitioning for all generated hypergraphs.

We perform a similar analysis on the comparison of proposed algorithm and sign-based partitioning using Fiedler vector of normalized Laplacian tensor of the hypergraph. This is done to study the behaviour of algorithm for normalized cut defined in Eq (16). The histogram plot is presented in Fig 7.

Fig 7 — It shows that the proposed algorithm performs significantly better as compared to sign-based partitioning for all generated hypergraphs.

It can be observed that the proposed algorithm has improved the ratio-cut value (defined in Eq (15)) and normalized cut value (defined in Eq (16)) significantly as compared to the traditional sign-based partitioning. This is primarily because cutting a few hyperedges does not necessarily produce only two components, unlike the case of graphs. For example, if we cut a hyperedge having 3 nodes in a hypergraph with one hyperedge only, we get 3 disconnected components, and there is no possibility of obtaining two connected components. Hence, we may get 3 connected components, even if we desired only 2 connected components.

Any partitioning algorithm producing many small connected components (like singletons) is likely to have a higher ratio-cut value. We observe that the sign-based partitioning approach using the Fiedler vector of the Laplacian tensor is more prone to producing many small connected components as compared to the results by proposed algorithms. Hence, the ratio-cut value or normalized cut value by sign-based partitioning is significantly higher.

5 Conclusions & future work

In this work, we propose a hypergraph partitioning algorithm using tensor eigenvalue framework which removes the hyperedges directly without performing reduction to a graph like existing methods. This was done by using the novel “hyperedge score” metric. To do this, we extended the definition of ratio-cut and normalized cut from graphs to hypergraph and showed the equivalence of relaxed optimization problem to a tensor eigenvalue problem. Further, we derived a tighter upper bound for the approximation of normalized-cut problem. The future directions of this work is along the lines of similar analysis for non-uniform and directed hypergraphs.

Supporting information

S1 File

(ZIP)

Click here for additional data file.^{(131.4KB, zip)}

Data Availability

We have worked with synthetic data and NOT real-world data as the work is more theory oriented. We have included the code to generate the synthetic data as a Supporting information files.

Funding Statement

This work was partially supported by Intel research grant RB/18-19/CSE/002/INTI/BRAV to Balaraman Ravindran The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Aittokallio T, Schwikowski B. Graph-based methods for analysing networks in cell biology. Briefings in bioinformatics. 2006;7(3):243–255. doi: 10.1093/bib/bbl022 [DOI] [PubMed] [Google Scholar]
2. Bhatt SN, Leighton FT. A framework for solving VLSI graph layout problems. Journal of Computer and System Sciences. 1984;28(2):300–343. doi: 10.1016/0022-0000(84)90071-0 [DOI] [Google Scholar]
3.Veksler O. Star shape prior for graph-cut image segmentation. In: European Conference on Computer Vision. Springer; 2008. p. 454–467.
4. Gao J, Buldyrev SV, Stanley HE, Havlin S. Networks formed from interdependent networks. Nature physics. 2012;8(1):40. [DOI] [PubMed] [Google Scholar]
5.Dhillon IS, Guan Y, Kulis B. Kernel k-means: spectral clustering and normalized cuts. In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM; 2004. p. 551–556.
6. Wang M, Fu W, Hao S, Tao D, Wu X. Scalable semi-supervised learning by efficient anchor graph regularization. IEEE Transactions on Knowledge and Data Engineering. 2016;28(7):1864–1877. doi: 10.1109/TKDE.2016.2535367 [DOI] [Google Scholar]
7. Chung FR, Graham FC. Spectral graph theory. 92. American Mathematical Soc.; 1997. [Google Scholar]
8. Defferrard M, Bresson X, Vandergheynst P. Convolutional neural networks on graphs with fast localized spectral filtering. Advances in neural information processing systems. 2016;29. [Google Scholar]
9.Zhou D, Huang J, Scholkopf B. Beyond pairwise classification and clustering using hypergraphs. In: Proceedings of the Neural Information Processing Systems; 2005.
10. Karypis G, Aggarwal R, Kumar V, Shekhar S. Multilevel hypergraph partitioning: applications in VLSI domain. IEEE Transactions on Very Large Scale Integration (VLSI) Systems. 1999;7(1):69–79. doi: 10.1109/92.748202 [DOI] [Google Scholar]
11.Agarwal S, Lim J, Zelnik-Manor L, Perona P, Kriegman D, Belongie S; IEEE. Beyond pairwise clustering. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05). 2005;2:838–845.
12.Huang Y, Liu Q, Metaxas D. Video object segmentation by hypergraph cut. In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE; 2009. p. 1738–1745.
13. Zhou D, Huang J, Schölkopf B. Learning with hypergraphs: Clustering, classification, and embedding. In: Advances in neural information processing systems; 2007. p. 1601–1608. [Google Scholar]
14.Yadati N, Nimishakavi M, Yadav P, Nitin V, Louis A, Talukdar P. HyperGCN: A new method of training graph convolutional networks on hypergraphs. arXiv preprint arXiv:180902589. 2018;.
15.Agarwal S, Branson K, Belongie S. Higher order learning with graphs. In: Proceedings of the 23rd International Conference on Machine learning. ACM; 2006. p. 17–24.
16.Kumar T, Darwin K, Parthasarathy S, Ravindran B. HPRA: Hyperedge prediction using resource allocation. In: 12th ACM conference on web science; 2020. p. 135–143.
17.Li L, Li T. News recommendation via hypergraph learning: encapsulation of user behavior and news content. In: Proceedings of the sixth ACM international conference on Web search and data mining; 2013. p. 305–314.
18. Gao Y, Wang M, Tao D, Ji R, Dai Q. 3-D object retrieval and recognition with hypergraph analysis. IEEE Transactions on Image Processing. 2012;21(9):4290–4303. doi: 10.1109/TIP.2012.2199502 [DOI] [PubMed] [Google Scholar]
19.Veldt N, Benson AR, Kleinberg J. Minimizing localized ratio cut objectives in hypergraphs. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining; 2020. p. 1708–1718.
20.Ghoshdastidar D, Dukkipati A. A provable generalized tensor spectral method for uniform hypergraph partitioning. In: International Conference on Machine Learning; 2015. p. 400–409.
21. Ihler E, Wagner D, Wagner F. Modeling hypergraphs by graphs with the same mincut properties. Information Processing Letters. 1993;45(4):171–175. doi: 10.1016/0020-0190(93)90115-P [DOI] [Google Scholar]
22. Hillar CJ, Lim LH. Most tensor problems are NP-hard. Journal of the ACM (JACM). 2013;60(6):1–39. doi: 10.1145/2512329 [DOI] [Google Scholar]
23. Hu S, Qi L. The eigenvectors associated with the zero eigenvalues of the Laplacian and signless Laplacian tensors of a uniform hypergraph. Discrete Applied Mathematics. 2014;169:140–151. doi: 10.1016/j.dam.2013.12.024 [DOI] [Google Scholar]
24.Shashua A, Zass R, Hazan T. Multi-way clustering using super-symmetric non-negative tensor factorization. In: European conference on computer vision. Springer; 2006. p. 595–608.
25. Benson AR. Three hypergraph eigenvector centralities. SIAM Journal on Mathematics of Data Science. 2019;1(2):293–312. doi: 10.1137/18M1203031 [DOI] [Google Scholar]
26. Von Luxburg U. A tutorial on spectral clustering. Statistics and computing. 2007;17(4):395–416. doi: 10.1007/s11222-007-9033-z [DOI] [Google Scholar]
27. Chen Y, Qi L, Zhang X. The Fiedler Vector of a Laplacian Tensor for Hypergraph Partitioning. SIAM Journal on Scientific Computing. 2017;39(6):A2508–A2537. doi: 10.1137/16M1094828 [DOI] [Google Scholar]
28.Li P, Milenkovic O. Submodular Hypergraphs: p-Laplacians, Cheeger Inequalities and Spectral Clustering. In: Dy J, Krause A, editors. Proceedings of the 35th International Conference on Machine Learning. vol. 80 of Proceedings of Machine Learning Research. PMLR; 2018. p. 3014–3023.
29. Zien JY, Schlag MD, Chan PK. Multilevel spectral hypergraph partitioning with arbitrary vertex sizes. IEEE Transactions on computer-aided design of integrated circuits and systems. 1999;18(9):1389–1399. doi: 10.1109/43.784130 [DOI] [Google Scholar]
30.Kumar T, Vaidyanathan S, Ananthapadmanabhan H, Parthasarathy S, Ravindran B. Hypergraph Clustering: A Modularity Maximization Approach. arXiv preprint arXiv:181210869. 2018;.
31.Qi L, Luo Z. Tensor analysis: spectral theory and special tensors. vol. 151. SIAM; 2017.
32. Banerjee A, Char A, Mondal B. Spectra of general hypergraphs. Linear Algebra and its Applications. 2017;518:14–30. doi: 10.1016/j.laa.2016.12.022 [DOI] [Google Scholar]
33. Hagen L, Kahng AB. New spectral methods for ratio cut partitioning and clustering. IEEE transactions on computer-aided design of integrated circuits and systems. 1992;11(9):1074–1085. doi: 10.1109/43.159993 [DOI] [Google Scholar]
34.Shi J, Malik J. Normalized cuts and image segmentation. Departmental Papers (CIS). 2000; p. 107.
35. Chung F. Four proofs for the Cheeger inequality and graph partition algorithms. Proceedings of ICCM. 2007;2:378. [Google Scholar]
36.Bühler T, Hein M. Spectral clustering based on the graph p-Laplacian. In: Proceedings of the 26th Annual International Conference on Machine Learning; 2009. p. 81–88.
37. Lee JR, Gharan SO, Trevisan L. Multiway spectral partitioning and higher-order cheeger inequalities. Journal of the ACM (JACM). 2014;61(6):1–30. doi: 10.1145/2665063 [DOI] [Google Scholar]
38.Chandrasekaran K, Xu C, Yu X. Hypergraph k-cut in randomized polynomial time. In: Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms. Society for Industrial and Applied Mathematics; 2018. p. 1426–1438.
39.Chekuri C, Li S. A note on the hardness of approximating the k-way Hypergraph Cut problem. Manuscript, http://chekuri.cs.illinois.edu/papers/hypergraph-kcut.pdf. 2015;.
40.Ghoshdastidar D, Dukkipati A. Spectral Clustering Using Multilinear SVD: Analysis, Approximations and Applications. In: AAAI; 2015. p. 2610–2616.
41. Chekuri C, Li S. On the Hardness of Approximating the k-WAY HYPERGRAPH CUT Problem. Theory of Computing. 2020;16(1). [Google Scholar]
42.Chandrasekaran K, Chekuri C. Min-max partitioning of hypergraphs and symmetric submodular functions. In: Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms (SODA). SIAM; 2021. p. 1026–1038.
43. Goldschmidt O, Hochbaum DS. A polynomial algorithm for the k-cut problem for fixed k. Mathematics of operations research. 1994;19(1):24–37. doi: 10.1287/moor.19.1.24 [DOI] [Google Scholar]
44. Qi L. Eigenvalues of a real supersymmetric tensor. Journal of Symbolic Computation. 2005;40(6):1302–1324. doi: 10.1016/j.jsc.2005.05.007 [DOI] [Google Scholar]
45. Qi L, Wang F, Wang Y. Z-eigenvalue methods for a global polynomial optimization problem. Mathematical Programming. 2009;118(2):301–316. doi: 10.1007/s10107-007-0193-6 [DOI] [Google Scholar]
46. Kolda TG, Mayo JR. Shifted power method for computing tensor eigenpairs. SIAM Journal on Matrix Analysis and Applications. 2011;32(4):1095–1124. doi: 10.1137/100801482 [DOI] [Google Scholar]
47. Hu S, Huang ZH, Qi L. Finding the extreme Z-eigenvalues of tensors via a sequential semidefinite programming method. Numerical Linear Algebra with Applications. 2013;20(6):972–984. doi: 10.1002/nla.1884 [DOI] [Google Scholar]
48. Chen L, Han L, Zhou L. Computing tensor eigenvalues via homotopy methods. SIAM Journal on Matrix Analysis and Applications. 2016;37(1):290–319. doi: 10.1137/15M1010725 [DOI] [Google Scholar]
49. Cui CF, Dai YH, Nie J. All real eigenvalues of symmetric tensors. SIAM Journal on Matrix Analysis and Applications. 2014;35(4):1582–1601. doi: 10.1137/140962292 [DOI] [Google Scholar]
50. Lasserre JB. Global optimization with polynomials and the problem of moments. SIAM Journal on optimization. 2001;11(3):796–817. doi: 10.1137/S1052623400366802 [DOI] [Google Scholar]
51. Ghoshdastidar D, Dukkipati A. Consistency of spectral partitioning of uniform hypergraphs under planted partition model. In: Advances in Neural Information Processing Systems; 2014. p. 397–405. [Google Scholar]
52. Hein M, Setzer S, Jost L, Rangapuram SS. The total variation on hypergraphs-learning on hypergraphs revisited. In: Advances in Neural Information Processing Systems; 2013. p. 2427–2435. [Google Scholar]
53. Li P, Milenkovic O. Inhomogeneous hypergraph clustering with applications. In: Advances in Neural Information Processing Systems; 2017. p. 2308–2318. [Google Scholar]
54.Zhang C, Hu S, Tang ZG, Chan T. Re-revisiting learning on hypergraphs: confidence interval and subgradient method. In: Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org; 2017. p. 4026–4034.
55. Chan THH, Louis A, Tang ZG, Zhang C. Spectral properties of hypergraph laplacian and approximation algorithms. Journal of the ACM (JACM). 2018;65(3):1–48. doi: 10.1145/3178123 [DOI] [Google Scholar]
56. Chan THH, Liang Z. Generalizing the hypergraph laplacian via a diffusion process with mediators. Theoretical Computer Science. 2020;806:416–428. doi: 10.1016/j.tcs.2019.07.024 [DOI] [Google Scholar]
57. Erdős P, Rényi A, et al. On the evolution of random graphs. Publ Math Inst Hung Acad Sci. 1960;5(1):17–60. [Google Scholar]
58. Holland PW, Laskey KB, Leinhardt S. Stochastic blockmodels: First steps. Social networks. 1983;5(2):109–137. doi: 10.1016/0378-8733(83)90021-7 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 File

(ZIP)

Click here for additional data file.^{(131.4KB, zip)}

Data Availability Statement

We have worked with synthetic data and NOT real-world data as the work is more theory oriented. We have included the code to generate the synthetic data as a Supporting information files.

[pone.0288457.ref001] 1. Aittokallio T, Schwikowski B. Graph-based methods for analysing networks in cell biology. Briefings in bioinformatics. 2006;7(3):243–255. doi: 10.1093/bib/bbl022 [DOI] [PubMed] [Google Scholar]

[pone.0288457.ref002] 2. Bhatt SN, Leighton FT. A framework for solving VLSI graph layout problems. Journal of Computer and System Sciences. 1984;28(2):300–343. doi: 10.1016/0022-0000(84)90071-0 [DOI] [Google Scholar]

[pone.0288457.ref003] 3.Veksler O. Star shape prior for graph-cut image segmentation. In: European Conference on Computer Vision. Springer; 2008. p. 454–467.

[pone.0288457.ref004] 4. Gao J, Buldyrev SV, Stanley HE, Havlin S. Networks formed from interdependent networks. Nature physics. 2012;8(1):40. [DOI] [PubMed] [Google Scholar]

[pone.0288457.ref005] 5.Dhillon IS, Guan Y, Kulis B. Kernel k-means: spectral clustering and normalized cuts. In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM; 2004. p. 551–556.

[pone.0288457.ref006] 6. Wang M, Fu W, Hao S, Tao D, Wu X. Scalable semi-supervised learning by efficient anchor graph regularization. IEEE Transactions on Knowledge and Data Engineering. 2016;28(7):1864–1877. doi: 10.1109/TKDE.2016.2535367 [DOI] [Google Scholar]

[pone.0288457.ref007] 7. Chung FR, Graham FC. Spectral graph theory. 92. American Mathematical Soc.; 1997. [Google Scholar]

[pone.0288457.ref008] 8. Defferrard M, Bresson X, Vandergheynst P. Convolutional neural networks on graphs with fast localized spectral filtering. Advances in neural information processing systems. 2016;29. [Google Scholar]

[pone.0288457.ref009] 9.Zhou D, Huang J, Scholkopf B. Beyond pairwise classification and clustering using hypergraphs. In: Proceedings of the Neural Information Processing Systems; 2005.

[pone.0288457.ref010] 10. Karypis G, Aggarwal R, Kumar V, Shekhar S. Multilevel hypergraph partitioning: applications in VLSI domain. IEEE Transactions on Very Large Scale Integration (VLSI) Systems. 1999;7(1):69–79. doi: 10.1109/92.748202 [DOI] [Google Scholar]

[pone.0288457.ref011] 11.Agarwal S, Lim J, Zelnik-Manor L, Perona P, Kriegman D, Belongie S; IEEE. Beyond pairwise clustering. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05). 2005;2:838–845.

[pone.0288457.ref012] 12.Huang Y, Liu Q, Metaxas D. Video object segmentation by hypergraph cut. In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE; 2009. p. 1738–1745.

[pone.0288457.ref013] 13. Zhou D, Huang J, Schölkopf B. Learning with hypergraphs: Clustering, classification, and embedding. In: Advances in neural information processing systems; 2007. p. 1601–1608. [Google Scholar]

[pone.0288457.ref014] 14.Yadati N, Nimishakavi M, Yadav P, Nitin V, Louis A, Talukdar P. HyperGCN: A new method of training graph convolutional networks on hypergraphs. arXiv preprint arXiv:180902589. 2018;.

[pone.0288457.ref015] 15.Agarwal S, Branson K, Belongie S. Higher order learning with graphs. In: Proceedings of the 23rd International Conference on Machine learning. ACM; 2006. p. 17–24.

[pone.0288457.ref016] 16.Kumar T, Darwin K, Parthasarathy S, Ravindran B. HPRA: Hyperedge prediction using resource allocation. In: 12th ACM conference on web science; 2020. p. 135–143.

[pone.0288457.ref017] 17.Li L, Li T. News recommendation via hypergraph learning: encapsulation of user behavior and news content. In: Proceedings of the sixth ACM international conference on Web search and data mining; 2013. p. 305–314.

[pone.0288457.ref018] 18. Gao Y, Wang M, Tao D, Ji R, Dai Q. 3-D object retrieval and recognition with hypergraph analysis. IEEE Transactions on Image Processing. 2012;21(9):4290–4303. doi: 10.1109/TIP.2012.2199502 [DOI] [PubMed] [Google Scholar]

[pone.0288457.ref019] 19.Veldt N, Benson AR, Kleinberg J. Minimizing localized ratio cut objectives in hypergraphs. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining; 2020. p. 1708–1718.

[pone.0288457.ref020] 20.Ghoshdastidar D, Dukkipati A. A provable generalized tensor spectral method for uniform hypergraph partitioning. In: International Conference on Machine Learning; 2015. p. 400–409.

[pone.0288457.ref021] 21. Ihler E, Wagner D, Wagner F. Modeling hypergraphs by graphs with the same mincut properties. Information Processing Letters. 1993;45(4):171–175. doi: 10.1016/0020-0190(93)90115-P [DOI] [Google Scholar]

[pone.0288457.ref022] 22. Hillar CJ, Lim LH. Most tensor problems are NP-hard. Journal of the ACM (JACM). 2013;60(6):1–39. doi: 10.1145/2512329 [DOI] [Google Scholar]

[pone.0288457.ref023] 23. Hu S, Qi L. The eigenvectors associated with the zero eigenvalues of the Laplacian and signless Laplacian tensors of a uniform hypergraph. Discrete Applied Mathematics. 2014;169:140–151. doi: 10.1016/j.dam.2013.12.024 [DOI] [Google Scholar]

[pone.0288457.ref024] 24.Shashua A, Zass R, Hazan T. Multi-way clustering using super-symmetric non-negative tensor factorization. In: European conference on computer vision. Springer; 2006. p. 595–608.

[pone.0288457.ref025] 25. Benson AR. Three hypergraph eigenvector centralities. SIAM Journal on Mathematics of Data Science. 2019;1(2):293–312. doi: 10.1137/18M1203031 [DOI] [Google Scholar]

[pone.0288457.ref026] 26. Von Luxburg U. A tutorial on spectral clustering. Statistics and computing. 2007;17(4):395–416. doi: 10.1007/s11222-007-9033-z [DOI] [Google Scholar]

[pone.0288457.ref027] 27. Chen Y, Qi L, Zhang X. The Fiedler Vector of a Laplacian Tensor for Hypergraph Partitioning. SIAM Journal on Scientific Computing. 2017;39(6):A2508–A2537. doi: 10.1137/16M1094828 [DOI] [Google Scholar]

[pone.0288457.ref028] 28.Li P, Milenkovic O. Submodular Hypergraphs: p-Laplacians, Cheeger Inequalities and Spectral Clustering. In: Dy J, Krause A, editors. Proceedings of the 35th International Conference on Machine Learning. vol. 80 of Proceedings of Machine Learning Research. PMLR; 2018. p. 3014–3023.

[pone.0288457.ref029] 29. Zien JY, Schlag MD, Chan PK. Multilevel spectral hypergraph partitioning with arbitrary vertex sizes. IEEE Transactions on computer-aided design of integrated circuits and systems. 1999;18(9):1389–1399. doi: 10.1109/43.784130 [DOI] [Google Scholar]

[pone.0288457.ref030] 30.Kumar T, Vaidyanathan S, Ananthapadmanabhan H, Parthasarathy S, Ravindran B. Hypergraph Clustering: A Modularity Maximization Approach. arXiv preprint arXiv:181210869. 2018;.

[pone.0288457.ref031] 31.Qi L, Luo Z. Tensor analysis: spectral theory and special tensors. vol. 151. SIAM; 2017.

[pone.0288457.ref032] 32. Banerjee A, Char A, Mondal B. Spectra of general hypergraphs. Linear Algebra and its Applications. 2017;518:14–30. doi: 10.1016/j.laa.2016.12.022 [DOI] [Google Scholar]

[pone.0288457.ref033] 33. Hagen L, Kahng AB. New spectral methods for ratio cut partitioning and clustering. IEEE transactions on computer-aided design of integrated circuits and systems. 1992;11(9):1074–1085. doi: 10.1109/43.159993 [DOI] [Google Scholar]

[pone.0288457.ref034] 34.Shi J, Malik J. Normalized cuts and image segmentation. Departmental Papers (CIS). 2000; p. 107.

[pone.0288457.ref035] 35. Chung F. Four proofs for the Cheeger inequality and graph partition algorithms. Proceedings of ICCM. 2007;2:378. [Google Scholar]

[pone.0288457.ref036] 36.Bühler T, Hein M. Spectral clustering based on the graph p-Laplacian. In: Proceedings of the 26th Annual International Conference on Machine Learning; 2009. p. 81–88.

[pone.0288457.ref037] 37. Lee JR, Gharan SO, Trevisan L. Multiway spectral partitioning and higher-order cheeger inequalities. Journal of the ACM (JACM). 2014;61(6):1–30. doi: 10.1145/2665063 [DOI] [Google Scholar]

[pone.0288457.ref038] 38.Chandrasekaran K, Xu C, Yu X. Hypergraph k-cut in randomized polynomial time. In: Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms. Society for Industrial and Applied Mathematics; 2018. p. 1426–1438.

[pone.0288457.ref039] 39.Chekuri C, Li S. A note on the hardness of approximating the k-way Hypergraph Cut problem. Manuscript, http://chekuri.cs.illinois.edu/papers/hypergraph-kcut.pdf. 2015;.

[pone.0288457.ref040] 40.Ghoshdastidar D, Dukkipati A. Spectral Clustering Using Multilinear SVD: Analysis, Approximations and Applications. In: AAAI; 2015. p. 2610–2616.

[pone.0288457.ref041] 41. Chekuri C, Li S. On the Hardness of Approximating the k-WAY HYPERGRAPH CUT Problem. Theory of Computing. 2020;16(1). [Google Scholar]

[pone.0288457.ref042] 42.Chandrasekaran K, Chekuri C. Min-max partitioning of hypergraphs and symmetric submodular functions. In: Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms (SODA). SIAM; 2021. p. 1026–1038.

[pone.0288457.ref043] 43. Goldschmidt O, Hochbaum DS. A polynomial algorithm for the k-cut problem for fixed k. Mathematics of operations research. 1994;19(1):24–37. doi: 10.1287/moor.19.1.24 [DOI] [Google Scholar]

[pone.0288457.ref044] 44. Qi L. Eigenvalues of a real supersymmetric tensor. Journal of Symbolic Computation. 2005;40(6):1302–1324. doi: 10.1016/j.jsc.2005.05.007 [DOI] [Google Scholar]

[pone.0288457.ref045] 45. Qi L, Wang F, Wang Y. Z-eigenvalue methods for a global polynomial optimization problem. Mathematical Programming. 2009;118(2):301–316. doi: 10.1007/s10107-007-0193-6 [DOI] [Google Scholar]

[pone.0288457.ref046] 46. Kolda TG, Mayo JR. Shifted power method for computing tensor eigenpairs. SIAM Journal on Matrix Analysis and Applications. 2011;32(4):1095–1124. doi: 10.1137/100801482 [DOI] [Google Scholar]

[pone.0288457.ref047] 47. Hu S, Huang ZH, Qi L. Finding the extreme Z-eigenvalues of tensors via a sequential semidefinite programming method. Numerical Linear Algebra with Applications. 2013;20(6):972–984. doi: 10.1002/nla.1884 [DOI] [Google Scholar]

[pone.0288457.ref048] 48. Chen L, Han L, Zhou L. Computing tensor eigenvalues via homotopy methods. SIAM Journal on Matrix Analysis and Applications. 2016;37(1):290–319. doi: 10.1137/15M1010725 [DOI] [Google Scholar]

[pone.0288457.ref049] 49. Cui CF, Dai YH, Nie J. All real eigenvalues of symmetric tensors. SIAM Journal on Matrix Analysis and Applications. 2014;35(4):1582–1601. doi: 10.1137/140962292 [DOI] [Google Scholar]

[pone.0288457.ref050] 50. Lasserre JB. Global optimization with polynomials and the problem of moments. SIAM Journal on optimization. 2001;11(3):796–817. doi: 10.1137/S1052623400366802 [DOI] [Google Scholar]

[pone.0288457.ref051] 51. Ghoshdastidar D, Dukkipati A. Consistency of spectral partitioning of uniform hypergraphs under planted partition model. In: Advances in Neural Information Processing Systems; 2014. p. 397–405. [Google Scholar]

[pone.0288457.ref052] 52. Hein M, Setzer S, Jost L, Rangapuram SS. The total variation on hypergraphs-learning on hypergraphs revisited. In: Advances in Neural Information Processing Systems; 2013. p. 2427–2435. [Google Scholar]

[pone.0288457.ref053] 53. Li P, Milenkovic O. Inhomogeneous hypergraph clustering with applications. In: Advances in Neural Information Processing Systems; 2017. p. 2308–2318. [Google Scholar]

[pone.0288457.ref054] 54.Zhang C, Hu S, Tang ZG, Chan T. Re-revisiting learning on hypergraphs: confidence interval and subgradient method. In: Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org; 2017. p. 4026–4034.

[pone.0288457.ref055] 55. Chan THH, Louis A, Tang ZG, Zhang C. Spectral properties of hypergraph laplacian and approximation algorithms. Journal of the ACM (JACM). 2018;65(3):1–48. doi: 10.1145/3178123 [DOI] [Google Scholar]

[pone.0288457.ref056] 56. Chan THH, Liang Z. Generalizing the hypergraph laplacian via a diffusion process with mediators. Theoretical Computer Science. 2020;806:416–428. doi: 10.1016/j.tcs.2019.07.024 [DOI] [Google Scholar]

[pone.0288457.ref057] 57. Erdős P, Rényi A, et al. On the evolution of random graphs. Publ Math Inst Hung Acad Sci. 1960;5(1):17–60. [Google Scholar]

[pone.0288457.ref058] 58. Holland PW, Laskey KB, Leinhardt S. Stochastic blockmodels: First steps. Social networks. 1983;5(2):109–137. doi: 10.1016/0378-8733(83)90021-7 [DOI] [Google Scholar]

PERMALINK

Hypergraph partitioning using tensor eigenvalue decomposition

Deepak Maurya

Balaraman Ravindran

Roles

Abstract

1 Introduction

1.1 Our contributions

1.2 Outline

2 Preliminaries

2.1 Reducing a hypergraph to a graph

2.2 Tensor representation of hypergraphs

3 Partitioning of hypergraphs

3.1 Partitioning of graphs

3.2 Ratio-cut and normalized-cut for hypergraphs

Fig 1. Hypergraph: H1.

3.3 Hypergraph partitioning algorithm

Fig 2. Hypergraph: H2.

3.4 Computation of tensor eigenvectors

3.5 Related works

4 Experiments

4.1 Proposed algorithm vs sign-based partitioning on cockroach graph

Fig 3. Cockroach graph.

Table 1. Edge-score for graph in Example 4.1.

4.2 Proposed algorithm vs sign-based partitioning on synthetic graphs & hypergraphs

4.2.1 Proposed algorithm vs sign-based partitioning on graphs generated by ER model

Fig 4. Histogram plot for percentage improvement on ratio-cut value by the proposed method for graphs generated by the ER model for different values of p.

4.2.2 Proposed algorithm vs sign-based partitioning on graphs generated by SBM

Fig 5. Histogram plot for percentage improvement on ratio-cut value by the proposed method for graphs generated by the SBM for different values of intra-cluster probability (p) and inter-cluster probability (q).

4.2.3 Proposed algorithm vs sign-based partitioning on hypergraphs generated by SBM

Fig 6. Histogram plot for percentage improvement on ratio cut value by the proposed method for hypergraphs generated by the SBM for different values of q.

Fig 7. Histogram plot for percentage improvement by the proposed method for normalized cut value on hypergraphs generated by the SBM for different values of q.

5 Conclusions & future work

Supporting information

Data Availability

Funding Statement

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Fig 1. Hypergraph: H₁.

Fig 2. Hypergraph: H₂.