Abstract
The p-persistent q-combinatorial Laplacian defined for a pair of simplicial complexes is a generalization of the q-combinatorial Laplacian. Given a filtration, the spectra of persistent combinatorial Laplacians not only recover the persistent Betti numbers of persistent homology but also provide extra multiscale geometrical information of the data. Paired with machine learning algorithms, the persistent Laplacian has many potential applications in data science. Seeking different ways to find the spectrum of an operator is an active research topic, becoming interesting when ideas are originated from multiple fields. In this work, we explore an alternative approach for the spectrum of persistent Laplacians. As the eigenvalues of a persistent Laplacian matrix are the roots of its characteristic polynomial, one may attempt to find the roots of the characteristic polynomial by homotopy continuation, and thus resolving the spectrum of the corresponding persistent Laplacian. We consider a set of simple polytopes and small molecules to prove the principle that algebraic topology, combinatorial graph, and algebraic geometry can be integrated to understand the shape of data.
Keywords: Persistent Laplacian, homotopy continuation, persistent homology, algebraic topology, combinatorial graph, algebraic geometry
1. Introduction.
Recent years witness the burst of advanced mathematical tools for data science [27, 22, 31, 12]. Persistent homology has been proved to be a powerful tool in analyzing the geometry shape and topological persistence of data [8, 10]. It has had much success in machine learning [7, 23, 26] ([7] is one of the first papers that combine persistent homology with machine learning), computational chemistry [31] and biology [29, 36, 25]. The methods of persistent homology start with constructing a filtration of simplicial complexes from the data so that multiscale geometrical information can be extracted thereafter from the computations of a family of topological invariants. Persistent homology is a quite informative method, since it can detect features that are persistent over a wide range of scales, rather than features that depend on a particular choice of parameters. Inspired by the theory of persistent homology, one recent generalization of a classical operator to its persistent version is the so-called persistent combinatorial Laplacian or persistent Laplacian for simplicity [34].
The applications of Laplace operators are ubiquitous in mathematics, physics, and data science. The graph Laplacian has been a fundamental concept in the study of graphs, partly because the spectrum of the graph Laplacian contains the topological information of the graph [11]. As one can think of a graph as a 1-dimensional simplicial complex and define the graph Laplacian as a sum of compositions of boundary operators, one can readily define a more general Laplacian called the combinatorial Laplacian for a high dimensional simplicial complex. The persistent combinatorial Laplacian, defined for a pair of simplicial complexes K ↪ L, is an extension of the aforementioned combinatorial Laplacian. When a filtration is given, it is natural to compute persistent Laplacians, and one may ask how persistent Laplacian and persistent homology are related. It turns out that the nullity of a persistent Laplacian is equal to the corresponding persistent Betti number (see Theorem 2.11). Moreover, nonzero eigenvalues and eigenvectors of a persistent Laplacian contain extra geometrical information of the filtration, and can be used to resolve a challenge about protein flexibility [34], therefore the theory of persistent Laplacian is a powerful tool for data analysis. Persistent Laplacians can be computed by software package HERMES [35].
Although the spectrum of an operator can be numerically computed in different ways, seeking new ways to calculate the spectrum of an operator is an active research topic [2]. In addition to the traditional methods of numerical linear algebra, one may alternatively resolve the spectrum by finding the roots of the characteristic polynomial associated with the operator. Homotopy continuation is an interesting method for solving a single polynomial or systems of polynomial equations. The essential idea is to build a homotopy between the system to be solved (called the target system) and an easier system with known roots (called the start system) and track down the known roots of the start system to the roots of the target system. As systems of polynomial equations arise in mathematics, science, and engineering, homotopy continuation methods have found applications in various areas, such as algebraic geometry [19, 20], robot kinematics [33], optimal control [3], differential equations [1, 17], and biology [14, 16, 28]. Several software packages implement homotopy continuation methods, such as Bertini [4], HomotopyContinuation.jl [6], Hom4PS-3 [9], and PHCpack [32].
In this work, we propose to solve the spectra of persistent Laplacians using homotopy continuation. In Section 2, we present the theory of persistent homology and persistent Laplacian. In section 3, we briefly explain how homotopy continuation works. In Section 4, we give a proof of principle application of our approach to some simple polytopes and small molecules. Our goal is to stimulate further research at the interface of algebraic geometry, algebraic topology, combinatorial graph, and data science. We use HERMES [35] to calculate the matrix representations of persistent Laplacians and then use Bertini [4] to find the roots of their associated characteristic polynomials. We hope this work can exposit the theory of persistent Laplacian and shed some light on the potential application of homotopy continuation methods in data science.
2. Persistent homology and persistent Laplacian.
In this section, we introduce basics of persistent homology and persistent Laplacian. Readers familiar with persistent homology can skip the subsection on simplicial homology and persistent homology. As to subsections on different versions of Laplacians, Proposition 1, Theorem 2.10, Theorem 2.11, and many examples can be skipped on a first read, as they are not directly related to the main contribution of this paper.
2.1. Simplicial homology and persistent homology.
We present here a very short introduction of persistent homology to establish concepts and notations [10].
Definition 2.1 (Simplex).
A q-simplex denoted as σq = [u0,..., uq] is the convex hull of q+1 affinely independent points {u0,..., uq} in . The orientation of σq is determined by the ordering of the vertices {u0,..., uq} and two orderings define the same orientation if and only if they differ by an even permutation. The dimension of σq = [u0,..., uq] is defined as q. For 0 ≤ i ≤ n, is said to be a face of σq, where the hat indicates the omission of the vertex.
Definition 2.2 (Simplicial complex).
A finite set of simplices, K, is a simplicial complex if the following conditions are satisfied: (1) all faces of any simplex in K are also in K; (2) the non-empty intersection of any two simplices in K is a common face of the two simplices. The dimension of a simplicial complex K is defined as the maximal dimension of its simplices.
Another important notion is the abstract simplicial complex.
Definition 2.3 (Abstract simplicial complex).
Suppose we have a finite collection of sets called K. For any set σ in K, if all subsets of σ are also in K, K is said to be an abstract simplicial complex. Subsets of σ are called faces of σ.
The notions of abstract simplicial complex and simplicial complex are closely related. One may build a simplicial complex from an abstract simplicial complex or vice versa [10]. They contain exactly the same combinatorial information. From now on we will not distinguish abstract simplicial complexes from simplicial complexes.
Definition 2.4 (Chain complex).
Suppose we have a simplicial complex K. For any q, the formal sums of all q-simplices with coefficients in a field, for instance , form an abelian group denoted by Cq(K). Such a group is called a chain group and the elements of it are called q-chains. The boundary operator ∂q : Cq(K) → Cq−1(K) is defined as
For simplicity we usually omit the subscript q and only write ∂. The collection of boundary operators and chain groups forms the so called chain complex of K
As ∂2 = 0, the homology group Hq is defined as ker ∂q/im∂q−1. The rank of Hq is called the qth Betti number and denoted by βq.
Definition 2.5 (Cochain complex).
Given a chain complex
the dual spaces Cq(K) of Cq(K) and dual maps d of ∂ form a cochain complex
where is called a coboundary operator. The cohomology group Hq is defined as . As Cq(K) is formally generated by q-simplices, there is a canonical isomorphism between Cq(K) and Cq(K) such that any simplex σ is identified with σ∗.
Definition 2.6 (Filtration).
A filtration is a series of finite simplicial complexes indexed over real numbers such that Xa ⊂ Xb if a ≤ b. When X is a finite set, we say that it is a finite filtration.
A filtration can be constructed in various ways from a set of points in .
Example 2.1 (Rips complex).
The Rips complex is an abstract simplicial complex. Given a set and a radius r, if a subset T of S has the property that the distance between any pair of points in T is at most 2r, then T is include in the Rips complex (see figure 2).
Example 2.2 (Alpha complex).
Now we build an alpha complex from a finite set . First we define the Voronoi cell. The Voronoi cell of a point u in S is
Let Bu(r) be the closed ball with center u and radius r. Denote the intersection Bu(r) ∩ Vu by Ru(r). Then the alpha complex is defined as
In other words, Alpha(r) is the nerve of cover {Ru(r), u ∈ S}. Let r go from 0 to the diameter of S, then we get a filtration of alpha complexes.
Definition 2.7 (Persistent Betti number).
Suppose a finite filtration X = {Xt | t ∈ T} is given. The inclusion Xt ↪ Xt+p induces a homomorphism
on the simplicial homology groups for each dimension q. The qth persistent homology groups are the images of such homomorphisms, and the qth persistent Betti numbers are the ranks of persistent homology groups.
2.2. Graph Laplacian.
Let G(V, E) be a simple graph with V = {vi} the vertex set and E the edge set.
Definition 2.8.
The Laplacian matrix (also called the graph Laplacian) of G is defined as
Example 2.3.
Take a 2-simplex [a, b, c]. Its graph Laplacian is
The Laplacian matrix is a symmetric positive semi-definite matrix. The topology of a graph can be inferred from the spectrum of the graph Laplacian. For instance, the number of connected components of a graph G is equal to the algebraic multiplicity of 0 in its graph Laplacian. The second smallest eigenvalue of a graph Laplacian is called the Fiedler value and can be regarded as a measure of topological connectivity. See [37] for more results concerning the graph Laplacian.
2.3. Combinatorial Laplacian.
Recall there is a canonical isomorphism between Cq(K) and Cq(K). The q-combinatorial Laplacian Δq : Cq(K) → Cq(K) is defined as
(1) |
and . Alternatively, we may equip Cq(K) with an inner product ⟨·, ·⟩ such that for any two simplices σi and σj, ⟨σi, σj⟩ = δij. Then one may regard as the adjoint map with respect to this inner product. Denote the matrix representation of the boundary operator ∂q by , then the matrix representation of Δq is
(2) |
As the combinatorial Laplacian is symmetric and positive semi-definite, its spectrum consists of non-negative real eigenvalues.
Example 2.4.
coincides with Δ0. Take a 2-simplex [a, b, c]. is
Then
i.e., .
Definition 2.9.
Two q-simplices σi, σj are said to be lower adjacent, denoted by , if they share a common (q − 1)-face. They are said to be upper adjacent, denoted by , if they both are faces of a (q + 1)-simplex. The lower degree degL(σ) of a q-simplex σ is q +1, the number of its (q −1)-faces. The upper degree degU(σ) of a q-simplex σ is defined as the number of (q+1)-simplices in K of which σ is a face. The degree of q-simplex σ is defined as
Now suppose with common upper (q+1)-simplex τ. Let’s examine the signs of the coefficients of σi, σj in ∂τ. We say that σi and σj are similarly oriented if the signs are the same; dissimilarly oriented if the signs are different. Next suppose with common lower (q − 1)-simplex η. Let’s examine the signs of the coefficients of η in ∂σi and ∂σj. We say that η is a similar common lower simplex of σi and σj if the signs are the same; a dissimilar common lower simplex if the signs are different.
The following proposition gives an explicit formula for .
Proposition 1.
[13, Thm. 3.3.4] Suppose we have a finite simplicial complex K and its set of q-simplices is {σ1,..., σn}.
When q = 0,
If q > 0, then
The kernel of a combinatorial Laplacian contains topological information. We have
(3) |
Later we will prove a generalized version of the above equality for persistent Laplacians.
2.4. Persistent Laplacian.
Suppose we have two simplicial complexes Kt ⊂ Kt+p from a filtration and boundary operators and . From now on we use to denote Cq(Kt) for simplicity. Bearing in mind the inclusion map from to , we define
i.e., consists of elements whose images under are in . One can define an inner product ⟨·, ·⟩ on such that for any two simplices σi and σj, ⟨σi, σj⟩ = δij. Denote as . As inherits the inner product from , the adjoint map of is well defined. The p-persistent q-combinatorial Laplacian operator [34] is defined as
(4) |
Theorem 2.10.
[The matrix representation of ] [34, 24] Suppose we pick up a basis {v1,··· ,vm} of and the associated inner product matrix is P. Let be the matrix representation of with respect to {v1,··· , vm} and the canonical basis of , the matrix representation of with respect to the canonical bases of and . Then the matrix representation of is
(5) |
Proof.
Denote the matrix representation of the adjoint map by X. It suffices to determine X. Take two vectors , . We abuse the notation a bit and use v, w to denote their coordinates in the form of column vector as well. We have
As v, w are arbitrarily taken, we conclude that . □
Example 2.5.
Consider the ordered point set
From now on we denote these points as 0, 1, 2, 3, and 4 according to the order and we omit brackets and commas in the notation of any simplex. For example, the simplex [(0, 0, 0.001), (3, −4, −0.001)] will be denoted as 01. Using GUDHI [15] we can build the filtration K of alpha complexes up to 9.77:
where (0, 1, 2, 3, 4 → 0.00) means that K0 = {0, 1, 2, 3, 4} and (14, 02, 23, 04, 34 → 6.25) means that K6.25 = K0 ∪ {14, 02, 23, 04, 34} and so on. Let’s first calculate . is equal to
is equal to
Note that . It is easy to see that . So is equal to
and P is equal to ⟨024 + 234, 024 + 234⟩. So is equal to
Example 2.6.
Consider the graph
and the filtration {(1, 2 → 0), (3, 4, 13, 34, 42 → 1)}. Let’s compute . is equal to
After a few steps of Gauss elimination we get
It is clear that , P = 3 and is equal to
Then is
Its spectrum is {0, 2/3}.
As stated earlier, now we prove the theorem that connects the nullity of persistent Laplacian with persistent Betti number.
Theorem 2.11 (Persistent Betti numbers from persistent Laplacians).
[34, 24] Given a filtration K of simplicial complexes, for any positive integer q, we have
(6) |
To prove Theorem 2.11, we need the following lemma.
Lemma 2.12.
[21] Suppose A : U → V and B : V → W are two linear maps where U, V and W are inner product spaces. We have
ker(A∗) = im(A)⊥;
ker(A∗A) = ker A;
- If BA = 0, then
- If BA = 0, then
If BA = 0, then from 4) and 5) we know
Proof.
-
If x ∈ im(A)⊥, then for any y ∈ V,So A∗x = 0. If x ∈ ker(A∗), then for any y ∈ V,
So x ∈ im(A)⊥.
-
⊃: obvious; ⊂: Let v ∈ ker(A∗A), then
implying Av = 0.
⊃: obvious; ⊂: Let v ∈ ker(AA∗ + B∗B). Then AA∗v = −B∗Bv. Apply B to both sides, we get −BB∗Bv = 0. As B∗Bv ∈ ker(B)∩im(B∗) (recall that ker(B∗B) = ker(B)), Bv must be 0. Applying A∗ to both sides and reasoning similarly, one can show that A∗v = 0.
-
Let π : V → im(A)⊥ be the projection and πB = π|ker(B). ker(πB) ⊂ ker(π) = im(A), and since ker(B) ⊃ im(A), we get ker(πB) = im(A). Next we show that imπB = ker(B)∩im(A)⊥. It suffices to show that im πB ⊂ ker (B). This is true since for any v ∈ ker(B),Now
□
Proof of Theorem 2.11.
Since , ker . Also bear in mind that dim . □
3. Method of homotopy continuation.
Our exposition of homotopy continuation follows [5]. A more theoretical treatment of this topic can be found elsewhere [30].
3.1. Path tracking.
Solving a system of polynomial equations f by homotopy continuation basically consists of three steps: 1) build a start system g such that g can be solved easily; 2) build a homotopy between two systems f and g; 3) track the roots of g to the roots of f.
We first look at a simple example. Let us say we wish to solve the following polynomial in one complex variable
We take a similar and simpler polynomial g(z) and deform the roots of g(z) to f(z). For instance we may take g(z) = z3 + 1 and construct a linear homotopy
where s is a complex variable. Though the second parameter of h is a complex variable, we still call h a homotopy between f and g for convenience. Then we parametrize s by a curve
(this is called the gamma trick [5, Section 6.1] and there are technical reasons behind such choice of parametrization). We substitute s(t) in h(z, s) and clear denominators, then obtain a usual homotopy
For each t0 ∈ [0, 1], H(z, t0) is a polynomial. Once we know how to numerically solve H(z, t0 − Δt) = 0 from the known roots of H(z, t0), we can pick a grid of [0, 1] and track the known roots of g step by step all the way to the solutions of f. This process is called the path tracking. Now suppose H(z(t), t) = 0 for any t ∈ (0, 1] with z(1) a root of g. Differentiate H(z(t), t) with respect to t, we have the Davidenko differential equation
If is nonzero, the Davidenko differential equation can be rewritten as
As we know the value of z(t) at t0, we have indeed transformed our original problem of tracking roots to the classical initial value problem of ordinary differential equation (ODE). One may use any ODE method to predict z(t0 −Δt) (The default ODE solver employed by Bertini is RKF45). For instance we can apply the simplest Euler’s method and get
Since we also know that H(z(t0 − Δt),t0 − Δt) should be zero, we can apply several iterations of Newton’s method to update z(t0 −Δt). Such a combination of an ODE predictor with Newton’s method is called a predictor-corrector method.
Now after the path tracking from t = 1 to t = 0, we get a sequence {z(ti)}. If the limit exists and is finite, we think of as a solution of f.
Example 3.1.
The reader may wonder why we do not just use
Consider the example H(z, t) = t(z2−1)+(1−t)(5−z2). When t = 1/2, H(z, 1/2) = 2 has no roots. When t = 5/6, H(z, 5/6) = 2/3z2 has a singular root 0, and the derivative of it at z = 0 is zero.
Example 3.2.
Though usually we are only interested in real roots of the target system, we should also track complex roots. Consider the homotopy
At t = 1, h(z, 1) has two real roots and two imaginary roots . As t goes from 1 to 0, e2πi(1−t) travels around the unit circle in the complex plane counterclockwise. The two real roots will be deformed to the two imaginary ones and vice versa.
We have discussed how to apply homotopy continuation to solve a single polynomial. The procedure for a single polynomial can be generalized to solve a square system. We introduce some concepts first.
Definition 3.1 (Square system).
A system of polynomial equations
with n polynomials and N variables is said to be square if n = N.
Definition 3.2 (Isolated solution).
For any solution z∗ of f, if there is r > 0 such that there is no other solutions but z∗ contained in
we say that z∗ is isolated.
Definition 3.3.
We say that a solution z∗ is nonsingular if
is non-singular. Otherwise z∗ is said to be a singular solution.
Definition 3.4 (Good homotopy).
Suppose we have a square system of polynomial equations f, a start system g, a set of D distinct solutions of g(z) = H(z, 1) = 0, and a system of infinitely differentiable functions
satisfying the following property:
For any t ∈ [0, 1], H(z, t) is a system of polynomial equations;
For any , there exists a smooth map such that pj(1) = ωj;
- For any t∗ ∈ (0, 1]:
- The associated paths of solutions do not cross each other, i.e., there do not exist two integers j, k with 1 ≤ j < k ≤ D such that pj(t∗) = pk(t∗);
- All pj(t∗) are isolated solutions of H(z, t∗) = 0.
If we denote the two-norm of a vector v by ‖v⃦2, the set of finite limits
contains every isolated solution of f(z) = 0.
Then we say that H is a good homotopy.
Now suppose we take a start system
with known roots and a good homotopy
such that for any solution p of g there is a map satisfying
H(z(t),t) = 0 when t ∈ (0,1] and
The Jacobian ∂H/∂z of H with respect to z is non-singular for points (z(t), t) with t ∈ (0, 1].
In this multidimensional case, the Davidenko differential equation is
where
Let
We rewrite the Davidenko equation as
Since is invertible on the path (z(t), t), we get
As in the one-dimensional case, we use predictor-corrector methods to track the roots of g to the roots of f.
Here we remind the reader that the predictor-corrector method may have trouble in dealing with singular solutions of f. If p∗ is a singular solution of f = H(z, 0) and z(t) is its path, the Jacobian of H can be very small when (z(t), t) → (p*, 0). This would make the path tracking very slow and require high precision. A couple of methods called the endgames are invented to deal with singular solutions of f [4].
3.2. Bertini’s theorem.
We have described in previous sections how to solve a square system. How to tackle a non-square system? We need Bertini’s theorem. We introduce some basic definitions regarding algebraic sets first.
Definition 3.5 (Affine complex algebraic set).
Given a system f of n polynomials and N complex variables, an affine complex algebraic set V (f1,..., fn) is the locus of solutions on .
Definition 3.6 (Manifold point).
A point of is said to be a manifold point if there is a neighborhood U ⊂ X such that for some mapping Φ defined on , Φ|U is a bijection from U to a neighborhood of 0 in . The set of regular points in X is refered to as Xreg. The dimension of p∗ is defined as k. It can be shown that in the usual topology of Xreg is dense in X.
Definition 3.7 (Zariski topology).
The Zariski topology is a weaker topology compared to the usual topology of Euclidean space. Given an affine algebraic set X, its open sets are taken to be intersections of X with sets of the form , where Y is an affine algebraic subset of .
Example 3.3.
The Zariski open sets of are the complements of finite sets.
Definition 3.8 (Irreducible components of an algebraic set).
Given an affine algebraic set X, the irreducible components of X are closures of the connected components of Xreg.
Definition 3.9 (Pure dimension).
For any , the dimension of X at p∗, is defined to be the maximum dimension of the irreducible components that contains p∗. The dimension of X is then defined as the maximum dimension of its points, i.e.,
If for all p∗ ∈ X, we say X is pure-dimensional.
Now we state the Bertini’s theorem.
Theorem 3.10.
[5, Thm. 1.15] Suppose we have n polynomials f1(z),..., fn(z) defined on a nonempty Zariski open subset U of where X is an irreducible affine complex algebraic set. We further assume that for any point x ∈ U, not all fj(x) are zero. Under such conditions, there is a nonempty Zariski open subset of such that for any λ = (λ1,..., λn) in , has the following property:
Zλ = V (fλ(z)) is either empty or of pure dimension N − 1; and
.
One variant of Bertini’s theorem is important to us.
Theorem 3.11 (Bertini’s theorem for systems).
[5, Thm. 9.3] Suppose we have a system of polynomial equations , then there is a Zariski open dense subset such that for any matrix A ∈ U, the following is satisfied
V (A · f)\V (f) is either empty or smooth of pure-dimension N − k; and
The number of irreducible components of V (A · f)\V (f) has nothing to do with A and the irreducible components of V (A · f)\V (f) are disjoint.
Now suppose we are dealing with a system f with n polynomials and N variables and n > N. Such a system is called an overdetermined system. If we take a random matrix and replace the system f with the system A·f. Bertini’s theorem tells us that the set V (A · f)\V (f) is either empty or N −N = 0 dimensional, i.e., Bertini’s theorem guarantees us that extra solutions introduced by A·f are isolated solutions. We can discard these extra solutions by simply verifying if they satisfy f = 0. A more detailed discussion of non-square system can be found in [5].
4. Experiments.
In this section, we study the geometry and topology of several small polytopes and molecules. For any polytope or molecule, a filtration of alpha complexes is generated (for a polytope or molecule lying in a plane we need to perturb its coordinates a bit beforehand). We use HERMES to calculate p-persistent 0-combinatorial Laplacians and use NumPy [18] to calculate the associated characteristic polynomials. Then we use Bertini to solve these polynomials and hence get the spectra of persistent Laplacians. As roots of a polynomial are unchanged under scalar multiplication, the maximal coefficient of any characteristic polynomial is scaled to 1. The configuration of Bertini is as follows: MPTYPE=2, FINALTOL=1e-8, COEFFBOUND=100, DEGREEBOUND=80, AMPSAFETYDIGITS1=1, AMPSAFETYDIGITS2=1, AMPMAXPREC=3328.
First let us look at 2-dimensional regular polygons, such as a pentagon, a heptagon, an octagon and a nonagon. In our experiments, the circumradius of any regular polygon is set to be 1.
The persistent Betti numbers and the smallest nonzero eigenvalues of persistent Laplacians regarding the pentagon are shown in Figure 4. We see abrupt changes near 0.6 and 1. This coincides with the geometrical properties of our pentagon, for its edge length is sin(π/5) ≈ 0.58 and radius is 1. Results for Heptagon, Octagon, and Nonagon are presented in Figures 10, 11, and 12, respectively in the Appendix.
Next let us look at some 3-dimensional objects, such as a cube, and an octahedron. We first study the cube with edge length 1. As the length of its face diagonal is and the length of its main diagonal is , one would expect some changes in the graphs near α = 0.5, α = 0.7 and α = 0.85. This is confirmed in Figure 6.
Another 3-dimensional example is the regular octahedron with edge length . The circumradius of any face is set to . The circumradius of the octahedron itself is set to 1. Such geometrical properties are reflected in Figure 7. Results for a tetrahedron, a triangular prism and a regular pyramid are illustrated in Figures 13, 14, and 15, respectively in the Appendix.
Next, we examine aromatic molecules, such as benzene, anthracene, naphthalene, and pyrene (see Figure 8). Here we only consider the C skeletons of them. First, consider the benzene molecule. The half-length of its edge is approximately 0.7Å, and its radius is approximately 1.4Å. One can see changes near 0.7 and 1.4 in Figure 9. Results for naphthalene, anthracene, and pyrene are presented in Figures 16, 17, and 18, respectively in the Appendix.
5. Conclusions.
Combinatorial Laplacian is a powerful tool for studying the shape of data. However, its performance is restricted partly because it is defined as a single graph or a simplicial complex and does not provide a multiscale analysis. Motivated by the theory of persistent homology, Wang et al. [34] introduced the persistent combinatorial Laplacian defined for a pair of complexes in a given filtration. Since the nullity of the spectrum of the persistent combinatorial Laplacian (aka persistent Laplacian) recovers the persistent Betti number, the persistent combinatorial Laplacian can be regarded as a unified tool to extract topological persistence and geometrical information out of high dimensional datasets.
Although the computation of the spectrum of the persistent Laplacian can be implemented in various ways, it is mathematically interesting to explore alternative ways, which might stimulate new mathematics in the long run. In this work, we explore the possibility of applying homotopy continuation methods. We have verified at least for some simple polytopes and small molecules in the three-dimensional space, the second smallest eigenvalues of persistent Laplacians calculated by homotopy continuation are very close to the result from HERMES. In other words, the geometrical properties of them can be inferred from the changes of the spectra obtained via homotopy continuation. The present work offers a unique example that combines algebraic topology, algebraic geometry, and combinatorial graph to solve problems in data science. For larger systems such as protein molecules, further research is needed to implement homotopy continuation.
Acknowledgments
This work was supported in part by NIH grant GM126189, NSF grants DMS-2052983, DMS-1761320, and IIS-1900473, NASA grant 80NSSC21M0023, Michigan Economic Development Corporation, George Mason University award PD45722, Bristol-Myers Squibb 65109, and Pfizer. The authors thank Dr. Wenrui Hao and Ms. Rui Wang for discussion and/or help.
Appendix A. Supplementary figures.
Appendix B. Coordinates of aromatic molecules. 1
In this section the xyz coordinates of benzene, naphthalene, anthracene, and pyrene are listed. We omit hydrogen atoms since they are irrelevant for the computation.
benzene: | naphthalene: |
1.400 0.000 0.000 | 2.404 0.756 0.000 |
0.700 1.212 0.000 | 2.433 −0.658 0.000 |
−0.700 1.212 0.000 | 1.267 −1.375 0.000 |
−1.400 0.000 0.000 | 0.014 −0.705 0.000 |
−0.700 −1.212 0.000 | −0.014 0.705 0.000 |
0.700 −1.212 0.000 | 1.211 1.425 0.000 |
−1.267 1.375 0.000 | −2.417 1.490 0.000 |
−2.433 0.659 0.000 | pyrene: |
−2.404 −0.756 0.000 | 0.000 1.214 −2.813 |
−1.211 −1.425 0.000 | 0.000 1.224 −1.410 |
anthracene: | 0.000 −0.000 −0.705 |
3.661 0.585 0.000 | 0.000 −1.224 −1.410 |
3.611 −0.840 0.000 | 0.000 0.000 0.705 |
2.417 −1.490 0.000 | 0.000 −1.224 1.410 |
1.187 −0.753 0.000 | 0.000 −2.435 0.700 |
2.515 1.317 0.000 | 0.000 −2.435 −0.700 |
1.237 0.668 0.000 | 0.000 1.224 1.410 |
−0.049 −1.403 0.000 | 0.000 −1.214 −2.813 |
−1.237 −0.668 0.000 | 0.000 −0.000 −3.507 |
0.049 1.403 0.000 | 0.000 −1.214 2.813 |
−1.187 0.753 0.000 | 0.000 1.214 2.813 |
−2.515 −1.317 0.000 | 0.000 0.000 3.507 |
−3.661 −0.585 0.000 | 0.000 2.435 0.700 |
−3.611 0.840 0.000 | 0.000 2.435 −0.700 |
Footnotes
The digits after the third decimal place of original atomic coordinates have been slightly perturbed to ensure the calculation of persistent Laplacians.
Contributor Information
XIAOQI WEI, Department of Mathematics, Michigan State University, MI 48824, USA.
GUO-WEI WEI, Department of Mathematics, Department of Electrical and Computer Engineering, Department of Biochemistry and Molecular Biology, Michigan State University, MI 48824, USA.
REFERENCES
- [1].Allgower EL, Bates DJ, Sommese AJ and Wampler CW, Solution of polynomial systems derived from differential equations, Computing, 76 (2006), 1–10. [Google Scholar]
- [2].Arnold DN, David G, Filoche M, Jerison D and Mayboroda S, Computing spectra without solving eigenvalue problems, SIAM J. Sci. Comput, 41 (2019), B69–B92. [Google Scholar]
- [3].Bates DJ, Fotiou IA and Rostalski P, A numerical algebraic geometry approach to nonlinear constrained optimal control, 46th IEEE Conference on Decision and Control, New Orleans, LA, 2007. [Google Scholar]
- [4].Bates DJ, Hauenstein JD, Sommese AJ and Wampler CW, Bertini: Software for numerical algebraic geometry. Available from: https://bertini.nd.edu. [Google Scholar]
- [5].Bates DJ, Hauenstein JD, Sommese AJ and Wampler CW, Numerically Solving Polynomial Systems with Bertini, Software, Environments, and Tools, 25, Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, 2013. [Google Scholar]
- [6].Breiding P and Timme S, HomotopyContinuation.jl: A package for homotopy continuation in Julia, in International Congress on Mathematical Software, Lecture Notes in Computer Science, 10931, Springer, 2018, 458–465. [Google Scholar]
- [7].Cang Z, Mu L, Wu K, Opron K, Xia K and Wei G-W, A topological approach for protein classification, Computational and Mathematical Biophysics, 3 (2015), 140–162. [Google Scholar]
- [8].Carlsson G, Topology and data, Bull. Amer. Math. Soc. (N.S.), 46 (2009), 255–308. [Google Scholar]
- [9].Chen T, Lee T-L and Li T-Y, Hom4ps-3: A parallel numerical solver for systems of polynomial equations based on polyhedral homotopy continuation methods, in Mathematical Software – ICMS 2014, Lecture Notes in Comput. Sci, 8592, Springer, Heidelberg, 2014, 183–190. [Google Scholar]
- [10].Edelsbrunner H and Harer JL, Computational Topology: An Introduction, American Mathematical Society, Providence, RI, 2010. [Google Scholar]
- [11].Friedman J, Computing betti numbers via combinatorial Laplacians, Algorithmica, 21 (1998), 331–346. [Google Scholar]
- [12].Gameiro M, Hiraoka Y, Izumi S, Kramar M, Mischaikow K and Nanda V, A topological measurement of protein compressibility, Jpn. J. Ind. Appl. Math, 32 (2015), 1–17. [Google Scholar]
- [13].Goldberg TE, Combinatorial Laplacians of simplicial complexes, Senior project, Bard College, 2002. Available from: https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.156.3354&rep=rep1&type=pdf.
- [14].Gross E, Davis B, Ho KL, Bates DJ and Harrington HA, Numerical algebraic geometry for model selection and its application to the life sciences, J. Roy. Soc. Interface, 13 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].The GUDHI Project, GUDHI User and Reference Manual, 3.4.1 edition, GUDHI Editorial Board, 2021. Available from: https://gudhi.inria.fr/doc/3.4.1/. [Google Scholar]
- [16].Hao W, Hauenstein JD, Hu B, Liu Y, Sommese AJ and Zhang Y-T, Multiple stable steady states of a reaction-diffusion model on zebrafish dorsal-ventral patterning, Discrete Contin. Dyn. Syst. Ser. S, 4 (2011), 1413–1428. [Google Scholar]
- [17].Hao W, Hu B and Sommese AJ, Numerical algebraic geometry and differential equations, in Future Vision and Trends on Shapes, Geometry and Algebra, Springer Proc. Math. Stat, 84, Springer, London, 2014, 39–53. [Google Scholar]
- [18].Harris CR, Millman KJ, van der Walt SJ, Gommers R and Virtanen P, et al. , Array programming with NumPy, Nature, 585 (2020), 357–362. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Hauenstein J, Rodriguez JI and Sturmfels B, Maximum likelihood for matrices with rank constraints, J. Algebr. Stat, 5 (2014), 18–38. [Google Scholar]
- [20].Leykin A and Sottile F, Galois groups of Schubert problems via homotopy computation, Math. Comp, 78 (2009), 1749–1765. [Google Scholar]
- [21].Lim L-H, Hodge Laplacians on graphs, SIAM Rev, 62 (2020), 685–715. [Google Scholar]
- [22].Liu X, Wang X, Wu J and Xia K, Hypergraph-based persistent cohomology (HPC) for molecular representations in drug design, Briefings in Bioinformatics, (2021), bbaa411. [DOI] [PubMed] [Google Scholar]
- [23].Love ER, Filippenko B, Maroulas V and Carlsson G, Topological deep learning, preprint, arXiv:2101.05778. [Google Scholar]
- [24].Mémoli F, Wan Z and Wang Y, Persistent Laplacians: Properties, algorithms and implications, preprint, arXiv:2012.02808. [Google Scholar]
- [25].Meng Z, Vijay Anand D, Lu Y, Wu J and Xia K, Weighted persistent homology for biomolecular data analysis, Scientific Reports, 10 (2020), 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Nasrin F, Oballe C, Boothe D and Maroulas V, Bayesian topological learning for brain state classification, 18th IEEE International Conference On Machine Learning And Applications (ICMLA), Boca Raton, FL, 2019. [Google Scholar]
- [27].Nguyen DD, Cang Z and Wei G-W, A review of mathematical representations of biomolecular data, Phys. Chem. Chem. Phys, 22 (2020), 4343–4367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [28].Ren Y, Martini JWR and Torres J, Decoupled molecules with binding polynomials of bidegree (n,2), J. Math. Biol, 78 (2019), 879–898. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [29].Sgouralis I, Nebenführ A and Maroulas V, A Bayesian topological framework for the identification and reconstruction of subcellular motion, SIAM J. Imaging Sci, 10 (2017), 871–899. [Google Scholar]
- [30].Sommese AJ and Wampler II CW, The Numerical Solution of Systems of Polynomials. Arising in Engineering and Science, World Scientific Publishing Co. Pte. Ltd., Hackensack, NJ, 2005. [Google Scholar]
- [31].Townsend J, Micucci CP, Hymel JH, Maroulas V and Vogiatzis KD, Representation of molecular structures with persistent homology for machine learning applications in chemistry, Nature Communications, 11 (2020), 1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [32].Verschelde J, Algorithm 795: Phcpack: A general-purpose solver for polynomial systems by homotopy continuation, ACM Trans. Math. Softw, 25 (1999), 251–276. [Google Scholar]
- [33].Wampler CW and Sommese AJ, Numerical algebraic geometry and algebraic kinematics, Acta Numer, 20 (2011), 469–567. [Google Scholar]
- [34].Wang R, Nguyen DD and Wei G-W, Persistent spectral graph, Int. J. Numer. Methods Biomed. Eng, 36 (2020), 27pp. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Wang R, Zhao R, Ribando-Gros E, Chen J, Tong Y and Wei G-W, HERMES: Persistent spectral graph software, Foundations of Data Science, 3 (2020), 67–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [36].Xia K and Wei G-W, Persistent homology analysis of protein structure, flexibility, and folding, Int. J. Numer. Methods Biomed. Eng, 30 (2014), 814–844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [37].Zhang X-D, The Laplacian eigenvalues of graphs: A survey, preprint, arXiv:1111.2897. [Google Scholar]