On the Nonnegative Rank of Euclidean Distance Matrices

Matthew M Lin; Moody T Chu

doi:10.1016/j.laa.2010.03.038

. Author manuscript; available in PMC: 2013 Aug 19.

Published in final edited form as: Linear Algebra Appl. 2010 Apr 24;433(3):681–689. doi: 10.1016/j.laa.2010.03.038

On the Nonnegative Rank of Euclidean Distance Matrices

Matthew M Lin ^1,¹, Moody T Chu ^2,^2,^✉

PMCID: PMC3747005 NIHMSID: NIHMS479124 PMID: 23966751

Abstract

The Euclidean distance matrix for n distinct points in ℝ^r is generically of rank r + 2. It is shown in this paper via a geometric argument that its nonnegative rank for the case r = 1 is generically n.

Keywords: Euclidean distance matrix, nonnegative rank factorization, nonnegative rank

1. Introduction

Any given nonnegative matrix A ∈ ℝ^m^×ⁿ can be expressed as the product A = UV for some nonnegative matrices U ∈ ℝ^m^×^k and V ∈ ℝ^k^×ⁿ with k ≤ min{m, n}. The smallest k that makes this factorization possible is called the nonnegative rank of A. For convenience, we denote the nonnegative rank of A by rank₊(A). Trivially the nonnegative rank has bounds such as

rank (A) \leq {rank}_{+} (A) \leq min {m, n} .

(1)

Determining the exact nonnegative rank and computing the corresponding factorization, however, are known to be NP-hard [6, 18]. If the nonnegative matrix A is such that rank₊(A) = rank(A), then we say that A has a nonnegative rank factorization (NRF). Even in this case, there is no known effective algorithm to compute the NRF.

It is shown recently that, if k < min{m, n}, then the probability that a matrix A with rank₊(A) = k should also have rank(A) = k is one. In other words, matrices which have an NRF are generic. To put it more plainly, if A = UV where U ∈ ℝ^m^×^k and V ∈ ℝ^k^×ⁿ are randomly generated nonnegative matrices, then with probability one we have rank(A) = k. The converse is nevertheless not true. Indeed, the question of computing the probability for a 4 × 4 nonnegative matrix of rank 3 to have nonnegative rank 3 is not trivial at all. It is very much analogous to the Sylvester’s four-point problem which, to this date, does not admit a determinate solution [14, 16]. For this reason, there has been considerable interest in the literature to identify nonnegative matrices with or without NRF.

A necessary and sufficient condition qualifying whether a nonnegative matrix has an NRF can be found in [17], but that result appears too theoretical for numerical verification. A few sufficient conditions for constructing nonnegative matrices without NRF have been given in [13, 15]. The simplest example is the 4 × 4 matrix

C = [\begin{matrix} 1 & 1 & 0 & 0 \\ 1 & 0 & 1 & 0 \\ 0 & 1 & 0 & 1 \\ 0 & 0 & 1 & 1 \end{matrix}],

with rank( Inline graphic ) = 3 and rank₊( ) = 4. Other known conditions for the existence of an NRF are for more restrictive subclasses of matrices such as the so called weakly monotone nonnegative matrices [12], λ-monotone [11], or matrices with nonnegative 1-inverse [4]. Still, given a nonnegative matrix, finding its (numerical) rank is computationally possible, but ensuring its nonnegative rank is an extremely hard task. Thus far, we know very little in the literature about nonnegative matrices which do not have NRF. This factorization has also be studied in the literature under the notion of prime matrices [3, 15].

The purpose of this short communication is to add the important class of Euclidean distance matrices to the list of matrices having no NRF. This note represents perhaps only a modest advance in the field, but it should be of interest to confirm the precise rank and nonnegative rank of a distance matrix.

2. Rank condition and standard form

Given n points p₁, …, p_n in the space ℝ^r, the corresponding Euclidean distance matrix (EDM) is the n × n symmetric and nonnegative matrix Q(p₁, …, p_n) = [q_ij] whose entry q_ij is defined by

q_{i j} = {‖ p_{i} - p_{j} ‖}^{2}, i, j = 1, \dots, n,

(2)

where ||·|| denotes the Euclidean norm in ℝ^r. As an exhaustive record of relative spacing between any two of the n particles in ℝ^m, the distance matrix Q(p₁, …, p_n) has many important applications in distance geometry. See, for example, the discussions in [7, 8, 9, 10]. Our attention here is solely on the rank condition of Q(p₁, …, p_n).

Theorem 2.1

For any n ≥ r + 2, the rank of Q(p₁, …, p_n) is no greater than r + 2 and is generically r + 2.

Proof

(This is a classical and well known fact. There are many elegant ways to verify this result, but for the sake of comparing the associated factorizations we find the following equality representation is most constructive and straightforward.) Regarding each p_ℓ as a column vector and q_ij = 〈p_i − p_j, p_i − p_j〉 with 〈·, ·〉 denoting the Euclidean inner product, we can write [1]

Q (p_{1}, \dots, p_{n}) = \underset{U}{\underset{︸}{[\begin{matrix} 〈 p_{1}, p_{1} 〉 & 1 & - 2 p_{1}^{⊤} \\ ⋮ & ⋮ & ⋮ \\ 〈 p_{i}, p_{i} 〉 & 1 & - 2 p_{i}^{⊤} \\ ⋮ & ⋮ & ⋮ \\ 〈 p_{n}, p_{n} 〉 & 1 & - 2 p_{n}^{⊤} \end{matrix}]}} \underset{V}{\underset{︸}{[\begin{matrix} 1 & \dots & 1 & \dots & 1 \\ 〈 p_{1}, p_{1} 〉 & \dots & 〈 p_{j}, p_{j} 〉 & \dots & 〈 p_{n}, p_{n} 〉 \\ p_{1} & p_{j} & p_{n} \end{matrix}]}} .

(3)

Note that U ∈ ℝⁿ^×(^r⁺²⁾ and V ∈ ℝ ⁽^r^+2)×ⁿ. Unless the points p₁, …, p_n satisfy some specific algebraic equations, such as ||p_ℓ|| = 1 for all ℓ = 1, … n, the matrices U and V are generically of rank r + 2.

The fact that the rank of an EDM depends on r, but is independent of the size n, is very interesting. The rank deficiency indicates that many entries in the matrix provide redundant information. It is curious to know whether rank₊(Q(p₁, …, p_n)) has similar property. Note that the two factors U and V in (3) cannot be both nonnegative, so the (minimum) nonnegative factorization of Q(p₁, …, p_n) is yet to be determined.

In a recent paper [1], it is estimated via an intriguing algebraic argument that for a nonnegative matrix of rank 3 to have nonnegative rank 10, we would need a matrix of order at least 252. The discussion in the sequel clearly indicates that the actual order can be much lower.

Suppose that a nonnegative matrix A has two factorizations, A = BC and A = FG. We say that these two factorizations are equivalent if there exist a permutation matrix P and a diagonal matrix D with positive diagonal elements such that BDP = F and P^⊤D⁻¹C = G [1]. With this notion in mind, it suffices to consider the nonnegative factorization for an EDM in a special form.

Lemma 2.1

Suppose n ≥ r + 2 ≥ 3. Then any nonnegative factorization of Q(p₁, …, p_n) is equivalent to the form

Q (p_{1}, \dots, p_{n}) = [\begin{matrix} 1 & 0 & * & * & \dots \\ * & 1 & 0 & * & \dots \\ 0 & * & 1 & * & \dots \\ * & * & * & * \\ ⋮ \end{matrix}] [\begin{matrix} 0 & * & + & * & \dots \\ + & 0 & * & * \\ * & + & 0 & * \\ * & * & * & * \\ ⋮ \end{matrix}]

(4)

where * stands for some undetermined nonnegative numbers and + stands for three undetermined positive numbers.

Proof

Suppose Q(p₁, …, p_n) = UV is a nonnegative factorization. Then there must exist an index 1 ≤ k₁ ≤ n such that u_1k₁v_k₁3 > 0. Permuting both the first and the k₁th columns of U and the first and the k₁th rows of V simultaneously will not affect the product and will place u_1k₁ at the (1, 1) position and v_k₁3 at the (1, 3) position. After scaling u_1k₁ to unit, rename without causing ambiguity the permuted matrices as U and V, respectively. The corresponding v₁₁ in the new V must be zero. Consequently, there must exist an index 2 ≤ k₂ ≤ n such that u_2k₂v_k₂1 > 0. Permuting the second and the k₂th columns of U and the second and the k₂th rows of V simultaneously will not affect the product, will not alter the first column of U or the first row of V, and will place u_2k₂ at the (2, 2) position and v_k₂1 at the (2, 1) position. Again, after scaling u_2k₂ to unit and renaming the permuted matrices as U and V, it must be u₃₁ = v₂₂ = 0. It follows that there exist an index 3 ≤ k₃ ≤ n such that u_3k₃v_k₃2 > 0. Permuting the third and the k₃th columns and rows and scaling u_3k₃ to unit will give rise to the structure specified in the lemma.

It is important to note that the procedure described in the above proof cannot be continued to the fourth or other rows or columns. For this reason, we refer to (4) as the standard nonnegative factorization of Q(p₁, …, p_n).

When reference to the points p₁, …, p_n is not critical, we abbreviate a generic Q(p₁, …, p_n) as Q_n. The notion of nonnegative rank has an interesting geometric meaning [5] which will be our main toll for verifying the nonnegative rank of Q_n. Let the columns of a general nonnegative matrix $A \in R_{+}^{m \times n}$ be denoted by A = [a₁, …, a_n]. Define the scaling factor σ(A) by

σ (A) : = diag {{‖ a_{1} ‖}_{1}, \dots, {‖ a_{n} ‖}_{1}},

(5)

where ||·||₁ stands for the 1-norm in ℝ_m, and the pullback map ϑ(A) by

ϑ (A) : = A σ {(A)}^{- 1} .

(6)

Each column of ϑ(A) can be regarded as a point on the (m − 1)-dimensional probability simplex Inline graphic defined by

D_{m} : = {x \in R_{+}^{m} ∣ x_{i} \geq 0, \sum_{i = 1}^{m} x_{i} = 1} .

(7)

Suppose a given nonnegative matrix A can be factorized as A = UV, where $U \in R_{+}^{m \times p}$ and $V \in R_{+}^{p \times n}$ . Because UV = (UD)(D⁻¹V) for any invertible nonnegative matrix D ∈ ℝ^p^×^p, we may assume without loss of generality that U is already a pullback so that σ(U) = I_n. We can write

A = ϑ (A) σ (A) = U V = ϑ (U) ϑ (V) σ (V) .

(8)

Note that the product ϑ(U)ϑ(V) itself is on the simplex Inline graphic . It follows that

ϑ (A) = ϑ (U) ϑ (V),

(9)

σ (A) = σ (V) .

(10)

In particular, if p = rank₊(A), then we see that rank₊(ϑ(A)) = p, and vice versa. The expression (9) means that the columns in the pullback ϑ(A) are convex combinations of columns of ϑ(U). The integer rank₊(A) stands for the minimal number of vertices on Inline graphic so that the resulting convex polytope encloses all columns of the pullback ϑ(A).

3. Nonnegative rank and factorization for linear EDM

Given a permutation σ of the set {1, 2, …, n}, define the permutation matrix P_σ:= [δ_iσ₍_j₎] where δ_st denotes the Kronecker delta function. Then it is easy to see that

P_{σ}^{⊤} Q (p_{1}, \dots, p_{n}) P_{σ} = Q (p_{σ (1)}, \dots p_{σ (n)}) .

(11)

In other words, the conjugation of an EDM by any permutation matrix remains to be an EDM. In the one dimensional case, i.e., r = 1, we may assume without loss of generality that the point are arranged is ascending order, p₁ < … < p_n. Define s_i:= p_i₊₁ − p_i, i = 1, …, n − 1. Entries in the linear EDM has a special ordering pattern that radiates away from the diagonal per column and row, i.e.

Q (p_{1}, \dots, p_{n}) = [\begin{matrix} 0 & s_{1}^{2} & {(s_{1} + s_{2})}^{2} & {(s_{1} + s_{2} + s_{3})}^{2} & \dots \\ s_{1}^{2} & 0 & s_{2}^{2} & {(s_{2} + s_{3})}^{2} & \dots \\ {(s_{1} + s_{2})}^{2} & s_{2}^{2} & 0 & s_{3}^{2} & \dots \\ {(s_{1} + s_{2} + s_{3})}^{2} & {(s_{2} + s_{3})}^{2} & s_{3}^{2} & 0 \\ ⋮ \end{matrix}]

(12)

We shall exploit this particular ordering to help to obtain some initial insight into the nonnegative rank of the EDM. Unless mentioned otherwise, the subsequent discussion is for the case r = 1.

It is illuminating to begin the analysis with the case n = 4. For convenience, we adopt the colon notation as in Matlab to pick out selected rows, columns or elements of vectors. Denote the columns of Q₄ by Q₄ = [q₁, …, q₄]. The probability simplex Inline graphic can easily be visualized via the unit tetrahedron S₃ in the first octant of ℝ³ if we identify the 4-dimensional vector x by the vector [x₁, x₂, x₃]^⊤ of its first three entries. In this way, columns of ϑ(Q₄) can be interpreted as points ϑ(q₁), ϑ(q₂), ϑ(q₃), ϑ(q₄) depicted in Figure 1. Note that the four points ϑ(q₁), ϑ(q₂), ϑ(q₃), ϑ(q₄) are coplanar because rank(Q₄) = 3. The y-intercept of this common plane is $\frac{s_{1} s_{2}}{s_{1} s_{2} - s_{3} (s_{1} + s_{2} + s_{3})}$ which is either negative or positive with value greater than 1. In either case, the plane intersects the tetrahedron as a quadrilateral. The first three points sit on three separate “ridges” of the quadrilateral and hence cannot be enclosed by any triangle within the quadrilateral except the one with vertices at these three points. If rank₊(Q₄) < 4, then ϑ(q₄) must be inside this triangle and hence be a convex combination of ϑ(q₁), ϑ(q₂), ϑ(q₃), which translates to that the vector q₄ must be a nonnegative combination of q₁, q₂, q₃, but this impossible because q₄₄ = 0. Thus rank₊(Q₄) = 4.

A geometric representation of the matrix ϑ(Q₄) when r = 1.

The expression Q₄ = UV in the form (4) represents a polynomial system of 22 equations in 23 unknowns whereas one of the nonzero unknowns can be normalized to unit. This nonlinear system is solvable. Other than the trivial factorization Q₄ = I₄Q₄ where I₄ stands for the identity matrix, we find that there are only three nontrivial nonnegative factorizations which we list in Table 1. While the first set of factorization in the table is equivalent to Q₄I₄, it is important to note that the last two sets of factorizations correspond to the four vertices of the quadrilateral shown in figure 1. This observation also shows that Q₄ is not prime [2, 15].

Table 1.

Standard nonnegative factorizations of Q₄.

[\begin{matrix} 1 & 0 & \frac{{s_{1}}^{2}}{s_{2}^{2}} & {(s_{1} + s_{2} + s_{3})}^{2} \\ \frac{{s_{2}}^{2}}{{(s_{1} + s_{2})}^{2}} & 1 & 0 & {(s_{2} + s_{3})}^{2} \\ 0 & \frac{{(s_{1} + s_{2})}^{2}}{{s_{1}}^{2}} & 1 & {s_{3}}^{2} \\ \frac{{s_{3}}^{2}}{{(s_{1} + s_{2})}^{2}} & \frac{{(s_{1} + s_{2} + s_{3})}^{2}}{{s_{1}}^{2}} & \frac{{(s_{2} + s_{3})}^{2}}{{s_{2}}^{2}} & 0 \end{matrix}]

[\begin{matrix} 0 & 0 & {(s_{1} + s_{2})}^{2} & 0 \\ {s_{1}}^{2} & 0 & 0 & 0 \\ 0 & {s_{2}}^{2} & 0 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]

[\begin{matrix} 1 & 0 & 0 & \frac{(s_{1} + s_{2}) s_{2} (s_{1} + s_{2} + s_{3})}{s_{2} + s_{3}} \\ 0 & 1 & 0 & {s_{2}}^{2} \\ 0 & \frac{s_{3} (s_{1} + s_{2})}{(s_{2} + s_{3}) s_{1}} & 1 & 0 \\ \frac{s_{3} (s_{2} + s_{3})}{(s_{1} + s_{2}) s_{1}} & 0 & \frac{(s_{2} + s_{3}) (s_{1} + s_{2} + s_{3})}{(s_{1} + s_{2}) s_{2}} & 0 \end{matrix}]

[\begin{matrix} 0 & {s_{1}}^{2} & \frac{s_{3} s_{1} (s_{1} + s_{2})}{s_{2} + s_{3}} & 0 \\ {s_{1}}^{2} & 0 & 0 & \frac{s_{3} (s_{2} + s_{3}) s_{1}}{s_{1} + s_{2}} \\ \frac{(s_{1} + s_{2}) s_{2} (s_{1} + s_{2} + s_{3})}{s_{2} + s_{3}} & {s_{2}}^{2} & 0 & 0 \\ 0 & 0 & 1 & \frac{(s_{2} + s_{3}) (s_{1} + s_{2} + s_{3})}{(s_{1} + s_{2}) s_{2}} \end{matrix}]

[\begin{matrix} 1 & 0 & 0 & {s_{1}}^{2} \\ \frac{s_{2} (s_{2} + s_{3})}{(s_{1} + s_{2}) (s_{1} + s_{2} + s_{3})} & 1 & 0 & 0 \\ 0 & \frac{s_{3} (s_{1} + s_{2})}{(s_{2} + s_{3}) s_{1}} & 1 & 0 \\ 0 & 0 & \frac{(s_{2} + s_{3}) (s_{1} + s_{2} + s_{3})}{(s_{1} + s_{2}) s_{2}} & \frac{s_{3} (s_{2} + s_{3}) s_{1}}{s_{1} + s_{2}} \end{matrix}]

[\begin{matrix} 0 & 0 & \frac{s_{2} (s_{1} + s_{2}) (s_{1} + s_{2} + s_{3})}{s_{2} + s_{3}} & {(s_{1} + s_{2} + s_{3})}^{2} \\ {s_{1}}^{2} & 0 & 0 & \frac{s_{3} (s_{2} + s_{3}) s_{1}}{s_{1} + s_{2}} \\ \frac{s_{2} (s_{1} + s_{2}) (s_{1} + s_{2} + s_{3})}{s_{2} + s_{3}} & {s_{2}}^{2} & 0 & 0 \\ 0 & 1 & \frac{s_{3} (s_{1} + s_{2})}{(s_{2} + s_{3}) s_{1}} & 0 \end{matrix}]

Open in a new tab

When n > 4, such a visualization in geometry is not possible, but the idea remains justifiable via an algebraic argument with which we precede as follows.

Theorem 3.1

Suppose that the linear EDM Q_n is of rank 3. Then rank₊(Q_n) = n.

Proof

Because rank(Q_n) = 3, its columns reside on a 3-dimensional subspace of ℝⁿ. The pull-back map ϑ can be considered as the intersection of this subspace and the hyperplane defined by $\sum_{i = 1}^{n} x_{i} = 1$ . Columns of ϑ(Q_n) therefore are “coplanar” whereas by their common plane we refer to a 2-dimensional affine subspace in ℝⁿ. Identifying any n-dimensional vector x ∈ Inline graphic by its first n−1 entries [x₁, …, x_n₋₁]^⊤, we thus are able to “see” columns ϑ(q₁), …, ϑ(q_n) as n points residing within the unit polyhedron in the first orthotant of ℝⁿ⁻¹. These points remain to be coplanar. (Indeed, the 2-dimensional affine subspace can be identified by a fixed point, say, ϑ(q₁), and two coordinate axes, say, v₁:= ϑ(q₂) − ϑ(q₁) and v₂:= ϑ(q₃) − ϑ(q_n), where all points in the 2-dimensional affine subspace can be represented as ϑ(q₁) + α₁v₁ + α₂v₂ with scalars α₁ and α₂. The drawing in Figure 1, therefore, is still relatively instructive.)

For 1 ≤ i ≤ n − 1, it is clear that ϑ(q_i) cannot possibly be a convex combination of any other ϑ(q_j) because of the unique zero at its ith entry. We claim further that ϑ(q_n) cannot possibly be in the convex hull spanned by ϑ(q₁), …, ϑ(q_n₋₁). Assume otherwise, then we would have

ϑ (q_{n}) = \sum_{i = 1}^{n - 1} c_{i} ϑ (q_{i})

for some c_i ≥ 0 with $\sum_{i = 1}^{n - 1} c_{i} = 1$ . Note that ||ϑ(q_n)||₁ = 1. However, ${‖ \sum_{i = 1}^{n - 1} c_{i} ϑ (q_{i}) ‖}_{1} < 1$ because ||ϑ(q_i)||₁ < 1 after chopping away the last row of ϑ(Q_n). This is a contradiction. The smallest number of vertices for a convex hull to enclose ϑ(q₁), …, ϑ(q_n), therefore, has to be n, implying that rank₊(Q_n) = n.

There is a subtle difference between the standard nonnegative factorization of Q₄ and that of Q_n when n ≥ 5. Except for the trivial factorization, both factors U and V in Table 1 for Q₄ are of rank 3. This is not the case in general.

Lemma 3.1

Suppose n ≥ 5 and Q_n = UV is a standard nonnegative factorization for the matrix Q_n. Then it cannot be such that both U and V are of rank 3 simultaneously.

Proof

Observe first that for n ≥ 3, assuming the generic condition rank(Q_n) = 3, we can partition Q_n as

Q_{n} = [\begin{array}{c} Q_{3} & Q_{3} Φ \\ Φ^{⊤} Q_{3} & Φ^{⊤} Q_{3} Φ \end{array}]

(13)

where Φ ∈ ℝ^3×(ⁿ⁻³⁾ is uniquely determined. Indeed, if we write Φ = [φ₄, … φ_n], then it can be shown that

φ_{j} = [\begin{matrix} \frac{(\sum_{ℓ = 2}^{j - 1} s_{ℓ}) (\sum_{ℓ = 3}^{j - 1} s_{ℓ})}{s_{1} (s_{1} + s_{2})} \\ - \frac{(\sum_{ℓ = 1}^{j - 1} s_{ℓ}) (\sum_{ℓ = 3}^{j - 1} s_{ℓ})}{s_{1} s_{2}} \\ \frac{(\sum_{ℓ = 1}^{j - 1} s_{ℓ}) (\sum_{ℓ = 2}^{j - 1} s_{ℓ})}{s_{2} (s_{1} + s_{2})} \end{matrix}], j = 4, \dots, n .

(14)

Note that the second entry in φ_j is always negative.

Assume by contradiction that both U and V of Q_n are of rank 3. As U and V appear in the standard form (4), their 3 × 3 leading principal submatrices U₁₁ and V₁₁ are nonsingular. Thus similar to (13), we can partition the nonnegative factors into blocks

Q_{n} = [\begin{array}{c} U_{11} & U_{11} Θ \\ Λ^{⊤} U_{11} & Λ^{⊤} U_{11} Θ \end{array}] [\begin{array}{c} V_{11} & V_{11} Γ \\ Δ^{⊤} V_{11} & Δ^{⊤} V_{11} Γ \end{array}],

(15)

where Θ, Λ, Γ and Δ are real matrices of compatible sizes. Upon comparison with (13), we see that Λ = Φ = Γ. Taking a closer look at the product Λ^⊤U₁₁, we find that the signs of its entries are given by

Λ^{⊤} U_{11} = [\begin{matrix} + & - & + \\ ⋮ & ⋮ & ⋮ \\ + & - & + \end{matrix}] [\begin{matrix} 1 & 0 & * \\ * & 1 & 0 \\ 0 & * & 1 \end{matrix}] = [\begin{matrix} * & \begin{matrix} \cdot \end{matrix} & + \\ ⋮ & ⋮ & ⋮ \\ * & \begin{matrix} \cdot \end{matrix} & + \end{matrix}],

where, again, * indicates some undetermined nonnegative numbers, + some undetermined positive numbers, and ⊡ some nonnegative numbers which can further be determined. Similarly, the signs for entries of V₁₁Γ are given by

V_{11} Γ = [\begin{matrix} 0 & * & + \\ + & 0 & * \\ * & + & 0 \end{matrix}] [\begin{matrix} + & \dots & + \\ - & \dots & - \\ + & \dots & + \end{matrix}] = [\begin{matrix} * & \dots & * \\ + & \dots & + \\ \begin{matrix} \cdot \end{matrix} & \dots & \begin{matrix} \cdot \end{matrix} \end{matrix}] .

Being nonnegative, U and V are complementary to each other in the sense that u_ijv_ji = 0 for all indices i and j. It follows that the +’s in the middle row of V₁₁Γ must cause the ⊡’s in the middle column of Λ^⊤U₁₁ to become zeros. This implies that the very same u₃₂ would have to satisfy the equalities

- \frac{(\sum_{ℓ = 1}^{j - 1} s_{ℓ}) (\sum_{ℓ = 3}^{j - 1} s_{ℓ})}{s_{1} s_{2}} + \frac{(\sum_{ℓ = 1}^{j - 1} s_{ℓ}) (\sum_{ℓ = 2}^{j - 1} s_{ℓ})}{s_{2} (s_{1} + s_{2})} u_{32} = 0,

for all j = 4, … n, which is not possible if n ≥ 5.

To compute the nonnegative factorization of Q_n for n ≥ 5 is considerably harder. The case n = 5, for example, involves a polynomial system of 39 nonlinear equations in 41 unknowns two of which can be normalized. A short cut from a geometric point of view might be worth mentioning. Write

Q_{5} = [\begin{array}{c} Q_{4} & q_{5} \\ q_{5}^{⊤} & 0 \end{array}],

(16)

with q₅ ∈ ℝ^4×1. Consider the submatrix [Q₄, q₅] only. Clearly, its columns are coplanar and, hence, ϑ(q₅) is a point in the interior of the quadrilateral drawn in Figure 1. In particular, if Q₄ = UV is one of the two nontrivial standard nonnegative factorizations of Q₄, i.e., columns of ϑ(U) (or ϑ(V^⊤)) are the four vertices of the quadrilateral, then q₅ is a nonnegative combination of columns of U (or V^⊤). In this way, two of the nontrivial standard nonnegative factorizations of Q₅ are given by

Q_{5} = [\begin{array}{c} U & 0 \\ 0^{⊤} & 1 \end{array}] [\begin{array}{c} V & w_{5} \\ q_{5}^{⊤} & 0 \end{array}] = [\begin{array}{c} U & q_{5} \\ z^{⊤} & 0 \end{array}] [\begin{array}{c} V & 0 \\ 0^{⊤} & 1 \end{array}],

(17)

respectively, where w₅ and z₅ are some nonnegative vectors satisfying Uw₅ = V^⊤z₅ = q₅. This procedure can be generalized to higher n, but there might be other nonnegative factorizations which are not of this particular form specified in (17).

4. A conjecture for higher dimensional EDM

In higher dimensional vector spaces, points p₁, …, p_n cannot be totally ordered. Thus, for r > 1 and n ≥ r + 2, the EDM will not enjoy the inherent structure indicated in (12). Nevertheless, if we denote p_j = [p_ij], then we can write

Q (p_{1}, \dots, p_{n}) = \sum_{i = 1}^{r} Q (p_{i 1}, \dots, p_{i n}) .

We have shown earlier that generically rank₊(Q(p_i₁, …, p_in)) = n for each 1 ≤ i ≤ r. Representing the distance matrices for respective components, these r linear EDMs in general are not related to each other. For their summation (of nonnegative entries) to cause a reduction of rank, they must satisfy some delicate algebraic constraints. We thus conjecture that rank₊(Q(p₁, …, p_n)) = n generically for all r.

It might be informative to reexamine the geometric representation of the matrix Q₄ when r > 1. In contrast to the setting in Figure 1, columns of Q₄ are not coplanar. Their representation becomes that depicted in Figure 2. The vertex ϑ(q₄) resides on the simplex Inline graphic . How the base plane determined by vertices ϑ(q₁), ϑ(q₂), and ϑ(q₃) intersects the axes characterizes the zero structure of nonnegative factors. Different from the case r = 1, there are several possibilities and there is simply no general rules here. The one shown in Figure 2 implies that the corresponding Q₄ is prime, which is another interesting contrast to the case when r = 1.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Contributor Information

Matthew M. Lin, Email: mlin@ncsu.edu, Department of Mathematics, North Carolina State University, Raleigh, NC 27695-8205, USA.

Moody T. Chu, Email: chu@math.ncsu.edu, Department of Mathematics, North Carolina State University, Raleigh, NC 27695-8205, USA.

References

1.Beasley LB, Laffey TJ. Real rank versus nonnegative rank. Linear Algebra Appl. 2009 doi: 10.1016/j.laa.2009.02.034. [DOI] [Google Scholar]
2.Berman A, Plemmons RJ. Matrix group monotonicity. Proc Amer Math Soc. 1974;46:355–359. [Google Scholar]
3.Berman A, Plemmons RJ. Nonnegative matrices in the mathematical sciences, vol. 9 of Classics in Applied Mathematics. Society for Industrial and Applied Mathematics (SIAM); Philadelphia, PA: 1994. Revised reprint of the 1979 original. [Google Scholar]
4.Campbell SL, Poole GD. Computing nonnegative rank factorizations. Linear Algebra Appl. 1981;35:175–182. [Google Scholar]
5.Chu MT, Lin MM. Low-dimensional polytope approximation and its applications to nonnegative matrix factorization. SIAM J Sci Comput. 2008;30:1131–1155. [Google Scholar]
6.Cohen JE, Rothblum UG. Nonnegative ranks, decompositions, and factorizations of nonnegative matrices. Linear Algebra Appl. 1993;190:149–168. [Google Scholar]
7.Crippen GM, Havel TF. Distance geometry and molecular conformation, vol. 15 of Chemometrics Series. Research Studies Press Ltd; Chichester: 1988. [Google Scholar]
8.Dattorro J. Euclidean distance matrix, thesis. Stanford University; 2004. http://www.stanford.edu/~dattorro/EDM.pdf. [Google Scholar]
9.Glunt W, Hayden TL, Raydan M. Molecular conformations from distance matrices. J Comput Chemistry. 1993;14:114–120. [Google Scholar]
10.Gower JC. Euclidean distance geometry. Math Sci. 1982;7:1–14. [Google Scholar]
11.Jain SK, Tynan J. Nonnegative rank factorization of a nonnegative matrix A with A†A ≥ 0. Linear Multilinear Algebra. 2003;51:83–95. [Google Scholar]
12.Jeter MW, Pye WC. A note on nonnegative rank factorizations. Linear Algebra Appl. 1981;38:171–173. [Google Scholar]
13.Jeter MW, Pye WC. Some nonnegative matrices without nonnegative rank factorizations. Indust Math. 1982;32:37–41. [Google Scholar]
14.Klain DA, Rota G-C. Lezioni Lincee. [Lincei Lectures]. Cambridge University Press; Cambridge: 1997. Introduction to geometric probability. [Google Scholar]
15.Richman DJ, Schneider H. Primes in the semigroup of non-negative matrices. Linear and Multilinear Algebra. 1974;2:135–140. [Google Scholar]
16.Sylvester JJ. Birmingham British Assoc Rept. 1865. On a special class of questions on the theory of probabilities; pp. 8–9. [Google Scholar]
17.Thomas LB. Solution to problem 73-14: Rank factorization of nonnegative matrices by a. berman and r. j. plemmons. SIAM Review. 1974;16:393–394. [Google Scholar]
18.Vavasis SA. On the complexity of nonnegative matrix factorization. 2007 Available at http://www.citebase.org/abstract?id=oai:arXiv.org:0708.4149.

[R1] 1.Beasley LB, Laffey TJ. Real rank versus nonnegative rank. Linear Algebra Appl. 2009 doi: 10.1016/j.laa.2009.02.034. [DOI] [Google Scholar]

[R2] 2.Berman A, Plemmons RJ. Matrix group monotonicity. Proc Amer Math Soc. 1974;46:355–359. [Google Scholar]

[R3] 3.Berman A, Plemmons RJ. Nonnegative matrices in the mathematical sciences, vol. 9 of Classics in Applied Mathematics. Society for Industrial and Applied Mathematics (SIAM); Philadelphia, PA: 1994. Revised reprint of the 1979 original. [Google Scholar]

[R4] 4.Campbell SL, Poole GD. Computing nonnegative rank factorizations. Linear Algebra Appl. 1981;35:175–182. [Google Scholar]

[R5] 5.Chu MT, Lin MM. Low-dimensional polytope approximation and its applications to nonnegative matrix factorization. SIAM J Sci Comput. 2008;30:1131–1155. [Google Scholar]

[R6] 6.Cohen JE, Rothblum UG. Nonnegative ranks, decompositions, and factorizations of nonnegative matrices. Linear Algebra Appl. 1993;190:149–168. [Google Scholar]

[R7] 7.Crippen GM, Havel TF. Distance geometry and molecular conformation, vol. 15 of Chemometrics Series. Research Studies Press Ltd; Chichester: 1988. [Google Scholar]

[R8] 8.Dattorro J. Euclidean distance matrix, thesis. Stanford University; 2004. http://www.stanford.edu/~dattorro/EDM.pdf. [Google Scholar]

[R9] 9.Glunt W, Hayden TL, Raydan M. Molecular conformations from distance matrices. J Comput Chemistry. 1993;14:114–120. [Google Scholar]

[R10] 10.Gower JC. Euclidean distance geometry. Math Sci. 1982;7:1–14. [Google Scholar]

[R11] 11.Jain SK, Tynan J. Nonnegative rank factorization of a nonnegative matrix A with A†A ≥ 0. Linear Multilinear Algebra. 2003;51:83–95. [Google Scholar]

[R12] 12.Jeter MW, Pye WC. A note on nonnegative rank factorizations. Linear Algebra Appl. 1981;38:171–173. [Google Scholar]

[R13] 13.Jeter MW, Pye WC. Some nonnegative matrices without nonnegative rank factorizations. Indust Math. 1982;32:37–41. [Google Scholar]

[R14] 14.Klain DA, Rota G-C. Lezioni Lincee. [Lincei Lectures]. Cambridge University Press; Cambridge: 1997. Introduction to geometric probability. [Google Scholar]

[R15] 15.Richman DJ, Schneider H. Primes in the semigroup of non-negative matrices. Linear and Multilinear Algebra. 1974;2:135–140. [Google Scholar]

[R16] 16.Sylvester JJ. Birmingham British Assoc Rept. 1865. On a special class of questions on the theory of probabilities; pp. 8–9. [Google Scholar]

[R17] 17.Thomas LB. Solution to problem 73-14: Rank factorization of nonnegative matrices by a. berman and r. j. plemmons. SIAM Review. 1974;16:393–394. [Google Scholar]

[R18] 18.Vavasis SA. On the complexity of nonnegative matrix factorization. 2007 Available at http://www.citebase.org/abstract?id=oai:arXiv.org:0708.4149.

PERMALINK

On the Nonnegative Rank of Euclidean Distance Matrices

Matthew M Lin

Moody T Chu

Abstract

1. Introduction