Some Comments on Shier’s Paper for Inverting Sparse Matrices

J M McNamee

doi:10.6028/jres.083.033

. 1978 Sep-Oct;83(5):485–487. doi: 10.6028/jres.083.033

Some Comments on Shier’s Paper for Inverting Sparse Matrices^*

J M McNamee ^1,^**

PMCID: PMC6764503 PMID: 34566001

Abstract

A paper by Shier (J. Res NBS 80B) shows how to partition the graph of a matrix into a tree so as to minimize the number of operations required to invert the matrix. The present paper shows how to economically solve a sparse system of linear equations after the application of Shier’s method to the coefficient matrix.

Keywords: Sparse equations, tree partitions

1. Introduction

In [1]¹ Shier points out that if: (a) the graph corresponding to a sparse matrix A is partitioned into subgraphs which themselves can be regarded as nodes of a tree, and (b) the nodes of this tree are suitably numbered; then A can be partitioned as (A_ij) where A_ij are submatrices and the ith row of A is

{\underset{˜}{A}}_{i} = (A_{i 1}, \cdot \cdot, A_{i i}, 0, 0, \dots, 0, A_{i, r (i)}, 0, \dots, 0), i = 1, \dots, n

(1)

and where node r(i) is the “father” of i in the tree. Also, A_ik = 0 (k < i) unless r(k) = i and A is block incidence-symmetric. He then describes a relatively efficient way of finding A⁻¹, involving the computation of A_ii⁻¹ and similar sub-matrices by standard methods for dense matrices combined with recursive application of his algorithm. He also describes a method for carrying out the tree partitioning in (a) above.

Unfortunately he does not describe in detail how his method can be applied to the much more common problem of solving sparse equations, although he does mention (p. 252, lines 3–5) that it can be so applied. This will now be done.

2. Solution of Equations

We have to solve:

A \underset{˜}{x} = \underset{˜}{b}

(2)

where A is partitioned as in (1) and x and b are partitioned conformably into sub-vectors

({\underset{˜}{x}}_{i}) and ({\underset{˜}{b}}_{i})

(3)

We may very efficiently solve (2) by Block Gaussian Elimination as follows: (A) (Elimination of sub-diagonal sub-matrices) For i = 1, ⋯, n − 1 do:

(I) Form multipliers m_{r (i), i} = A_{r (i), i} A_{i i}^{- 1}

(4)

Eliminate A_r(i),i by subtracting m_r(i),ix (row i) from row r(i), i.e.

(II) Update A_{r (i), r (i)} \leftarrow A_{r (i), r (i)} - A_{r (i), i} A_{i i}^{- 1} A_{i, r (i)}

(5)

(III) Update {\underset{˜}{b}}_{r (i)} \leftarrow {\underset{˜}{b}}_{r (i)} - A_{r (i), i} A_{i i}^{- 1} {\underset{˜}{b}}_{i}

(6)

(B) (Back-Substitution)

(IV) {\underset{˜}{x}}_{n} = A_{n n}^{- 1} {\underset{˜}{b}}_{n}

(7)

(V) For i = n − 1, ··, 1 do:

{\underset{˜}{x}}_{i} = A_{i i}^{- 1} [{\underset{˜}{b}}_{i} - A_{i, r (i)} {\underset{˜}{x}}_{r (i)}]

(8)

(The above simply constitutes Gaussian Elimination with coefficients consisting of submatrices instead of scalars.) The great advantage of this method is that there is no fill-in except within the blocks, i.e., a zero submatrix always remains zero.

3. A More Economical Method

Further economy can be obtained by omitting the explicit calculation of m in (4). Rather we can perform triangular decomposition

A_{i i} = L_{i i} U_{i i}

(9)

Then (6) can be replaced by:

(I) Solve L_{i i} {\underset{˜}{v}}_{i} = {\underset{˜}{b}}_{i}

(10)

(II) Solve U_{i i} {\underset{˜}{z}}_{i} = {\underset{˜}{v}}_{i}

(11)

Then {\underset{˜}{z}}_{i} = A_{i i}^{- 1} {\underset{˜}{b}}_{i}

(12)

(III) Form {\underset{˜}{w}}_{r (i)} = A_{r (i), i} {\underset{˜}{z}}_{i}

(13)

(IV) {\underset{˜}{b}}_{r (i)} \leftarrow {\underset{˜}{b}}_{r (i)} - {\underset{˜}{w}}_{r (i)}

(14)

(5) Can be replaced by similar calculations with each column of A_i,r(i) taking the place, in turn, of ${\underset{˜}{b}}_{i}$ . (7) Can be replaced by (9), (10), (11) with i = n. (8) Can be replaced by:

(I) {\underset{˜}{b}}_{i} \leftarrow {\underset{˜}{b}}_{i} - A_{i, r (i)} {\underset{˜}{x}}_{r (i)}

(15)

(II) Solve L_{i i} {\underset{˜}{v}}_{i} = {\underset{˜}{b}}_{i}

(16)

(II) Solve U_{i i} {\underset{˜}{x}}_{i} = {(\underset{˜}{v})}_{i}

(17)

4. Operation Count

(1) If the explicit inverse A_ii⁻¹ and m_r(i),i are employed as in §2, using dense matrix techniques, the operation count would be as follows, assuming A_ii is order p_ixp_i, and p_i = p for all i: the formation of A_ii⁻¹, m_r(i),i and equation (5) each require 0(p³) multiplications, for a total of 0(3p³); while eq (6) requires 0(p²), (7) requires 0(p³ + p²) and (8) requires 0(2p²). Thus, the total number of multiplications is approximately

(n - 1) 3 p^{3} + (n - 1) 3 p^{2} + p^{3} + p^{2} = (3 n - 2) (p^{3} + p^{2}) .

(18)

(II) If the method of §3 is used we have: equation (9) requires 0(p³/3) multiplications; equations (10), (11) and (13) together need 0(2p²). The solution of equations (10), (11) and (13) with any column of A_i,r(i) in place of ${\underset{˜}{b}}_{i}$ requires 0(2p²) for each column, i.e. 0(2p³) in all. Equations (15)–(17) require 0(2p²). Equations (9), (10) and (11) for i = n require $0 (\frac{p^{3}}{3} + p^{2})$ . Thus, the total number of multiplications required for this method is approximately

(\frac{7}{2} n - 2) p^{3} + (4 n - 3) p^{2}

(19)

Thus the method of §3 is more efficient for large p_i = p, when we may ignore multiplicative and overhead factors.

(III) If the equations are solved directly without any partitioning, as if they were full, the number of multiplications required is $\approx \frac{1}{3} {(n p)}^{3} + {(n p)}^{2}$ , which for large n is much greater than $\frac{7}{3} n p^{3}$ .

5. Labelling of Tree Nodes

The nodes of the tree must be numbered in such a way that its incidence matrix has the form (1). This can be accomplished for example by a modification of the “Reverse Cuthill-McKee Algorithm” [2]. (This was originally devised as a band-width minimization technique, although that aspect has no relevance in the present context.) Simplified and re-worded for our purposes the algorithm may be described thus:

Suppose there are N nodes in the tree. Choose an arbitrary node and number it N (this is defined as the only member of “level” 1). Set I = 1 and J = N − 1.
Consider all nodes adjacent to nodes in level I but as yet unnumbered (they will be defined as members of level I + 1). Suppose there are K such nodes in all. If K = 0 terminate. Otherwise assign to them the numbers J, J − 1, ··, J − K + 1.

Set J = J − K and I = I + 1.

Repeat step B until K = 0.

It is simple to prove that the incidence matrix of a tree thus numbered has the form (1), i.e. each node (numbered i, say) is adjacent to only one node having a higher number (say r(i)).

Proof: Suppose if possible a node numbered i is adjacent to two nodes numbered r₁ and r₂ such that r₁ > i, r₂ > i. Then the nodes r₁ and r₂ belong to lower levels than node i. Hence they are both connected, via paths not including node i, to node N. Thus we have two separate paths connecting nodes i and N, i.e. we have a loop. But this contradicts the assumption that the graph is a tree. Hence there must be only one node adjacent to i with number > i. Q.E.D.

Footnotes

Figures in brackets indicate the literature references at the end of this paper.

6. References

[1].Shier D. R., Inverting sparse matrices by tree partitioning, J. Res. Nat. Bur. Stand (U.S.), 80B (Math. Sci.), No. 2, pp. 245–257 (April–June 1976). [Google Scholar]
[2].Cuthill E., Several strategies for reducing the bandwidth of a matrix In: Sparse Matrices and their applications, Ed. D. J. Rose and R. A. Willoughby pp. 157–160 (Plenum Press, New York, N. Y., 1972). [Google Scholar]

[R1] [1].Shier D. R., Inverting sparse matrices by tree partitioning, J. Res. Nat. Bur. Stand (U.S.), 80B (Math. Sci.), No. 2, pp. 245–257 (April–June 1976). [Google Scholar]

[R2] [2].Cuthill E., Several strategies for reducing the bandwidth of a matrix In: Sparse Matrices and their applications, Ed. D. J. Rose and R. A. Willoughby pp. 157–160 (Plenum Press, New York, N. Y., 1972). [Google Scholar]

PERMALINK

Some Comments on Shier’s Paper for Inverting Sparse Matrices^*

J M McNamee

Abstract

1. Introduction

2. Solution of Equations

3. A More Economical Method

4. Operation Count

5. Labelling of Tree Nodes

Footnotes

6. References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Some Comments on Shier’s Paper for Inverting Sparse Matrices*

J M McNamee

Abstract

1. Introduction

2. Solution of Equations

3. A More Economical Method

4. Operation Count

5. Labelling of Tree Nodes

Footnotes

6. References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Some Comments on Shier’s Paper for Inverting Sparse Matrices^*