Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2002 Apr 16;99(8):5217–5222. doi: 10.1073/pnas.032677299

The Pythagorean Theorem: II. The infinite discrete case

Richard V Kadison 1,
PMCID: PMC122749  PMID: 16578869

Abstract

The study of the Pythagorean Theorem and variants of it as the basic result of noncommutative, metric, Euclidean Geometry is continued. The emphasis in the present article is the case of infinite discrete dimensionality.

1. Introduction

We continue our study of the Pythagorean Theorem begun in ref. 1. The numbering of results and remarks in ref. 1 will be used in this article in two ways: A reference to Proposition 3 is a reference to that proposition in ref. 1, and the results in this article will be numbered from 13 on (following Proposition 12, the last result in ref. 1).

Our focus in this article is the case of infinite-dimensional Hilbert space and an infinite-dimensional subspace, although we study first the case of a finite-dimensional subspace of infinite-dimensional space. The novelty in that case is that we are projecting the vectors of an infinite orthonormal basis onto the finite-dimensional subspace. Propositions 1 and 2 apply, as they stand, to prove those variants of the Pythagorean Theorem in that situation. The Carpenter's Theorem for that case is quite another matter. Several “proofs” of it were developed, the first of which was invalid: as argued, the “proof” yielded results that did not respect necessary conditions (discovered later). There is a small warning here: the intricacies of the arguments in the case of an infinite-dimensional subspace with infinite-dimensional orthogonal complement are needed to cause certain infinitely repeated processes to produce convergent sums. That convergence is far less automatic than might sometimes seem natural. The shortest of our arguments in the case of the finite-dimensional subspace was not short. Junhao Shen suggested a change in strategy that produces a shortened version, which appears as the proof of Theorem 13. I am happy to express my gratitude for his suggestion. The results involving finite-dimensional subspaces appear in the next section.

In the third section, we study the case of an infinite-dimensional subspace with infinite-dimensional complement. Although the precise formula of Proposition 3 does not apply in that case, the “integrality condition” implicit in that assertion (and mentioned) plays a crucial role in these results.

In the last section, we resume the examination of the relation between the Pythagorean Theorem and doubly stochastic (now, infinite) matrices. We produce such matrices with a block and its complement of finite weight (as promised). Although the formula of Proposition 12 is not applicable here, the integrality condition is and is proved in the concluding section.

2. Finite and Cofinite-Dimensional Subspaces

We prove versions of the Carpenter's Theorem for subspaces of infinite-dimensional Hilbert space ℋ that are finite or cofinite-dimensional in ℋ. From Proposition 2, if we specify numbers in [0, 1], they must have sum m if they are to be the squares of the lengths of the projections of the elements in an orthonormal basis onto some m-dimensional subspace of ℋ. Subject to this condition, is there such a subspace? There is, as we shall show. The proof uses the finite case (Theorem 6) and some additional constructions.

Theorem 13.

If {eb}b∈𝔹is an orthonormal basis for the Hilbert space ℋ and numberstbin [0, 1] are specified, there is an m-dimensional subspace 𝒱 of ℋ such that Feb2 = tbfor each b in 𝔹, whereF is the projection with range 𝒱, if and only ifb∈𝔹tb = m.

Proof:

Let 𝔹0 be {b ∈ 𝔹 : tb = 0} and ℋ0 the closed linear span of {eb}b∈𝔹0. Since ∑b∈𝔹∖𝔹0tb = m and tb > 0 when b ∉ 𝔹0, 𝔹∖𝔹0is a countable set. Note that m = ∑b∈𝔹∖𝔹0tb ≤ ∑b∈𝔹∖𝔹0 1 =dim(ℋ⊖ℋ0). If we find 𝒱 in ℋ⊖ℋ0 such that ∥Feb2 = tb for each b in 𝔹∖𝔹0, we are done, because Feb = 0 when b ∈ 𝔹0. Restricting to ℋ⊖ℋ0, we may assume that ℋ is separable and that each tb > 0. We have dealt with the case where ℋ has finite dimension. Henceforth, we assume that ℋ has dimension ℵ0, that our given orthonormal basis is e1, e2, … , and that the specified numbers are a1, a2, … .

Some further reductions are useful. By restricting our attention to the orthogonal complement of the subspace of ℋ spanned by the basis elements ej for which aj is 0 or 1 and constructing a projection with matrix having diagonal the remaining aj with respect to the remaining ej, we may assume that each aj ∈ (0, 1). Since ∑Inline graphicaj = m, there are at most a finite number of aj greater than a given number, and these can be written in decreasing order. Thus, for some permutation π of ℕ, aπ(1)aπ(2) ≥ ⋯ . Suppose Eπ is a projection with matrix relative to e1, e2, … having diagonal aπ(1), aπ(2), … . Then U*πEπUπ has matrix with diagonal a1, a2, … , where Uπ is the permutation unitary that maps ej to eπ−1(j) for each j in ℕ. It suffices, in general, to construct our matrix with the specified diagonal in any order.

We may assume that m ≥ 2 (from Proposition 1) and that a1a2 ≥ ⋯ with each aj in (0, 1). Now, ∑Inline graphicaj = ma1 > m − 1. Thus there is an s such that ∑Inline graphicaj < m − 1 and ∑Inline graphicajm − 1. If t = ∑Inline graphicaj − (m − 1), then tas+1. From Theorem 6, there is a projection E of rank m − 1 with matrix relative to the basis e2, e3, … , es+1 having a2, a3, … , as+1t as its diagonal. Moreover, ∑Inline graphicaj + a1 + t = 1. Thus, from Proposition 1, there is a projection G of rank 1 with matrix relative to the basis e1, es+2, es+3, … whose diagonal is a1 + t, as+2, as+3, … . Noting that as+1tas+1a1a1 + t, we can splice the two projections E and G together and then “permute” the splice, to form a projection F whose matrix relative to the basis e1, e2, … has diagonal a1, a2, … . ■

Following this fifteenth variation, the situation in which the projections of the elements of our orthonormal basis {en} on the subspace 𝒱 of ℋ have lengths whose squares sum to ∞ remains to be studied. In this case, from Proposition 2, 𝒱 must be infinite dimensional. Once again, the question of whether the squares of the lengths can be assigned in [0, 1] arbitrarily, subject only to the condition that their sum diverges is the more involved aspect of this case. Our experience, to this point, makes it tempting to believe that the question has an affirmative answer. Some further consideration makes it clear that there is more to be said.

Again, the question can be formulated in terms of projections, their matrices relative to {en}, and diagonals of those matrices. If 𝒱 is the (infinite-dimensional) subspace for the specified lengths a1, a2, … and E is the projection with 𝒱 as range, then IE has ℋ⊖𝒱, the orthogonal complement of 𝒱, as its range. The matrix for the projection IE has 1 − a1, 1 − a2, … as its diagonal. If ∑j1 − aj converges, it must converge to an integer, since IE is a projection. If we choose the aj such that ∑j1 − aj converges, but not to an integer (e.g., let aj be 1 − (1/j2)), then aj → 1, whence ∑aj = ∞, and there is no projection with diagonal a1, a2, … relative to {ej}. (If E were such a projection, IE would have diagonal 1 − a1, 1 − a2, … , with finite sum other than an integer, contradicting Proposition 2). However, if ∑j1 − aj = m, with m an integer, then there is a projection IE with diagonal 1 − a1, 1 − a2, … , and hence a projection E with diagonal a1, a2, … . This discussion provides our sixteenth variation.

Theorem 14.

If {ea}a∈𝔸is an orthonormal basis for the Hilbert space ℋ and{ta}a∈𝔸is a family of numbers in [0, 1], there is an infinite-dimensional subspace 𝒱 of ℋ with m-dimensional orthogonal complement such thatFea2 = tafor each a in 𝔸, where F is the projection with range 𝒱, if and only ifa∈𝔸1 − ta = m.

3. Infinite-Dimensional Subspaces with Infinite-Dimensional Complement

In the context of orthonormal bases for Hilbert space, there remains the case where ∑a∈𝔸ta and ∑a∈𝔸1 − ta diverge. This case leads to our seventeenth variation. As we shall see in the course of the proof, and as noted in the statement, there is more to the story than the divergence of the two sums noted. To recognize this in advance, we need only consider the case where the assigned diagonal entries consist of an infinite number of 0s and terms a1, a2, … in [½, 1] such that ∑1 − aj converges to a number a not an integer. If E is a projection whose matrix has that diagonal, then the restriction of E to the space generated by the basis elements corresponding to all the aj is also a projection of the sub-Hilbert space onto some subspace of it. Relative to those basis elements corresponding to the aj, that projection has a matrix whose diagonal has entries a1, a2, … . But ∑1 − aj is a number other than an integer, by assumption. We have seen that such a diagonal is not a possibility for a projection. This restriction appears in more complex form as well. If we replace a finite number of the 0s by numbers r1, … , rk in (0, ½) such that (r1 + ⋯ + rk) − ∑Inline graphic(1 − aj) is not an integer, then the resulting assignment of numbers is not the diagonal of a projection. This reduces to the case just considered by noting that, with the present assumption, ∑Inline graphic(1 − aj) + (1 − r1) + ⋯ + (1 − rk) is not an integer. Of course, the restriction on the diagonal in this case is foreshadowed by Proposition 3 (and the remarks following it).

To simplify the discussion and focus on essentials, we deal with the case where ℋ has dimension ℵ0.

Theorem 15.

Let {e1, e2, …} be an orthonormal basis for the Hilbert space ℋ and numbersa1, a2, … in [0, 1]be specified. Let a1, a2, … be the ajin (½, 1], a"1, a"2, … those in[0, ½], a the sum of thea"j, and b the sumInline graphic1 − aj.There is an infinite-dimensional subspace 𝒱 of ℋ with infinite-dimensional complement such thatFej2 = ajfor each j, where F is the projection with range 𝒱, if and only ifInline graphicajand Inline graphic1 − ajdiverge and either of a or b is infinite or both are finite and ab is an integer.

Proof:

As in the proof of Theorem 13, by restricting to the orthogonal complement of the subspace of ℋ generated by the basis elements ej for which aj is either 0 or 1, we may assume that each aj ∈ (0, 1). Pursuing this same idea, we note that if {ℕj}j=1,2,…is a set of mutually disjoint subsets of ℕ with union ℕ such that we can find a projection Ej with range contained in the closed subspace generated by all en for which n ∈ ℕj, with ∥Ejen2 = an, for each such en, and Ejem = 0 for each other em, then ∑Inline graphicEj (= F) is a projection such that ∥Fej2 = aj for each j.

Let ℕ′ be the set of aj-indices of aj and ℕ" those of a"j. We suppose, first, that ∑Inline graphica"j = ∞. Let n(1) be the least integer n such that a1 + a"1 + ⋯ + a"n ≥ 3. Since a1 < 1 and each a"j ≤ ½, n(1) ≥ 5. Let b2, … , bn(1) be a"2, … , a"n(1) rearranged in decreasing order (so, b2b3 ≥ ⋯ ≥ bn(1)). Let m(1) be the least integer m such that a1 + b1 + b2 + ⋯ + bm ≥ 3, where b1 = a"1. Then 5 ≤ m(1) ≤ n(1) and a1 + ∑Inline graphicbj < 3. Let ã be 3 − a1 − ∑Inline graphicbj, bm(1)−1 be bm(1)−1 + ã, and bm(1) be bm(1) − ã. Then a1 + ∑Inline graphicbj + bm(1)−1 = 3, and

graphic file with name M17.gif

(∗)

[For the first inequality of (∗), note that a1 + ∑Inline graphicbj ≥ 3, whence bm(1) ≥ 3 − a1 − ∑Inline graphicbj = ã; for the last inequality, note that bm(1) is some a"j (≤ ½) as is bm(1)−1. Thus bm(1)−1 = bm(1)−1 + ã ≤ bm(1)−1 + bm(1) ≤ 1.]

Let ℕ1 be the set of indices of the aj in {a1, a"1, … , a"n(1)} corresponding to a1, b1, … , bm(1)−1. [Recall that b1, … , bn(1) is a rearrangement of a"1, … , a"n(1).] Let j(1), j(2), … be the numbers in ℕ"∖ℕ1 in ascending order, except that j(1) is the index of the aj in {a"1, … , a"n(1)} that bm(1) represents, and j(2) is the index of the aj that a2 represents. Let n(2) be the least integer n such that a2 + bm(1) + ∑Inline graphicaj(k) ≥ 3. Let c1 be bm(1), c2 be a2, and c3, … , cn(2) be aj(3), … , aj(n(2)) rearranged in decreasing order except that c3 is aj(3). [Note that the smallest number in ℕ"∖ℕ1 is one of j(1) or j(3).] Let m(2) be the least integer m such that ∑Inline graphiccj ≥ 3. Then ∑Inline graphiccj < 3, m(2) ≤ n(2), and m(2) ≥ 6. Let b̃ be 3 − ∑Inline graphiccj, cm(2)−1 be cm(2)−1 + b̃, and cm(2) be cm(2) − b̃. Note that 0 < b̃ ≤ cm(2) ≤ ½, whence

graphic file with name M24.gif

We repeat this process, letting ℕ2 be j(1), j(2) and the set of indices of the aj in {aj(3), … , aj(n(2))} corresponding to c3, … , cm(2)−1. Let k(1), k(2), … be the numbers in ℕ"∖(ℕ1 ∪ ℕ2) in ascending order, except that k(1) is the index of the aj in {aj(1), … , aj(n(2))} that cm(2) represents, and k(2) is the index of the aj that a3 represents. Let n(3) be the least integer n such that a3 + cm(2) + ∑Inline graphicak(r) ≥ 3. Let d1 be cm(2), d2 be a3, and d3, … , dn(3) be ak(3), … , ak(n(3)) rearranged in decreasing order except that d3 is ak(3). Again, the smallest number in ℕ"∖(ℕ1 ∪ ℕ2) is one of k(1) or k(3). Let m(3) be the least integer m such that ∑Inline graphicdj ≥ 3. Then ∑Inline graphicdj < 3, m(3) ≤ n(3), and m(3) ≥ 6. Let c̃ be 3 − ∑Inline graphicdj, dm(3)−1 be dm(3)−1 + c̃, and dm(3) be dm(3) − c̃. Note that 0 < c̃ ≤ dm(3) ≤ ½ so that

graphic file with name M29.gif

Continuing in this way, we construct disjoint subsets ℕ1, ℕ2, … of ℕ with union ℕ. In addition, if r(1), … , r(m(j) − 1) are the elements of ℕj, with r(3), … , r(m(j) − 1) in ascending order, there are alterations ãr(1) and ãr(m(j)−1) of ar(1) and ar(m(j)−1), as described, such that ãr(1) + aj + ∑Inline graphicar(k) + ãr(m(j)−1) = 3. At the same time, with s(1), … , s(m(j + 1) − 1), the elements of ℕj+1 and ãs(1), as(2), as(3), … , as(m(j+1)−2), ãs(m(j+1)−1) summing to 3, with ãs(1) and ãs(m(j+1)−1) the elements as(1) and as(m(j+1)−1) altered as described, we have ãr(m(j)−1) + ãs(1) = ar(m(j)−1) + as(1) and

graphic file with name M31.gif

We include the possibility that ℕ′ is finite (or null) in the foregoing argument. If ℕ′ is {a1, … , aj−1}, we eliminate the references to “ak” in the construction of ℕk when kj.

From Theorem 6, there are three-dimensional projections Ej and Ej+1 whose matrices relative to the bases er(1), … , er(m(j)−1) and es(1), … , es(m(j+1)−1) have diagonals ãr(1), ar(2), ar(3), … , ar(m(j)−2), ãr(m(j)−1) and ãs(1), as(2), as(3), … , as(m(j+1)−2), ãs(m(j+1)−1), respectively. We extend each of the projections Ej to a projection (denoted, again, by “Ej”) defined on all of ℋ by letting it annihilate all other basis elements. Then, because the sets ℕj are disjoint, EjEk = 0 when jk. Let E be ∑Inline graphicEj. The next part of this argument is devoted to describing the splicing used to transform E into the projection with the specified diagonal.

We transform E by means of a sequence of unitary operators Wnn) of the form appearing in the proof of Theorem 7. In the present case, with h(1) the index of the aj represented by bm(1)−1 [recall that j(1) is the index of the aj represented by bm(1)], W11)eh(1) =sinθ1eh(1) +cosθ1ej(1), W11)ej(1) = −cosθ1eh(1) +sinθ1ej(1), W11)ej = ej for all other ej, whence

graphic file with name M33.gif

and

graphic file with name M34.gif

Here, θ1 is chosen [as it may be, by virtue of (∗)] so that the convex combination bm(1)−1sin2θ1 + bm(1)cos2θ1 is bm(1)−1, from which bm(1)−1cos2θ1 + bm(1)sin2θ1 = bm(1), since bm(1)−1 + bm(1) = bm(1)−1 + bm(1).

In the same way, we define W22). If g(1) is the index of the aj represented by cm(2)−1, then W22)eg(1) =sinθ2eg(1) +cosθ2ek(1), W22)ek(1) =−cosθ2eg(1) +sinθ2ek(1). [Recall that k(1) is the index of the aj represented by cm(2).] Again, W22)EW22)* “splices” cm(2)−1 and cm(2) to cm(2)−1 and cm(2), when θ2 is suitably chosen, and leaves other “diagonal entries” of E unaltered. Note, too, that W11)W22)ej = W22)W11)ej for each j. Here, we use the fact that ℕ1 and ℕ2 each contain four or more elements so that no two of j(1), h(1), g(1), and k(1) are equal. Thus W11) and W22) commute.

Note that Wkk)ej = ej unless j ∈ ℕk ∪ ℕk+1. Thus Wkk)En = En if n is neither k nor k + 1, since En has range in the space generated by ej with j in ℕn. Suppose j ∈ ℕn. Then

graphic file with name M35.gif
graphic file with name M36.gif
graphic file with name M37.gif
graphic file with name M38.gif
graphic file with name M39.gif

In any event, Fr(ej) = Fs(ej) when r and s exceed n + 1. Thus {Fr(ej)} converges as r → ∞ for each fixed j. As {Fr} is a sequence of projections, it is uniformly bounded by 1. The basis {ej} generates a dense linear manifold on each element of which {Fr} acts to produce a convergent sequence of vectors in ℋ. It follows that {Frx} is Cauchy convergent, hence convergent for each x in ℋ. If Fx is its limit, then F is linear and ∥F∥ ≤ 1. Thus the sequence of projections {Fr} is strong-operator convergent to F, and F is a projection whose matrix relative to {ej} has diagonal {aj} in some order.

The foregoing argument establishes our result when a = ∞. If b = ∞, the argument shows that there is a projection G with b1, b2, … as its diagonal, where bj = 1 − aj. The diagonal of IG is a1, a2, … (= 1 − b1, 1 − b2, …). It remains to treat the case where a and b are finite and ab is an integer. With this assumption, and the added hypothesis that both the sums of the aj and of the bj are infinite, ℕ′ and ℕ" must be infinite sets. There are at most a finite number of a"j exceeding 1/n for a given positive integer n. By arranging those a"j in decreasing order and letting n take on the values 1, 2, … , successively, we may re-label the a"j so that a"1a"2 ≥ ⋯ . Similarly, we may assume that b"1b"2 ≥ ⋯ , where b"j = 1 − aj, whence a1a2 ≤ ⋯ . As ∑a"j and ∑b"j are finite, we have that a"j → 0 and b"j → 0 as j → ∞. It follows that aj → 1 as j → ∞. We may assume that ab (otherwise, we work with b1, b2, …).

We discuss a procedure for “distributing the a"j among a1, a2, … .” Let n be the smallest integer k such that a"1 ≤ ∑Inline graphicb"j. Then ∑Inline graphicb"j < a"1. Replace a1, … , an−1 by 1, … , 1 and an by an + a"1 − ∑Inline graphicb"j(= ã′n). Then 0 < an < ã′nand a"1 − ∑Inline graphicb"jb"n, soã′nan + b"n = 1. In addition,

graphic file with name M44.gif

It follows from Lemma 5 that (an, … , a1, a"1) is contained in the permutation polytope generated by (1, … , 1, ã′n, 0). As at the end of the proof of Theorem 6, there is a unitary operator U1 on ℋ such that U1ek = ek for each k not in ℕ1, the set of indices {j(1), … , j(n + 1)}, of the aj in {an, … , a1, a"1}, which restricts to 𝔜1, the subspace of ℋ generated by ej(1), … , ej(n+1), as a unitary operator U1 such that U1AU1* has matrix with diagonal a"1, a1, … , an, when A, on 𝔜1, has matrix with diagonal 0, 1, … , 1, ã′n relative to the basis ej(1), … , ej(n+1).

We now distribute a"2 among an+1, an+2, … by the procedure just described, forming a finite subset ℕ2 (= {k(1), … , k(m + 1)}) of ℕ, disjoint from ℕ1, a unitary operator U2 on ℋ, such that U2ek = ek for each k not in ℕ2, whose restriction U2 to 𝔜2, the space generated by ek(1), … , ek(m+1), transforms each operator A, on 𝔜2, whose matrix has diagonal 0, 1, … , 1, ã′n+m into one whose matrix has diagonal a"2, an+1, … , an+m relative to the basis ek(1), … , ek(m+1).

Continuing in this way, we construct disjoint subsets ℕ1, ℕ2, … of ℕ with union ℕ, commuting unitaries U1, U2, … on ℋ, and (finite-dimensional) subspaces 𝔜1, 𝔜2, … with span dense in ℋ, as described before. Distributing all the a"j among the aj yields an infinite sequence of 0s in place of the a"j and an infinite sequence ã1, ã2, … of numbers in (½, 1] such that ∑Inline graphic1 − ãj = ba, an integer, by assumption. (There is “room” for the distribution of all the a"j among the aj from the assumption that ab.) From Theorem 14, there is a projection E0 with diagonal ã1, ã2, … relative to the basis {ej}j∈ℕ′, and of course, a projection E with diagonal 0, 0, … , ã1, ã2, … relative to the basis {ej}. We organize the basis {ej} according to the sets ℕ1, ℕ2, … and the diagonal of the matrix for E such that the entries at the diagonal positions corresponding to numbers in ℕjare the numbers obtained from the matching step of the distribution procedure. If we now form U1EU*1, U2U1EU*1U*2, … , successively, we construct a sequence of projections that converges, in the strong-operator topology, to a projection F (by the same argument used for the first part of the proof, where we assumed that a is infinite). The diagonal of F relative to {ej} is a"1, a"2, … , a1, a2, … , that is, a1, a2, … , by choice of U1, U2, … .

Having completed the proof of the “Carpenter's Theorem” in this case, we are left with the task of verifying the curious “integrality” condition imposed on ab. In more detail, we let our orthonormal basis for ℋ be {ej}j∈ℤ0, where ℤ0 is the set of nonzero integers, and F be a projection on ℋ with matrix (ajk) relative to {ej}. Let ℤ and ℤ+ be the negative and positive integers in ℤ0, respectively. Our assumption now is that ∑Inline graphicajj (= a) and ∑Inline graphic1 − ajj (= b) are finite. We wish to prove that ab is an integer.

Let E be the projection whose range is spanned by {ej}j∈ℤ. In effect, we have assumed that the positive operators EFE and (IE)(IF)(IE) are of trace class (L1). It follows that FE, EF, (IF)(IE), and (IE)(IF) are operators of Hilbert–Schmidt Class (L2). Thus the sum of all |ajk|2 with j or k in ℤ converges as does the sum of all |bjk|2 with j or k in ℤ+, where (bjk) is the matrix of IF. If T is a Hilbert–Schmidt operator on ℋ, we denote by ∥T2 the Hilbert–Schmidt (L2) norm of T. That is, ∥TInline graphic is tr(T*T) (=tr(TT*)), where “tr(T*T)” denotes the sum of the diagonal entries of the matrix for T*T relative to an arbitrary orthonormal basis for ℋ, in particular, relative to {ej}j∈ℤ0. Thus ∥TInline graphic is the sum of the squares of the absolute values of all the entries of the matrix for T (or of T*). Since

graphic file with name M50.gif

with B a bounded operator on ℋ,

graphic file with name M51.gif

and ∥BT2 ≤ ∥B∥∥T2. In this notation, we have ∥EFInline graphic = ∥FEInline graphic =tr(EFE) = a and ∥(IF)(IE)∥Inline graphic = b.

Given a positive ɛ (<(2 + 2a1/2 + 2b1/2)−1), choose n0 in ℤ+ such that ∑j or k<−n0|ajk|2 and ∑j or k>n0|bjk|2 are each less than ɛ′2, where ɛ′ = ɛ[28(1 + a1/2 + b1/2)]−1. Let A be the matrix that has ajk as its j,k entry when |j| and |k| do not exceed n0, 1 at the j,j entry when j > n0, and 0 at all other entries. Then, by choice of n0 and A, ∥FA2 < 2ɛ′. Hence |∥F∥2 − ∥A∥2| < 2ɛ′.

Let E0 be the projection with range spanned by {e−1, … , en0}. Again, by choice of n0 and A, ∥EFE0A2 < 2ɛ′. Hence |∥EF2 − ∥E0A2| < 2ɛ′. Let E0 be the projection with range spanned by {e1, … , en0} and I0 be the projection E0 + E0. Then ∥(IE)A − (IE)F2 < 2ɛ′. At the same time, (IE)(IA) = E0(IA), whence

graphic file with name M55.gif
graphic file with name M56.gif
graphic file with name M57.gif
graphic file with name M58.gif

Note, too, that

graphic file with name M59.gif
graphic file with name M60.gif

Let A0 be I0FI0. Then A0AInline graphic = AA2, whence ∥A0AInline graphic2 < 6ɛ′. Of course, we may treat A0 as the 2n0 × 2n0 matrix (ajk)|j|,|k|≤n0. Since A0 is I0FI0, ∥A0∥ ≤ 1 and A0 is positive. Let λn0, … , λn0 be the 2n0 eigenvalues of A0 (with repetitions). Then λn0 − λInline graphic, … , λn0 − λInline graphic are the eigenvalues of A0AInline graphic. Let ɛj be λj − λInline graphic (= λj(1 − λj)). If λj ≥ ½, then (1 − λj)2 ≤ 4ɛInline graphic. If λj ≤ ½, then λInline graphic ≤ 4ɛInline graphic. We suppose that m0 of the eigenvalues λj are in [½, 1]. Let G0 be the projection whose matrix relative to the basis that diagonalizes A0 is diagonal with 1 in place of each of the λj in [½, 1] and 0 in place of the other λj. Then

graphic file with name M70.gif

and ∥A0G02 ≤ 2∥A0AInline graphic2 < 12ɛ′. Thus ∥E0A0E0G02 < 12ɛ′. Since E0A = E0A0,

graphic file with name M72.gif

At the same time, (IE)(IA) = E0(I0A0), whence

graphic file with name M73.gif
graphic file with name M74.gif

It follows that |∥EF2 − ∥E0G02| < 14ɛ′ < 1, and

graphic file with name M75.gif

From the foregoing, we have that

graphic file with name M76.gif
graphic file with name M77.gif
graphic file with name M78.gif

For the last inequality, we note that ∥E0G02 ≤ 1 + ∥EF2. In the same way,

graphic file with name M79.gif
graphic file with name M80.gif
graphic file with name M81.gif

Thus, as I0, E0, and G0 have ranks 2n0, n0, and m0, respectively, from Proposition 3,

graphic file with name M82.gif
graphic file with name M83.gif
graphic file with name M84.gif

Now, m0 and n0 vary with the choice of ɛ, but they are always integers. As ab is arbitrarily close to an integer, ab is an integer.

We have established that ab is an integer for an arbitrary subset 𝒮 of diagonal elements of F with convergent sum a whose complementary set of diagonal elements 𝒮′, subtracted from 1, also has a convergent sum b. To conclude our proof, we note that the existence of any such 𝒮 implies that the set 𝒮0 of ajj in [0, ½] is such a set. As 𝒮 has a convergent sum, it contains at most a finite number of ajj exceeding ½. Similarly, 𝒮′ contains at most a finite number of ajj less than or equal to ½. Shifting one finite set from 𝒮 to 𝒮′ and the other from 𝒮′ to 𝒮 produces the sets 𝒮0 and 𝒮′0 with convergent sums a0 and b0, respectively. Moreover, a0b0 and ab differ by an integer, as described in the comment following the proof of Proposition 3. Thus a0b0 is an integer if and only if ab is. ■

4. Pythagorean Matrices

In Remark 11, we discussed doubly stochastic matrices. We used Proposition 12 to provide another proof of Proposition 3, referring to Proposition 12 as a “Pythagorean Theorem” for finite doubly stochastic matrices. In this section, we consider infinite doubly stochastic matrices. We prove a Pythagorean Theorem for a class of them, the Pythagorean matrices (Proposition 16).

We say that a doubly stochastic matrix A (= (ajk)) is Pythagorean when there is a Hilbert space ℋ and two orthonormal bases {ej} and {fk} for ℋ such that ajk = |〈ej, fk〉|2. We show that a Pythagorean Theorem holds for Pythagorean matrices: The difference of the weights of complementary blocks is an integer when those weights are finite. As promised in Remark 11, we establish the existence of doubly stochastic matrices with infinite complementary blocks, each of which has finite weight and no zero diagonal entries. We use the “Carpenter” result in Theorem 15 and the notation of the proof of “integrality” for this.

When j ∈ ℤ, let aj be 2j. When j ∈ ℤ+, let aj be 1 − 2j. Since 1 = ∑j∈ℤ aj (= a) and 1 = ∑j∈ℤ+ 1 − aj (= b) and ab = 0, an integer, we may apply Theorem 15 to conclude that there is an infinite-dimensional subspace ℋ0 of ℋ with infinite-dimensional complement ℋ⊖ℋ0 (= ℋ′0) such that ∥Fej2 = aj, for each j in ℤ0, where F is the projection of ℋ on ℋ0. If j ∈ ℤ+, ∥(IF)ej2 = 1 − ∥Fej2 = 1 − aj = 2j. Let {fk}k∈ℤ+ and {fk}k∈ℤ be orthonormal bases for ℋ0 and ℋ′0, respectively. Let ajk be |〈ej, fk〉|2, so that (ajk) is an infinite doubly stochastic matrix, as noted before. Since

graphic file with name M85.gif

for each j in ℤ0, and ∑k∈ℤ ajk = ∥(IF)ej2 = 1 − aj, for each such j, we have that ∑j∈ℤk∈ℤ+ajk = ∑j∈ℤ2j = 1 and ∑j∈ℤ+k∈ℤ ajk = ∑j∈ℤ+ 2j = 1. Thus the weight of each of the ℤ+, ℤ and ℤ, ℤ+ complementary blocks is 1. Of course, we can construct other examples of infinite complementary blocks, each having finite weight, by using Theorem 15 in the manner just described.

We prove the analogue of Proposition 12 for Pythagorean matrices.

Proposition 16.

The difference of the weights of a block and its complementary block in an infinite Pythagorean matrix is an integer when those weights are finite.

Proof:

Since the complement of a finite block has infinite weight, we may assume that the block A0 and its complement A0 in the matrix A are infinite with finite weights. Assume that A is Pythagorean. We may also assume that A = (ajk)j,k∈ℤ0 and that A0 = (ajk)j,k∈ℤ (so that A0 = (ajk)j,k∈ℤ+). By assumption, there are orthonormal bases {ej}j∈ℤ0 and {fk}k∈ℤ0, for a Hilbert space ℋ, such that ajk = |〈ej, fk〉|2. Let ℋ0 be the closure of the space spanned by {fk}k∈ℤ and F the projection of ℋ on ℋ0. As in the proof of Proposition 12, ∥Fej2 is the sum of the entries of the jth row of A0, when j ∈ ℤ, while ∥(IF)ej2 is the sum of the entries in the jth row of A0, when j ∈ ℤ+. At the same time, ∥Fej2 is the j diagonal entry of the matrix for F and ∥(IF)ej2 is the j diagonal entry for the matrix of IF (matrices formed relative to the basis {ej}j∈ℤ0). Thus w(A0) is the sum of the j diagonal entries in the matrix for F with j in ℤ, and w(A0) is the sum of the j diagonal entries in the matrix for IF (that is, 1 minus the j diagonal entry for F), with j in ℤ+. These sums are finite, by assumption [because w(A0) and w(A0) are finite]. But Theorem 15 assures us that the difference of these sums is an integer. Thus w(A0) − w(A0) is an integer when A is Pythagorean. ■

There is a great deal more to be said about doubly stochastic matrices in this context, and there are a number of questions that have not been answered (for example: Are there non-Pythagorean doubly stochastic matrices?) That discussion must await another occasion.

References


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES