Abstract
The study of the Pythagorean Theorem and variants of it as the basic result of noncommutative, metric, Euclidean Geometry is continued. The emphasis in the present article is the case of infinite discrete dimensionality.
1. Introduction
We continue our study of the Pythagorean Theorem begun in ref. 1. The numbering of results and remarks in ref. 1 will be used in this article in two ways: A reference to Proposition 3 is a reference to that proposition in ref. 1, and the results in this article will be numbered from 13 on (following Proposition 12, the last result in ref. 1).
Our focus in this article is the case of infinite-dimensional Hilbert space and an infinite-dimensional subspace, although we study first the case of a finite-dimensional subspace of infinite-dimensional space. The novelty in that case is that we are projecting the vectors of an infinite orthonormal basis onto the finite-dimensional subspace. Propositions 1 and 2 apply, as they stand, to prove those variants of the Pythagorean Theorem in that situation. The Carpenter's Theorem for that case is quite another matter. Several “proofs” of it were developed, the first of which was invalid: as argued, the “proof” yielded results that did not respect necessary conditions (discovered later). There is a small warning here: the intricacies of the arguments in the case of an infinite-dimensional subspace with infinite-dimensional orthogonal complement are needed to cause certain infinitely repeated processes to produce convergent sums. That convergence is far less automatic than might sometimes seem natural. The shortest of our arguments in the case of the finite-dimensional subspace was not short. Junhao Shen suggested a change in strategy that produces a shortened version, which appears as the proof of Theorem 13. I am happy to express my gratitude for his suggestion. The results involving finite-dimensional subspaces appear in the next section.
In the third section, we study the case of an infinite-dimensional subspace with infinite-dimensional complement. Although the precise formula of Proposition 3 does not apply in that case, the “integrality condition” implicit in that assertion (and mentioned) plays a crucial role in these results.
In the last section, we resume the examination of the relation between the Pythagorean Theorem and doubly stochastic (now, infinite) matrices. We produce such matrices with a block and its complement of finite weight (as promised). Although the formula of Proposition 12 is not applicable here, the integrality condition is and is proved in the concluding section.
2. Finite and Cofinite-Dimensional Subspaces
We prove versions of the Carpenter's Theorem for subspaces of infinite-dimensional Hilbert space ℋ that are finite or cofinite-dimensional in ℋ. From Proposition 2, if we specify numbers in [0, 1], they must have sum m if they are to be the squares of the lengths of the projections of the elements in an orthonormal basis onto some m-dimensional subspace of ℋ. Subject to this condition, is there such a subspace? There is, as we shall show. The proof uses the finite case (Theorem 6) and some additional constructions.
Theorem 13.
If {eb}b∈𝔹is an orthonormal basis for the Hilbert space ℋ and numberstbin [0, 1] are specified, there is an m-dimensional subspace 𝒱 of ℋ such that ∥Feb∥2 = tbfor each b in 𝔹, whereF is the projection with range 𝒱, if and only if∑b∈𝔹tb = m.
Proof:
Let 𝔹0 be {b ∈ 𝔹 : tb = 0} and ℋ0 the closed linear span of {eb}b∈𝔹0. Since ∑b∈𝔹∖𝔹0tb = m and tb > 0 when b ∉ 𝔹0, 𝔹∖𝔹0is a countable set. Note that m = ∑b∈𝔹∖𝔹0tb ≤ ∑b∈𝔹∖𝔹0 1 =dim(ℋ⊖ℋ0). If we find 𝒱 in ℋ⊖ℋ0 such that ∥Feb∥2 = tb for each b in 𝔹∖𝔹0, we are done, because Feb = 0 when b ∈ 𝔹0. Restricting to ℋ⊖ℋ0, we may assume that ℋ is separable and that each tb > 0. We have dealt with the case where ℋ has finite dimension. Henceforth, we assume that ℋ has dimension ℵ0, that our given orthonormal basis is e1, e2, … , and that the specified numbers are a1, a2, … .
Some further reductions are useful. By restricting our attention to the
orthogonal complement of the subspace of ℋ spanned by the basis
elements ej for which aj
is 0 or 1 and constructing a projection with matrix having diagonal the
remaining aj with respect to the remaining
ej, we may assume that each
aj ∈ (0, 1). Since
∑
aj = m, there are
at most a finite number of aj greater than a
given number, and these can be written in decreasing order. Thus, for
some permutation π of ℕ, aπ(1) ≥
aπ(2) ≥ ⋯ . Suppose
Eπ is a projection with matrix relative to
e1, e2, … having diagonal
aπ(1), aπ(2), … . Then
U*πEπUπ
has matrix with diagonal a1,
a2, … , where Uπ is
the permutation unitary that maps ej to
eπ−1(j) for each
j in ℕ. It suffices, in general, to construct our matrix
with the specified diagonal in any order.
We may assume that m ≥ 2 (from Proposition
1) and that a1 ≥ a2
≥ ⋯ with each aj in (0, 1). Now,
∑
aj = m −
a1 > m − 1. Thus there is an s
such that ∑
aj < m −
1 and ∑
aj ≥ m
− 1. If t =
∑
aj − (m − 1),
then t ≤ as+1. From Theorem 6,
there is a projection E of rank m − 1 with
matrix relative to the basis e2, e3,
… , es+1 having a2,
a3, … , as+1 − t as its
diagonal. Moreover,
∑
aj +
a1 + t = 1. Thus, from Proposition
1, there is a projection G of rank 1 with matrix
relative to the basis e1, es+2,
es+3, … whose diagonal is
a1 + t, as+2,
as+3, … . Noting that
as+1 − t ≤ as+1 ≤
a1 ≤ a1 + t, we can splice the
two projections E and G together and then
“permute” the splice, to form a projection F whose
matrix relative to the basis e1,
e2, … has diagonal a1,
a2, … . ■
Following this fifteenth variation, the situation in which the projections of the elements of our orthonormal basis {en} on the subspace 𝒱 of ℋ have lengths whose squares sum to ∞ remains to be studied. In this case, from Proposition 2, 𝒱 must be infinite dimensional. Once again, the question of whether the squares of the lengths can be assigned in [0, 1] arbitrarily, subject only to the condition that their sum diverges is the more involved aspect of this case. Our experience, to this point, makes it tempting to believe that the question has an affirmative answer. Some further consideration makes it clear that there is more to be said.
Again, the question can be formulated in terms of projections, their matrices relative to {en}, and diagonals of those matrices. If 𝒱 is the (infinite-dimensional) subspace for the specified lengths a1, a2, … and E is the projection with 𝒱 as range, then I − E has ℋ⊖𝒱, the orthogonal complement of 𝒱, as its range. The matrix for the projection I − E has 1 − a1, 1 − a2, … as its diagonal. If ∑j1 − aj converges, it must converge to an integer, since I − E is a projection. If we choose the aj such that ∑j1 − aj converges, but not to an integer (e.g., let aj be 1 − (1/j2)), then aj → 1, whence ∑aj = ∞, and there is no projection with diagonal a1, a2, … relative to {ej}. (If E were such a projection, I − E would have diagonal 1 − a1, 1 − a2, … , with finite sum other than an integer, contradicting Proposition 2). However, if ∑j1 − aj = m, with m an integer, then there is a projection I − E with diagonal 1 − a1, 1 − a2, … , and hence a projection E with diagonal a1, a2, … . This discussion provides our sixteenth variation.
Theorem 14.
If {ea}a∈𝔸is an orthonormal basis for the Hilbert space ℋ and{ta}a∈𝔸is a family of numbers in [0, 1], there is an infinite-dimensional subspace 𝒱 of ℋ with m-dimensional orthogonal complement such that∥Fea∥2 = tafor each a in 𝔸, where F is the projection with range 𝒱, if and only if∑a∈𝔸1 − ta = m.
3. Infinite-Dimensional Subspaces with Infinite-Dimensional Complement
In the context of orthonormal bases for Hilbert space, there
remains the case where ∑a∈𝔸ta
and ∑a∈𝔸1 − ta diverge.
This case leads to our seventeenth variation. As we shall see in the
course of the proof, and as noted in the statement, there is
more to the story than the divergence of the two sums noted. To
recognize this in advance, we need only consider the case where
the assigned diagonal entries consist of an infinite number of 0s and
terms a1, a2, … in
[½, 1] such that ∑1 − aj
converges to a number a not an integer. If E is a
projection whose matrix has that diagonal, then the restriction of
E to the space generated by the basis elements corresponding
to all the aj is also a projection of the
sub-Hilbert space onto some subspace of it. Relative to those basis
elements corresponding to the aj, that
projection has a matrix whose diagonal has entries
a1, a2, … . But
∑1 − aj is a number other than an
integer, by assumption. We have seen that such a diagonal is not a
possibility for a projection. This restriction appears in more complex
form as well. If we replace a finite number of the 0s by numbers
r1, … , rk in (0, ½)
such that (r1 + ⋯ + rk) −
∑
(1 − aj) is not an
integer, then the resulting assignment of numbers is not the diagonal
of a projection. This reduces to the case just considered by
noting that, with the present assumption,
∑
(1 − aj) +
(1 − r1) + ⋯ + (1 −
rk) is not an integer. Of course, the restriction on
the diagonal in this case is foreshadowed by Proposition 3
(and the remarks following it).
To simplify the discussion and focus on essentials, we deal with the case where ℋ has dimension ℵ0.
Theorem 15.
Let {e1, e2, …} be
an orthonormal basis for the Hilbert space ℋ and numbersa1, a2, … in [0, 1]be specified. Let a′1,
a′2, … be the ajin (½, 1], a"1,
a"2, … those in[0, ½], a
the sum of thea"j, and b
the sum∑
1 − a′j.There is an infinite-dimensional subspace 𝒱 of ℋ
with infinite-dimensional complement such that∥Fej∥2 = ajfor each j, where
F
is
the
projection
with
range
𝒱, if
and
only
if ∑
ajand ∑
1 − ajdiverge and either of a
or b
is infinite
or both are finite and a − b
is an integer.
Proof:
As in the proof of Theorem 13, by restricting to the
orthogonal complement of the subspace of ℋ generated by the
basis elements ej for which
aj is either 0 or 1, we may assume that each
aj ∈ (0, 1). Pursuing this same idea, we
note that if {ℕj}j=1,2,…is a set of mutually disjoint subsets of ℕ with union ℕ
such that we can find a projection Ej with range
contained in the closed subspace generated by all
en for which n ∈
ℕj, with
∥Ejen∥2 =
an, for each such en, and
Ejem = 0 for each other
em, then
∑
Ej (= F) is a
projection such that ∥Fej∥2 =
aj for each j.
Let ℕ′ be the set of aj-indices of
a′j and ℕ" those of
a"j. We suppose, first, that
∑
a"j = ∞.
Let n(1) be the least integer n such that
a′1 + a"1 +
⋯ + a"n ≥ 3. Since
a′1 < 1 and each
a"j ≤ ½, n(1) ≥ 5.
Let b2, … , bn(1) be
a"2, … ,
a"n(1) rearranged in decreasing order (so,
b2 ≥ b3 ≥ ⋯ ≥
bn(1)). Let m(1) be the least integer
m such that a′1 +
b1 + b2 + ⋯ + bm
≥ 3, where b1 =
a"1. Then 5 ≤ m(1) ≤
n(1) and a′1 +
∑
bj < 3. Let
ã be 3 − a′1 −
∑
bj,
b′m(1)−1 be bm(1)−1 +
ã, and b′m(1) be
bm(1) − ã. Then
a′1 +
∑
bj +
b′m(1)−1 = 3, and
![]() |
(∗)
[For the first inequality of (∗), note that
a′1 +
∑
bj ≥ 3, whence
bm(1) ≥ 3 − a′1
− ∑
bj = ã; for
the last inequality, note that bm(1) is some
a"j (≤ ½) as is
bm(1)−1. Thus
b′m(1)−1 = bm(1)−1 +
ã ≤ bm(1)−1 + bm(1) ≤
1.]
Let ℕ1 be the set of indices of the
aj in {a′1,
a"1, … , a"n(1)}
corresponding to a′1, b1,
… , bm(1)−1. [Recall that
b1, … , bn(1) is a
rearrangement of a"1, … ,
a"n(1).] Let j(1), j(2), …
be the numbers in ℕ"∖ℕ1 in ascending order, except
that j(1) is the index of the aj in
{a"1, … ,
a"n(1)} that bm(1)
represents, and j(2) is the index of the
aj that a′2
represents. Let n(2) be the least integer n such
that a′2 +
b′m(1) +
∑
aj(k) ≥ 3. Let
c1 be b′m(1),
c2 be a′2, and
c3, … , cn(2) be
aj(3), … , aj(n(2)) rearranged
in decreasing order except that c3 is
aj(3). [Note that the smallest number in
ℕ"∖ℕ1 is one of j(1) or j(3).]
Let m(2) be the least integer m such that
∑
cj ≥ 3. Then
∑
cj < 3, m(2)
≤ n(2), and m(2) ≥ 6. Let b̃ be
3 − ∑
cj,
c′m(2)−1 be cm(2)−1 +
b̃, and c′m(2) be
cm(2) − b̃. Note that 0 <
b̃ ≤ cm(2) ≤ ½, whence
![]() |
We repeat this process, letting ℕ2 be j(1),
j(2) and the set of indices of the aj in
{aj(3), … , aj(n(2))}
corresponding to c3, … ,
cm(2)−1. Let k(1), k(2), … be the
numbers in ℕ"∖(ℕ1 ∪ ℕ2) in ascending
order, except that k(1) is the index of the
aj in {aj(1), … ,
aj(n(2))} that cm(2)
represents, and k(2) is the index of the
aj that a′3
represents. Let n(3) be the least integer n such
that a′3 +
c′m(2) +
∑
ak(r) ≥ 3. Let
d1 be c′m(2),
d2 be a′3, and
d3, … , dn(3) be
ak(3), … , ak(n(3)) rearranged
in decreasing order except that d3 is
ak(3). Again, the smallest number in
ℕ"∖(ℕ1 ∪ ℕ2) is one of
k(1) or k(3). Let m(3) be the least
integer m such that
∑
dj ≥ 3. Then
∑
dj < 3, m(3)
≤ n(3), and m(3) ≥ 6. Let c̃ be
3 − ∑
dj,
d′m(3)−1 be dm(3)−1 +
c̃, and d′m(3) be
dm(3) − c̃. Note that 0 <
c̃ ≤ dm(3) ≤ ½ so that
![]() |
Continuing in this way, we construct disjoint subsets
ℕ1, ℕ2, … of ℕ with union ℕ. In
addition, if r(1), … , r(m(j) − 1) are the
elements of ℕj, with r(3), … ,
r(m(j) − 1) in ascending order, there are alterations
ãr(1) and
ãr(m(j)−1) of
ar(1) and ar(m(j)−1), as
described, such that ãr(1) +
a′j +
∑
ar(k) +
ãr(m(j)−1) = 3. At the same time, with
s(1), … , s(m(j + 1) − 1), the elements of
ℕj+1 and ãs(1),
as(2), as(3), … ,
as(m(j+1)−2), ãs(m(j+1)−1) summing
to 3, with ãs(1) and
ãs(m(j+1)−1) the elements
as(1) and as(m(j+1)−1)
altered as described, we have ãr(m(j)−1)
+ ãs(1) = ar(m(j)−1) +
as(1) and
![]() |
We include the possibility that ℕ′ is finite (or null) in the foregoing argument. If ℕ′ is {a′1, … , a′j−1}, we eliminate the references to “a′k” in the construction of ℕk when k ≥ j.
From Theorem 6, there are three-dimensional projections
Ej and Ej+1 whose
matrices relative to the bases er(1), … ,
er(m(j)−1) and es(1), … ,
es(m(j+1)−1) have diagonals
ãr(1), ar(2), ar(3),
… , ar(m(j)−2), ãr(m(j)−1)
and ãs(1), as(2),
as(3), … , as(m(j+1)−2),
ãs(m(j+1)−1), respectively. We extend each of
the projections Ej to a projection (denoted,
again, by “Ej”) defined on all of
ℋ by letting it annihilate all other basis elements. Then,
because the sets ℕj are disjoint,
EjEk = 0 when j ≠
k. Let E be
∑
Ej. The next part of
this argument is devoted to describing the splicing used to transform
E into the projection with the specified diagonal.
We transform E by means of a sequence of unitary operators Wn(θn) of the form appearing in the proof of Theorem 7. In the present case, with h(1) the index of the aj represented by bm(1)−1 [recall that j(1) is the index of the aj represented by bm(1)], W1(θ1)eh(1) =sinθ1eh(1) +cosθ1ej(1), W1(θ1)ej(1) = −cosθ1eh(1) +sinθ1ej(1), W1(θ1)ej = ej for all other ej, whence
![]() |
and
![]() |
Here, θ1 is chosen [as it may be, by virtue of (∗)] so that the convex combination b′m(1)−1sin2θ1 + b′m(1)cos2θ1 is bm(1)−1, from which b′m(1)−1cos2θ1 + b′m(1)sin2θ1 = bm(1), since b′m(1)−1 + b′m(1) = bm(1)−1 + bm(1).
In the same way, we define W2(θ2). If g(1) is the index of the aj represented by cm(2)−1, then W2(θ2)eg(1) =sinθ2eg(1) +cosθ2ek(1), W2(θ2)ek(1) =−cosθ2eg(1) +sinθ2ek(1). [Recall that k(1) is the index of the aj represented by cm(2).] Again, W2(θ2)EW2(θ2)* “splices” c′m(2)−1 and c′m(2) to cm(2)−1 and cm(2), when θ2 is suitably chosen, and leaves other “diagonal entries” of E unaltered. Note, too, that W1(θ1)W2(θ2)ej = W2(θ2)W1(θ1)ej for each j. Here, we use the fact that ℕ1 and ℕ2 each contain four or more elements so that no two of j(1), h(1), g(1), and k(1) are equal. Thus W1(θ1) and W2(θ2) commute.
Note that Wk(θk)ej = ej unless j ∈ ℕk ∪ ℕk+1. Thus Wk(θk)En = En if n is neither k nor k + 1, since En has range in the space generated by ej with j in ℕn. Suppose j ∈ ℕn. Then
![]() |
![]() |
![]() |
![]() |
![]() |
In any event, Fr(ej) = Fs(ej) when r and s exceed n + 1. Thus {Fr(ej)} converges as r → ∞ for each fixed j. As {Fr} is a sequence of projections, it is uniformly bounded by 1. The basis {ej} generates a dense linear manifold on each element of which {Fr} acts to produce a convergent sequence of vectors in ℋ. It follows that {Frx} is Cauchy convergent, hence convergent for each x in ℋ. If Fx is its limit, then F is linear and ∥F∥ ≤ 1. Thus the sequence of projections {Fr} is strong-operator convergent to F, and F is a projection whose matrix relative to {ej} has diagonal {aj} in some order.
The foregoing argument establishes our result when a = ∞. If b = ∞, the argument shows that there is a projection G with b1, b2, … as its diagonal, where bj = 1 − aj. The diagonal of I − G is a1, a2, … (= 1 − b1, 1 − b2, …). It remains to treat the case where a and b are finite and a − b is an integer. With this assumption, and the added hypothesis that both the sums of the aj and of the bj are infinite, ℕ′ and ℕ" must be infinite sets. There are at most a finite number of a"j exceeding 1/n for a given positive integer n. By arranging those a"j in decreasing order and letting n take on the values 1, 2, … , successively, we may re-label the a"j so that a"1 ≥ a"2 ≥ ⋯ . Similarly, we may assume that b"1 ≥ b"2 ≥ ⋯ , where b"j = 1 − a′j, whence a′1 ≤ a′2 ≤ ⋯ . As ∑a"j and ∑b"j are finite, we have that a"j → 0 and b"j → 0 as j → ∞. It follows that a′j → 1 as j → ∞. We may assume that a ≤ b (otherwise, we work with b1, b2, …).
We discuss a procedure for “distributing the
a"j among a′1,
a′2, … .” Let n be the
smallest integer k such that
a"1 ≤
∑
b"j. Then
∑
b"j <
a"1. Replace a′1,
… , a′n−1 by 1, … , 1 and
a′n by
a′n + a"1 −
∑
b"j(=
ã′n). Then 0 <
a′n < ã′nand a"1 −
∑
b"j ≤
b"n, soã′n ≤
a′n + b"n = 1. In
addition,
![]() |
It follows from Lemma 5 that (a′n, … , a′1, a"1) is contained in the permutation polytope generated by (1, … , 1, ã′n, 0). As at the end of the proof of Theorem 6, there is a unitary operator U1 on ℋ such that U1ek = ek for each k not in ℕ1, the set of indices {j(1), … , j(n + 1)}, of the aj in {a′n, … , a′1, a"1}, which restricts to 𝔜1, the subspace of ℋ generated by ej(1), … , ej(n+1), as a unitary operator U′1 such that U′1AU′1* has matrix with diagonal a"1, a′1, … , a′n, when A, on 𝔜1, has matrix with diagonal 0, 1, … , 1, ã′n relative to the basis ej(1), … , ej(n+1).
We now distribute a"2 among a′n+1, a′n+2, … by the procedure just described, forming a finite subset ℕ2 (= {k(1), … , k(m + 1)}) of ℕ, disjoint from ℕ1, a unitary operator U2 on ℋ, such that U2ek = ek for each k not in ℕ2, whose restriction U′2 to 𝔜2, the space generated by ek(1), … , ek(m+1), transforms each operator A, on 𝔜2, whose matrix has diagonal 0, 1, … , 1, ã′n+m into one whose matrix has diagonal a"2, a′n+1, … , a′n+m relative to the basis ek(1), … , ek(m+1).
Continuing in this way, we construct disjoint subsets ℕ1,
ℕ2, … of ℕ with union ℕ, commuting unitaries
U1, U2, … on ℋ, and
(finite-dimensional) subspaces 𝔜1,
𝔜2, … with span dense in ℋ, as described
before. Distributing all the a"j among
the a′j yields an infinite
sequence of 0s in place of the a"j and
an infinite sequence ã1,
ã2, … of numbers in (½, 1] such
that ∑
1 −
ãj = b − a, an integer, by
assumption. (There is “room” for the distribution of
all the a"j among the
a′j from the assumption that
a ≤ b.) From Theorem 14, there is a
projection E0 with diagonal
ã1, ã2, … relative
to the basis {ej}j∈ℕ′, and
of course, a projection E with diagonal 0, 0,
… , ã1, ã2, …
relative to the basis {ej}. We organize the
basis {ej} according to the sets
ℕ1, ℕ2, … and the diagonal of the
matrix for E such that the entries at the diagonal positions
corresponding to numbers in ℕjare the
numbers obtained from the matching step of the distribution procedure.
If we now form U1EU*1,
U2U1EU*1U*2,
… , successively, we construct a sequence of projections
that converges, in the strong-operator topology, to a projection
F (by the same argument used for the first part of the
proof, where we assumed that a is infinite). The diagonal of
F relative to {ej} is
a"1, a"2, … ,
a′1, a′2, … , that
is, a1, a2, … , by choice of
U1, U2, … .
Having completed the proof of the “Carpenter's Theorem” in this
case, we are left with the task of verifying the curious
“integrality” condition imposed on a − b. In
more detail, we let our orthonormal basis for ℋ be
{ej}j∈ℤ0, where
ℤ0 is the set of nonzero integers, and F be a
projection on ℋ with matrix (ajk)
relative to {ej}. Let ℤ− and
ℤ+ be the negative and positive integers in
ℤ0, respectively. Our assumption now is that
∑
ajj (= a) and
∑
1 − ajj (= b) are
finite. We wish to prove that a − b is an integer.
Let E be the projection whose range is spanned by
{ej}j∈ℤ−. In effect,
we have assumed that the positive operators EFE and
(I − E)(I − F)(I − E) are of trace class
(L1). It follows that FE, EF, (I −
F)(I − E), and (I − E)(I − F) are
operators of Hilbert–Schmidt Class (L2). Thus
the sum of all |ajk|2 with
j or k in ℤ− converges as does the
sum of all |bjk|2 with
j or k in ℤ+, where
(bjk) is the matrix of I − F.
If T is a Hilbert–Schmidt operator on ℋ, we denote
by ∥T∥2 the Hilbert–Schmidt
(L2−) norm of T. That is,
∥T∥
is tr(T*T) (=tr(TT*)), where “tr(T*T)” denotes the
sum of the diagonal entries of the matrix for T*T relative
to an arbitrary orthonormal basis for ℋ, in particular,
relative to {ej}j∈ℤ0.
Thus ∥T∥
is the sum of the squares of
the absolute values of all the entries of the matrix for T
(or of T*). Since
![]() |
with B a bounded operator on ℋ,
![]() |
and ∥BT∥2 ≤
∥B∥∥T∥2. In this notation, we have
∥EF∥
= ∥FE∥
=tr(EFE) = a and ∥(I − F)(I −
E)∥
= b.
Given a positive ɛ (<(2 + 2a1/2 + 2b1/2)−1), choose n0 in ℤ+ such that ∑j or k<−n0|ajk|2 and ∑j or k>n0|bjk|2 are each less than ɛ′2, where ɛ′ = ɛ[28(1 + a1/2 + b1/2)]−1. Let A be the matrix that has ajk as its j,k entry when |j| and |k| do not exceed n0, 1 at the j,j entry when j > n0, and 0 at all other entries. Then, by choice of n0 and A, ∥F − A∥2 < 2ɛ′. Hence |∥F∥2 − ∥A∥2| < 2ɛ′.
Let E0 be the projection with range spanned by {e−1, … , e−n0}. Again, by choice of n0 and A, ∥EF − E0A∥2 < 2ɛ′. Hence |∥EF∥2 − ∥E0A∥2| < 2ɛ′. Let E′0 be the projection with range spanned by {e1, … , en0} and I0 be the projection E0 + E′0. Then ∥(I − E)A − (I − E)F∥2 < 2ɛ′. At the same time, (I − E)(I − A) = E′0(I − A), whence
![]() |
![]() |
![]() |
![]() |
Note, too, that
![]() |
![]() |
Let A0 be
I0FI0. Then
A0 − A
= A −
A2, whence ∥A0 −
A
∥2 < 6ɛ′. Of course, we
may treat A0 as the 2n0
× 2n0 matrix
(ajk)|j|,|k|≤n0. Since
A0 is I0FI0,
∥A0∥ ≤ 1 and A0 is
positive. Let λ−n0, … ,
λn0 be the 2n0
eigenvalues of A0 (with repetitions). Then
λ−n0 −
λ
, … ,
λn0 − λ
are the eigenvalues of A0 −
A
. Let ɛj be
λj − λ
(=
λj(1 − λj)). If
λj ≥ ½, then (1 −
λj)2 ≤ 4ɛ
. If
λj ≤ ½, then
λ
≤ 4ɛ
. We suppose
that m0 of the eigenvalues
λj are in [½, 1]. Let
G0 be the projection whose matrix relative to
the basis that diagonalizes A0 is diagonal with
1 in place of each of the λj in
[½, 1] and 0 in place of the other
λj. Then
![]() |
and ∥A0 −
G0∥2 ≤ 2∥A0 −
A
∥2 < 12ɛ′. Thus
∥E0A0 −
E0G0∥2 < 12ɛ′. Since
E0A = E0A0,
![]() |
At the same time, (I − E)(I − A) = E′0(I0 − A0), whence
![]() |
![]() |
It follows that |∥EF∥2 − ∥E0G0∥2| < 14ɛ′ < 1, and
![]() |
From the foregoing, we have that
![]() |
![]() |
![]() |
For the last inequality, we note that ∥E0G0∥2 ≤ 1 + ∥EF∥2. In the same way,
![]() |
![]() |
![]() |
Thus, as I0, E0, and G0 have ranks 2n0, n0, and m0, respectively, from Proposition 3,
![]() |
![]() |
![]() |
Now, m0 and n0 vary with the choice of ɛ, but they are always integers. As a − b is arbitrarily close to an integer, a − b is an integer.
We have established that a − b is an integer for an arbitrary subset 𝒮 of diagonal elements of F with convergent sum a whose complementary set of diagonal elements 𝒮′, subtracted from 1, also has a convergent sum b. To conclude our proof, we note that the existence of any such 𝒮 implies that the set 𝒮0 of ajj in [0, ½] is such a set. As 𝒮 has a convergent sum, it contains at most a finite number of ajj exceeding ½. Similarly, 𝒮′ contains at most a finite number of ajj less than or equal to ½. Shifting one finite set from 𝒮 to 𝒮′ and the other from 𝒮′ to 𝒮 produces the sets 𝒮0 and 𝒮′0 with convergent sums a0 and b0, respectively. Moreover, a0 − b0 and a − b differ by an integer, as described in the comment following the proof of Proposition 3. Thus a0 − b0 is an integer if and only if a − b is. ■
4. Pythagorean Matrices
In Remark 11, we discussed doubly stochastic matrices. We used Proposition 12 to provide another proof of Proposition 3, referring to Proposition 12 as a “Pythagorean Theorem” for finite doubly stochastic matrices. In this section, we consider infinite doubly stochastic matrices. We prove a Pythagorean Theorem for a class of them, the Pythagorean matrices (Proposition 16).
We say that a doubly stochastic matrix A (= (ajk)) is Pythagorean when there is a Hilbert space ℋ and two orthonormal bases {ej} and {fk} for ℋ such that ajk = |〈ej, fk〉|2. We show that a Pythagorean Theorem holds for Pythagorean matrices: The difference of the weights of complementary blocks is an integer when those weights are finite. As promised in Remark 11, we establish the existence of doubly stochastic matrices with infinite complementary blocks, each of which has finite weight and no zero diagonal entries. We use the “Carpenter” result in Theorem 15 and the notation of the proof of “integrality” for this.
When j ∈ ℤ−, let aj be 2j. When j ∈ ℤ+, let aj be 1 − 2−j. Since 1 = ∑j∈ℤ− aj (= a) and 1 = ∑j∈ℤ+ 1 − aj (= b) and a − b = 0, an integer, we may apply Theorem 15 to conclude that there is an infinite-dimensional subspace ℋ0 of ℋ with infinite-dimensional complement ℋ⊖ℋ0 (= ℋ′0) such that ∥Fej∥2 = aj, for each j in ℤ0, where F is the projection of ℋ on ℋ0. If j ∈ ℤ+, ∥(I − F)ej∥2 = 1 − ∥Fej∥2 = 1 − aj = 2−j. Let {fk}k∈ℤ+ and {fk}k∈ℤ− be orthonormal bases for ℋ0 and ℋ′0, respectively. Let ajk be |〈ej, fk〉|2, so that (ajk) is an infinite doubly stochastic matrix, as noted before. Since
![]() |
for each j in ℤ0, and ∑k∈ℤ− ajk = ∥(I − F)ej∥2 = 1 − aj, for each such j, we have that ∑j∈ℤ− ∑k∈ℤ+ajk = ∑j∈ℤ−2j = 1 and ∑j∈ℤ+∑k∈ℤ− ajk = ∑j∈ℤ+ 2−j = 1. Thus the weight of each of the ℤ+, ℤ− and ℤ−, ℤ+ complementary blocks is 1. Of course, we can construct other examples of infinite complementary blocks, each having finite weight, by using Theorem 15 in the manner just described.
We prove the analogue of Proposition 12 for Pythagorean matrices.
Proposition 16.
The difference of the weights of a block and its complementary block in an infinite Pythagorean matrix is an integer when those weights are finite.
Proof:
Since the complement of a finite block has infinite weight, we may assume that the block A0 and its complement A′0 in the matrix A are infinite with finite weights. Assume that A is Pythagorean. We may also assume that A = (ajk)j,k∈ℤ0 and that A0 = (ajk)j,k∈ℤ− (so that A′0 = (ajk)j,k∈ℤ+). By assumption, there are orthonormal bases {ej}j∈ℤ0 and {fk}k∈ℤ0, for a Hilbert space ℋ, such that ajk = |〈ej, fk〉|2. Let ℋ0 be the closure of the space spanned by {fk}k∈ℤ− and F the projection of ℋ on ℋ0. As in the proof of Proposition 12, ∥Fej∥2 is the sum of the entries of the jth row of A0, when j ∈ ℤ−, while ∥(I − F)ej∥2 is the sum of the entries in the jth row of A′0, when j ∈ ℤ+. At the same time, ∥Fej∥2 is the j diagonal entry of the matrix for F and ∥(I − F)ej∥2 is the j diagonal entry for the matrix of I − F (matrices formed relative to the basis {ej}j∈ℤ0). Thus w(A0) is the sum of the j diagonal entries in the matrix for F with j in ℤ−, and w(A′0) is the sum of the j diagonal entries in the matrix for I − F (that is, 1 minus the j diagonal entry for F), with j in ℤ+. These sums are finite, by assumption [because w(A0) and w(A′0) are finite]. But Theorem 15 assures us that the difference of these sums is an integer. Thus w(A0) − w(A′0) is an integer when A is Pythagorean. ■
There is a great deal more to be said about doubly stochastic matrices in this context, and there are a number of questions that have not been answered (for example: Are there non-Pythagorean doubly stochastic matrices?) That discussion must await another occasion.
References
- 1.Kadison R. Proc Natl Acad Sci USA. 2002;99:4178–4184. doi: 10.1073/pnas.032677199. [DOI] [PMC free article] [PubMed] [Google Scholar]



































