Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2003 Jun 24;100(14):8138–8141. doi: 10.1073/pnas.1337650100

The matrix-valued hypergeometric equation

Juan A Tirao 1,*
PMCID: PMC166195  PMID: 12824462

Abstract

The hypergeometric differential equation was found by Euler [Euler, L. (1769) Opera Omnia Ser. 1, 11–13] and was extensively studied by Gauss [Gauss, C. F. (1812) Comm. Soc. Reg. Sci. II 3, 123–162], Kummer [Kummer, E. J. (1836) Riene Ang. Math. 15, 39–83; Kummer, E. J. (1836) Riene Ang. Math. 15, 127–172], and Riemann [Riemann, B. (1857) K. Gess. Wiss. 7, 1–24]. The hypergeometric function known also as Gauss' function is the unique solution of the hypergeometric equation analytic at z = 0 and with value 1 at z = 0. This function, because of its remarkable properties, has been used for centuries in the whole subject of special functions. In this article we give a matrix-valued analog of the hypergeometric differential equation and of Gauss' function. One can only speculate that many of the connections that made Gauss' function a vital part of mathematics at the end of the 20th century will be shared by its matrix-valued version, discussed here.


The hypergeometric equation is a second-order differential equation with three regular singular points. This equation was found by Euler (1) and was studied extensively by Gauss (2), Kummer (3, 4), and Riemann (5). Using a linear fractional transformation, we can place the three singularities at 0, 1, and ∞. Accordingly, the equation becomes

graphic file with name M1.gif [1]

This is Euler's hypergeometric differential equation. The hypergeometric function known also as Gauss' function is defined by the hypergeometric series

graphic file with name M2.gif

for |z| < 1 and by analytic continuation elsewhere. If c is not an integer, then 2F1(c, a, b; z) is the only solution of Eq. 1 analytic at z = 0 and with value 1 at z = 0. See ref. 6 for a complete account on the subject.

This function, because of its remarkable properties, has been used for centuries in the whole subject of special functions. In the past 30 years, the discoveries of new special functions and of applications of special functions to new areas of mathematics have initiated a resurgence of interest in this field.

Representation theory provides a very important approach to the study of special functions. Historically this approach dates to the classical papers of Cartan (7) and Weyl (8) in which they showed that spherical harmonics arise in a natural way from a study of functions on G/K, where G is the orthogonal group in n space and K denotes the orthogonal group in (n- 1) space. To get a theory applying to larger classes of special functions, it is necessary to drop the assumption that G is compact and also to consider functions not just on G/K but also on G with values in End(V), where V is any finite dimensional complex vector space. Let denote the set of all equivalence classes of complex finite dimensional irreducible representations of K; for each δ ∈ , let ξδ denote the character of δ, d(δ) the degree of δ, and χδ = d(δ)ξδ. A spherical function Φ on G (see ref. 9) of type δ ∈ is a continuous function on G with values in End(V) such that Φ(e) = 1 and

graphic file with name M3.gif

The first general results were obtained by Gelfand in 1950 (10), who considered spherical functions of trivial type for Riemannian symmetric pairs (G, K); a short time thereafter the fundamental papers of Godement (11) and Harish-Chandra (12, 13) appeared. It turns out that the spherical functions of trivial type for a rank-one Riemannian symmetric pair, when G is suitably parametrized, can be identified with hypergeometric functions. In ref. 14 one finds a detailed elaboration of this theory for any K type when the symmetric space G/K is the complex projective plane.

One can only speculate that many of the connections that made Gauss' function a vital part of mathematics at the end of the 20th century will be shared by its End(V) valued version discussed here. It is natural to wonder whether the spherical functions of any type, associated to a rank-one Riemannian symmetric pair, can be expressed in terms of these matrix-valued hypergeometric functions and to study their relation with the relatively new theory of matrix-valued orthogonal polynomials.

There are two other important programs to generalize the classical hypergeometric equation. One is due to Gelfand (15) and his school and the other to Gross (16). In the first, the generalization involves scalar valued functions of several variables, whereas in the second, one is dealing with scalar valued functions of a matrix argument.

Moreover, it is worth observing that the abstract hypergeometric equation considered by Hille (17),

graphic file with name M4.gif

can be written in the form of Eq. 2.

Let V be a d-dimensional complex vector space. Given A, B, C ∈ End(V), let us consider the following hypergeometric equation

graphic file with name M5.gif [2]

where F denotes a function on ℂ with values in V.

Let us look for solutions of the form F = zαG, with G analytic at z = 0 and G(0) ≠ 0. We have

graphic file with name M6.gif

From Eq. 2 the following differential equation for G follows,

graphic file with name M7.gif

If we put G(z) = ∑n0 znGn, we obtain the following recursion relation for the coefficients Gn. For all k ≥ -2 we have

graphic file with name M8.gif

For k = -2 we have α(C + α- 1)G0 = 0 from which we derive the following indicial equation:

graphic file with name M9.gif [3]

Let β1,..., βd be the eigenvalues of C; then the roots of the indicial equation are α = 0, 1- β1,..., 1- βd.

For α = 0 we obtain the solutions F of Eq. 2, analytic at z = 0, the Taylor coefficients of which are given by the recursion relation

graphic file with name M10.gif [4]

If the eigenvalues of C are not 0, -1, -2,..., then the function F is characterized by its value at 0, because the matrix (k + 2)(C + k + 1) is nonsingular for all k ≥ -1.

Let us introduce the notation

graphic file with name M11.gif

for all m ≥ 0 and (C, A, B)0 = 1. We observe that

graphic file with name M12.gif

Theorem 1. If {Fn}n0 is a sequence in V that satisfies the recursion relation (Eq. 4) and Inline graphic, then

  1. Fn+1 = [1/(n + 1)](C + n)-1(A + n)(B + n)Fn, for all n ≥ 0, and

  2. Fn = (1/n!)(C, A, B)nF0, for all n ≥ 0.

Proof: For (i), if we put k = -1 in Eq. 4 we get F1 = C-1ABF0. Now let n ≥ 1 and set k = n- 1 in Eq. 4; then

graphic file with name M14.gif

For k = n- 2 we obtain

graphic file with name M15.gif

Therefore,

graphic file with name M16.gif

which proves (i). The statement in (ii) follows directly from (i) by induction on n ≥ 0.

Definition 1: If A, B, C ∈ End(V) and no eigenvalues of C are in the set {0, -1, -2,...}, we define

graphic file with name M17.gif

In the next theorem we summarize our results on the analytic solutions at z = 0 of differential Eq. 2.

Theorem 2. If Inline graphic then

  1. the function Inline graphic ia analytic on |z| < 1 with values in End(V), and

  2. if F0V, then F(z) = 2F1(A, B C; z)F0 is a solution of the hypergeometric equation
    graphic file with name M20.gif
    such that F(0) = F0. Conversely any solution F analytic at z = 0 is of this form.

Let β ≠ 1 be an eigenvalue of C. Then α = 1- β is a root of indicial Eq. 3. Now we want to study a sequence {Gk}k0 satisfying

graphic file with name M21.gif [5]

For k = -2, α(C + α- 1)G0 = 0 if and only if G0 is an eigenvector of C of eigenvalue β.

If Inline graphic, then {Gk}k0 is determined by G0.

Theorem 3. If {Gn}n0 is a sequence in V that satisfies the recursion relation (Eq. 5), Inline graphic and G0V is an eigenvector of C of eigenvalue β, then

  1. Gn+1 = [1/(α + n + 1)](C + α + n)-1(A + α + n) × (B + α + n)Gn, for all n ≥ 0, and

  2. Gn = [1/(α + 1)n](C + α, A + α, B + α)nG0, for all n ≥ 0.

Proof: For (i), if we put k = -1 in Eq. 5 we obtain

graphic file with name M24.gif

which proves (i) for n = 0. Now for n ≥ 1 the proof continues along the same line as in Theorem 2. The statement in (ii) follows directly from (i) by induction on n ≥ 0.

Definition 2: If A, B, C ∈ End(V) and -α ∉ Inline graphic, then we define the function

graphic file with name M26.gif

Notice that for α = 0 we have

graphic file with name M27.gif

Theorem 4. If Inline graphic, then Inline graphic is analytic on |z| < 1 with values in End(V).

If Inline graphic is an eigenvalue of C and G0V is an eigenvector of C of eigenvalue β, then

graphic file with name M31.gif

is a solution of the hypergeometric equation

graphic file with name M32.gif

Observe that when V = ℂ and C = c, A = a, B = b are complex numbers, then β = c and the above function coincides with z1-c2F1(a-c+1, b-c+1 2-c; z) for G0 = 1.

Corollary 1. Let C be diagonalizable and let Vi) be the eigenspace of C of eigenvalue βi. Let Inline graphic be a basis of V and let {Gi,j}j be a basis of Vi) for each βi. If Inline graphic and O is a simply connected region in Inline graphic with 0 ∈ O, then

graphic file with name M36.gif

is a basis of the space of all solutions of the hypergeometric equation

graphic file with name M37.gif

analytic at z = 0. Moreover,

graphic file with name M38.gif

is a basis of the space of all analytic solutions on O.

When V = ℂ, a differential equation of the form

graphic file with name M39.gif [6]

with u, v, c ∈ ℂ, after solving a quadratic equation, becomes

graphic file with name M40.gif [7]

This is not necessarily the case when dim(V) > 1. In other words, a differential equation of the form

graphic file with name M41.gif [8]

with U, V, C ∈ End(V), cannot always be reduced to one of the form of Eq. 2, because a quadratic equation in a noncommutative setting as End(V) may have no solutions. Thus, it is important to notice how to obtain the solutions of Eq. 8.

If the eigenvalues of C are not 0, -1, -2,... let us introduce the sequence [C, U, V]n ∈ End(V) by defining inductively

graphic file with name M42.gif

for all n ≥ 0.

Definition 3: If U, V, C ∈ End(V) and no eigenvalues of C are in the set {0, -1, -2,...}, we define

graphic file with name M43.gif

If Inline graphic, then we define the function

graphic file with name M45.gif

Notice that for α = 0 we have

graphic file with name M46.gif

and that if U = 1 + A + B, V = AB, then

graphic file with name M47.gif

Now Theorem 4 and Corollary 1 generalize, mutatis mutandis, in the following way.

Theorem 5.

  1. If Inline graphic, the function Inline graphic is analytic on |z| < 1 with values in End(V).

  2. If Inline graphic is an eigenvalue of C and G0V is an eigenvector of C of eigenvalue β, then
    graphic file with name M51.gif
    is a solution of the differential equation
    graphic file with name M52.gif

Corollary 2. Let C be diagonalizable, and let Vi) be the eigenspace of C of eigenvalue βi. Let Inline graphic be a basis of V, and let {Gi,j}j be a basis of Vi) for each βi. If Inline graphic and O is a simply connected region in Inline graphic with 0 ∈ Ō, then

graphic file with name M56.gif

is a basis of the space of all solutions of the differential equation

graphic file with name M57.gif

analytic at z = 0. Moreover,

graphic file with name M58.gif

is a basis of the space of all analytic solutions on O.

Example: Following Grünbaum (see ref. 18), let us consider the differential equation

graphic file with name M59.gif [9]

where F denotes a function on ℂ with values in M(2, ℂ), and X, U, V, and W are the matrices

graphic file with name M60.gif
graphic file with name M61.gif

where α, β ∈ ℂ and j = 0, 1,....

The term FW in Eq. 9 forces us to consider this equation as a differential equation on functions that take values on ℂ4 and to consider the left and right multiplication by matrices in Inline graphic as linear maps in ℂ4. Thus, instead of Eq. 9, we shall consider the following equivalent differential equation

graphic file with name M63.gif [10]

where

graphic file with name M64.gif

where I in the matrices above denotes the 2 × 2 identity matrix, and

graphic file with name M65.gif

It is easy to verify that the matrices A and B given below satisfy Ũ = 1 + A + B, = AB.

graphic file with name M66.gif

The parameters x1, x3, y1, and y3 are only subject to the following conditions:

graphic file with name M67.gif

It is important to notice that spec(C) = {α + 1, α + 2}, where each eigenvalue has multiplicity 2. We also remark that (A + k)(B + k) is, generically, nonsingular for kj, and that the kernel of (A + j)(B + j) is two-dimensional. Thus, Inline graphic is not a polynomial function, as in the classical case, but nevertheless we have the following result.

Corollary 3. Differential Eq. 9 is equivalent to a hypergeometric equation of the form of Eq. 2. Therefore, the Jacobi polynomials Inline graphic introduced by Grünbaum are given in terms of hypergeometric functions, namely

graphic file with name M70.gif

This gives an explicit example within the theory of matrix-valued orthogonal polynomials initiated by Krein (19).

This paper was submitted directly (Track II) to the PNAS office.

References

  • 1.Euler, L. (1769) Opera Omnia Ser. 1, 11-13. [Google Scholar]
  • 2.Gauss, C. F. (1812) Comm. Soc. Reg. Sci. II 3 123-162. [Google Scholar]
  • 3.Kummer, E. J. (1836) Riene Ang. Math. 15 39-83. [Google Scholar]
  • 4.Kummer, E. J. (1836) Riene Ang. Math. 15 127-172. [Google Scholar]
  • 5.Riemann, B. (1857) K. Gess. Wiss. Goett. 7 1-24. [Google Scholar]
  • 6.Andrews, G., Askey, R. & Roy, R. (1999) Special Functions: Encyclopedia of Mathematics and Its Applications (Cambridge Univ. Press, Cambridge, U.K.).
  • 7.Cartan, É. (1929) Rend. Circ. Mat. Palermo 53 217-252. [Google Scholar]
  • 8.Weyl, H. (1931) Gruppen Theorie und Quantenmechanik (Hirzel, Leipzig), 2nd Ed.
  • 9.Tirao, J. (1977) Rev. Union Mat. Argent. 28 75-98. [Google Scholar]
  • 10.Gelfand, I. (1950) Dokl. Akad. Nauk. SSSR 70 5-8. [Google Scholar]
  • 11.Godement, R. (1952) Trans. Am. Math. Soc. 73 496-556. [Google Scholar]
  • 12.Harish-Chandra (1958) Am. J. Math. 80 241-310. [Google Scholar]
  • 13.Harish-Chandra (1958) Am. J. Math. 80 553-613. [Google Scholar]
  • 14.Grünbaum, F. A., Pacharoni, I. & Tirao, J. A. (2002) J. Funct. Anal. 188 350-441. [Google Scholar]
  • 15.Gelfand, I. (1986) Dokl. Akad. Nauk. SSSR 288 14-18. [Google Scholar]
  • 16.Gross, K. & Richards, D. (1991) Bull. Am. Math. Soc. New Ser. 24 349-355. [Google Scholar]
  • 17.Hille, E. (1969) Lectures on Ordinary Differential Equations (Addison–Wesley, Reading, MA).
  • 18.Grünbaum, F. A. (2003) Bull. Soc. Mat. France 127 207-214. [Google Scholar]
  • 19.Krein, M. G. (1949) Dokl. Akad. Nauk. SSSR 69 125-128. [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES