Skip to main content
The Scientific World Journal logoLink to The Scientific World Journal
. 2014 Aug 19;2014:560931. doi: 10.1155/2014/560931

An Inversion-Free Method for Finding Positive Definite Solution of a Rational Matrix Equation

Fazlollah Soleymani 1, Mahdi Sharifi 1, Solat Karimi Vanani 1, Farhad Khaksar Haghani 1, Adem Kılıçman 2,*
PMCID: PMC4151580  PMID: 25215323

Abstract

A new iterative scheme has been constructed for finding minimal solution of a rational matrix equation of the form X + A*X −1 A = I. The new method is inversion-free per computing step. The convergence of the method has been studied and tested via numerical experiments.

1. Introduction

In this paper, we will discuss the following nonlinear matrix equation:

X+AX1A=I, (1)

where A is an n × n nonsingular complex matrix, I is the unit matrix of the appropriate size, and XC n×n is an unknown Hermitian positive definite (HPD) matrix that should be found. It was proved in [1] that if (1) has an HPD solution, then all its Hermitian solutions are positive definite and, moreover, it has the maximal solution X L and the minimal solution X S in the sense that X SXX L for any HPD solution X.

A lot of papers have been published regarding the iterative HPD solutions of such nonlinear rational matrix equations in the literature due to their importance in some practical problems arising in control theory, dynamical problems, and so forth (see [2, 3]).

The most common iterative method for finding the maximal solution of (1) is the following fixed-point iteration [4]:

X0=I,Xk+1=IAXk1A. (2)

The maximal solution of (1) can be obtained through X L = IY S, where Y S is the minimal solution of the dual equation Y + AY −1 A* = I.

In 2010, Monsalve and Raydan in [5] proposed the following iteration method (also known as Newton's method) for finding the minimal solution:

X0=AA,Xk+1=Xk(2IA(IXk)A1Xk), (3)

which is an inversion-free scheme. Since, A −1 should be computed only once in contrast to the matrix iteration (2). Note that A −∗ = A −1*, and similar notations are used throughout.

Remark 1 . —

We remark that there are several other well-known iterative methods for solving (1) rather than Newton's method (3). To the best of our knowledge, the procedure of extending higher-order iterative methods for finding the solution of (1) has not been exploited up to now. Hence, we hope that this interlink among the fields of root-finding and solving (1) may lead to discovering novel and innovative techniques.

The rest of this paper is organized as follows. In Section 2, we develop and analyze a new inversion-free method for finding roots of a special map F. In Section 3, we provide some numerical comparisons by employing some experiments in machine precision. Some concluding remarks will be drawn in Section 4.

2. A New Iterative Method

An equivalent formulation of (1) is to find an HPD matrix X such that F(X) = 0. Toward this goal, we write F(X)≔X + A*X −1 AI = 0. Furthermore, we have

F(X)=AX1A+XI=AX1A(IX)=AX1A(AXA)A=AX1AA(A1AA1XA)A=(A1XA)1A(A1AA1XA)A. (4)

Now, using a change of variable as Z = A −1 XA −∗, we could simplify (4) as follows:

G(Z)=Z1A(A1AZ)A=0. (5)

In order to obtain an iterative method for finding the minimal HPD solution of (1), it is now enough to solve the well-known matrix equation Z −1B = 0, at which B = A(A −1 A −∗Z)A*. One of such ways to challenge this matrix inversion problem is via applying the Schulz-type iteration methods (see, e.g., [610]). Applying Chebyshev's method [11] yields

X0=AA,Xk+1=Xk[3IA(IXk)×A1Xk(3IA(IXk)A1Xk)]. (6)

We remark that there is a tight relationship between iterative methods for nonlinear systems and the construction of higher-order methods for matrix equations ([12, 13]).

The matrix iteration (6) requires A −1 to be computed only once at the beginning of the iteration and this makes the iterative method fall in the category of inversion-free algorithms for solving (1).

In the meantime, it is easy to show that the zeros of the map G(Z) = Z −1B are equal to the zeros of the map F(X) = X −1H, wherein H = A −∗(IX)A −1. To be more precise, we are finding the inverse of the matrix H which matches the minimal HPD solution of (1).

Remark 2 . —

Following Remark 1, we applied Chebyshev's method for (1) in this work and will study its theoretical behavior. The extension of the other well-known root-finding schemes for finding the minimal solution of (1) will remain for future studies.

Lemma 3 . —

The proposed method (6) produces a sequence of Hermitian matrices using the Hermitian initial matrix X 0 = AA*.

Proof —

The initial matrix AA* is Hermitian, and H k = A −∗(IX k)A −1. Thus, H 0 = A −∗ A −1A −∗ AA*A −1 is also Hermitian; that is, H 0* = H 0. Now using inductive argument, we have

(X1)=(X0[3IH0X0(3IH0X0)])=(3X03X0H0X0+[X0H0X0H0X0])=3X03X0H0X0+X0H0X0H0X0=X1. (7)

By considering (X l)* = X l, (lk) we now show that

(Xl+1)=(Xl[3IHlXl(3IHlXl)])=(3Xl3XlHlXl+[XlHlXlHlXl])=3Xl3XlHlXl+XlHlXlHlXl=Xl+1. (8)

Note that H l = (H l)* has been used in (8). Now the conclusion holds for any l + 1. Thus, the proof is complete.

Theorem 4 . —

By considering that A and X k are nonsingular matrices, the sequence {X k} generated by (6) is convergent to the minimal solution using the initial matrix X 0 = AA*.

Proof —

Let us consider H k = A −∗(IX k)A −1. We therefore have

IHkXk+1=I[A(IXk)A1]×(Xk[3IA(IXk)A1Xk×(3IA(IXk)A1Xk)])=(IHkXk)3. (9)

Taking a generic matrix operator norm from both sides of (9), we obtain

||IHkXk+1||||IHkXk||3. (10)

On the other hand, Chebyshev's method for matrix inversion problem is convergent if the initial approximation reads ||IHX 0|| < 1. That is to say, ||I − [A −∗(IX)A −1]X 0|| < 1. This together with the initial matrix X 0 = AA* gives

||AXA||<1, (11)

which is true when X is the minimal HPD solution of (1).

Note that since

0<XS=IAXS1A=A[AA1XS1]A=A[X01XS1]A, (12)

we obtain that X 0 −1 > X S −1; thus X 0 < X S. And subsequently using mathematical induction, it would be observed that {X k} tends to X S.

The only problem that happens in this process is the fact that the convergence order is q-linear. In fact, although Chebyshev's method for matrix inversion has third local order of convergence, this rate will not be preserved for finding the minimal HPD solution of (1).

The reason is that the matrix H, which we must compute its inverse by Chebyshev's method, is dependent on the X itself. That is to say, the unknown is located in the essence of the matrix H = A −∗(IX)A −1.

Theorem 5 . —

The sequence of matrices produced by (6) satisfies the following error inequality:

||Xk+1XS||||λk||||XkXS||, (13)

where λ k = −X S −1 δ k + X k X S 2 δ k + 3Y kX k X S −1 Y kY k X S −1 X k + Y k A −∗ A −1 X k and Y k = X k A −∗ A −1 X k.

Proof —

First since lim⁡k X k = X S and by using (6) we have

Xk+1XS=Xk[3IA(IXk)A1Xk×(3IA(IXk)A1Xk)]XS=[3Xk3XkA(IXk)A1Xk+XkA×(IXk)A1XkA(IXk)A1Xk]XS=[3Xk3XkA((IXS)(XkXS))A1Xk+XkA((IXS)(XkXS))A1XkA×((IXS)(XkXS))A1Xk]XS=[3Xk3XkXS1Xk+3XkA(XkXS)A1Xk+Xk(XS1Xk)2XkXS1XkA(XkXS)×A1XkXkA(XkXS)A1XkXS1Xk+XkA×(XkXS)A1XkA(XkXS)A1Xk]XS=[XS3Xk+3XkXS1XkXk(XS1Xk)2]+3XkA(XkXS)A1Xk+Xk(XS1Xk)2XkXS1XkA(XkXS)A1XkXkA(XkXS)A1XkXS1Xk+XkA×(XkXS)A1XkA(XkXS)A1Xk. (14)

Note that we have used the fact that (IX S) = A*X S −1 A. Relation (14) yields

δk+1=δkXS1δk+XkXS2δk2+3XkA(δk)A1XkXkXS1XkAδkA1XkXkAδkA1XkXS1Xk+XkAδkA1XkAδkA1Xk=λkδk, (15)

wherein δ k = X kX S and λ k = −X S −1 δ k + X k X S 2 δ k + 3X k A −∗ A −1 X kX k X S −1 X k A −∗ A −1 X kX k A −∗ A −1 X k X S −1 X k + X k A −∗ A −1 X k A −∗ A −1 X k. We remark that δ k A −1 X k = A −1 X k δ k.

Consequently, one has the error inequality (13). This shows the q-linear order of convergence for finding the minimal HPD solution of (1). We thus have

0<limsupkXk+1XSXkXS<1, (16)

which is guaranteed since

0<limsupk||λk||<1. (17)

3. Numerical Comparisons

In this section, we mainly investigate the performance of the new method (6) for matrix equation (1). All experiments were run on a Pentium IV computer, using Mathematica 8 [14]. We report the number of required iterations (Iter) for converging. In our implementations, we stop all considered methods when the infinity norm of two successive iterates is less than given tolerance.

Note that recently Zhang in [15] studied a way to accelerate the beginning of such iterative methods for finding the minimal solution of (1) via applying multiple Newton's method for matrix inversion. This technique could be given by

Xk+1=Xk((t+1)ItHkXk), (18)

for any 1 ≤ t ≤ 2. Subsequently, we could improve the behavior of the new method (6) using (18) as provided in Algorithm 1.

Algorithm 1.

Algorithm 1

A hybrid method for computing the minimal HPD solution of (1).

We compare Algorithm 1, denoted by PM, with (2) denoted by M1, (3) denoted by M2, and the method proposed by El-Sayed and Al-Dbiban [16] denoted by M3, which is a modification of the method presented by Zhan in [17], as follows:

X0=Y0=I,Yk+1=(IXk)Yk+I,Xk+1=IAYk+1A. (19)

Example 1 (see [18]). —

In this experiment, we compare the results of different methods for finding the minimal solution of (1) when the matrix A is defined by

A=(0.370.130.120.300.340.120.110.170.29), (20)

and the solution is

XS=(0.2159810.09604060.1013050.09604060.3310820.1544870.1013050.1544870.241782). (21)

The results are given in Figure 1(a) in terms of the number of iterations when the stopping criterion is ||X k+1X k|| ≤ 10−8.

Figure 1.

Figure 1

Number of iterations against accuracies for experiment 1 (a) and experiment 2 (b).

Note that PM and M2 converge to X S, whereas all the other schemes converge to X L; thus we use other schemes to find the maximal solution of the dual equation D(X) = X + AX −1 A* − I in our written codes so as to have fair comparisons. Here t = 2 has been chosen for PM (with l = 19). This l = 19 for the number of iterations in the inner finite loop of Algorithm 1 has been considered in the numerical report.

Furthermore, we have chosen this number empirically. In fact, varying l shows us that we even can obtain better or worse results than the reported ones in different examples.

In Example 1, we have used t = 2. In fact we have chosen this value for t since we are solving an operator equation in essence. To be more precise, we wish to consider the solution of the operator equation to be of multiplicity 2. This consideration makes the algorithm converge faster at the initial phase of the process and when we are enough close to the solution, then we flash back to the ordinary methods, that is, treat the solution as a simple zero (solution) of the operator equation.

Example 2 (see [19]). —

Applying the stopping criterion ||X k+1X k|| ≤ 10−12, we compare the behavior of various methods for the following test matrix:

A=(0.10.150.25980760.150.21250.06495190.25980760.06495190.137), (22)

with the solution as

XS=(0.1126840.00001301610.0001427990.00001301610.0784090.01983070.0001427990.01983070.101129). (23)

The results are illustrated in Figure 1(b), wherein t = 1.2 has been chosen for PM (with l = 1).

4. Conclusions

We have studied the fact that the minimal HPD solution of (1) is equivalent to the roots of a nonlinear map. This special map has been solved by the well-known Chebyshev method as a matrix inversion problem.

The developed method requires the computation of one matrix inverse at the beginning of the process and it is hence an inversion-free method. The convergence and the rate of convergence have been studied for this scheme. Furthermore, using a proper acceleration technique from the literature, we have further speeded up the process of finding the HPD solution of (1).

Acknowledgments

The authors express their sincere thanks to the referees for the careful and details reading of the manuscript and very helpful suggestions that improved the manuscript substantially. The authors also gratefully acknowledge that this research was partially supported by the University Putra Malaysia under the GP-IBT Grant Scheme having project number GP-IBT/2013/9420100.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

References

  • 1.Engwerda JC, Ran CMA, Rijkeboer AL. Necessary and sufficient conditions for the existence of a positive definite solution of the matrix equation X + A*X −1 A = Q . Linear Algebra and its Applications. 1993;186:255–275. [Google Scholar]
  • 2.El-Sayed SM, Ran ACM. On an iteration method for solving a class of nonlinear matrix equations. SIAM Journal on Matrix Analysis and Applications. 2001/02;23(3):632–645. [Google Scholar]
  • 3.Li J. Solutions and improved perturbation analysis for the matrix equation X − A*X −p A = Q(p > 0) Abstract and Applied Analysis. 2013;2013:12 pages.575964 [Google Scholar]
  • 4.Engwerda JC. On the existence of a positive definite solution of the matrix equation X + A*X −1 A = I . Linear Algebra and its Applications. 1993;194:91–108. [Google Scholar]
  • 5.Monsalve M, Raydan M. A new inversion-free method for a rational matrix equation. Linear Algebra and Its Applications. 2010;433(1):64–71. [Google Scholar]
  • 6.Ullah MZ, Soleymani F, Al-Fhaid AS. An efficient matrix iteration for computing weighted Moore-Penrose inverse. Applied Mathematics and Computation. 2014;226:441–454. [Google Scholar]
  • 7.Soleymani F, Tohidi E, Shateyi S, Haghani FK. Some matrix iterations for computing matrix sign function. Journal of Applied Mathematics. 2014;2014:9 pages.425654 [Google Scholar]
  • 8.Soleymani F, Sharifi M, Shateyi S, Haghani F. A class of Steffensen-type iterative methods for nonlinear systems. Journal of Applied Mathematics. 2014;2014:9 pages.705375 [Google Scholar]
  • 9.Soleymani F, Stanimirović PS, Shateyi S, Haghani FK. pproximating the matrix sign function using a novel iterative method. Abstract and Applied Analysis. 2014;2014:9 pages.105301 [Google Scholar]
  • 10.Soleymani F, Sharifi M, Shateyi S, Khaksar Haghani F. An algorithm for computing geometric mean of two Hermitian positive definite matrices via matrix sign. Abstract and Applied Analysis. 2014;2014:6 pages.978629 [Google Scholar]
  • 11.Traub JF. Iterative Methods for the Solution of Equations. Englewood Cliffs, NJ, USA: Prentice Hall; 1964. [Google Scholar]
  • 12.Iliev A, Kyurkchiev N. Nontrivial Methods in Numerical Analysis: Selected Topics in Numerical Analysis. LAP LAMBERT Academic Publishing; 2010. [Google Scholar]
  • 13.Soleymani F, Lotfi T, Bakhtiari P. A multi-step class of iterative methods for nonlinear systems. Optimization Letters. 2014;8(3):1001–1015. [Google Scholar]
  • 14.Trott M. The Mathematica Guide-Book for Numerics. New York, NY, USA: Springer; 2006. [Google Scholar]
  • 15.Zhang L. An improved inversion-free method for solving the matrix equation X + A ∗ X −α A = Q . Journal of Computational and Applied Mathematics. 2013;253:200–203. [Google Scholar]
  • 16.El-Sayed SM, Al-Dbiban AM. A new inversion free iteration for solving the equation X + A*X −1 A = Q . Journal of Computational and Applied Mathematics. 2005;181(1):148–156. [Google Scholar]
  • 17.Zhan X. Computing the extremal positive definite solutions of a matrix equation. SIAM Journal on Scientific Computing. 1996;17(5):1167–1174. [Google Scholar]
  • 18.Guo CH, Lancaster P. Iterative solution of two matrix equations. Mathematics of Computation. 1999;68(228):1589–1603. [Google Scholar]
  • 19.El-Sayed SM. An algorithm for computing positive definite solutions of the nonlinear matrix equation X + A*X −1 A = I . International Journal of Computer Mathematics. 2003;80(12):1527–1534. [Google Scholar]

Articles from The Scientific World Journal are provided here courtesy of Wiley

RESOURCES