Skip to main content
Springer logoLink to Springer
. 2017 Jun 23;2017(1):145. doi: 10.1186/s13660-017-1415-y

A globally convergent QP-free algorithm for nonlinear semidefinite programming

Jian-Ling Li 1, Zhen-Ping Yang 1, Jin-Bao Jian 2,
PMCID: PMC5488296  PMID: 28680248

Abstract

In this paper, we present a QP-free algorithm for nonlinear semidefinite programming. At each iteration, the search direction is yielded by solving two systems of linear equations with the same coefficient matrix; l1 penalty function is used as merit function for line search, the step size is determined by Armijo type inexact line search. The global convergence of the proposed algorithm is shown under suitable conditions. Preliminary numerical results are reported.

Keywords: nonlinear semidefinite programmming, KKT conditions, QP-free algorithm, global convergence

Introduction

Consider the following nonlinear semidefinite programming (NLSDP for short):

minf(x)s.t. A(x)0;s.t.hj(x)=0,jE={1,2,,l}, 1.1

where f:RnR, hj(jE):RnRl and A:RnSm are continuously differentiable functions, not necessarily convex. Sm is a space whose elements are real symmetric matrices of size m×m. ⪯ denotes the negative semidefinite order, that is, AB if and only if AB is a negative semidefinite matrix.

NLSDP (1.1) has a broad range of applications such as eigenvalue problems, control problems, optimal structural design, truss design problems (see [13]). So it is desired to develop numerical methods for solving NLSDP (1.1).

In recent years, NLSDPs have been attracting a great deal of research attention [1, 325]. As is well known, NLSDP (1.1) is an extension of nonlinear programming, some efficient numerical methods for the latter are generalized to solve NLSDP. For example, Correa and Ramirez [26] proposed an algorithm which used the sequential linear SDP method. Fares et al. [27] applied the sequential linear SDP method to robust control problems. Freund et al. [4] also studied a sequential SDP method. Kanzow et al. [9] presented a successive linearization method with a trust region-type globalization strategy.

In addition, Kovara and Stingl [10] developed a computer code PENNON for solving NLSDP (1.1), where the augmented Lagrangian function method was used. Sun et al. [20] and Luo et al. [11, 22] proposed an augmented Lagrangian method for NLSDP (1.1), respectively. Sun et al. [19] analyzed the rate of local convergence of the augmented Lagrangian method for NLSDPs. Yamashita et al. recently proposed a primal-dual interior point method for NLSDP (1.1) (see [23]). The algorithm is globally convergent and locally superlinearly convergent under suitable conditions. Very recently Aroztegui [28] proposed a feasible direction interior point algorithm for NLSDP (1.1) with only semidefinite matrix constraint.

As we know, QP-free (also called SSLE) method is a kind of efficient methods for standard nonlinear programs (see [16]-[13]). In this paper, motivated from QP-free method for standard nonlinear programs, based on techniques of perturbation and penalty function, we propose a globally convergent QP-free algorithm for NLSDP (1.1). The construction of systems of linear equations (SLE for short) is a key point. Based on KKT conditions of NLSDP (1.1) and techniques of perturbation, we construct two SLEs skillfully. At each iteration, the search direction is yielded by solving two SLEs with the same coefficient matrix; An exact penalty function is used as the merit function for line search and the step size is determined by suitable inexact line search. The global convergence of the proposed algorithm is shown under some mild conditions.

The paper is organized as follows. In Section 2 we restate some definitions and results on NLSDP and matrix analysis. In Section 3 the algorithm is presented and its feasibility is discussed. The global convergence is analyzed in Section 4. Some preliminary numerical results are reported in Section 5 and some concluding remarks are given in the final section.

Preliminaries

For the sake of convenience, some results on matrix analysis and NLSDP are restated in this section, which will be employed in the following analysis of the proposed algorithm. More introduction for theory of matrices should be seen in [21] and [6]. Denote by Rm×n the space of m×n real matrices, denote by S+m and S++m the sets of m-order symmetric positive semidefinite and positive definite matrices, respectively. The sets Sm and Sm are defined similarly.

Definition 2.1

For any A=(aij),B=(bij)Rm×n, the inner product of A and B is defined by

A,B=Tr(BTA)=i=1mj=1naijbij, 2.1

where Tr(P) means the trace of the matrix P.

Definition 2.2

[6]

For any MRm×m, let

sym(M)=12(M+MT),skw(M)=12(MMT), 2.2

sym(M) and skw(M) are called the symmetric part and the skew part of M, respectively.

Given a matrix ASm, let m=12m(m+1), define a map svec: SmRm:

svec(A)=(a11,2a21,,2am1,a22,2a32,,2am2,,amm)T,

and the map smat:RmSm is defined to be the inverse of svec. Then the inner product of matrices is indicated by

A,B=svec(A)Tsvec(B),for A,BSm. 2.3

Definition 2.3

[21]

For any A,BRm×m, the symmetric Kronecker product, denoted by AsB, is a mapping on a vector u=svec(U) where U is an m×m symmetric matrix and is defined as

(AsB)u=12svec(BUAT+AUBT). 2.4

For any matrix USm, it is verified that the following equality is true:

(AsB)svec(U)=svec(sym(BUA)). 2.5

Note that the linear operator AsB is defined implicitly in (2.4). In Appendix of [21] a matrix representation of AsB is given as follows:

AsB=12Q(AB+BA)QT, 2.6

where AB=[aijB] (i,j=1,2,,m) is the Kronecker product of A and B, Q is an orthogonal m×m2 matrix (i.e. QQT=Im), with the following property:

Qvec(U)=svec(U),QTsvec(U)=vec(U),USm, 2.7

where vec(U)=(u11,u21,,um1,u12,u22,,um2,,umm)T.

Remark 2.1

One choice for the matrix Q is given in the appendix of [21].

Lemma 2.1

[21]

For any A,BSm, the following results are true:

  1. AsB=BsA;

  2. (AsB)T=ATsBT;

  3. (AsB)(CsD)=12(ACsBD+ADsBC);

  4. If A and B are symmetric positive definite, then AsB is positive definite.

Lemma 2.2

[28]

If A,BSm, A0 and AB+BA0, then B0.

Lemma 2.3

If AS++m, BSm, then all eigenvalues of AB are less than zero.

The proof is elementary and omitted here.

Lemma 2.4

[28]

If AS++m, BSm, and they commute, then (AsIm)1(BsIm)Sm.

Lemma 2.5

Suppose AS++m, BSm, and they commute, then (AsIm)1(BsIm)Sm.

Proof

Since AS++m, BSm, and they commute, there exists an orthogonal matrix PRm×m such that

A=PDAP1,B=PDBP1,

where DA is a diagonal and positive definite matrix, and DB is a diagonal and negative definite matrix. It follows from Lemma 2.1(3) that

AsIm=TDAT1,BsIm=TDBT1,

where T=PsP, DA=DAsIm and DB=DBsIm. We know from Lemma 2.1(2), (3) that T is orthogonal, from Lemma 2.1(4) that DA is a diagonal and positive definite matrix, and DB is a diagonal and negative definite matrix. Hence,

(AsIm)1(BsIm)=TDADBT1Sm.

 □

In the rest of this section we state the first order optimality conditions for NLSDP (1.1). For the sake of convenience, we first introduce some notations. Given a matrix valued function A(), we use the notation

DA(x)=(A(x)x1,,A(x)xn)T

for its differential operator evaluated at x, where A(x)xi denotes the partial derivative of A(x) with respect to xi with components apq(x)xi (p,q=1,,m). Then the derivative of A() in the direction d=(d1,,dn)TRn at x denoted by DA(x)d is defined by

DA(x)d=i=1ndiA(x)xi. 2.8

If we denote

A(x):=(svec(A(x)x1),,svec(A(x)xn))m×n, 2.9

then by (2.8), the following equality is true:

svec(DA(x)d)=A(x)d. 2.10

The Lagrangian function of NLSDP (1.1) L:Rn×Sm×RlR is defined by

L(x,Λ,μ)=f(x)+A(x),Λ+h(x)Tμ, 2.11

where h(x)=(h1(x),h2(x),,hl(x))T. In view of (2.3), the above equality can be rewritten as follows:

L(x,λ,μ)=f(x)+svec(A(x))Tλ+h(x)Tμ,

where λ:=svec(Λ). The gradient of L(x,λ,μ) with respect to x is given as follows:

xL(x,λ,μ)=f(x)+A(x)Tλ+h(x)μ, 2.12

where h(x)=(h1(x),h2(x),,hl(x)).

We are now in a position to restate the definition of the first order optimality conditions for NLSDP (1.1).

Definition 2.4

[18]

For xRn, if there exist a matrix ΛSm and a vector μ (Rl) such that

xL(x,Λ,μ)=0, 2.13a
ΛA(x)=0,Λ0, 2.13b
h(x)=0,A(x)0, 2.13c

then x is called a KKT point of NLSDP (1.1).

Remark 2.2

According to the Von Neumann-Theobald inequality, the complementarity condition ΛA(x)=0 has the following two useful equivalent forms:

Tr(ΛA(x))=0,λj(Λ)λj(A(x))=0,j{1,2,,m}.

The algorithm

In this section, we present our algorithm and show it is well defined. For the sake of simplicity, we introduce some notations:

Ω={xRn:A(x)0,h(x)=0},F={xRn:A(x)0},F0={xRn:A(x)0},

that is, Ω is the feasible set of NLSDP (1.1).

In general, ΛA(x) is not guaranteed to be symmetric, so we consider sym(ΛA(x))=0 instead of ΛA(x)=0. Then the three equalities of KKT condition (2.13a)-(2.13c) can be rewritten in the following form:

f(x)+A(x)Tλ+h(x)μ=0,svec(sym(ΛA(x)))=0,h(x)=0. 3.1

In order to solve (3.1) at each Newton iteration, we define a vector-value function φ:Rn+m+lRn+m+l as follows:

φ(x,λ,μ)=(φLg(x,λ,μ)φC(x,λ,μ)φh(x,λ,μ))=(f(x)+A(x)Tλ+h(x)μsvec(sym(ΛA(x)))h(x)).

It follows from (2.5) and Lemma 2.1 that

φC(x,λ,μ)=svec(sym(IΛA(x)))=(IsA(x))svec(Λ)=(ΛsI)svec(A(x)),

thus, the Jacobian of φ is

φ(x,λ,μ)=(xx2L(x,λ,μ)A(x)Th(x)(ΛsI)A(x)IsA(x)0h(x)T00).

Instead of the Hessian xx2L(x,λ,μ), we employ a positive definite matrix denoted by H which can be a quasi-Newton approximation or the identity matrix. A Newton-like iteration to solve (3.1) is given by the linear systems as follows:

(HA(x)Th(x)(ΛsI)A(x)IsA(x)0h(x)T00)(x0xλ0λμ0μ)=(f(x)+A(x)Tλ+h(x)μsvec(sym(ΛA(x)))h(x)), 3.2

where (x,Λ,μ)F0×S++m×Rl is the current point, (x0,Λ0,μ0)F×S++m×Rl is the new estimates given by the Newton-like iteration, λ:=svec(Λ) and λ0:=svec(Λ0). Let d0=xx0, we obtain from (3.2)

Hd0+A(x)Tλ0+h(x)μ0=f(x), 3.3a
(ΛsI)A(x)d0+(IsA(x))λ0=0, 3.3b
h(x)Td0=h(x). 3.3c

If d0=0, then we have

f(x)+A(x)Tλ0+h(x)μ0=0,(IsA(x))λ0=0,h(x)=0.

Since A(x)0, IsA(x) is nonsingular and we have Λ0:=smat(λ0)=0, which implies that Λ0A(x)=0. Therefore, x is a KKT point. If d00, then d0 is not guaranteed to be a feasible direction. To obtain a better search direction, we modify (3.3b) by introducing an appropriate right hand side, so we obtain another linear equations as follows:

Hd1+A(x)Tλ1+h(x)μ1=f(x),(ΛsI)A(x)d1+(IsA(x))λ1=λd0,h(x)Td1=h(x). 3.4

In order to ensure that SLEs (3.3a)-(3.3c) and (3.4) have a unique solution, respectively, the following assumption is required.

  1. For any xF, the matrix
    B(x)=(A(x)Th(x)A(x)sIm0)
    is full of column rank.

The following lemma gives a sufficient condition of the assumption A1.

Lemma 3.1

For any xF, if A(x)0 and {h1(x),,hl(x)} is linearly independent, then B(x) is full of column rank, i.e., the assumption A1 holds.

Lemma 3.2

Let H be a positive definite matrix. If the assumption A1 holds, then the coefficient matrix of the SLEs (3.3a)-(3.3c) and (3.4)

W(x,H,Λ)=def(HA(x)Th(x)(ΛsIm)A(x)A(x)sI0h(x)T00) 3.5

is nonsingular, hence, SLEs (3.3a)-(3.3c) and (3.4) have a unique solution, respectively.

The proof is elementary and it is omitted here.

In our algorithm the following exact penalty function is used as a merit function for line search:

P(x;σ)=f(x)+σjE|hj(x)|, 3.6

where σ>0 is a penalty parameter. Further, we define a function P(;d;σ):Rn×Rn×[0,+)R associated with P(x;σ) by

P(x;d;σ)=f(x)+f(x)Td+σjE|hj(x)+hj(x)Td|. 3.7

Now the algorithm is described in detail.

Algorithm A

Parameters. α(0,12), β,ξ(0,1), λI>0, σ1>0, ρ1,ρ2>0.

Initialization. Select an initial iteration point x0F0, H0S++n, Λ0 (S++m) satisfying λIImΛ0 such that Λ0 and A(x0) commute. Let λ0=svec(Λ0), k:=0.

Step 1.
Let (dk0,λk0,μk0) be the solution of the SLE (3.3a)-(3.3c) in (d,λ,μ), i.e.,
{Hkd+A(xk)Tλ+jEμjhj(xk)=f(xk),(ΛksIm)A(xk)d+(A(xk)sIm)λ=0,hj(xk)Td=hj(xk),jE. 3.8
If dk0=0, then stop, xk is a KKT point of NLSDP (1.1); else, go to Step 2.
Step 2.
Let (dk1,λk1,μk1) be the solution of the SLE (3.4) in (d,λ,μ), i.e.,
{Hkd+A(xk)Tλ+jEμjhj(xk)=f(xk),(ΛksIm)A(xk)d+(A(xk)sIm)λ=λkdk0,hj(xk)Td=hj(xk),jE. 3.9
Step 3.
Compute the search direction dk and the approximate multiplier vector (λk,μk):
dk=(1δk)dk0+δkdk1, 3.10
λk=(1δk)λk0+δkλk1, 3.11
μk=(1δk)μk0+δkμk1, 3.12
where
δk={1ξ,if f(xk)Tdk10;1,if f(xk)Tdk1>0 and f(xk)Tdk1f(xk)Tdk0;min{ξ,|(1ξ)f(xk)Tdk0+(μk0)Th(xk)f(xk)T(dk0dk1)|},otherwise. 3.13
Step 4.
(Update the penalty parameter) Set σk=(3ξ)maxjE|μjk0|+ρ1. The updating rule of σk is as follows:
σk={max{σk,σk1+ρ2},if σk>σk1,σk1,otherwise. 3.14
Step 5.
(Line search) Set the step size tk to be the first number of the sequence {1,β,β2,} satisfying the following two inequalities:
P(xk+tdk;σk)P(xk;σk)+tα(P(xk;dk;σk)P(xk;0;σk)), 3.15
A(xk+tdk)0. 3.16
Step 6.
Set xk+1=xk+tkdk. Using the following methods to generate Λk+1 commuting with A(xk+1):
Step 6.1.
If the search direction dk does not descend or is not feasible, set Λk+1=Im and go to Step 7.
Step 6.2.
Compute the least eigenvalue λmin(Λk) of the matrix Λ¯k. If λmin(Λk)λI, then let Λk+1=Λk; otherwise, let Λk+1=Λk+(λIλmin(Λk))Im.
Step 7.

Set λk+1=svec(Λk+1), and update Hk by some method to Hk+1 such that Hk+1 is symmetric positive definite. Let k:=k+1, return to Step 1.

By (3.8), the following lemma is obvious.

Lemma 3.3

Suppose that the assumption A1 holds. If dk0=0, then xk is a KKT point of NLSDP (1.1).

Lemma 3.4

Suppose that the assumption A1 holds. Then the search direction dk of Algorithm A satisfies the following inequality:

f(xk)Tdkξ(dk0)THkdk0+(3ξ)jE|μjk0hj(xk)|. 3.17

Proof

First we show that the inequality

f(xk)Tdk0(dk0)THkdk0+jE|μjk0hj(xk)| 3.18

holds. Premultiplying the first equation of (3.8) by (dk0)T, we obtain

(dk0)THkdk0+jEμjk0(dk0)Thj(xk)+(dk0)TA(xk)Tλk0=(dk0)Tf(xk). 3.19

According to the second equation of (3.8), we get

(dk0)TA(xk)Tλk0=(λk0)T((ΛksIm)1(A(xk)sIm))Tλk0.

Substituting the above equality and the third equality of (3.8) into (3.19), we have

(dk0)Tf(xk)=(dk0)THkdk0+(λk0)T((ΛksIm)1(A(xk)sIm))Tλk0+jEμjk0hj(xk).

In view of Lemma 2.4, the matrix (ΛksIm)1(A(xk)sIm) is negative semidefinite, so it follows from the above equality that

(dk0)Tf(xk)(dk0)THkdk0+jE|μjk0hj(xk)|,

i.e., the inequality (3.18) holds.

Next, we will prove the inequality (3.17) is true. The rest of the proof is divided into three cases.

Case A. f(xk)Tdk10. From (3.13) we have δk=1ξ. It follows from (3.10), (3.13), (3.18) and ξ(0,1) that

f(xk)Tdkξ(dk0)THkdk0+ξjE|μjk0hj(xk)|ξ(dk0)THkdk0+(3ξ)jE|μjk0hj(xk)|, 3.20

that is, (3.17) holds.

Case B. f(xk)Tdk1>0 and f(xk)Tdk1f(xk)Tdk0. From (3.13), one has δk=1. It follows from (3.10), (3.19) and ξ(0,1) that

f(xk)Tdk=f(xk)Tdk1f(xk)Tdk0(dk0)THkdk0+jE|μjk0hj(xk)|,

which implies (3.17) holds.

Case C. f(xk)Tdk1>0 and f(xk)Tdk1>f(xk)Tdk0. It follows from (3.13) and ξ(0,1) that

δk=|(1ξ)f(xk)Tdk0+(μk0)Th(xk)f(xk)T(dk1dk0)||(ξ1)f(xk)Tdk0|+|(μk0)Th(xk)|f(xk)T(dk1dk0). 3.21

If f(xk)Tdk00, then we obtain from the above inequality

(1δk)f(xk)Tdk0+δkf(xk)Tdk1ξf(xk)Tdk0+|(μk0)Th(xk)|,

which together with (3.10) and (3.18) gives

f(xk)Tdkξ(dk0)THkdk0+(1+ξ)jE|μjk0hj(xk)|ξ(dk0)THkdk0+(3ξ)jE|μjk0hj(xk)|. 3.22

If f(xk)Tdk0>0, then the inequality (3.21) gives rise to

δkf(xk)Tdk1δkf(xk)Tdk0(1ξ)f(xk)Tdk0+|(μk0)Th(xk)|,

which together with (3.10) and (3.18) shows

f(xk)Tdk(2ξ)(dk0)THkdk0+(3ξ)jE|μjk0hj(xk)|ξ(dk0)THkdk0+(3ξ)jE|μjk0hj(xk)|. 3.23

The inequalities (3.22) and (3.23) indicate that the inequality (3.17) is true. □

Lemma 3.5

Suppose that the assumption A1 holds. If xk (F) is not a KKT point of NLSDP (1.1), then

P(xk;dk;σk)P(xk;0;σk)<0. 3.24

Proof

From (3.8) and (3.9) we know that (dk,λk,μk) is the solution of the following SLE:

Hkd+A(xk)Tλ+jEμjhj(xk)=f(xk), 3.25a
(ΛksIm)A(xk)d+(A(xk)sIm)λ=δkλkdk0, 3.25b
hj(xk)Td=hj(xk),jE. 3.25c

From the definition (3.6) of the function P(xk;dk;σk) and (3.25c), we have

P(xk;dk;σk)P(xk;0;σk)=f(xk)TdkσkjE|hj(xk)|ξ(dk0)THkdk0+(3ξ)jE|μjk0hj(xk)|σkjE|hj(xk)|ξ(dk0)THkdk0+((3ξ)maxjE|μjk0|σk)jE|hj(xk)|, 3.26

the first inequality above is due to (3.17).

Since xk is not a KKT point of NLSDP (1.1), it implies from Step 1 of Algorithm A that dk00, so (dk0)THkdk0>0. On the other hand, it follows from the updating rule of σk that σk>(3ξ)maxjE|μjk0|, therefore, (3.26) gives rise to

P(xk;dk;σk)P(xk;0;σk)<0,

that is, the inequality (3.24) holds. □

Lemma 3.6

Suppose that the assumption A1 holds. If Algorithm A does not stop at the current iterate xk, then (3.15) and (3.16) are satisfied for t>0 small enough, so Algorithm  A is well defined.

Proof

It follows from the Taylor expansion and (3.6) that

P(xk+tdk;σk)P(xk;σk)=tf(xk)Tdk+σkjE(|hj(xk)+thj(xk)Tdk||hj(xk)|)+o(t)=P(xk;tdk;σk)P(xk;0;σk)+o(t). 3.27

The second equality above is due to (3.7). From the convexity of P(xk;d;σk) for d, we obtain

P(xk;tdk;σk)P(xk;0;σk)t(P(xk;dk;σk)P(xk;0;σk)), 3.28

which together with (3.27) and Lemma 3.4 gives for t small enough

P(xk+tdk;σk)P(xk;σk)tα(P(xk;dk;σk)P(xk;0;σk)),

where α(0,1). Hence, (3.15) holds for sufficiently small t>0.

In what follows, we prove (3.16) holds for sufficiently small t>0. Since A(x) is twice continuously differentiable function, it follows from Taylor expansion that

A(xk+tdk)=A(xk)+tDA(xk)dk+o(t)=A(xk)+O(t). 3.29

Note that the largest eigenvalue function λmax(A)=maxv=1vTAv, we deduce from (3.29) and A(xk)0 that

λmax(A(xk+tdk))=maxv=1{vTA(xk)v+vTO(t)v}<0

for 0<t<1 small enough, which implies (3.16) holds for 0<t<1 small enough.

By summarizing the above discussions, we conclude that Algorithm A is well defined. □

Global convergence

If Algorithm A terminates at xk after a finite number of iterations, we know from Lemma 3.3 that xk is a KKT point of NLSDP (1.1). In this section, without loss of generality, we assume that the sequence {xk} generated by Algorithm A is infinite. We will prove any accumulation point of {xk} is a stationary point or a KKT point of NLSDP (1.1), i.e., Algorithm A is globally convergent. We first generalize the definition of stationary point for nonlinear programming defined in [16] to nonlinear semidefinite programming.

Definition 4.1

Let xRn, if there exist a matrix Λ (Sm) and a vector μ (Rl) such that

xL(x,Λ,μ)=0, 4.1
ΛA(x)=0,A(x)0,h(x)=0, 4.2

then x is called a stationary point of NLSDP (1.1).

In order to analyze the global convergence, some additional assumptions are required:

  • A2

    The sequence {xk} yielded by Algorithm A lies in a nonempty closed and bounded set X.

  • A3

    The functions f(x), h(x) and A(x) are twice continuously differentiable on an open set containing X.

  • A4

    There exists a positive constant λs such that λs>λI and λIImΛkλsIm for all k.

  • A5

    The matrix Hk is uniformly positive definite, i.e., there exist two positive constants a and b such that ay2yTHkyby2 for all yRn .

Let x be an accumulation point of {xk}, then there exists a subset K{1,2,} such that limkKxk=x. Without loss of generality, we suppose

HkKH,h(xk)Kh(x),ΛkKΛ,W(xk,Hk,Λk)KW(x,H,Λ),

where W(xk,Hk,Λk) is defined by (3.5) and

W(x,H,Λ)=def(HA(x)Th(x)(ΛsIm)A(x)A(x)sIm0h(x)T00).

From the assumptions A2-A3, we obtain the following conclusions immediately.

Lemma 4.1

Suppose the assumptions A2-A3 hold. Then there exists a constant M>1 such that |f(yk)|M, f(yk)M, 2f(yk)M, h(yk)M, h(yk)M, A(yk)FM, DA(yk)FM and D2A(yk)FM, for any ykN(xk), where N(xk) is a neighborhood of xk.

Lemma 4.2

Suppose the assumptions A1-A5 hold. Then

  1. there exists a constant c1>0 such that W(xk,Hk,Λk)1c1 for any kK;

  2. there exists a constant Mˆ>1 such that λk0Mˆ, λk1Mˆ, μk0Mˆ, μk1Mˆ, dk0Mˆ and dk1Mˆ for any kK.

The following result is an important property of the penalty parameter σk, which is obtained by the updating rule (3.14).

Lemma 4.3

Suppose the assumptions A1-A5 hold. Then the penalty parameter σk is updated only in a finite number of steps.

Based on Lemma 4.3, in the rest of the paper, we assume, without loss of generality, that σkσ˜ for all k, where

σ˜>supk{(3ξ)maxjE|μjk0|}.

By using of Lemma 4.2, we obtain the following result.

Lemma 4.4

Suppose the assumptions A1-A5 hold. Then there exists a constant c2>0 such that

dkdk0c2dk0. 4.3

For the sake of simplicity, in the rest of this section, let (d0,μ0,λ0) be the solution of the following SLE in (d,μ,λ):

{Hd+A(x)Tλ+jEμjhj(x)=f(x),(ΛsIm)A(x)d+(A(x)sIm)λ=0,hj(x)Td=hj(x),jE. 4.4

Let (d1,μ1,λ1) be the solution of the following SLE in (d,μ,λ):

{Hd+A(x)Tλ+jEμjhj(x)=f(x),(ΛsIm)A(x)d+(A(x)sIm)λ=λd0,hj(x)Td=hj(x),jE. 4.5

From the above equalities and Lemma 4.2, we obtain the following conclusion.

Lemma 4.5

Suppose the assumptions A1-A5 hold, and δkKδ. Then

  • (i)

    dk0Kd0, μk0Kμ0, λk0Kλ0,

  • (ii)

    dk1Kd1, μk1Kμ1, λk1Kλ1,

  • (iii)

    d0=0 if and only if d=0 where d=(1δ)d0+δd1.

Remark 4.1

By (3.13), we know that {δk} is bounded, so in the rest of the paper, we assume, without loss of generality, that δkKδ.

Lemma 4.6

Suppose the assumptions A1-A5 hold. Let x be an accumulation point of the sequence {xk} and xkKx. If dkK0, then x is a KKT point or a stationary point of NLSDP (1.1), and λkKsvec(Λ), μkKμ, where (Λ,μ) is the Lagrangian multiplier corresponding to x.

Proof

It is clear from Lemma 4.2 that {λk} and {μk} are bounded. Assume that λ̂, μ̂ are accumulation points of {λk} and {μk}, respectively. Without loss of generality, we assume that λkKλˆ and μkKμˆ.

Obviously, (dk,λk,μk) satisfies the SLE (3.25a)-(3.25c). By taking the limit on K in (3.25a)-(3.25c), we obtain

A(x)λˆ+jEμˆjhj(x)=f(x), 4.6a
(A(x)sI)λˆ=0, 4.6b
hj(x)=0,jE. 4.6c

If xF0, i.e., A(x)0, then we know from Lemma 2.1(4) that A(x)sI is nonsingular, so the equation (4.6b) has a unique solution λˆ=0. Let Λˆ:=smat(λˆ)=0, so ΛˆA(x)=0. Together with (4.6a) and (4.6c), we conclude that x is a KKT point of NLSDP (1.1).

If xΩF0, let Λˆ:=smat(λˆ). It follows from (4.6b) that sym(ΛˆA(x))=0, which means that ΛˆA(x) is a skw-symmetric matrix. Hence Tr(ΛˆA(x))=0. According to Remark 2.2, we obtain ΛˆA(x)=0. Combining with (4.6a) and (4.6c), x is a stationary point of NLSDP (1.1). (λ,μ) is the Lagrangian multiplier corresponding to x, that is,

A(x)Tλ+jEμjhj(x)=f(x),ΛA(x)=0,

where Λ=smat(λ). It is not difficult to verify that (λ,μ) is the solution of the following SLE:

A(x)Tλ+jEμjhj(x)=f(x), 4.7a
(A(x)sI)λ=0. 4.7b

From (4.6a)-(4.6c), we know that (λˆ,μˆ) is also the solution of (4.7a)-(4.7b). It is clear from the assumption A1 that the solution of (4.7a)-(4.7b) is unique, therefore, λˆ=λ, μˆ=μ. The proof is completed. □

Based on Lemma 4.6, the following conclusion is obvious.

Lemma 4.7

Suppose the assumptions A1-A5 hold. Let xkKx. If dk1K0, then x is a KKT point or a stationary point of NLSDP (1.1).

Lemma 4.8

Suppose the assumptions A1-A5 hold, xkKx. If infK{dk1}>0, then dkK0.

Proof

By contradiction, we assume that there exist a subset KK and a constant d¯>0 such that dkd¯, k(K) large enough. From the assumptions A1-A5, (3.13) and the updating rule of Λk, we assume without loss of generality that HkKH, δkKδ, ΛkKΛ. On the other hand, it follows from the updating rule of Λk and the assumption A4 that Λ is positive definite. According to Lemma 4.5(iii), there exists d_>0 such that dk0d_ for all kK.

Firstly, we show that there exists t_>0 independent of k such that (3.15) and (3.16) are satisfied for all tt_. For any kK, it is clear from the assumptions A1 and A5 and Lemmas 3.3-3.4 and Lemmas 4.1-4.2 that

P(xk;dk;σ˜)P(xk;0;σ˜)ξad_2. 4.8

Together with (3.27)-(3.28), there exists tf>0 independent of k such that

P(xk+tdk;σ˜)P(xk;σ˜)tα[P(xk;dk;σ˜)P(xk;0;σ˜)] 4.9

for all kK and t(0,tf], where α(0,1). The above inequality shows the inequality (3.15) holds.

We next prove the inequality (3.16) holds. It follows from (3.8) and Lemma 2.1(4) and Lemma 2.4 that

|f(xk)Tdk0+(μk0)Th(xk)|=|(dk0)THkdk0+(λk0)T((ΛksIm)1(A(xk)sIm))Tλk0|adk02.

Combining with Lemmas 4.1-4.2 and (3.13), there exists a constant 0<δ˜1 such that δkδ˜ for kK. By the mean-value theorem and Lemmas 4.1-4.2, we obtain

A(xk+tdk)=A(xk)+tDA(xk)dk+t2(D2A(x+tϑdk)(dk,dk))A(xk)+tDA(xk)dk+t2M3Im 4.10

for any kK, where ϑ(0,1), M=max{Mˆ,M}. Let N(t;xk)=A(xk)+tDA(xk)dk+t2M3Im, the above inequality is rewritten as

A(xk+tdk)N(t;xk), 4.11

thus, in order to prove that A(xk+tdk) is negative definite, it is sufficient to prove that N(t;xk) is negative definite. In view of Λk0, the definition (2.2) of sym and Lemma 2.2, it is sufficient to show that there exists tA>0 independent of k such that

sym(ΛkN(t;xk))0,t(0,tA]. 4.12

In view of (2.10), (2.5) and Lemma 2.1(1), we obtain

(ΛksIm)A(xk)dk=svec(sym(ΛkDA(xk)dk)). 4.13

Let Λk=smat(λk), i.e., λk=svec(Λk), it is obvious from (2.5) that

(A(xk)sIm)λk=(A(xk)sIm)svec(Λk)=svec(sym(ΛkA(xk))). 4.14

Hence, (4.13), (4.14) and (3.25b) give rise to

sym(ΛkDA(xk)dk+ΛkA(xk))=smat(svec(sym(ΛkDA(xk)dk))+svec(sym(ΛkA(xk))))=smat(δkλkdk0)=δkdk0Λk.

Based on the above equality, we have

sym(ΛkN(t;xk))=sym(Λk(A(xk)+tDA(xk)dk+t2M3Im))=sym((ΛktΛk)A(xk))+(t2M3Λktδkdk0Λk)sym((ΛktΛk)A(xk))+(2t2M3tδ˜d_)Λk; 4.15

note the positive definiteness of Λk, hence, if

max{vT((ΛktΛk)A(xk))v:vRm,v=1}0,for any kK, 4.16

then (4.12) holds for tδ˜d_2M3.

Since Λk and A(xk) are symmetric and commuting, there exists an orthogonal matrix Qk such that

Λk=QkTDλkQk,A(xk)=QkTDAkQk, 4.17

where Dλk and DAk are diagonal matrices. Then (ΛktΛk)A(xk)=QkT(DλktQkΛkQkT)DAkQk. Let Λ˜k=QkΛkQkT, so in order to prove (4.16), it is enough to show that there exists a constant tA>0 such that

vT((DλktΛ˜k)DAk)v0,vv=1, 4.18

for any t(0,tA) and kK. By Lemma 4.6 and Λk=smat(λk), we know {Λk} is bounded, furthermore, {Λ˜k} is also bounded. Let Λ˜ be an accumulation point of {Λ˜k}. Without loss of generality, we assume that Λ˜kKΛ˜. Let Bk=Λ˜kΛ˜, obviously, BkK0, thus there exists γ>0 such that

|vT(BkDAk)v|<γ 4.19

for any kK. Note that

vT(DλktΛ˜k)DAkv=vT(DλktΛ˜)DAkvtvT(BkDAk)v. 4.20

It follows from the assumption A4 that all eigenvalues of Dλk are between λI and λs for all k. According to Weyl’s theorem (see [6]), there exists t1>0 such that all eigenvalues of (DλktΛ˜) are positive for any t(0,t1]. We also know from A(xk)0 and the second equality in (4.17) that DA is negative definite. Therefore, for any v with v=1 and t(0,t1], it follows from Lemma 2.3 that (DλktΛ˜)DAk is also negative definite. Combining with (4.19), for any v with v=1 and any t(0,t1), we obtain

vT((DλktΛ˜)DAk)vtvT(BkDAk)v0, 4.21

together with (4.20) shows that (4.18) is satisfied, further, (4.16) and (4.12) hold.

Let tA=min{t1,md_2M3}, thus (4.12) holds for any t(0,tA]. Hence, we see that A(xk+tdk)0 holds for t(0,tA] and any kK. Let t¯=min{tf,tA}, for any t_(0,t¯], (3.15) and (3.16) are satisfied for all tt_. Combining with (4.8) and (4.9), we obtain for any kK

P(xk+1;σ˜)P(xk;σ˜)t_αξad_2. 4.22

On the other hand, the sequence {P(xk;σ˜)} decreases monotonically and P(xk;σ˜)KP(x;σ˜), so {P(xk;σ˜)}k=1 is convergent. Let limkP(xk;σ˜)=ϱ and taking the limit in the above inequality, we have t_ξαad_20, which is a contradiction. Hence, dkK0. □

Based on Lemmas 4.6-4.8, the following global convergence of Algorithm A is immediate.

Theorem 4.1

Suppose the assumptions A1-A5 hold. Then Algorithm A either terminates in a finite number of iterations at a KKT point of the NLSDP (1.1), or it generates a sequence {xk} whose every accumulation point is a KKT point or a stationary point of the NLSDP (1.1).

Numerical experiments

Algorithm A has been implemented in Matlab 2011b and the codes have been run on a 3.40 GHz Intel(R) Core(TM)i3-3240 machine with a Windows 7 system. We choose H0 as n-order identical matrix and at each iteration, Hk is updated by the damped BFGS formula in [15] and Λ0 as m-order identical matrix. In the numerical experiments, we choose the parameters as follows:

α=0.25,β=0.5,ξ=0.5,λI=0.5,σ1=0.5,ρ1=1,ρ2=2.

The stop criterion is dk0104.

The test problems are described as follows:

I. The first test problem is Rosen-Suzuki problem [29] combined with a negative semidefinite constraint and denoted by CM:

minf0(x)=x12+x22+2x32+x425x15x221x3+7x4s.t. x12+x22+x32+x42+x1x2+x3x48=0,s.t.x12+2x22+x32+2x42x1x49=0,s.t.2x12+x22+x32+2x1x2x45=0,s.t.(x2x300002x4x100x12x40000x2x3)0.

II. We select some test problems from [7] only with equality constraints and we add a negative semidefinite matrix constraint.

  1. We select the problems HS6, HS7, HS8, HS9 combined with the following 2×2 order symmetric matrix which comes from [14] and rename them MHS6, MHS7, MHS8 and MHS9, respectively:
    (x12x12x12x22)0.
  2. Choose the problems HS26, HS27, HS28 and HS61 combined with the following 3×3 order symmetric matrix and rename them MHS26, MHS27, MHS28 and MHS61, respectively:
    (x12x120x12x22000x34)0.
  3. Choose the problems HS40, HS42, HS47, HS48, HS50, HS51, HS77 and HS79, adding the negative semidefinite matrix constraint in the problem CM and renaming them MHS40, MHS42, MHS47, MHS48, MHS50, MHS51, MHS77 and MHS79.

III. Nearest correlation matrix problem (NCM for short) (see [23]):

minf(X)=12XAFs.t. XϵI,s.t.Xii=1,i=1,2,,m,

where ASm is given. In NCM problem, eigenvalues of X should not be less than ϵ, and the diagonal elements of X are equal to 1. Elements of the matrix A are uniform random numbers in [1,1] with Aii=1, i=1,2,,m. Set ϵ=103. In addition, we compare with the results of [23] (Algo. SDPIP for short) and [24] (Algo. YYNY for short), and their results from [24].

The numerical results are listed in Table 1 and Table 2. The meanings of the notations in Table 1 and Table 2 are as follows:

  • n: the number of variables;

  • l: the number of equality constraints;

  • m: the dimensionality of the negative semidefinite matrix;

  • Iter.: the number of iterations;

  • NF: the number of evaluations for f(x);

  • NC: the number of evaluations for all constraint functions;

  • ffinal: the optimal value;

  • Time (s): the time of calculation;

  • -: means that the result is not given.

Table 1.

The numerical results of test problems I and II

Problem n l m x0 Iter. NF NC ffinal Time (s)
CM 4 3 4 (2.5,2.5,2.5,2.5)T 19 72 72 −4.400000e + 001 4.097408e − 001
PHS6 2 1 2 (2,2)T 99 128 128 1.226381e − 006 3.541575e − 001
PHS7 2 1 2 (1,5)T 43 169 169 −1.732051e + 000 3.551911e − 001
PHS8 2 2 2 (1,4)T 4 4 4 −1 2.195229e − 001
PHS9 2 1 2 (4,4)T 2 2 2 −4.999996e − 001 2.025914e − 001
PHS26 3 1 3 (1.5,1.5,1.5)T 28 28 28 3.726010e − 005 2.514937e − 001
PHS27 3 1 3 (1,1,1)T 17 17 17 5.426241e − 002 2.354974e − 001
PHS28 3 1 3 (1,1,1)T 6 6 6 6.756098e − 001 1.708627e − 001
PHS40 4 3 4 (0.5,0.5,0.5,0.5)T 8 10 10 −2.500001e − 001 2.773717e − 001
PHS42 4 2 4 (1,1,1,1)T 17 28 28 1.385766e + 001 2.415490e − 001
PHS47 5 3 4 (1,1,1,1,1)T 31 80 80 2.910505e − 001 2.642828e − 001
PHS48 5 2 4 (3,3,3,3,3)T 49 140 140 3.060758e − 008 2.962501e − 001
PHS50 5 3 4 (3,3,3,3,3)T 23 84 84 2.390072e − 009 3.139633e − 001
PHS51 5 3 4 (1,1,1,1,1)T 13 14 14 4.687353e − 008 2.302719e − 001
PHS61 3 2 3 (2.5,2.5,2.5)T 59 59 59 −8.191909e + 001 3.401501e − 001
PHS77 5 2 4 (1,1,1,1,1)T 23 25 25 2.415051e − 001 2.393263e − 001
PHS79 5 3 4 (1,1,1,1,1)T 44 50 50 7.877716e − 002 3.415668e − 001

Table 2.

The numerical results for NCM problem

n l m Algorithm Iter. NF NC
10 5 5 Algo. A 8 15 15
Algo. YYNY 8 - -
Algo. SDPIP 9 - -
45 10 10 Algo. A 10 19 19
Algo. YYNY 8 - -
Algo. SDPIP 10 - -
105 15 15 Algo. A 10 20 20
Algo. YYNY 10 - -
Algo. SDPIP 11 - -
190 20 20 Algo. A 10 18 18
Algo. YYNY 11 - -
Algo. SDPIP 12 - -
300 25 25 Algo. A 10 25 25
Algo. YYNY 10 - -
Algo. SDPIP 11 - -
435 30 30 Algo. A 10 19 19
Algo. YYNY 9 - -
Algo. SDPIP 10 - -
595 35 35 Algo. A 11 25 25
Algo. YYNY 11 - -
Algo. SDPIP 11 - -
780 40 40 Algo. A 11 24 24
Algo. YYNY 11 - -
Algo. SDPIP 11 - -
1,225 50 50 Algo. A 12 34 34
Algo. YYNY - -
Algo. SDPIP - -

Concluding remarks

We have presented a globally convergent QP-free algorithm for nonlinear SDP problems. Based on KKT conditions of nonlinear SDP problems and techniques of perturbation, we construct two SLEs skillfully. Under some linear independence condition, the SLEs have unique solution. At each iteration, the search direction is yielded by solving two SLEs with the same coefficient matrix; some penalty function is used as the merit function for line search and the penalty parameter is updated automatically in the algorithm. The preliminary numerical results show that the proposed algorithm is effective and comparable.

Acknowledgements

Project supported by the Natural Science Foundation of China (No. 11561005), the Natural Science Foundation of Guangxi Province (Nos. 2016GXNSFAA380248, 2014GXNSFFA118001).

Footnotes

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

All authors read and approved the final manuscript.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Jarre F. An interior point method for semidefinite programming. Optim. Eng. 2000;1:347–372. doi: 10.1023/A:1011562523132. [DOI] [Google Scholar]
  • 2.Ben TA, Jarre F, Kocvara M, Nemirovski A, Zowe J. Optimization design of trusses under a nonconvex global buckling constraint. Optim. Eng. 2000;1:189–213. doi: 10.1023/A:1010091831812. [DOI] [Google Scholar]
  • 3.Wolkowicz H, Saigal R, Vandenberghe L, editors. Handbook of Semidefinite Programming. Boston: Kluwer Academic; 2000. [Google Scholar]
  • 4.Freund RW, Jarre F, Vogelbusch CH. Nonlinear semidefinite programming: sensitivity, convergence, and an application in passive reduced-order modeling. Math. Program. 2007;109:581–611. doi: 10.1007/s10107-006-0028-x. [DOI] [Google Scholar]
  • 5.Gao ZY, He GP, Wu F. Sequential systems of linear equation algorithm with arbitrary initial point. Sci. China Ser. A. 1997;27:24–33. [Google Scholar]
  • 6.Horn RA, Johnson CR. Matrix Analysis. Cambridge: Cambridge University Press; 1985. [Google Scholar]
  • 7.Hock W, Schittkowski K. Test Examples for Nonlinear Programming Codes. Berlin: Springer; 1981. [Google Scholar]
  • 8.Jian JB, Quan R, Cheng WX. A feasible QP-free algorithm combining the interior point method with active set for constrained optimization. Comput. Math. Appl. 2009;58:1520–1533. doi: 10.1016/j.camwa.2009.07.018. [DOI] [Google Scholar]
  • 9.Kanzow C, Nagel C, Kato H, Fukushima M. Successive linearization methods for nonlinear semidefinite programs. Comput. Optim. Appl. 2005;31:251–273. doi: 10.1007/s10589-005-3231-4. [DOI] [Google Scholar]
  • 10.Kovara M, Stingl M. PENNON: a code for convex nonlinear and semidefinite programming. Optim. Methods Softw. 2003;18:317–333. doi: 10.1080/1055678031000098773. [DOI] [Google Scholar]
  • 11.Luo HZ, Wu HX, Chen GT. On the convergence of augmented Lagrangian methods for nonlinear semidefinite programming. J. Glob. Optim. 2012;54:599–618. doi: 10.1007/s10898-011-9779-x. [DOI] [Google Scholar]
  • 12.Li JL, Lv J, Jian JB. A globally and superlinearly convergent primal-dual interior point method for general constrained optimization. Numer. Math., Theory Methods Appl. 2015;8:313–335. doi: 10.4208/nmtma.2015.m1338. [DOI] [Google Scholar]
  • 13.Li JL, Huang RS, Jian JB. A superlinearly convergent QP-free algorithm for mathematical programs with equilibrium constraints. Appl. Math. Comput. 2015;269:885–903. [Google Scholar]
  • 14.Noll D. Local convergence of an augmented Lagrangian method for matrix inequality constrained programming. Optim. Methods Softw. 2007;22:777–802. doi: 10.1080/10556780701223970. [DOI] [Google Scholar]
  • 15.Powell MJD. Numerical Analysis. Berlin: Springer; 1978. A fast algorithm for nonlinearly constrained optimization calculations; pp. 144–157. [Google Scholar]
  • 16.Panier ER, Tits RL, Herskovits N. A QP-free globally convergent, locally superlinear convergent algorithm for inequality constrainted optimization. SIAM J. Optim. 1988;26:788–811. doi: 10.1137/0326046. [DOI] [Google Scholar]
  • 17.Qi HD, Qi LQ. A new QP-free, globally convergent, locally superlinearly convergent algorithm for inequality constrained optimization. SIAM J. Optim. 2000;11:113–132. doi: 10.1137/S1052623499353935. [DOI] [Google Scholar]
  • 18.Shapiro A. First and second order analysis of nonlinear semidefinite programs. Math. Program. 1997;77:301–320. [Google Scholar]
  • 19.Sun DF, Sun J, Zhang LW. The rate of convergence of the augmented Lagrangian method for nonlinear semidefinite programming. Math. Program. 2008;114:349–391. doi: 10.1007/s10107-007-0105-9. [DOI] [Google Scholar]
  • 20.Sun J, Zhang LW, Wu Y. Properties of the augmented Lagrangian in nonlinear semidefinite optimization. J. Optim. Theory Appl. 2006;129:437–456. doi: 10.1007/s10957-006-9078-8. [DOI] [Google Scholar]
  • 21.Todd MJ, Toh KC, Tütüncü RH. On the Nesterov-Todd direction in semidefinite programming. SIAM J. Optim. 1998;8:769–796. doi: 10.1137/S105262349630060X. [DOI] [Google Scholar]
  • 22.Wu HX, Luo HZ, Ding XD, Chen GT. Global convergence of modified augmented Lagrangian methods for nonlinear semidefinite programming. Comput. Optim. Appl. 2013;56:531–558. doi: 10.1007/s10589-013-9568-1. [DOI] [Google Scholar]
  • 23.Yamashita H, Yabe H, Harada K. A primal-dual interior point method for nonlinear semidefinite programming. Math. Program., Ser. A. 2012;135:89–121. doi: 10.1007/s10107-011-0449-z. [DOI] [Google Scholar]
  • 24.Yamakawa Y, Yamashita N, Yabe H. A differentiable merit function for the shifted perturbed Karush-Kuhn-Tucker conditions of the nonlinear semidefinite programming. Pac. J. Optim. 2015;11:557–579. [Google Scholar]
  • 25.Zhu ZB, Zhu HL. A filter method for nonlinear semidefinite programming with global convergence. Acta Math. Sin. 2014;30:1810–1826. doi: 10.1007/s10114-014-3241-1. [DOI] [Google Scholar]
  • 26.Correa R, Ramirez H. A global algorithm for nonlinear semidefinite programming. SIAM J. Optim. 2004;15:303–318. doi: 10.1137/S1052623402417298. [DOI] [Google Scholar]
  • 27.Fares B, Noll D, Apkarian P. Robust control via sequetial semidefinite programming. SIAM J. Control Optim. 2002;40:1791–1820. doi: 10.1137/S0363012900373483. [DOI] [Google Scholar]
  • 28.Aroztegui M, Herskovits J, Roche JR, Baźan E. A feasible direction interior point algorithm for nonlinear semidefinite programming. Struct. Multidiscip. Optim. 2014;50:1019–1035. doi: 10.1007/s00158-014-1090-2. [DOI] [Google Scholar]
  • 29.Chen ZW, Miao SC. A penalty-free method with trust region for nonlinear semidefinite programming. Asia-Pac. J. Oper. Res. 2015;32:1–24. [Google Scholar]

Articles from Journal of Inequalities and Applications are provided here courtesy of Springer

RESOURCES