Skip to main content
Taylor & Francis Open Select logoLink to Taylor & Francis Open Select
. 2017 Oct 12;67(1):1–23. doi: 10.1080/02331934.2017.1387547

New verifiable stationarity concepts for a class of mathematical programs with disjunctive constraints

Matúš Benko a,*, Helmut Gfrerer
PMCID: PMC5761710  PMID: 29375237

Abstract

In this paper, we consider a sufficiently broad class of non-linear mathematical programs with disjunctive constraints, which, e.g. include mathematical programs with complemetarity/vanishing constraints. We present an extension of the concept of Q-stationarity which can be easily combined with the well-known notion of M-stationarity to obtain the stronger property of so-called QM-stationarity. We show how the property of QM-stationarity (and thus also of M-stationarity) can be efficiently verified for the considered problem class by computing Q-stationary solutions of a certain quadratic program. We consider further the situation that the point which is to be tested for QM-stationarity, is not known exactly, but is approximated by some convergent sequence, as it is usually the case when applying some numerical method.

Keywords: Mathematical programs with disjunctive constraints, B-stationarity, M-stationarity, QM-stationarity

1. Introduction

In this paper, we consider the following mathematical program with disjunctive constraints (MPDC)

minxRnf(x)subject toFi(x)Di:=j=1KiDij,i=1,,mD, (1)

where the mappings f:RnR and Fi:RnRli, i=1,,mD are assumed to be continuously differentiable and DijRli, j=1,,Ki, i=1,,mD are convex polyhedral sets.

Denoting m:=i=1mDli,

F:=(F1,,FmD):RnRm,D:=i=1mDDi (2)

we can rewrite the MPDC (1) in the form

minxRnf(x)subject toF(x)D. (3)

It is easy to see that D can also be written as the union of i=1mDKi convex polyhedral sets by

D=νJD(ν)withJ:=i=1mD{1,,Ki},D(ν):=i=1mDDiνi. (4)

As an example for MPDC consider a mathematical program with complementarity constraints (MPCC) given by

minxRnf(x)subject togi(x)0,i=1,mI,hi(x)=0,i=1,mE,Gi(x)0,Hi(x)0,Gi(x)Hi(x)=0,i=1,mC (5)

with f:RnR, gi:RnR, i=1,,mI, hi:RnR, i=1,,mE, Gi,Hi:RnR, i=1,,mC. This problem fits into our setting (1) with mD=mC+1,

F1=(g1,,gmI,h1,hmE)T,D11=R-mI×{0}mE,l1=mI+mE,K1=1Fi+1=(-Gi,-Hi)T,Di+11={0}×R-,Di+12=R-×{0},li+1=Ki+1=2,i=1,,mC.

MPCC is known to be a difficult optimization problem, because, due to the complementarity constraints Gi(x)0, Hi(x)0, Gi(x)Hi(x)=0, many of the standard constraint qualifications of nonlinear programming are violated at any feasible point. Hence, it is likely that the usual Karush–Kuhn–Tucker conditions fail to hold at a local minimizer and various first-order optimality conditions such as Abadie (A-), Bouligand (B-), Clarke (C-), Mordukhovich (M-) and Strong (S-) stationarity conditions have been studied in the literature [19].

Another prominent example is the mathematical program with vanishing constraints (MPVC)

minxRnf(x)subject togi(x)0,i=1,mI,hi(x)=0,i=1,mE,Hi(x)0,Gi(x)Hi(x)0,i=1,mV (6)

with f:RnR, gi:RnR, i=1,,mI, hi:RnR, i=1,,mE, Gi,Hi:RnR, i=1,,mV. Again, the problem MPVC can be written in the form (1) with mD=mV+1, F1, D11 as in the case of MPCC and

Fi+1=(-Hi,Gi)T,Di+11={0}×R,Di+12=R-2,li+1=Ki+1=2,i=1,,mV.

Similar as in the case of MPCC, many of the standard constraint qualifications of non-linear programming can be violated at a local solution of (6) and a lot of stationarity concepts have been introduced. For a comprehensive overview for MPVC we refer to [10] and the references therein.

However, when we do not formulate MPCC or MPVC as a non-linear program but as a disjunctive program MPDC, then first-order optimality conditions can be formulated which are valid under weak constraint qualifications. We know that a local minimizer is always B-stationary, which geometrically means that no feasible descent direction exists, or, in a dual formulation, that the negative gradient of the objective belongs to the regular normal cone of the feasible region, cf. [11, Theorem 6.12]. The difficult task is now to estimate this regular normal cone. For this regular normal cone always a lower inclusion is available, which yields so-called S-stationarity conditions. For an upper estimate, one can use the limiting normal cone which results in the so-called M-stationarity conditions. The notions of S-stationarity and M-stationarity have been introduced in [12] for general programs (3). S-stationarity always implies B-stationarity, but it requires some strong qualification condition on the constraints which is too restrictive. On the other hand, M-stationarity requires only some weak constraint qualification but it does not preclude the existence of feasible descent directions. Further, it is not known in general how to efficiently verify the M-stationarity conditions, since the description of the limiting normal cone involves some combinatorial structure which is not known to be resolved without enumeration techniques. These difficulties in verifying M-stationarity have also some impact for numerical solution procedures. E.g. for many algorithms for MPCC it cannot be guaranteed that a limit point is M-stationary, cf. [13].

In the recent paper [14], we derived another upper estimate for the regular normal cone yielding so-called Q-stationarity conditions. Q-stationarity has the advantage over S-stationarity that it does not require such unnecessarily strong constraint qualification conditions. Q-stationarity can be easily combined with M-stationarity to obtain so-called QM-stationarity which is stronger than M-stationarity. This is one of the advantages of QM-stationarity: there are several stationarity notions, in particular in the MPCC literature, like M-, C-, A- and weak stationarity, which are valid under weak constraint qualification conditions. M-stationarity is known to be the strongest stationarity concept and we even improve M-stationarity by QM-stationarity.

For the disjunctive formulations of the problems MPCC and MPVC the Q- and QM-stationarity conditions have been worked out in detail in [14]. In this paper, we extend this approach to the general problem MPDC. We show that under a qualification condition which ensures S-stationarity of local minimizers, Q-stationarity and S-stationarity are equivalent. Further, we prove that under some weak constraint qualification every local minimizer of MPDC is a QM-stationary solution and we provide an efficient algorithm for verifying QM-stationarity of some feasible point. More exactly, this algorithm either proves the existence of some feasible descent direction, i.e. the point is not B-stationary, or it computes multipliers fulfilling the QM-stationarity condition. To this end, we consider quadratic programs with disjunctive constraints (QPDC), i.e. the objective function f in MPDC is a convex quadratic function and the mappings Fi, i=1,,mD are linear. We propose a basic algorithm for QPDC, which either returns a Q-stationary point or proves that the problem is unbounded. Further, we show that M-stationarity for MPDC is related with Q-stationarity of some QPDC and the combination of the two parts yields the algorithm for verifying QM-stationarity. This algorithm does not rely on enumeration techniques and this is another big advantage of the concepts of Q- and QM-stationarity.

Our approach is well suited to the MPDC (1) when all the numbers Ki, i=1,,mD are small or of moderate size. Our disjunctive structure is not induced by integral variables like, e.g. in [15]. It is also not related to the approach of considering the convex hull of a family of convex sets like in [16,17].

The outline of the paper is as follows. In Section 2, we recall some basic definitions from variational analysis and discuss various stationarity concepts. In Section 3, we introduce the concepts of Q- and QM-stationarity for general optimization problems. These concepts are worked out in more detail for MPDC in Section 4. In Section 5, we consider quadratic programs with disjunctive linear constraints. We present a basic algorithm for solving such problems, which either return a Q-stationary solution or prove that the problem is not bounded below. In the next section, we demonstrate how this basic algorithm can be applied to a certain quadratic program with disjunctive linear constraints in order to verify M-stationarity or QM-staionarity of a point or to compute a descent direction. In the last Section 7, we present some results for numerical methods for solving MPDC which prevent convergence to non M-stationary and non-QM-stationary points.

Our notation is fairly standard. In Euclidean space Rn we denote by · and ·,· the Euclidean norm and scalar product, respectively, whereas we denote by u:=max{|ui||i=1,,n} the maximum norm. The closed ball around some point x with radius r is denoted by B(x,r). Given some cone QRn, we denote by Q:={qRn|q,q0qQ} its polar cone. By d(x,A):=inf{x--y|yA} we refer to the usual distance of some point x to a set A. We denote by 0+C the recession cone of a convex set C.

2. Preliminaries

For the reader’s convenience, we start with several notions from variational analysis. Given a set ΩRn and a point z¯Ω, the cone

TΩ(z¯)={w|wkw,tk0withz¯+tkwkΩ}

is called the (Bouligand/Severi) tangent/contingent cone to Ω at z¯. The (Fréchet) regular normal cone to Ω at z¯Ω can be equivalently defined either by

N^Ω(z¯):={vRd|lim supzΩz¯v,z-z¯z-z¯0},

where zΩz¯ means that zz¯ with zΩ, or as the dual/polar to the contingent cone, i.e. by

N^Ω(z¯):=TΩ(z¯).

For convenience, we put NΩ(z¯):= for z¯Ω. Further, the (Mordukhovich) limiting/basic normal cone to Ω at z¯Ω is given by

NΩ(z¯):={wRd|zkz¯,wkwwithwkN^Ω(zk)for allk}.

If Ω is convex, then both the regular and the limiting normal cones coincide with the normal cone in the sense of convex analysis. Therefore, we will use in this case the notation NΩ.

Consider now the general mathematical program

minxRnf(x)subject toF(x)D (7)

where f:RnR, F:RnRm are continuously differentiable and DRm is a closed set. Let

Ω:={xRn|F(x)D} (8)

denote the feasible region of the program (7). Then a necessary condition for a point x¯Ω being locally optimal is

f(x¯),u0uTΩ(x¯), (9)

which is the same as

-f(x¯)N^Ω(x¯), (10)

cf. [11, Theorem 6.12]. The main task of applying this first-order optimality condition now is the computation of the regular normal cone N^Ω(x¯) which is very difficult for nonconvex D.

We always have the inclusion

F(x¯)TN^D(F(x¯))N^Ω(x¯), (11)

but equality will hold in (11) for nonconvex sets D only under comparatively strong conditions, e.g. when F(x¯) is surjective, see [11, Exercise 6.7]. The following weaker sufficient condition for equality in (11) uses the notion of metric subregularity.

Definition 1:

A multifunction Ψ:RnRm is called metrically subregular at a point (x¯,y¯) of its graph gphΨ with modulus κ>0, if there is a neighborhood U of x¯ such that

d(x,Ψ-1(y¯))κd(y¯,Ψ(x))xU.

Theorem 1:

[18, Theorem 4]]Let Ω be given by (8) and x¯Ω. If the multifunction xF(x)-D is metrically subregular at (x¯,0) and if there exists a subspace LRm such that

TD(F(x¯))+LTD(F(x¯)) (12)

and

F(x¯)Rn+L=Rm, (13)

then

N^Ω(x¯)=F(x¯)TN^D(F(x¯)).

In order to state an upper estimate for the regular normal cone N^Ω(x¯) we need some constraint qualification.

Definition 2:

[12, Definition 6]] Let Ω be given by (8) and let x¯Ω.

  • (1)
    We say that the generalized Abadie constraint qualification (GACQ) holds at x¯ if
    TΩ(x¯)=TΩlin(x¯), (14)
    where TΩlin(x¯):={uRn|F(x¯)uTD(F(x¯))} denotes the linearized cone.
  • (2)
    We say that the generalized Guignard constraint qualification (GGCQ) holds at x¯ if
    (TΩ(x¯))=(TΩlin(x¯)). (15)

Obviously GGCQ is weaker than GACQ, but GACQ is easier to verify by using some advanced tools of variational analysis. E.g. if the mapping xF(x)-D is metrically subregular at (x¯,0) then GACQ is fulfilled at x¯, cf. [19, Proposition 1]. Tools for verifying metric subregularity of constraint systems can be found e.g. in [20].

Proposition 1:

[14, Proposition 3]]Let Ω be given by (8), let x¯Ω and assume that GGCQ is fulfilled, while the mapping uF(x¯)u-TD(F(x¯)) is metrically subregular at (0, 0). Then

N^Ω(x¯)F(x¯)TNTD(F(x¯))(0)F(x¯)TND(F(x¯)). (16)

Note that we always have NTD(F(x¯))(0)ND(F(x¯)), see [11, Proposition 6.27]. However, if D is the union of finitely many convex polyhedral sets, then equality

NTD(F(x¯))(0)=ND(F(x¯)) (17)

holds. This is due to the fact that by the assumption on D there is some neighborhood V of 0 such that (D-F(x¯))V=TD(F(x¯))V.

Let us mention that metric subregularity of the constraint mapping xF(x)-D at (x¯,0) does not only imply GACQ and consequently GGCQ, but also metric subregularity of the mapping uF(x¯)u-TD(F(x¯)) at (0, 0) with the same modulus, see [21, Proposition 2.1].

The concept of metric subregularity has the drawback that, in general, it is not stable under small perturbations. It is well known that the stronger property of metric regularity is robust.

Definition 3:

A multifunction Ψ:RnRm is called metrically regular near a point (x¯,y¯) of its graph gphΨ with modulus κ>0, if there are neighborhoods U of x¯ and V of y¯ such that

d(x,Ψ-1(y))κd(y,Ψ(x))(x,y)U×V.

The infimum of the moduli κ for which the property of metric regularity holds is denoted by

regΨ(x¯,y¯).

In the following proposition, we gather some well-known properties of metric regularity:

Proposition 2:

Let x¯F-1(D) where F:RnRm is continuously differentiable and D is the union of finitely many convex polyhedral sets and consider the multifunctions xΨ(x):=F(x)-D and uDΨ(x¯)(u):=F(x¯)u-TD(F(x¯)). Then

regΨ(x¯,y¯)=regDΨ(x¯)(0,0)=max{1F(x¯)Tλ|λND(F(x¯))=NTD(F(x¯))(0),λ=1}.

Moreover for every κ>regΨ(x¯,y¯) there is a neighborhood W of x¯ such that for all xW the mapping uF(x)u-TD(F(x¯)) is metrically regular near (0, 0) with modulus κ,

λκF(x)TλλND(F(x¯))=NTD(F(x¯))(0) (18)

and

d(u,F(x)-1TD(F(x¯)))κd(F(x)u,TD(F(x¯)))uRn.

Proof The statement follows from [11, Exercise 9.44] together with the facts that by our assumption on D condition (17) holds and that TD(F(x¯)) is a cone.

We now recall some well known stationarity concepts based on the considerations above.

Definition 4:

Let x¯ be feasible for the program (7).

  • (i)

    We say that x¯ is B-stationary, if (9) or, equivalently, (10) hold.

  • (ii)
    We say that x¯ is S-stationary, if
    -f(x¯)F(x¯)TN^D(F(x¯)).
  • (iii)
    We say that x¯ is M-stationary, if
    -f(x¯)F(x¯)TND(F(x¯)).

Every local minimizer of (7) is B-stationary and this stationarity concept is considered to be the most preferable one. S- and M-stationarity have been introduced in [12] as a generalization of these notions for MPCC. Using the inclusion (5) it immediately follows, that S-stationarity implies B-stationarity. However the reverse implication only holds true under some additional condition on the constraints, e.g. under the assumptions of Theorem 1. Note that there always hold the inclusions

F(x¯)TN^D(F(x¯))(TΩlin(x¯))(TΩ(x¯))=N^Ω(x¯).

In order that a B-stationary point is also S-stationarity, both inclusions must be fulfilled with equality, i.e. besides the GGCQ (TΩlin(x¯))=(TΩ(x¯)) which allows to replace the tangent cone by the linearized tangent cone, we need another constraint qualification condition ensuring F(x¯)TN^D(F(x¯))=(TΩlin(x¯)) like the conditions (12) and (13). It is well known that this additional condition is much more restrictive than the usual constraint qualifications allowing the linearization of the problem like metric (sub)regularity of the constraint mapping F(·)-D. Thus in the general case one cannot expect that a local minimizer is also S-stationary. This is the reason why other stationarity concepts like M-stationarity have also to be considered.

A B-stationary point is M-stationary under the very weak assumptions of Proposition 1. However, the inclusion N^Ω(x¯)F(x¯)TND(F(x¯)) can be strict, implying that a M-stationary point x¯ needs not to be B-stationary. Hence, M-stationarity does eventually not preclude the existence of feasible descent directions, i.e. directions uTΩ(x¯) with f(x¯),u<0.

3. On Q- and QM-stationarity

In this section, we consider an extension of the concept of Q-stationarity as introduced in the recent paper [14]. Q-stationarity is based on the following simple observation.

Consider the general program (7), assume that GGCQ holds at the point x¯Ω and assume that we are given K convex cones QiTD(F(x¯)), i=1,,K. Then for each i=1,,K we obviously have TΩlin(x¯)=F(x¯)-1TD(F(x¯))F(x¯)-1Qi implying

N^Ω(x¯)=(TΩlin(x¯))(F(x¯)-1Qi).

If we further assume that (F(x¯)-1Qi)=F(x¯)TQi and by taking into account, that by [14, Lemma 1] we have

(F(x¯)TS1)(F(x¯)TS2)=F(x¯)T(S1(kerF(x¯)T+S2))

for arbitrary sets S1,S2Rm, we obtain

N^Ω(x¯)i=1KF(x¯)TQi=F(x¯)T(Q1(kerF(x¯)T+Q2))i=3KF(x¯)TQi=F(x¯)T(Q1(kerF(x¯)T+Q2)(kerF(x¯)T+Q3))i=4KF(x¯)TQi==F(x¯)TQ1i=2K(kerF(x¯)T+Qi).

Here, we use the convention that for sets S1,,SKRm we set i=lKSi=Rm for l>K. It is an easy consequence of (11), that equality holds in this inclusion, provided F(x¯)T(Q1i=2K(kerF(x¯)T+Qi))F(x¯)TN^D(F(x¯). Hence, we have shown the following theorem.

Theorem 2:

Assume that GGCQ holds at x¯Ω and assume that Q1,,QK are convex cones contained in TD(F(x¯)). If

(F(x¯)-1Qi)=F(x¯)TQi,i=1,,K, (19)

then

N^Ω(x¯)F(x¯)TQ1i=2K(kerF(x¯)T+Qi)=i=1KF(x¯)TQi. (20)

Further, if

F(x¯)TQ1i=2K(kerF(x¯)T+Qi)F(x¯)TN^D(F(x¯)), (21)

then equality holds in (20) and N^Ω(x¯)=F(x¯)TN^D(F(x¯)).

Remark 1:

Condition (19) is e.g. fulfilled, if for each i=1,,K either there is a direction ui with F(x¯)uiriQi or Qi is a convex polyhedral set, cf. [14, Proposition 1].

The proper choice of Q1,,QK is crucial in order that (20) provides a good estimate for the regular normal cone. It is obvious that we want to choose the cones Qi, i=1,,K as large as possible in order that the inclusion (20) is tight. Further, it is reasonable that a good choice of Q1,,QK fulfills

i=1KQi=N^D(F(x¯)) (22)

because then equation (21) holds whenever F(x¯) has full rank. We now show that (21) holds not only under this full rank condition but also under some weaker assumption.

Theorem 3:

Assume that GGCQ holds at x¯Ω and assume that we are given convex cones Q1,,QKTD(F(x¯)) fulfilling (19), (22) and

kerF(x¯)T(Q1-Qi)={0},i=2,,K. (23)

Then

N^Ω(x¯)=F(x¯)TN^D(F(x¯))=F(x¯)TQ1i=2K(kerF(x¯)T+Qi).

In particular, (23) holds if there is a subspace

Li=1K(Qi(-Qi)) (24)

such that (13) holds.

Proof The statement follows from Theorem 2 if we can show that (21) holds. Consider xF(x¯)T(Q1i=2K(kerF(x¯)T+Qi)). Then there are elements λiQi, i=1,,K and μikerF(x¯)T such that λ1=μi+λi, i=2,,K and x=F(x¯)Tλ1. We conclude μi=λ1-λiQ1-Qi, implying μikerF(x¯)T(Q1-Qi)={0} and thus

λ1=λ2==λKi=1KQi=N^D(F(x¯)).

Hence, xF(x¯)TN^D(F(x¯)) and (21) is verified. In order to show the last assertion note that from (24), we conclude LQi and consequently QiL=L. Thus Q1-QiL-L=L, i=2,,K. Since

kerF(x¯)TL=((kerF(x¯)T)+L)=(F(x¯)Rn+L)={0},

it follows that (23) holds.

Corollary 1:

Assume that GGCQ holds at x¯Ω and assume that we are given convex cones Q1,,QKTD(F(x¯)) fulfilling (19) and (22). Further assume that there is some subspace L fulfilling (12) and (13). Then, the sets

Q~i:=Qi+L,i=1,,K

are convex cones contained in TD(F(x¯)),

(F(x¯)-1Q~i)=F(x¯)TQ~i,i=1,,K (25)

and

N^Ω(x¯)=F(x¯)TN^D(F(x¯))=F(x¯)TQ~1i=2K(kerF(x¯)T+Q~i).

Proof Firstly observe that Q~i=Qi+LTD(F(x¯))+LTD(F(x¯)) by (12). Next, consider zriQ~i. By (13) there exists uRn and lL such that F(x¯)u+l=z. Because of -lLQ~i we have z-2lQ~i and thus F(x¯)u=z-l=12z+12(z-2l)riQ~i by [22, Theorem 6.1] implying (25) by taking into account Remark 1. Further, from QiQ~iTD(F(x¯)) it follows that

N^D(F(x¯))=(TD(F(x¯)))i=1KQ~ii=1KQi=N^D(F(x¯)).

Finally, note that LQ~i(-Q~i), i=1,,K and the assertion follows from Theorem 3.

The following definition is motivated by Theorem 2.

Definition 5:

Let x¯ be feasible for the program (7) and let Q1,,QK be convex cones contained in TD(F(x¯)) fulfilling (19).

  • (i)
    We say that x¯ is Q-stationary with respect to Q1,,QK, if
    -f(x¯)F(x¯)TQ1i=2K(kerF(x¯)T+Qi).
  • (ii)
    We say that x¯ is QM-stationary with respect to Q1,,QK, if
    -f(x¯)F(x¯)TND(F(x¯))Q1i=2K(kerF(x¯)T+Qi).

Note that this definition is an extension of the definition of Q- and QM-stationarity in [14], where only the case K=2 was considered.

The following corollary is an immediate consequence of the definitions and Theorem 2.

Corollary 2:

Assume that GGCQ is fulfilled at the point x¯ feasible for (7). Further assume that we are given convex cones Q1,,QKTD(F(x¯)) fulfilling (19). If x¯ is B-stationary, then x¯ is Q-stationary with respect to Q1,,QK. Conversely, if x¯ is Q-stationary with respect to Q1,,QK and (21) is fulfilled, then x¯ is S-stationary and consequently B-stationary.

We know that under the assumptions of Proposition 1 every B-stationary point x¯ for the problem (7) is both M-stationary and Q-stationary with respect to every collection of cones Q1,,QKTD(F(x¯)) fulfilling (19), i.e.

-f(x¯)F(x¯)TND(F(x¯))F(x¯)TQ1i=2K(kerF(x¯)T+Qi)=F(x¯)T(kerF(x¯)T+ND(F(x¯)))Q1i=2K(kerF(x¯)T+Qi).

Comparing this relation with the definition of QM-stationarity we see that QM-stationarity with respect to Q1,,QK is stronger than the simultaneous fulfilment of M-stationarity and Q-stationarity with respect to Q1,,QK. We refer to [14, Example 2] for an example which shows that QM-stationarity is strictly stronger than M-stationarity. This is one of the advantages of QM-stationarity:

However, to ensure QM-stationarity of a B-stationary point x¯, some additional assumption has to be fulfilled.

Lemma 1:

Let x¯ be B-stationary for the program (7) and assume that the assumptions of Proposition 1 are fulfilled at x¯. Further assume that for every λNTD(F(x¯))(0) there exists a convex cone QλTD(F(x¯)) containing λ and satisfying (F(x¯)-1Qλ)=F(x¯)TQλ. Then there exists a convex cone Q1TD(F(x¯)) fulfilling (F(x¯)-1Q1)=F(x¯)TQ1 such that for every collection Q2,,QKTD(F(x¯)) fulfilling (19) the point x¯ is QM-stationary with respect to Q1,,QK.

Proof From the definition of B-stationarity and (16) we deduce the existence of λNTD(F(x¯))(0) fulfilling -f(x¯)=F(x¯)Tλ. By taking Q1=Qλ we obviously have λNTD(F(x¯))(0)Q1ND(F(x¯))Q1 implying -f(x¯)F(x¯)T(ND(F(x¯))Q1). Now consider cones Q2,,QKTD(F(x¯)) fulfilling (19). Similar to the derivation of Theorem 2 we obtain

-f(x¯)F(x¯)T(ND(F(x¯))Q1)i=2KF(x¯)TQi=F(x¯)TND(F(x¯))Q1i=2K(kerF(x¯)T+Qi)

and the lemma is proved.

Lemma 2:

Let x¯ be feasible for (7) and assume that TD(F(x¯)) is the union of finitely many closed convex cones C1,,Cp. Then for every λNTD(F(x¯))(0) there is some i¯{1,,p} satisfying λCi¯.

Proof Consider λNTD(F(x¯))(0). By the definition of the limiting normal cone there are sequences tkTD(F(x¯))0 and λkλ with

λkN^TD(F(x¯))(tk)=i:tkCiTCi(tk)=i:tkCi(TCi(tk))=i:tkCiNCi(tk).

By passing to a subsequence if necessary we can assume that there is an index i¯ such that tkCi¯ for all k and we obtain λkNCi¯(tk)={cCi¯|c,tk=0}Ci¯. Since the polar cone Ci¯ is closed, we deduce λCi¯.

If TD(F(x¯)) is the union of finitely many convex polyhedral cones C1,,Cp, then the mapping uF(x¯)u-TD(F(x¯)) is a polyhedral multifunction and thus metrically subregular at (0, 0) by Robinson’s result [23]. Further we know that for any convex polyhedral cone Q we have (F(x¯)-1Q)=F(x¯)TQ. Hence, we obtain the following corollary.

Corollary 3:

Assume that x¯ is B-stationary for the program (7), that GGCQ is fulfilled at x¯ and that TD(F(x¯)) is the union of finitely many convex polyhedral cones. Then there is a convex polyhedral cone Q1TD(F(x¯)) such that for every collection Q2,,QK of convex polyhedral cones contained in TD(F(x¯)) the point x¯ is QM-stationary with respect to Q1,,QK.

Let us notice that in contrast to S-, M- and many other types of stationarity the properties of Q- and QM-stationarity cannot be characterized by some single multiplier. In fact, Q- and QM-stationarity with respect to Q1,,QK implies the existence of K multipliers λ1,,λK satisfying

λiQi,f(x¯)+F(x¯)Tλi=0,i=1,,K.

In case of QM-stationarity the multiplier λ1 also fulfills the M-stationarity conditions.

Further, let us note that QM-stationarity, although it is stronger that M-stationarity, does not imply B-stationarity in general. Thus, in general QM-stationarity is not a sufficient condition for a local minimizer as well.

4. Application to MPDC

It is clear that Q-stationarity is not a very strong optimality condition for every choice of Q1,,QKTD(F(x¯)). As mentioned above the fulfillment of (22) is desirable. For the general problem (7), it can be impossible to choose the cones Q1,,QK such that (22) holds. If TD(F(x¯)) is the union of finitely many convex cones C1,,Cp then we obviously have

N^D(F(x¯))=i=1pCi.

However, to consider Q-stationarity with respect to C1,,Cp is in general not a feasible approach because p is often very large. We will now work out that the concepts of Q- and QM-stationarity are tailored for the MPDC (1). In what follows let D and F be given by (2).

Given a point y=(y1,,ymD)D, we denote by

Ai(y):={j{1,,Ki}|yiDij},i=1,,mD

the indices of sets Dij which contain yi. Further we choose for each i=1,,mD some index set Ji(y)Ai(y) such that

TDi(yi)=jJi(y)TDij(yi). (26)

Obviously the choice Ji(y)=Ai(y) is feasible but for practical reasons it is better to choose Ji(y) smaller if possible. E.g. if TDij2(yi)TDij1(yi) holds for some indices j1,j2Ai(y), then we will not include j2 in Ji(y). Such a situation can occur e.g. in case of MPVC when (-Hi(x¯),Gi(x¯))=(0,a) with a<0.

Now consider

νJ(y):=i=1mDJi(y).

Since for every i=1,,mD the set Di is the union of finitely many convex polyhedral sets, for every tangent direction tTDi(yi) we have yi+αtDi for all α>0 sufficiently small. Hence, we can apply [24, Proposition 1] to obtain

TD(ν)(y)=i=1mDTDiνi(yi),νJ(y)

with D(ν) given by (4), and

TD(y)=i=1mDTDi(yi)=i=1mD(jJi(y)TDij(yi))=νJ(y)TD(ν)(y). (27)

We will apply this setting in particular to points y=F(x¯) with x¯ feasible for MPDC.

Lemma 3:

Let x¯ be feasible for the MPDC (1) and assume that we are given K elements ν1,,νKJ(F(x¯)) such that

{νi1,,νiK}=Ji(F(x¯)),i=1,,mD. (28)

Then for each l=1,,K the cone Ql:=TD(νl)(F(x¯)) is a convex polyhedral cone contained in TD(F(x¯)), (F(x¯)-1Ql)=F(x¯)TQl, and

l=1KQl=N^D(F(x¯)).

Proof Obviously, for every l=1,,K the cone Ql is convex and polyhedral because it is the product of convex polyhedral cones. This implies (F(x¯)-1Ql)=F(x¯)TQl and QlTD(F(x¯)) follows from (27). By taking into account (27) the last assertion follows from

N^D(F(x¯))=(TD(F(x¯)))=i=1mD(jJi(F(x¯))TDij(Fi(x¯)))=i=1mD(l=1KTDiνil(Fi(x¯)))=i=1mD(l=1K(TDiνil(Fi(x¯))))=l=1K(i=1mD(TDiνil(Fi(x¯))))=l=1K(i=1mDTDiνil(Fi(x¯)))=l=1KQl.

Definition 6:

Let x¯ be feasible for the MPDC (1) and let index sets Ji(F(x¯))Ai(x¯), i=1,,mD fulfilling (26) be given. Further we denote by Q(x¯) the collection of all elements (ν1,,νK) with νlJ(F(x¯))=i=1mDJi(F(x¯)), l=1,,K such that (28) holds.

  • (1)

    We say that x¯ is Q-stationary (QM-stationary) for (1) with respect to (ν1,,νK)Q(x¯), if x¯ is Q-stationary (QM-stationary) with respect to Q1,,QK in the sense of Definition 5 with Ql:=TD(νl)(F(x¯)), l=1,,K.

  • (2)

    We say that x¯ is Q-stationary (QM-stationary) for (1) if x¯ is Q-stationary (QM-stationary) for (1) with respect to some (ν1,,νK)Q(x¯).

Definition 6 is an extension of the definition of Q- and QM-stationarity made for MPCC and MPVC in [14]. Note that the number K appearing in the definition of Q(x¯) is not fixed. Denoting Kmin(x¯) the minimal number K such that (ν1,,νK)Q(x¯), we obviously have

Kmin(x¯)=maxi=1,,mD|Ji(F(x¯))|maxi=1,,mDKi. (29)

We see from (27) that the tangent cone TD(F(x¯)) is the union of the |J(F(x¯))|=i=1mD|Ji(F(x¯))| convex polyhedral cones TD(ν)(y). Hence, the minimal number Kmin(x¯) is much smaller than the number of components of the tangent cone, except when all or nearly all sets Ji(F(x¯)) have cardinality 1. E.g. when Ki2 holds for all i=1,,mD as it is the case of the MPCC (5), then we have Kmin(x¯)2 whereas the number of convex cones building the tangent cone TD(F(x¯)) grows exponentially with the number of biactive constraints, i.e. complementarity constraints satisfying Gi(x¯)=Hi(x¯)=0. The concepts of Q- and QM-stationarity for MPDC take advantage of the fact that although the tangent cone is the union of a huge number of cones, its polar cone can be written as the intersection of a small number of polars. Further, it is clear that for every ν1J(F(x¯)) and every KKmin(x¯) we can find ν2,,νKJ(F(x¯)) such that (ν1,,νK)Q(x¯).

We allow K to be greater than Kmin(x¯) for numerical reasons primarily. Recall that for testing Q-stationarity with respect to (ν1,,νK), we have to check for all l=1,,K whether -f(x¯)F(x¯)TQl, or equivalently, that u=0 is a solution of the linear optimization program

minf(x¯),usubject toF(x¯)uQl

with Ql=TD(νl)(F(x¯)). Theoretically the treatment of degenerated linear constraints is not a big problem but the numerical practice tells us the contrary. In [25] we have implemented an algorithm for solving MPVC based on Q-stationarity and the degeneracy of the linear constraints was the reason when the algorithm crashed. The possibility of choosing K>Kmin(x¯) gives us more flexibility to avoid linear programs with degenerated constraints.

The following theorem follows from Corollaries 2, 3, Theorem 3 and the considerations above.

Theorem 4:

Let x¯ be feasible for the MPDC (1) and assume that GGCQ is fulfilled at x¯.

  • (i)

    If x¯ is B-stationary then x¯ is Q-stationary with respect to every element (ν1,,νK)Q(x¯) and there exists some ν¯1J(F(x¯)) such that x¯ is QM-stationary with respect to every (ν¯1,ν2,,νK)Q(x¯).

  • (ii)
    Conversely, if x¯ is Q-stationary with respect to some (ν1,,νK)Q(x¯) and
    F(x¯)T(Q1l=2K(kerF(x¯)T+Ql))F(x¯)TN^D(F(x¯)), (30)
    where Ql:=TD(νl)F(x¯), l=1,,K, then x¯ is S-stationary and consequently B-stationary. In particular, (30) is fulfilled if
    kerF(x¯)T(Q1-Ql)={0},l=2,,K. (31)

5. On quadratic programs with disjunctive constraints

In this section, we consider the special case of quadratic programs with disjunctive constraints (QPDC)

minxRnq(x):=12xTBx+dTxsubject toAixDi:=j=1KiDij,i=1,,mD, (32)

where B is a positive semidefinite n×n matrix, dRn, Ai, i=1,,mD are li×n matrices and DijRli, i=1,,mD, j=1,,Kj are convex polyhedral sets, i.e. QPDC is a special case of MPDC with f(x)=q(x) and Fi(x)=Aix, i=1,,mD. In what follows, we denote by A the m×n matrix

A=A1AmD,

where m:=i=1mDli.

We start our analysis with the following preparatory lemma.

Lemma 4:

Assume that the convex quadratic program

minxRn12xTBx+dTxsubject toAxC (33)

is feasible, where B is some symmetric positive semidefinite n×n matrix, dRn, A is a m×n matrix and CRm is a convex polyhedral set. Then either there exists a direction w satisfying

Bw=0,Aw0+C,dTw<0, (34)

or the program (33) has a global solution x¯.

Proof Assume that for every w with Bw=0, Aw0+C we have dTw0, i.e. 0 is a global solution of the program

mindTwsubject towS:=w|BAw{0}n×0+C.

Since C is a convex polyhedral set, its recession cone 0+C is a convex polyhedral cone and so is {0}n×0+C as well. Hence,

N^S(0)=S=(BTAT)({0}n×0+C)=BTRn+AT(0+C)

and from the first-order optimality condition -dN^S(0) we derive the existence of multipliers μBRn and μC(0+C) such that

-d=BTμB+ATμC.

The convex polyhedral set C is the sum of the convex hull Σ of its extreme points and its recession cone. Hence, for every x feasible for (33) there is some c1Σ and some c20+C such that Ax=c1+c2 and, by taking into account μCTc20, we obtain

12xTBx+dTx=12xTBx-μBTBx-μCTAx=12(x-μB)TB(x-μB)-12μBTBμB-μCTc1-μCTc2-12μBTBμB-μCTc1. (35)

The set Σ is compact and we conclude that the objective of (33) is bounded below on the feasible domain A-1C by -12μBTBμB-maxc1ΣμCTc1. Thus

α:=inf12xTBx+dTx|AxC

is finite and there remains to show that the infimum is attained. Consider some sequence xkA-1C with limk12xkTBxk+dTxk=α. We conclude from (35) that (xk-μB)TB(xk-μB) is bounded which in turn implies that the sequence B1/2xk is bounded. Hence, the sequence xkTBxk=B1/2xk2 is bounded as well and we can conclude also the boundedness of dTxk. By passing to a subsequence we can assume that the sequence (B1/2xk,dTxk) converges to some (z,β) and it follows that α=12z2+β. Since C is a convex polyhedral set, it follows by applying [22, Theorem 19.3] twice, that the sets A-1C and {(B1/2u,dTu)|uA-1C} are convex and polyhedral. Since convex polyhedral sets are closed, it follows that (z,β){(B1/2u,dTu)|uA-1C}. Thus there is some x¯A-1C with (z,β)=(B1/2x¯,dTx¯) and 12x¯TBx¯+dTx¯=α follows. This shows that x¯ is a global minimizer for (33).

In what follows, we assume that we have at hand an algorithm for solving (33), which either computes a global solution x¯ or a descent direction w fulfilling (34). Such an algorithm is, e.g. the active set method as described in [26], where we have to rewrite the constraints equivalently in the form ATai,xbi, i=1,,p using the representation of C as the intersection of finitely many half-spaces, C={c|ai,cbi,i=1,,p}.

Consider now the following algorithm. Inline graphic

Algorithm 1 can be considered as a kind of active index set strategy. The set (νk,1,,νk,K)Q(xk) chosen in step (2) acts as a working set and is a subset of the active pieces of the disjunctive constraints. The number K will also depend on xk and for practical reasons it is desirable to keep K small to have small numerical effort in each iteration. Recall that we can always choose K equal to the number Kmin(xk) given by (29) which is bounded by maxi=1,,mDKi. The working set is used for testing for unboundedness of the problem and Q-stationarity, respectively, by investigating the quadratic subproblems (QPk,l). If one of these subproblem’s problem appears unbounded, we stop the algorithm because of unboundedness of the whole program. If xk is a solution for every quadratic subproblem, we stop the algorithm because xk is Q-stationary. On the other hand, if xk is not a solution of one of these subproblems, then we take the point xk+1 as the solution of this subproblem, yielding a smaller objective function value. Then, we repeat the whole procedure by generating a new working set and testing for termination.

In the next theorem, we show that Algorithm 1 is finite. However, we do not know any nontrivial bound on the number of iterations needed, as usual for active set strategies.

Theorem 5:

Algorithm 1 terminates after a finite number of iterations either with some feasible point and some descent direction w indicating that QPDC is unbounded below or with some Q-stationary solution.

Proof If Algorithm 1 terminates in step (2) the output is a feasible point together with some descent direction showing that QPDC is unbounded below. If the algorithm does not terminate in step (2), the computed sequence of function values q(xk) is strictly decreasing. Moreover, denoting νk:=νk-1,l where l is the index chosen in step (4), we see that for each k2 the point xk is global minimizer of the problem

minq(x)subject toAxD(νk).

This shows that all the vectors νk must be pairwise different and since there is only a finite number of possible choices for νk, the algorithm must stop in step (3). We will now show that the final iterate xk is Q-stationary with respect to (νk,1,,νk,K). Since for each l=1,,K the point xk is a global minimizer of the subproblem (Qk,l), it also satisfies the first order optimality condition

q(xk),u0for everyuRnsatisfyingAuTD(νk,l)(Axk)).

This shows Q-stationarity of xk and the theorem is proved.

6. On verifying QM-stationarity for MPDC

The following theorem is crucial for the verification of M-stationarity.

Theorem 6:

  • (i)
    Let x¯ be feasible for the general program (7). If there exists a B-stationary solution of the program
    min(u,v)Rn×Rmf(x¯),u+12v2subject toF(x¯)u+vTD(F(x¯)), (36)
    then x¯ is M-stationary.
  • (ii)

    Let x¯ be B-stationary for the MPDC (1) and assume that GGCQ holds at x¯. Then the program (36) has a global solution.

Proof

  • (i)

    Let (u¯,v¯) denote a B-stationary solution, i.e. -(f(x¯),v¯)N^Γ(u¯,v¯), where Γ=(F(x¯)I)-1TD(F(x¯)). Since the matrix (F(x¯)I) obviously has full rank, we have N^Γ(u¯,v¯)=(F(x¯)I)TN^TD(F(x¯))(F(x¯)u¯+v¯) by [11, Exercise 6.7]. Thus there exists a multiplier λN^TD(F(x¯))(F(x¯)u¯+v¯) such that -f(x¯)=F(x¯)Tλ and -v¯=λ. Using [11, Proposition 6.27] we have N^TD(F(x¯))(F(x¯)u¯+v¯)NTD(F(x¯))(F(x¯)u¯+v¯)NTD(F(x¯))(0)ND(F(x¯)) establishing M-stationarity of x¯.

  • (ii)
    Consider for arbitrarily fixed νJ(F(x¯)) the convex quadratic program
    min(u,v)Rn×Rmf(x¯),u+12v2subject toF(x¯)u+vTD(ν)(F(x¯)). (37)
    Assuming that this quadratic program does not have a solution, by Lemma 4 we could find a direction (wu,wv) satisfying
    000Iwuwv=0,F(x¯)wu+wv0+TD(ν)(F(x¯)),f(x¯),wu+0,wv<0.
    This implies wv=0, F(x¯)wu0+TD(ν)(F(x¯))=TD(ν)(F(x¯))TD(F(x¯)) and f(x¯),wu<0 and thus, together with GGCQ, -f(x¯)(TΩlin(x¯))=N^D(F(x¯)) contradicting our assumption that x¯ is B-stationary for (1). Hence, the quadratic program (37) must possess some global solution (uν,vν). By choosing ν¯J(F(x¯)) such that f(x¯),uν¯=min{f(x¯),uν|νJ(F(x¯))} it follows from (27) that (uν¯,vν¯) is a global solution of (36).

We now want to apply Algorithm 1 to the problem (36). Note that the point (0, 0) is feasible for (36) and therefore we can start Algorithm 1 with (u1,v1)=(0,0).

Corollary 4:

Let x¯ be feasible for the MPDC (1) and apply Algorithm 1 to the QPDC (36). If the algorithm returns an iterate together with some descent direction indicating that (36) is unbounded below and if GGCQ is fulfilled at x¯, then x¯ is not B-stationary. On the other hand, if the algorithm returns a Q-stationary solution, then x¯ is M-stationary.

Proof Observe that in case when Algorithm 1 returns a Q-stationary solution, by Theorem 4(ii) this solution is B-stationary because the Jocobian of the constraints (F(x¯)I) obviously has full rank. Now the statement follows from Theorem 6.

We now want to analyse how the output of Algorithm 1 can be further utilized. Recalling that TD(F(x¯)) has the disjunctive structure

TD(F(x¯))=i=1mD(jJi(F(x¯))TDij(Fi(x¯))),

we define for y=(y1,,ymD)TD(F(x¯)) the index sets

AiTD(y):={jJi(F(x¯))|yiTDij(Fi(x¯))},i=1,,mD.

Further we choose for each i=1,,mD some index set JiTD(y)AiTD(y) such that

TTDi(Fi(x¯))(yi)=jJiTD(y)TTDij(Fi(x¯))(yi) (38)

and set

JTD(y):=i=1mDJiTD(y).

Note that we always have

JTD(y)J(F(x¯)).

In order to verify Q-stationarity for the problem (36) at some feasible point (uv), we have to consider the set QTD(u,v) consisting of all (ν1,,νK) with νlJTD(F(x¯)u+v), l=1,,K such that

{νi1,,νiK}=JiTD(F(x¯)u+v),i=1,,mD.

At the k-th iterate (uk,vk) we have to choose (νk,1,,νk,K)QTD(uk,vk) and then for each l=1,,K we must analyse the convex quadratic program

(QPk,l)minu,vf(x¯),u+12v2subject toF(x¯)u+vTD(νk,l)(F(x¯)).

If for some l¯{1,,K} this quadratic program is unbounded below then Algorithm 1 returns the index ν¯:=νk,l¯ together with a descent direction (wu,wv) fulfilling, as argued in the proof of Theorem 6(ii),

wv=0,F(x¯)wu0+TD(ν¯)(F(x¯))=TD(ν¯)(F(x¯)),f(x¯),wu<0.

Therefore, wu constitutes a feasible descent direction, provided GACQ holds at x¯, i.e. for every α>0 sufficiently small the projection of x¯+αwu on the feasible set F-1(D) yields a point with a smaller objective function value than x¯. If GACQ also holds for the constraint F(x)D(ν¯) at x¯, then we can also project the point x¯+αwu on F-1(D(ν¯)) in order to reduce the objective function.

Now, assume that the final iterate (uk,vk) of Algorithm 1 is Q-stationary for (36) and consequently x¯ is M-stationary for the MPDC (1). Setting λ:=-vk, the first order optimality conditions for the quadratic programs (QPk,l) result in

-f(x¯)=F(x¯)Tλ,λl=1KNTD(νk,l)(F(x¯))(F(x¯)uk+vk)=N^TD(F(x¯))(F(x¯)uk+vk)ND(F(x¯)).

From this we conclude -f(x¯)F(x¯)T(Q1ND(F(x¯)) with Q1:=TD(ν¯)(F(x¯))TTD(ν¯)(F(x¯))(F(x¯)uk+vk) where ν¯=νk,1 is the index vector returned from Algorithm 1. Now choosing ν2,,νK such that (ν¯,ν2,,νK)Q(x¯) we can simply check by testing -f(x¯)ND(νl)(F(x¯)), l=2,,K, whether x¯ is QM stationary or x¯ is not B-stationary.

Further, we have the following corollary.

Corollary 5:

Let x¯ be B-stationary for the MPDC (1) and assume that GGCQ is fulfilled at x¯. Let ν¯ be the index vector returned by Algorithm 1 applied to (36). Then ν¯J(F(x¯)) and for every ν2,,νK with (ν¯,ν2,,νK)Q(x¯) the point x¯ is QM-stationary with respect to (ν¯,ν2,,νK).

7. Numerical aspects

In practice, the point x¯ which should be checked for M-stationarity and QM-stationarity, respectively, often is not known exactly. E.g. x¯ can be the limit point of a sequence generated by some numerical method for solving MPDC. Hence, let us assume that we are given some point x~ close to x¯ and we want to state some rules when we can consider x~ as approximately M-stationary or QM-stationary. Let us assume that the convex polyhedral sets Dij have the representation

Dij={y|ali,j,ybli,j,l=1,,pi,j},i=1,,mD,j=1,,Ki,

where without loss of generality we assume ali,j=1.

We use here the following approach.

7.

In the first step of Algorithm 2, we want to estimate the tangent cone TD(F(x¯)). In fact, to calculate TD(F(x¯)) we need not to know the point F(x¯), we only need the index sets of constraints active at x¯ and these index sets are approximated by ϵ-active constraints. Note that whenever A~i(x~,ϵ)=A~i(x¯,0)=Ai(F(x¯)) and I~ij(x~,ϵ)=I~ij(x¯,0), i=1,,mD, jAi(F(x¯)) this approach yields the exact tangent cones TDij(F(x¯))=Tij(x~,ϵ) for all i=1,,mD, jAi(F(x¯)). To be consistent with the notation of Section 4 we make the convention that in this case the index vector ν¯ computed in step (2) belongs to J(x¯) and also, whenever we determine ν2,νK is step (4), we have (ν¯,ν2,,νK)Q(x¯). The regularization term σ2u2 in QPDC(x~,ϵ,σ) forces the objective to be strictly convex and therefore Algorithm 1 will always terminate with a Q-stationary solution. Further note that the verification of (39) requires the solution of K-1 linear optimization problems.

The following theorem justifies Algorithm 2. In the sequel, we denote by M(x¯) (Msub(x¯)) the set of all νJ(x¯) such that the mapping F(·)-D(ν) is metrically regular near (x¯,0) (metrically subregular at (x¯,0)).

Theorem 7:

Let x¯ be feasible for the MPDC (1) and assume that f and F are Lipschitz near x¯. Consider sequences xtx¯, ϵt0, σt0 and ηt0 with

limtxt-x¯ϵt=limtσtηt+xt-x¯ηt=0

and let (u~t,v~t), ν¯t and eventually νt,2,νt,Kt and l¯t denote the output of Algorithm 2 with input data (xt,ϵt,σt,ηt).

  • (i)
    For all t sufficiently large and for all i{1,,mD} we have
    A~i(xt,ϵt)=Ai(F(x¯)),I~ij(xt,ϵt)=I~ij(x¯,0),jAi(F(x¯)). (39)
  • (ii)
    Assume that the mapping xF(x)-D is metrically regular near (x¯,0).
    • (a)
      If x¯ is B-stationary then for all t sufficiently large the point xt is accepted as approximately M-stationary and approximately QM-stationary.
    • (b)
      If for infinitely many t the point xt is accepted as approximately M-stationary then x¯ is M-stationary.
    • (c)
      If for infinitely many t the point xt is accepted as approximately QM-stationary and {ν¯t,νt,2,,νt,Kt}M(x¯) then x¯ is QM-stationary.
    • (d)
      For every t sufficiently large such that the point xt is not accepted as approximately M-stationary and ν¯tMsub(x¯) we have min{f(x)|F(x)D(ν¯t)}<f(x¯).
    • (e)
      For every t sufficiently large such that the point xt is not accepted as approximately QM-stationary and νt,l¯tMsub(x¯) we have min{f(x)|F(x)D(νt,l¯t)}<f(x¯).

Proof (i) Let R>0 be chosen such that f, F and their derivatives are Lipschitz on B(x¯,R) with constant L. It is easy to see that we can choose ϵ>0 such that for all i{1,,mD} we have A~i(x¯,ϵ)=A~i(x¯,0)=Ai(F(x¯)) and such that for every jAi(F(x¯)) we have I~ij(x¯,ϵ)=I~ij(x¯,0). Consider t with xt-x¯<R, Lxt-x¯<ϵt<ϵ/2 and fix i{1,,mD}. For every jAi(F(x¯)) we have

d(Fi(xt),Dij)Fi(xt)-Fi(x¯)Lxt-x¯<ϵt,

whereas for jAi(F(x¯)) we have

d(Fi(xt),Dij)d(Fi(x¯),Dij)-Fi(xt)-Fi(x¯)ϵ-Lxt-x¯>ϵt

showing A~i(xt,ϵt)=Ai(F(x¯)). Now fix jAi(F(x¯)) and let lI~ij(x¯,0), i.e. ali,j,Fi(x¯)=bli,j. By taking into account ali,j=1 we obtain

ali,j,Fi(xt)bli,j-Fi(xt)-Fi(x¯)>bli,j-ϵt

implying lI~ij(xt,ϵt), whereas for lI~ij(x¯,0)=I~ij(x¯,ϵ) we have

ali,j,Fi(xt)ali,j,Fi(x¯)+Fi(xt)-Fi(x¯)<bli,j-ϵ+ϵt<bli,j-ϵt

showing lI~ij(xt,ϵt). Hence, I~ij(xt,ϵt)=I~(x¯,0). Because of our assumptions we have xt-x¯<R and Lxt-x¯<ϵt<ϵ/2 for all t sufficiently large and this proves (40).

(ii) In view of Proposition 2, we can choose κ large enough such that the mappings F(·)-D, uF(x¯)u-TD(F(x¯)) and F(·)-D(ν), uF(x¯)u-TD(ν)(F(x¯)), νM(x¯) are metrically regular near (x¯,0) with modulus κ. By eventually shrinking R we can assume that for every xB(x¯,R) the mappings uF(x)u-TD(F(x¯)), uF(x)u-TD(ν)(F(x¯)), νM(x¯) are metrically regular near (0, 0) with modulus κ+1.

Without loss of generality we can assume that xtB(x¯,R) and (40) holds for all t implying that TDij(F(x¯))=Tij(x~,ϵt) holds for all i=1,,mD, jAi(F(x¯)). In fact, then the problem QPDC(xt,ϵt,σt) is the same as

minu,vf(xt),u+σt2u2+12v2subject toF(xt)u+vTD(F(x¯)).

The point (u~t,v~t) is Q-stationary for this program and thus also S-stationary by Theorem 4(ii) and the full rank property of the matrix (F(xt)I). Hence, there is a multiplier λtN^TD(F(x¯))(F(xt)u~t+v~t)NTD(F(x¯))(0) fulfilling v~t+λt=0, f(xt)+σtu~t+F(xt)Tλt=0 and we conclude

v~t=λt(κ+1)f(xt)+σtu~t (40)

from (18).

By Q-stationarity of (u~t,v~t) we know that (u~t,v~t) is the unique solution of the strictly convex quadratic program

minf(xt),u+σt2u2+12v2subject toF(xt)u+vTD(ν¯t)(F(x¯)). (41)

For every α0, the point α(u~t,v~t) is feasible for this quadratic program and thus α=1 is solution of

minα0αf(xt),u~t+α2σt2u~t2+12v~t2

implying

-f(xt),u~t=σtu~t2+v~t2.

Hence,

σtu~t-f(xt),u~tu~tf(xt) (42)

and from (41) we obtain

v~t=λt2(κ+1)f(xt). (43)

(a) Assume on the contrary that x¯ is B-stationary but for infinitely many t the point xt is not accepted as approximately M-stationary and hence u~tηt/σt. This implies

d(F(x¯)u~tu~t,TD(F(x¯)))d(F(xt)u~tu~t,TD(F(x¯)))+Lxt-x¯v~tu~t+Lxt-x¯2(κ+1)f(xt)σtηt+Lxt-x¯

and by the metric regularity of uF(x¯)u-TD(F(x¯)) near (0, 0) we can find u^tF(x¯)-1TD(F(x¯)) with

u^t-u~tu~tκ2(κ+1)f(xt)σtηt+Lxt-x¯.

Our choice of the parameters σt, ηt together with (43) ensures that for t sufficiently large we have

f(x¯),u^tf(x¯),u~tu~t+f(x¯)u^t-u~tu~tf(xt),u~tu~t+Lxt-x¯+f(x¯)u^t-u~tu~t-σtu~t+Lxt-x¯+f(x¯)u^t-u~tu~t-ηt+Lxt-x¯+f(x¯)κ2(κ+1)f(xt)σtηt+Lxt-x¯<0

which contradicts B-stationarity of x¯. Hence, for all t sufficiently large the point xt must be accepted as approximately M-stationary.

To prove the statement that xt is also accepted as approximately QM-stationary for all t sufficiently large we can proceed in a similar way. Assume on the contrary that x¯ is B-stationary but for infinitely many t the point xt is not accepted as approximately QM-stationary. For those t, let wt denote some element fulfilling F(xt)wtTD(νt,l¯t)TD(F(x¯)), wt1 and f(xt),wt-ηt. Then, similar as before we can find w^tF(x¯)-1TD(F(x¯)) such that

w^t-wtκF(x¯)-F(xt)wtκLnxt-x¯

and for large t we obtain

f(x¯),w^tf(xt),wt+f(x¯)-f(xt)wt+f(x¯)w^t-wt-ηt+Ln(1+κf(x¯))xt-x¯<0

contradicting B-stationarity of x¯.

(b) By passing to a subsequence we can assume that for all t the point xt is accepted as approximately M-stationary and hence σtutηt0. By (44) we have that the sequence λtNTD(F(x¯))(0) is uniformly bounded and by passing to a subsequence once more we can assume that it converges to some λ¯NTD(F(x¯))(0). By [11, Proposition 6.27] we have λ¯ND(F(x¯)) and together with

0=limt(f(xt)+F(xt)Tλt)=f(x¯)+F(x¯)Tλ¯

M-stationarity of x¯ is established.

(c) By passing to a subsequence we can assume that for all t the point xt is accepted as approximately QM-stationary and {ν¯t,νt,2,,νt,Kt}M(x¯). Hence, for all t the point xt is also accepted as M-stationary and by passing to a subsequence and arguing as in (b) we can assume that λt converges to some λ¯ND(F(x¯)) fulfilling f(x¯)+F(x¯)Tλ¯=0. Since the set M(x¯) is finite, by passing to a subsequence once more we can assume that there is a number K and elements ν¯,ν2,,νK such that Kt=K , ν¯t=ν¯ and νt,l=νl, l=2,,K holds for all t. Since we assume that (40) holds we have (ν¯,ν2,,νK)Q(x¯) and we will now show that x¯ is QM-stationary with respect to (ν¯,ν2,,νK). Since (u~t,v~t) also solves (42), it follows that λt=-vtNTD(ν¯)(F(x¯))(F(xt)u~t+v~t)ND(ν¯)(F(x¯)) and thus λ¯ND(F(x¯))ND(ν¯)(F(x¯)) implying -f(x¯)F(x¯)T(ND(F(x¯))(TD(ν¯)(F(x¯)))). There remains to show -f(x¯)(TD(νl)(F(x¯)))=ND(νl)(F(x¯)), l=2,,K. Assume on the contrary that -f(x¯)(TD(νl¯)(F(x¯))) for some index l¯{2,,K}. Then there is some uF(x¯)-1TD(νl¯)(F(x¯)), u=12 such that f(x¯),u=:-γ<0 and since νl¯M(x¯), for each t there is some u^tF(xt)-1TD(νl¯)(F(x¯)) with

u-u^t(κ+1)F(x¯)-F(xt)un2(κ+1)Lxt-x¯.

It follows that for all t sufficiently large we have u^t1 and

f(xt),u^tf(x¯),u+f(xt)-f(x¯)u^t+f(x¯)u-u^t-γ+Ln1+κ+12xt-x¯<-ηt

contradicting our assumption that xt is accepted as approximately QM-stationary.

(d), (e) We assume that κ is chosen large enough such that the mappings F(·)-D(ν), νMsub(x¯) are metrically subregular at (x¯,0) with modulus κ. Then by [21, Proposition 2.1] the mappings uF(x¯)u-TD(ν)(F(x¯)), νMsub(x¯) are metrically subregular at (0, 0) with modulus κ as well. Taking into account that (u~t,v~t) solves (42), we can copy the arguments from part (a) with TD(F(x¯)) replaced by TD(ν¯t)(F(x¯)) to show the existence of u^tF(x¯)-1TD(ν¯t)(F(x¯)) with f(x¯),u^t<0 whenever xt is not accepted as approximately M-stationary and t is sufficiently large. In doing so, we also have to recognize that metric regularity of uF(x¯)u-TD(ν¯t)(F(x¯)) can be replaced by the weaker property of metric subregularity. Since ν¯tMsub(x¯), u^t is a feasible descent direction and for sufficiently small α>0 the projection of x¯+αu^t on F-1(D(ν¯t)) yields a point with a smaller objective function value than x¯. This proves (d). In order to show (e), we can proceed in a similar way. Using the same arguments as in part (a), we can prove the existence of a feasible direction w^tTD(νt,l¯t) with f(x¯),w^t<0, whenever t is sufficiently large and xt is not accepted as approximately QM-stationary. Together with νt,l¯tMsub(x¯) the assertion follows.

Funding Statement

The research was supported by the Austrian Science Fund (FWF) [grant number P26132-N25], [grant number P29190-N32].

Disclosure statement

No potential conflict of interest was reported by the authors.

References

  • [1]. Flegel ML, Kanzow C. A Fritz John approach to first order optimality conditions for mathematical programs with equilibrium constraints. Optimization. 2003;52:277–286. [Google Scholar]
  • [2]. Fukushima M, Pang JS. Complementarity constraint qualifications and simplified B-stationary conditions for mathematical programs with equilibrium constraints. Comput Optim Appl. 1999;13:111–136. [Google Scholar]
  • [3]. Kanzow C, Schwartz A. Mathematical programs with equilibrium constraints: enhanced Fritz John conditions, new constraint qualifications and improved exact penalty results. SIAM J Optim. 2010;20:2730–2753. [Google Scholar]
  • [4]. Outrata JV. Optimality conditions for a class of mathematical programs with equilibrium constraints. Math Oper Res. 1999;24:627–644. [Google Scholar]
  • [5]. Outrata JV. A generalized mathematical program with equilibrium constraints. SIAM J Control Optim. 2000;38:1623–1638. [Google Scholar]
  • [6]. Scheel H, Scholtes S. Mathematical programs with complementarity constraints: stationarity, optimality, and sensitivity. Math Oper Res. 2000;25:1–22. [Google Scholar]
  • [7]. Ye JJ. Constraint qualifications and necessary optimality conditions for optimization problems with variational inequality constraints. SIAM J Optim. 2000;10:943–962. [Google Scholar]
  • [8]. Ye JJ. Necessary and sufficient optimality conditions for mathematical programs with equilibrium constraints. J Math Anal Appl. 2005;307:350–369. [Google Scholar]
  • [9]. Ye JJ, Ye XY. Necessary optimality conditions for optimization problems with variational inequality constraints. Math Oper Res. 1997;22:977–997. [Google Scholar]
  • [10]. Hoheisel T. Mathematical programs with vanishing constraints [PhD thesis], Julius-Maximilians-Universität Würzburg; 2009. [Google Scholar]
  • [11]. Rockafellar RT, Wets RJ-B. Variational analysis. Berlin: Springer; 1998. [Google Scholar]
  • [12]. Flegel ML, Kanzow C, Outrata JV. Optimality conditions for disjunctive programs with application to mathematical programs with equilibrium constraints. Set-Valued Anal. 2007;15:139–162. [Google Scholar]
  • [13]. Kanzow C, Schwartz A. The price of inexactness: convergence properties of relaxation methods for mathematical programs with equilibrium constraints revisited. Math Oper Res. 2015;40:253–275. [Google Scholar]
  • [14]. Benko M, Gfrerer H. On estimating the regular normal cone to constraint systems and stationary conditions. Optimization. 2017;66:61–92. [Google Scholar]
  • [15]. Jeroslow R. Representability in mixed integer programming, I: characterization results. Discrete Appl Math. 1977;17:223–243. [Google Scholar]
  • [16]. Balas E. Disjunctive programming and a hierarchy of relaxations for discrete optimization problems. SIAM J Algebraic Discrete Meth. 1985;6:466–486. [Google Scholar]
  • [17]. Ceria S, Soares J. Convex programming for disjunctive convex optimization. Math Program Ser A. 1999;86:595–614. [Google Scholar]
  • [18]. Gfrerer H, Outrata JV. On computation of generalized derivatives of the normal-cone mapping and their applications. Math Oper Res. 2016;41:1535–1556. [Google Scholar]
  • [19]. Henrion R, Outrata JV. Calmness of constraint systems with applications. Math Program Ser B. 2005;104:437–464. [Google Scholar]
  • [20]. Gfrerer H, Klatte D. Lipschitz and Hölder stability of optimization problems and generalized equations. Math Program Ser A. 2016;158:35–75. [Google Scholar]
  • [21]. Gfrerer H. First order and second order characterizations of metric subregularity and calmness of constraint set mappings. SIAM J Optim. 2011;21:1439–1474. [Google Scholar]
  • [22]. Rockafellar RT. Convex analysis. Princeton (NJ): Princeton University Press; 1970. [Google Scholar]
  • [23]. Robinson SM. Some continuity properties of polyhedral multifunctions. Math Program Stud. 1981;14:206–214. [Google Scholar]
  • [24]. Gfrerer H, Ye JJ. New constraint qualifications for mathematical programs with equilibrium constraints via variational analysis. SIAM J Optim. 2017;27:842–865. [Google Scholar]
  • [25]. Benko M, Gfrerer H. An SQP method for mathematical programs with vanishing constraints with strong convergence properties. Comput Optim Appl. 2017;67:361–399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [26]. Fletcher R. Practical methods of optimization. Vol. 2, Constrained optimization. Chichester: Wiley; 1981. [Google Scholar]

Articles from Optimization are provided here courtesy of Taylor & Francis

RESOURCES