Skip to main content
Journal of Research of the National Institute of Standards and Technology logoLink to Journal of Research of the National Institute of Standards and Technology
. 2006 Apr 1;111(2):135–141. doi: 10.6028/jres.111.013

Analysis of Interior-Point Paths

J Stoer 1
PMCID: PMC4662503  PMID: 27274925

Abstract

Infeasible-interior-point paths are the main tools in interior-point methods for solving many kinds of optimization problems. These paths are usually parametrized by a penalty-parameter r ↓ 0 and further parameters describing their off-centrality and infeasiblilty. Starting with an early result of C. Witzgall et al. [12] in linear programming, this paper gives an overview on results concerning the existence of these paths, their analyticity and the limiting behavior of their derivatives as r ↓ 0, and this also for degenerate problems in the areas of linear programming, linear complementarity problems, and semi-definte programming.

Keywords: infeasible interior-point paths, linear complementarity problems, linear programs, semidefinite linear complementarity problems, semidefinite linear programs

1. Introduction

This paper deals with the analysis of certain paths that are the main tool in sequential unconstrained optimization techniques (SUMT) and interior point methods. These were first studied systematically by Fiacco and McCormick in [3] as a means to solve constrained minimization problems of the form

inf{f(u)|gi(u)0,i=1,2,m}. (1)

In particular, they used Frisch’s logarithmic penalty function (see Frisch [4], [5]) and studied paths u(r), r > 0, of local minimizers of the penalized problem

inf{f(u)rilog(gi(u))|gi(u)<0,i=1,2,,m}

as r ↓ 0. They investigated the local existence and uniqueness of these paths, their differentiability proper ties for r > 0 and the convergence of both u(r) and u′(r) for r ↓ 0, if r > 0 is small and u(r) is close to a local optimal solution u* of problem (1). They used the crucial assumption that u* satisfies the standard regularity conditions, as the existence of Lagrange multipliers, the first order necessary and the second order sufficiency condition for a local minimum, and the strict complementarity condition. These assumptions are natural and they allow the use of the implicit function theorem to analyze the path. However they also exclude degenerate problems of practical importance: These occur e.g. already with convex and even linear or quadratic problems (1) that have more than one optimal solution u* or which, together with their optimal Lagrange multipliers, violate strict complementarity.

Another limitation of paths like u(r) is that they exist only for strictly feasible optimization problems (1). More general are infeasible-interior-point paths that use strictly feasible solutions of a family of suitably perturbed optimization problems that shrink to the unperturbed problem (1) as the path parameter r tends to zero. Again this poses the question of the existence of such paths and their limiting behavior, also for degenerate unperturbed problems.

It is the purpose of this paper to give an overview over some results that are relevant in this context.

In a joint paper [12] with P. T. Boggs and P. D. Domich, C. Witzgall was one of the first to deal also with degenerate problems in the context of linear programs. Their pioneering results will be outlined in Sec. 2. Similar and even sharper results have been found for much more general classes of programs. We describe some of them that refer to linear complementarity problems in Sec. 3. Still more difficult are semidefinite linear programs and semidefinite linear complementarity problems. Results for them are described in the final Sec. 4.

2. Linear Programs

Consider the dual linear programs

(P)mincTx(D)maxbTux:Ax=by,u:ATu+y=c,x0,y0,

where A = (a1, …, an) is a real m × n-matrix of rank m with columns ai. Let SP and SD be the optimal solution sets of (P) and (D).

Consider for (D) the Frisch logarithmic penalty function and its minimum u(r) for r > 0,

u(r):=argminu{bTuri=1nlog(ciaiTu)|ATu<c}.

The path {u(r)}r>0, if it exists, is called the dual central path; then u = u(r) solves the equation

b=ri=1naiciaiTu (2)

A central result of [12] adapted to (P) and (D) is

Theorem 1 Assume

  1. rank A = m,

  2. (D) is strictly feasible,

  3. SD is nonempty and bounded.

Then the path u(r) is well-defined for r > 0 and the following limits exist:

u*:=limr0u(r)SD,u(0):=limr0u(r)u*r.

The limit point u* is the analytic center of SD, and the function u(r) is analytic in r for r > 0.

Let

y(r):=cATu(r),xi(r):=ryi(r),i=1,,n.

Then (x, y, u)(r), r > 0, the primal-dual central path, solves

(PD)rAx=b,x>0,ATu+y=c,y>0,xy=re,

where e is the vector

e:=(1,1,,1)TRn,

and xy is the componentwise product

xy:=(x1y1,,xnyn)T.

By Eq. 2 the primal-dual path exists in the situation of Theorem 1. In general, the primal-dual path is well-defined for r > 0, provided (P) and (D) are strictly feasible; primal-dual interior point methods try to follow this path for r ↓ 0. But there are some difficulties with this concept:

  • – (PD)r is a nonlinear equation for each r, so one can at best approximate points on it.

  • – The path does not exist, if (P) and (D) are not strictly feasible, even when they are solvable.

These difficulties are avoided if one uses infeasible (weighted) interior-point paths instead, defined as the solution z: = (x, y, u) = z(r, η) = (x, y, u)(r, η) of the nonlinear system

(PD)r,ηAx=b+rb¯,x>0ATu+y=c+rc¯,y>0,xy=rη,

where r > 0, 0 < ηRn is a weight vector, and b¯, c¯ are suitable perturbation vectors. The infeasible interior-point central path belongs to the weight vector e.

As initialization one may choose r0: = 1, any x0 > 0, y0 > 0, u0, and put

b¯:=Ax0b,c¯:=ATu0+y0c,η0:=x0y0.

Then (x0, y0, u0) = (x, y, u)(r0, η0) solves at least (PD)r,η for r = r0 and η = η0. The choice x0 = y0: = e leads to η0 = e, that is to a point on the central path.

The practical significance of these paths lies in their use in infeasible-interior-point methods (see e.g. Mizuno [10] for an early analysis of such methods in the context of linear programming). Typically, these methods start with an initial point

z0=z(r0,η0),r0>0,η0>0,

on the initial trajectory at (r0, η0) defined by the initialization. Then these methods construct further iterates

zk=z(rk,ηk),rk>0,0<ηkRn,k=1,2,,

where the rk > rk+1 > ⋯ > 0 converge to 0 and the weights ηke stay bounded and close to e. The construction of zk+1 from zk use, as a rule, the tangent of z(., ηk) at rk, that is the derivative zk:=z(r,ηk)/r|r=rk (usually called the affine scaling direction). The quality of these methods depends on how quickly the rk converge to 0. With most methods, the superlinear convergence of the rk relies on the existence (and boundedness) of the derivatives zk′ = z′ (rk, ηk) as rk ↓ 0.

Also with the paths z(r, η) the same questions arise as before:

  • – their existence and uniqueness for 0 < rr0 = 1, η > 0,

  • – their analytical properties,

  • – and their limiting properties for r ↓ 0.

We will answer these questions in the context of more general complementarity problems in Sec. 3. Here, we only formulate linear programs as particular complementarity problems, which is well-known.

By the duality theorem of linear programming, feasible solutions x, (y, u) of (P) and (D) satisfy

0cTxbTu=(ATu+y)Tx(Ax)Tu=yTx,

and they are optimal solutions of (P) and (D), if they are feasible and xTy = 0.

By elimination of u, define the linear manifold ℳ ⊂ R2n,

:={(x,y)|u:Ax=b,ATu+y=c},

and the associated linear subspace

Φ:={(x,y)|u:Ax=0,ATu+y=0}.

Then

(x,y)ΦxTy=0,

and by the assumption rank (A) = m,

dimΦ=dim=n.

The linear programs (P) and (D) are then equivalent to the associated complementarity problem of finding a solution (x, y) of

(LCPLP)(x,y)x0,y0,xy=0(xTy=0).

Under the assumptions of Theorem 1, this particular complementarity problem satisfies

(a)dimΦ=η(b)xTy=0forall(x,y)Φ(c)(LCPLP)issolvable. (3)

3. General Linear Complementarity Problems

A general (LCP) (in horizontal form) is to solve

(LCP)Px+Qy=q,x0,y0,xy=(xTy=0),

where qRn and P, Q are real n × n-matrices with rank [P,Q] = n. With respect to (LCP) we define:

  1. z = (x, y) is a (strictly) feasible point of (LCP), iff Px + Qy = q and z ≥ 0 (resp. z > 0).

  2. The set of solutions of (LCP) is denoted by S. A solution (x, y) ∈ S is called strictly complementary, if x + y > 0.

  3. Φ ⊂ R2n is the linear space
    Φ:=N[P,Q]={(x,y)|Px+Qy=0}.
  4. (LCP), resp. Φ, is called monotone, if
    (x,y)ΦxTy0.

The LCPs associated with linear programs are monotone; by Eq. 3 they even satisfy xTy = 0 for all (x, y) ∈ Φ.

For the presentation of the following results, we make the following rather weak standard assumption

(V)1)rank[P,Q]=n,i.e.dimΦ=n,2)(LCP)ismonotone3)S,(LCP)issolvable.

Note that (V) 3) does not require the unique solvability of (LCP). Degenerate problems with more than one solution are not excluded.

Still more general than monotone complementarity problems are sufficient linear complementarity problems. They were introduced by Cottle, Pang, and Venkateswaran in [2] and further characterized by Väliaho [11]. They are the most general class of linear complementarity problems of the form (LCP) with the following properties:

  1. For all q, the solution set S = S(q) of (LCP) is a convex (perhaps empty) set.

  2. Each stationary solution of the (nonconvex) quadratic optimization problem
    inf{xTy|(x,y)isafeasiblepointof(LCP)}associatedwith(LCP)isasolutionof(LCP).

Again, infeasible (weighted) interior-point paths play an important role in methods for solving (LCP). They are defined as the solutions (x, y) = (x, y)(r, η) of

(LCP)r,ηPx+Qy=q+rq¯,(x,y)>0,xy=rη,

where r > 0, 0 < ηRn is a weight, and q¯Rn is a perturbation vector.

For initialization, choose any x0 > 0, y0 > 0, r0 > 0, and put

q¯:=(Px0+Qy0q)/r0,η0:=x0y0/r0.

Then at least (x0, y0) = (x, y)(r0, η0), and

q¯Mr0:={(Px+Qyq)/r0|x>0,y>0}.

Interior-point methods for solving (LCP) try to follow such paths for r ↓ 0 in much the same way as described for linear programs in Sec. 2.

As before, the same questions with respect to these paths (x, y)(r, η) arise,

  • – their existence and uniqueness for 0 < rr0 = 1, η > 0,

  • – their analytical properties as functions of r, η,

  • – and their limiting properties for r ↓ 0.

The following theorem (see Stoer and Wechs [9]) can be viewed as a generalization of Theorem 1, the result of Witzgall et al.:

Theorem 2 Assume (V), r0 > 0, and q¯Mr0. Then the following holds:

  1. The path z(r, η): = (x, y)(r, η) is a well-defined analytic function of (r, η) for all (r,η)(0,r0]×R++n, which converges for r ↓ 0 to a solution of (LCP):
    z*(η)=(x,y)(η):=limr0(x,y)(r,η)S.
  2. If, in addition, (LCP) has a strictly complementary solution, then there is an ε > 0, so that the function z(r, η) can be extended to an analytic function z^(r,η) on the enlarged domain of all (r,η)[ε,r0]×R++n

  3. If (LCP) is solvable but has no strictly complementary solution, then there is an ε > 0, and an analytic function z^(ρ,η) on the domain (ρ,η)[ε,r0]×R++n so that z(ρ2,η)=z^(ρ,η) for all
    (ρ,η)(0,r0]×R++n.
    (“(x, y)(r, η) is an analytic function of ρ=r even in r = 0.”)

3.1 Remarks

  1. By a result of Goldman and Tucker [6], any solvable complementarity problem associated with linear programs has also a strictly complementary solution. So, part b) of the theorem applies in this case, which generalizes

    Theorem 1: the limits
    z*(k)(η):=limr0krkz(r,η),k=0,1,,
    of all derivatives exist, and are in fact the derivatives of an analytic function z^(r,η) at r = 0 extending z(r, η).
  2. It is easy to find convex quadratic programs for which the corresponding complementarity problem is only solvable but has no strictly complementary solution, so case c) of the theorem can occur.

  3. Theorem 2 remains true for sufficient complementarity problems (i.e. if “monotone” is replace by “sufficient” in Assumption (V) (see [8]).

The proof of Theorem 2 is not easy in the case of degeneracies, when (LCP) has more than one solution or no strictly complementary solution. The mathematical reason is that the solutions (x, y, r, η) of (LCP)r,η with x ≥ 0, y ≥ 0, r ≥ 0 and η > 0 then form a semialgebraic variety with a singularity in r = 0, so that the implicit function theorem cannot be applied directly (though indirectly, see [9], [8] for details).

4. Semidefinite Linear Programs and Complementarity Problems

In this section we show that the results of Theorems 1 and 2 even generalize to semidefinite linear programs and to semidefinite linear complementarity problems.

We use the following notation, which is standard:

  • Sn: = {ARn×n | A = AT} is the space of all real symmetric n × n-matrices.

  • Sn is provided with the scalar product 〈A,B〉: = tr (BTA) = tr (BA).

  • A ≥ 0 (resp. A ≻ 0) means that ASn is positive semidefinite (resp. positive definite), and S++n is the set of all positive definite matrices in Sn.

Then the standard dual pair of semidefinite linear programs has the form

(P)infC,XX:Ai,X=bi,i=1,2,,m,X0.
(D)supbTuu,Y:i=1muiAi+YY0.=C,

Here,

C,AiSnfori=1,,m,b(b1,,bm)TRm.

The following results are known from the duality theory of semidefinite linear programs:

If (P) and (D) have strictly feasible solutions then they also have optimal solutions, and X and (u, Y) are optimal for (P) and (D), respectively, iff

Ai,X=bi,i=1,2,,m,i=1muiAi+Y=C,XY=0,X0,Y0.

By elimination of u one can formulate this as a particular semidefinite linear complementarity problem, which in general has the following form: Solve

(SDLCP)P(X)+Q(Y)=q,XY=0,X,Y0.

Here, qRn¯,n¯:=n(n+1)/2=dimSn, and P,Q:SnRn¯ are linear operators of the form

P(X)=(P1,X,,Pn¯,X)T,PiSn,Q(X)=(Q1,X,,Qn¯,X)T,QiSn.

As in the previous section we define:

  1. The nullspace N([P,Q]) of [P,Q] is denoted by
    Φ:={(X,Y)|P(X)+Q(Y)=0,X,YSn}.
  2. (SDLCP), resp. Φ, is called monotone, if
    (X,Y)ΦX,Y0.
  3. The set of feasible points of (SDLCP) is defined by
    :={(X,Y)|P(X)+Q(Y)=q,X,Y0}.
  4. The set of solutions of SDLCP is denoted by
    S:={(X,Y)|XY=0}.
  5. A solution (X, Y) ∈ S is called strictly complementary if X + Y ≻ 0.

In the case of the dual semidefinite linear programs (P), (D), Φ is given by

Φ={(X,Y)|Ai,X=0,fori=1,,m,Y=i=1muiAiforsomeu},

and Φ even satisfies

(X,Y)ΦX,Y=0,

so the corresponding (SDLCP) is monotone. It is not known whether in this case the corresponding (SDLCP) always has a strictly complementary solution if it is solvable.

The following standard hypothesis, which corresponds to the standard assumption of the previous section, is very weak

(V)1){P(X)+Q(Y)|X,YSn}=Rn¯,2)(SDLCP)ismontone3)S,(SDLCP)issolvable.

The definition of infeasible weighted interior-point paths differs slightly from the previous section: For 0 < rr0 and weight matrices M ≻ 0, infeasible weighted interior point paths are now defined as solutions (X, Y) = (X, Y)(r, M) of

P(X)+Q(Y)=q+rq¯,X0,Y0(XY+YX)/2=rM.

Here, the condition XY = 0 of (SDLCP) is weakened to (XY + YX)/2 = rM, which corresponds to the “AHO” approach proposed by Alizadeh, Haeberly and Overton in [1].

The perturbation vector q¯ is chosen such that q¯r0 for some r0 > 0, where

r:={P(X)+Q(Y)qr|X,Y,XY+YX0}.

A possible initialization is to choose r0: = 1, X0 = Y0: = I and M0: = I and q¯:=P(X0)+Q(Y0)q.

Again the same questions arise for these paths (X, Y)(r, M) as in the previous sections, namely their

  • – existence and uniqueness for 0 < rr0, M ≻ 0,

  • – analytical properties as functions of r and M,

  • – and their limiting properties for r ↓ 0.

The following is the main result on (SDLCP). It corresponds to Theorems 1 and 2, and shows the fruitfulness of the early results of C. Witzgall:

Theorem 3 Assume (V), r0 > 0 and q¯r0. Then:

  1. The path function (X,Y)(r, M) is a well-defined analytic function for all (r,M)(0,r0]×S++n, which converges to a solution of (SDLCP) as r ↓ 0,
    (X*,Y*)(M):=limr0(X,Y)(r,M)S.
  2. If (SDLCP) even has a strictly complementary solution, then there is an ε > 0 so that the path function (X, Y)(r, M) can be extended to an analytic function (X^,Y^)(r,M) defined on the enlarged domain (r,M)(ε,r0]×S++n.

    “(X, Y)(r, M) is analytic in (r, M) even at r = 0.”

The proof of this theorem is even more involved than that of Theorem 2, it can be found in Preiss and Stoer [7].

It is an open problem, whether (and how) the path function (X, Y)(r, M) can be extended, if (SDLCP) satisfies only (V), but has no strictly complementary solutions.

Biography

About the author: Josef Stoer is a Professor Emeritus at the University of Würzburg, Institute of Applied Mathematics and Statistics.

5. References

  • 1.Alizadeh F, Haeberly J-P, Overton M. Primal-Dual Interior-Point Methods for Semidefinite Programming: Convergence Rates, Stability and Numerical results. SIAM J Optim. 1998;8:746–768. [Google Scholar]
  • 2.Cottle RW, Pang J-S, Venkateswaran V. Sufficient Matrices and the Linear Complementarity Problem. Linear Algebra Appl. 1989;114/115:231–249. [Google Scholar]
  • 3.Fiacco AV, McCormick GP. Nonlinear Programming: Sequential Unconstrained Minimization Techniques. JohnWiley and Sons, Inc; 1968. [Google Scholar]
  • 4.Frisch KR. Principles of Linear Programming—With Particular References to the Double Gradient Form of the Logarithmic Potential Method. University Institute of Economics; Oslo: Oct 18, 1954. Memorandum of. [Google Scholar]
  • 5.Frisch KR. The Logarithmic Potential Method of Convex Programming. University Institute of Economics; Oslo: May 13, 1955. Memorandum of. [Google Scholar]
  • 6.Goldman AJ, Tucker AW. Theory of Linear Programming, in Annals of Mathematical Studies 38. In: Kuhn HW, Tucker AW, editors. Linear Inequalities and Related Systems. North Holland; Amsterdam: 1956. pp. 53–97. [Google Scholar]
  • 7.Preiß M, Stoer J. Analysis of Infeasible-Interior-Point Paths Arising with Semidefinite Linear Complementarity Problems. Math Programming Ser A. 2004;99:499–520. [Google Scholar]
  • 8.Stoer J, Wechs M. Infeasible-Interior-Point Paths for sufficient Linear Complementarity Problems and Analyticity. Math Programming. 1998;83:407–423. [Google Scholar]
  • 9.Stoer J, Wechs M. On the Analyticity Properties of Infeasible-Interior-Point Paths for Monotone Linear Complementarity Problems. Numer Math. 1999;81:631–645. [Google Scholar]
  • 10.Mizuno S. Polynomiality of Infeasible-Interior-Point Algorithms for Linear Programming. Mathematical Programming. 1994;67:109–119. [Google Scholar]
  • 11.Väliaho H. P*-Matrices are just Sufficient. Linear Algebra Appl. 1996;239:103–108. [Google Scholar]
  • 12.Witzgall C, Boggs PT, Domich PD. On the Convergence Behavior of Trajectories for Linear Programming. In: Lagarias JC, Todd MJ, editors. Contemporary Mathematics 114, Mathematical Developments Arising from Linear Programming Algorithms. American Mathematical Society; Providence, RI: 1991. pp. 161–187. [Google Scholar]

Articles from Journal of Research of the National Institute of Standards and Technology are provided here courtesy of National Institute of Standards and Technology

RESOURCES