On the directional asymptotic approach in optimization theory

Matúš Benko; Patrick Mehlitz

doi:10.1007/s10107-024-02089-w

. 2024 Jul 5;209(1-2):859–937. doi: 10.1007/s10107-024-02089-w

On the directional asymptotic approach in optimization theory

Matúš Benko ^1,², Patrick Mehlitz ^3,^✉

PMCID: PMC11735583 PMID: 39830447

Abstract

As a starting point of our research, we show that, for a fixed order $γ \geq 1$ , each local minimizer of a rather general nonsmooth optimization problem in Euclidean spaces is either M-stationary in the classical sense (corresponding to stationarity of order 1), satisfies stationarity conditions in terms of a coderivative construction of order $γ$ , or is asymptotically stationary with respect to a critical direction as well as order $γ$ in a certain sense. By ruling out the latter case with a constraint qualification not stronger than directional metric subregularity, we end up with new necessary optimality conditions comprising a mixture of limiting variational tools of orders 1 and $γ$ . These abstract findings are carved out for the broad class of geometric constraints and $γ : = 2$ , and visualized by examples from complementarity-constrained and nonlinear semidefinite optimization. As a byproduct of the particular setting $γ : = 1$ , our general approach yields new so-called directional asymptotic regularity conditions which serve as constraint qualifications guaranteeing M-stationarity of local minimizers. We compare these new regularity conditions with standard constraint qualifications from nonsmooth optimization. Further, we extend directional concepts of pseudo- and quasi-normality to arbitrary set-valued mappings. It is shown that these properties provide sufficient conditions for the validity of directional asymptotic regularity. Finally, a novel coderivative-like variational tool is used to construct sufficient conditions for the presence of directional asymptotic regularity. For geometric constraints, it is illustrated that all appearing objects can be calculated in terms of initial problem data.

Keywords: Asymptotic stationarity and regularity, Constraint qualifications, Directional limiting variational calculus, M-stationarity, Pseudo- and super-coderivatives, Pseudo- and quasi-normality

Introduction

In order to identify local minimizers of optimization problems analytically or numerically, it is desirable that such points satisfy applicable necessary optimality conditions. Typically, under validity of a constraint qualification, first-order necessary optimality conditions of abstract Karush–Kuhn–Tucker (KKT)-type hold at local minimizers. Here, first-order refers to the fact that first-order tools of (generalized) differentiation are used to describe the variation of all involved data functions. In the case where the celebrated tools of limiting variational analysis are exploited, one speaks of so-called Mordukhovich (or, briefly, M-) stationarity, see [66]. In the absence of constraint qualifications, i.e., in a degenerate situation, local minimizers still satisfy a Fritz–John (FJ)-type first-order necessary optimality condition which allows for a potentially vanishing multiplier associated with the generalized derivative of the objective function. Since such a condition allows to discard the objective function, it might be too weak in practically relevant scenarios.

In recent years, asymptotic (approximate or sequential are also common) concepts of stationarity and regularity received much attention not only in standard nonlinear optimization, see [3, 5–7], but also in complementarity-, cardinality-, and switching-constrained programming, see [4, 53, 61, 70], conic optimization, see [2], nonsmooth optimization, see [45, 63, 64], or even infinite-dimensional optimization, see [26, 55, 58]. The interest in asymptotic stationarity conditions is based on the observation that they hold at local minimizers in the absence of constraint qualifications while being more restrictive than the corresponding FJ-type conditions, and that different types of solution algorithms like multiplier-penalty- and some SQP-methods naturally compute such points. Asymptotic constraint qualifications provide conditions which guarantee that an asymptotically stationary point is already stationary in classical sense. It has been reported, e.g., in [5, 61, 63, 70] that asymptotic constraint qualifications are comparatively mild. Inherently from their construction, asymptotic constraint qualifications simplify the convergence analysis of some numerical solution algorithms.

The aim of this paper is to apply the directional approach to limiting variational analysis, see e.g. [18], in order to enrich the asymptotic stationarity and regularity conditions from [58, 63] with the aid of directional information. Noting that the directional tools of variational analysis were successfully applied to find refined M-stationarity-type optimality conditions and mild constraint qualifications for diverse problems in optimization theory, see e.g. [14–16, 36, 38, 39, 42] and the references therein, this seems to be a desirable goal.

Section 4 contains the core of our research. As a starting point, we show in Sect. 4.2 (see, particularly, Theorem 4.1) that local minimizers of rather general optimization problems in Euclidean spaces, which we formally introduce in Sect. 4, are either M-stationary, satisfy a stationarity condition combining the limiting subdifferential of the objective function and a coderivative-like tool associated with the constraints of some arbitrary order $γ \geq 1$ , a so-called pseudo-coderivative, see [37], or come along with an asymptotic stationarity condition depending on a critical direction as well as the order $γ$ where the involved sequence of multipliers is diverging. Even for $γ : = 1$ , this enhances the findings from [58, 63]. Furthermore, this result opens a new way on how to come up with applicable necessary optimality conditions for the original problem, namely, by ruling out the irregular situation of asymptotic stationarity which can be done in the presence of so-called metric pseudo-subregularity of order $γ$ , see [37] again. In the case $γ : = 1$ , we end up with M-stationarity, and metric pseudo-subregularity reduces to metric subregularity, i.e., we obtain results related to [36]. For $γ > 1$ , this procedure leads to a mixed-order stationarity condition involving the pseudo-coderivative of order $γ$ , and metric pseudo-subregularity is weaker than metric subregularity. If $γ : = 2$ and so-called geometric constraints, induced by a twice continuously differentiable mapping g as well as a closed set D, are investigated, this pseudo-coderivative can be estimated from above in terms of initial problem data, i.e., in terms of (first- and second-order) derivatives associated with g as well as tangent and normal cones to D, under mild conditions. These estimates of the pseudo-coderivative of order 2 are interesting on their own and presented in Sect. 3, which is the essence to all applications of our general findings. The associated mixed-order necessary optimality conditions and qualification conditions are worked out in Sect. 4.3, and in Sect. 4.4, they are applied to complementarity-constrained and nonlinear semidefinite optimization problems in order to illustrate our findings. Let us note that related necessary optimality conditions for optimization problems which comprise first- and second-order tools at the same time can be found e.g. in [9, 11–13, 35, 37, 51]. These results are based on the concept of 2–regularity and its extensions, see [11, 74] for its origins. Indeed, even Gfrerer’s metric pseudo-subregularity from [37], utilized in this paper, can be seen as an extension of 2–regularity to arbitrary set-valued mappings. For us, however, these mixed-order conditions are only a by-product - we focus on how they can be used to find new constraint qualifications guaranteeing M-stationarity of local minimizers.

Section 5 is dedicated to the investigation of directional asymptotic regularity conditions, which are motivated by the asymptotic stationarity conditions from Theorem 4.1 (for $γ : = 1$ ) and whose validity directly yields M-stationarity of local minimizers. Roughly speaking, these conditions demand certain control of unbounded input sequences (multipliers) associated with the regular coderivative of the underlying set-valued mapping in a neighborhood of the reference point. We enrich and refine the asymptotic regularity conditions from [63] in two ways. First, the directional approach reveals that asymptotic regularity is only necessary in critical directions. Second, we observe an additional restriction the problematic multipliers satisfy: while their norm tends to infinity, their direction is tightly controlled. These insights enable us to relate our new constraint qualifications with already existing ones from the literature. Similarly as standard asymptotic regularity, the directional counterpart is also independent of (directional) metric subregularity. However, several sufficient conditions for metric subregularity, which are independent of asymptotic regularity, imply directional asymptotic regularity. For instance, this is true for the First-Order Sufficient Condition for Metric Subregularity from [39], see Sect. 5.1. Moreover, in Sect. 5.2, we extend the (directional) concepts of pseudo- and quasi-normality from [15, 16] to abstract set-valued mappings and show that these conditions are sufficient for directional metric subregularity as well as directional asymptotic regularity. Notably, even standard (nondirectional) versions of pseudo- and quasi-normality do not imply asymptotic regularity since the latter does not restrict the direction of the problematic multipliers. Finally, a new directional coderivative-like tool, the directional super-coderivative, see Sect. 2.3, is used in Sect. 5.3 to construct sufficient conditions for the validity of directional asymptotic regularity. In the presence of so-called metric pseudo-regularity, see [37] again, this leads to conditions in terms of the aforementioned pseudo-coderivatives. Noting that these generalized derivatives can be computed in terms of initial problem data for geometric constraint systems, we can specify our findings in this situation. As it turns out, the approach is closely related to our findings from Sect. 4.3. Furthermore, we show that the explicit sufficient conditions for directional asymptotic regularity provide constraint qualifications for M-stationarity which are not stronger than the First- and Second-Order Sufficient Condition for Metric Subregularity from [39].

Notation and preliminaries

We rely on standard notation taken from [10, 25, 66, 71].

Basic notation

Let $R$ , $R_{+}$ , and $R_{-}$ denote the real, the nonnegative real, and the nonpositive real numbers, respectively. The sign function $sgn : R \to {- 1, 0, 1}$ is defined by $sgn (t) : = - 1$ for all $t < 0$ , $sgn (t) : = 1$ for all $t > 0$ , and $sgn (0) : = 0$ . Throughout the paper, $X$ and $Y$ denote Euclidean spaces, i.e., finite-dimensional Hilbert spaces. For simplicity, the associated inner product will be represented by $⟨ \cdot, \cdot ⟩$ since the underlying space will be clear from the context. The norm induced by the inner product is denoted by $∥\cdot∥$ . The unit sphere in $X$ will be represented by $S_{X}$ . Furthermore, for $ε > 0$ and $\bar{x} \in X$ , $B_{ε} (\bar{x}) : = {x \in X | ∥x - \bar{x}∥ \leq ε}$ is the closed $ε$ -ball around $\bar{x}$ . We are also concerned with so-called (closed) directional neighborhoods of given directions. These are sets of type

\begin{matrix} B_{ε, δ} (u) : = {w \in X | ∥∥w∥ u - ∥u∥ w∥ \leq δ ∥u∥ ∥w∥, ∥w∥ \leq ε}, \end{matrix}

where $u \in X$ is a reference direction and $ε, δ > 0$ . Clearly, $B_{ε, δ} (0) = B_{ε} (0)$ . For a nonempty set $Q \subset X$ , the closed convex cone $Q^{\circ} : = {η \in X | \forall x \in Q : ⟨ η, x ⟩ \leq 0}$ is referred to as the polar cone of Q. Furthermore, for some $\bar{x} \in X$ , ${[\bar{x}]}^{⊥} : = {η \in X | ⟨ η, \bar{x} ⟩ = 0}$ and $span (\bar{x})$ are the annihilator of $\bar{x}$ and the smallest subspace of $X$ containing $\bar{x}$ , respectively. By $dist (\bar{x}, Q) : = {inf}_{x} {∥x - \bar{x}∥ | x \in Q}$ , we denote the distance of $\bar{x}$ to Q. For simplicity, we use $\bar{x} + Q : = Q + \bar{x} : = {x + \bar{x} \in X | x \in Q}$ . The closure and the horizon cone of Q are represented by $cl (Q)$ and $Q^{\infty}$ , respectively. For a given linear operator $A : X \to Y$ , $A^{*} : Y \to X$ is used to denote its adjoint while $Im A : = {A x \in Y | x \in X}$ is the image of A.

Let $g : X \to Y$ be a continuously differentiable mapping. We use $\nabla g (\bar{x}) : X \to Y$ to denote the derivative of g at $\bar{x} \in X$ . Note that $\nabla g (\bar{x})$ is a linear operator. Let us emphasize that, in the special case $Y : = R$ , $\nabla g (\bar{x})$ does not coincide with the standard gradient which would correspond to $\nabla g {(\bar{x})}^{*} 1$ . For twice continuously differentiable g and a vector $λ \in Y$ , we set $⟨ λ, g ⟩ (x) : = ⟨ λ, g (x) ⟩$ for each $x \in X$ in order to denote the associated scalarization mapping $⟨ λ, g ⟩ : X \to R$ . By $\nabla ⟨ λ, g ⟩ (\bar{x})$ and $\nabla^{2} ⟨ λ, g ⟩ (\bar{x})$ we represent the first- and second-order derivatives of this map at $\bar{x} \in X$ (w.r.t. the variable which enters g). Furthermore, for $u, u^{'} \in X$ , we make use of

\begin{matrix} \nabla^{2} g (\bar{x}) [u, u^{'}] : = \sum_{i = 1}^{m} ⟨ u, \nabla^{2} ⟨ e_{i}^{c}, g ⟩ (\bar{x}) (u^{'}) ⟩ e_{i}^{c} \end{matrix}

for brevity where $m \in N$ is the dimension of $Y$ and $e_{1}^{c}, \dots, e_{m}^{c} \in Y$ denote the m canonical unit vectors of $Y$ . In the case $Y : = R$ , the second-order derivative $\nabla^{2} g (\bar{x}) : X \times X \to R$ is a bilinear mapping, and for each $u \in X$ , we identify $\nabla^{2} g (\bar{x}) u$ with an element of $X$ .

Fundamentals of variational analysis

Let us fix a closed set $Q \subset X$ and some point $x \in Q$ . We use

\begin{matrix} T_{Q} (x) : = \{u \in X |\begin{matrix} \exists {u_{k}}_{k \in N} \subset X, \exists {t_{k}}_{k \in N} \subset R_{+} : \\ u_{k} \to u, t_{k} ↓ 0, x + t_{k} u_{k} \in Q \forall k \in N \end{matrix})\} \end{matrix}

to denote the (Bouligand) tangent cone to Q at x. Furthermore, we make use of

the regular (or Fréchet) and limiting (or Mordukhovich) normal cone to Q at x. Observe that both of these normal cones coincide with the standard normal cone of convex analysis as soon as Q is convex. For $\tilde{x} \notin Q$ , we set $T_{Q} (\tilde{x}) : = \emptyset$ and ${\hat{N}}_{Q} (\tilde{x}) : = N_{Q} (\tilde{x}) : = \emptyset$ . Finally, for some $u \in X$ , we use

\begin{matrix} N_{Q} (x ; u) : = \{η \in X |\begin{matrix} \exists {u_{k}}_{k \in N} \subset X, \exists {t_{k}}_{k \in N} \subset R_{+}, \exists {η_{k}}_{k \in N} \subset X : \\ u_{k} \to u, t_{k} ↓ 0, η_{k} \to η, η_{k} \in {\hat{N}}_{Q} (x + t_{k} u_{k}) \forall k \in N \end{matrix})\} \end{matrix}

in order to represent the directional limiting normal cone to Q at x in direction u. Note that this set is empty if u does not belong to $T_{Q} (x)$ . If Q is convex, we have $N_{Q} (x ; u) = N_{Q} (x) \cap {[u]}^{⊥}$ .

The limiting normal cone to a set is well known for its robustness, i.e., it is outer semicontinuous as a set-valued mapping. In the course of the paper, we exploit an analogous property of the directional limiting normal cone which has been validated in [42, Proposition 2].

Lemma 2.1

Let $Q \subset X$ be closed and fix $x \in Q$ . Then, for each $u \in X$ , we have

\begin{matrix} N_{Q} (x ; u) = \{η \in X |\begin{matrix} \exists {u_{k}}_{k \in N} \subset X, \exists {t_{k}}_{k \in N} \subset R_{+}, \exists {η_{k}}_{k \in N} \subset X : \\ u_{k} \to u, t_{k} ↓ 0, η_{k} \to η, η_{k} \in N_{Q} (x + t_{k} u_{k}) \forall k \in N \end{matrix})\} . \end{matrix}

In this paper, the concept of polyhedrality will be of essential importance. Let us recall that a set $Q \subset R^{m}$ will be called polyhedral if it is the union of finitely many convex polyhedral sets. Similarly, it is referred to as locally polyhedral around $x \in Q$ whenever $Q \cap {z \in R^{m} | \forall i \in {1, \dots, m} : | z_{i} - x_{i} | \leq ε}$ is polyhedral for some $ε > 0$ . The following lemma provides some basic properties of polyhedral sets. Statement (a) is proven in [48, Proposition 8.24]. The equality in statement (b) follows from [19, Proposition 2.11] and the rest is straightforward, see [38, Lemma 2.1] as well.

Lemma 2.2

Let $Q \subset R^{m}$ be a closed set which is locally polyhedral around some fixed point $x \in Q$ . Then the following statements hold.

There exists a neighborhood $U \subset R^{m}$ of x such that $(x + T_{Q} (x)) \cap U = Q \cap U$ .
For arbitrary $u \in R^{m}$ , we have
$\begin{matrix} N_{Q} (x ; u) = N_{T_{Q} (x)} (u) \subset N_{Q} (x) \cap {[u]}^{⊥} . \end{matrix}$ 2.1
If Q is, additionally, convex, and $u \in T_{Q} (x)$ , then the final inclusion holds as an equality.

It is well known that the regular and limiting normal cone enjoy an exact product rule which is not true for the tangent cone in general. However, the following lemma shows that such a product rule also holds for tangents as soon as polyhedral sets are under consideration. Its proof is straightforward and, hence, omitted.

Lemma 2.3

For closed sets $P \subset X$ and $Q \subset Y$ as well as $x \in P$ and $y \in Q$ , we have $T_{P \times Q} (x, y) \subset T_{P} (x) \times T_{Q} (y)$ .
For closed sets $P \subset R^{n}$ and $Q \subset R^{m}$ as well as $x \in P$ and $y \in Q$ , such that P and Q are locally polyhedral around x and y, respectively, we have $T_{P \times Q} (x, y) = T_{P} (x) \times T_{Q} (y)$ .

Let us mention that a slightly more general version of the above lemma can be found in [41, Proposition 1].

For a set-valued mapping $Φ : X ⇉ Y$ , we use $dom Φ : = {x \in X | Φ (x) \neq \emptyset}$ , $gph Φ : = {(x, y) \in X \times Y | y \in Φ (x)}$ , $ker Φ : = {x \in X | 0 \in Φ (x)}$ , and $Im Φ : = ⋃_{x \in X} Φ (x)$ in order to represent the domain, graph, kernel, and image of $Φ$ , respectively. Furthermore, the so-called inverse mapping $Φ^{- 1} : Y ⇉ X$ is defined via $gph Φ^{- 1} : = {(y, x) \in Y \times X | (x, y) \in gph Φ}$ .

There exist numerous concepts of local regularity or Lipschitzness associated with set-valued mappings. In this paper, we are mostly concerned with so-called directional metric pseudo-(sub)regularity which originates from [37, Definition 1].

Definition 2.1

Fix a set-valued mapping $Φ : X ⇉ Y$ which has a closed graph locally around $(\bar{x}, \bar{y}) \in gph Φ$ , a pair of directions $(u, v) \in X \times Y$ , and a constant $γ \geq 1$ .

We say that $Φ$ is metrically pseudo-regular of order $γ$ in direction (u, v) at $(\bar{x}, \bar{y})$ if there are constants $ε > 0$ , $δ > 0$ , and $κ > 0$ such that the estimate
$\begin{matrix} {∥x - \bar{x}∥}^{γ - 1} dist (x, Φ^{- 1} (y)) \leq κ dist (y, Φ (x)) \end{matrix}$ 2.2
holds for all $(x, y) \in (\bar{x}, \bar{y}) + B_{ε, δ} (u, v)$ with $dist (y, Φ (x)) \leq δ {∥x - \bar{x}∥}^{γ}$ . In the case where this is fulfilled for $(u, v) : = (0, 0)$ , we say that $Φ$ is metrically pseudo-regular of order $γ$ at $(\bar{x}, \bar{y})$ .
We say that $Φ$ is metrically pseudo-subregular of order $γ$ in direction u at $(\bar{x}, \bar{y})$ if there are constants $ε > 0$ , $δ > 0$ , and $κ > 0$ such that (2.2) holds for $y : = \bar{y}$ and all $x \in \bar{x} + B_{ε, δ} (u)$ . In the case where this is fulfilled for $u : = 0$ , we say that $Φ$ is metrically pseudo-subregular of order $γ$ at $(\bar{x}, \bar{y})$ .

Metric pseudo-regularity of order $γ \geq 1$ in direction (u, 0) at $(\bar{x}, \bar{y})$ is a sufficient condition for metric pseudo-subregularity of order $γ$ in direction u at the same point, see [37, Lemma 3]. Observe that metric pseudo-subregularity in a specified direction of some order $γ \geq 1$ implies metric pseudo-subregularity of arbitrary order larger than $γ$ in the same direction. For $γ : = 1$ , the above definition of (directional) metric pseudo-subregularity recovers the one of (directional) metric subregularity, see [36, Definition 1.2]. On the contrary, for $γ : = 1$ , the above definition of directional metric pseudo-regularity does not recover the one of directional metric regularity which demands that (2.2) holds for all $(x, y) \in (\bar{x}, \bar{y}) + B_{ε, δ} (u, v)$ such that $∥(u, v)∥ dist ((x, y), gph Φ) \leq δ ∥(u, v)∥ ∥(x, y) - (\bar{x}, \bar{y})∥$ , see [36, Definition 1.1]. Particularly, for $(u, v) : = (0, 0)$ , the notion of directional metric regularity reduces to the classical one of metric regularity, while directional metric pseudo-regularity does not. This was shown in [37, Example 1.1], which is a very natural example, and we will use it to illustrate some novel concepts.

Example 2.1

For every $γ \geq 1$ , the mapping $Φ : R ⇉ R$ , given by $Φ (x) : = {| x |^{γ}}$ , $x \in R$ , is metrically pseudo-regular of order $γ$ at (0, 0). The case $γ : = 1$ provides an example of a mapping which is metrically pseudo-regular of order 1 at (0, 0) but not metrically regular there. The violation of metric regularity is clear as any points $y < 0$ approaching 0 come along with $Φ^{- 1} (y) = \emptyset$ , blowing up the left-hand side of (2.2). These problematic elements y are, however, ruled out by the condition $dist (y, Φ (x)) \leq δ {∥x - \bar{x}∥}^{γ}$ in the definition of metric pseudo-regularity, which reads $| y - | x | | \leq δ | x |$ in the present situation.

Another important case, which we will explore in detail, corresponds to $γ : = 2$ . In this case, the notions from Definition 2.1 provide an extension of so-called 2-regularity from [11, 74] to set-valued mappings. In Sect. 3.2, we compare our approach with an extension of 2-regularity to constraint mappings from [8, 9].

Recall that a single-valued function $g : X \to Y$ is called calm in direction $u \in X$ at $x \in X$ whenever there are constants $ε > 0$ , $δ > 0$ , and $L > 0$ such that

\begin{matrix} \forall x^{'} \in x + B_{ε, δ} (u) : ‖ g (x^{'}) - g (x) ‖ \leq L ‖ x^{'} - x ‖ . \end{matrix}

If this holds for $u : = 0$ , we simply say that g is calm at x. Clearly, the latter property is weaker than Lipschitzness of g at x.

Generalized differentiation

In this section, we recall some notions from generalized differentiation and introduce some novel derivatives for set-valued mappings.

Subdifferentials

Let us start with a lower semicontinuous function $φ : X \to R \cup {\infty}$ and some point $\bar{x} \in dom φ : = {x \in X | φ (x) < \infty}$ . The lower semicontinuous function $d φ (\bar{x}) : X \to R \cup {- \infty, \infty}$ given by

\begin{matrix} \forall u \in X : d φ (\bar{x}) (u) : = \underset{t ↓ 0, u^{'} \to u}{lim inf} \frac{φ (\bar{x} + t u^{'}) - φ (\bar{x})}{t} \end{matrix}

is referred to as the subderivative of $φ$ at $\bar{x}$ . The regular (or Fréchet) and limiting (or Mordukhovich) subdifferential of $φ$ at $\bar{x}$ are given by

\begin{matrix} \hat{\partial} φ (\bar{x}) & : = {η \in X | (η, - 1) \in {\hat{N}}_{epi φ} (\bar{x}, φ (\bar{x}))}, \\ \partial φ (\bar{x}) & : = {η \in X | (η, - 1) \in N_{epi φ} (\bar{x}, φ (\bar{x}))}, \end{matrix}

respectively, where $epi φ : = {(x, α) \in X \times R | φ (x) \leq α}$ is the epigraph of $φ$ . In the case where $φ$ is continuously differentiable at $\bar{x}$ , both sets reduce to the singleton containing only the gradient $\nabla φ {(\bar{x})}^{*} 1$ . We note that for any sequences ${x_{k}}_{k \in N} \subset dom φ$ and ${x_{k}^{*}}_{k \in N} \subset X$ such that $x_{k} \to \bar{x}$ , $φ (x_{k}) \to φ (\bar{x})$ , $x_{k}^{*} \to x^{*}$ for some $x^{*} \in X$ , and $x_{k}^{*} \in \partial φ (x_{k})$ for each $k \in N$ , we also have $x^{*} \in \partial φ (\bar{x})$ , see [71, Proposition 8.7]. This property is referred to as robustness of the limiting subdifferential.

In the case where $φ$ is locally Lipschitzian around $\bar{x}$ , and for some direction $u \in X$ ,

\begin{matrix} \partial φ (\bar{x} ; u) : = \{η \in X |\begin{matrix} \exists {u_{k}}_{k \in N} \subset X, \exists {t_{k}}_{k \in N} \subset R_{+}, \exists {η_{k}}_{k \in N} \subset X : \\ u_{k} \to u, t_{k} ↓ 0, η_{k} \to η, η_{k} \in \hat{\partial} φ (\bar{x} + t_{k} u_{k}) \forall k \in N \end{matrix})\} \end{matrix}

is referred to as the limiting subdifferential of $φ$ at $\bar{x}$ in direction u. We note that $\partial φ (\bar{x} ; 0) = \partial φ (\bar{x})$ and $\partial φ (\bar{x} ; u) \subset \partial φ (\bar{x})$ for all $u \in X$ . Furthermore, let us mention that, in the definition of the directional limiting subdifferential, we can equivalently replace the requirement $η_{k} \in \hat{\partial} φ (\bar{x} + t_{k} u_{k})$ by $η_{k} \in \partial φ (\bar{x} + t_{k} u_{k})$ for each $k \in N$ . This can be easily checked by means of a classical diagonal sequence argument. Hence, the directional limiting subdifferential also enjoys a certain kind of robustness.

Graphical derivatives

Below, we introduce three different graphical derivatives of a set-valued mapping. While the standard graphical derivative is well known from the literature, the concepts of graphical pseudo-derivative and graphical subderivative are, to the best of our knowledge, new.

Definition 2.2

Let $Φ : X ⇉ Y$ be a set-valued mapping possessing a closed graph locally around $(\bar{x}, \bar{y}) \in gph Φ$ .

The graphical derivative of $Φ$ at $(\bar{x}, \bar{y})$ is the mapping $D Φ (\bar{x}, \bar{y}) : X ⇉ Y$ given by
$\begin{matrix} gph D Φ (\bar{x}, \bar{y}) = T_{gph Φ} (\bar{x}, \bar{y}) . \end{matrix}$
In the case where $Φ$ is single-valued at $\bar{x}$ , we use $D Φ (\bar{x}) : X ⇉ Y$ for brevity.
Given $γ \geq 1$ , the graphical pseudo-derivative of order $γ$ of $Φ$ at $(\bar{x}, \bar{y})$ is the mapping $D_{γ} Φ (\bar{x}, \bar{y}) : X ⇉ Y$ which assigns to $u \in X$ the set of all $v \in Y$ such that there are sequences ${u_{k}}_{k \in N} \subset X$ , ${v_{k}}_{k \in N} \subset Y$ , and ${t_{k}}_{k \in N} \subset R_{+}$ which satisfy $u_{k} \to u$ , $v_{k} \to v$ , $t_{k} ↓ 0$ , and $(\bar{x} + t_{k} u_{k}, \bar{y} + {(t_{k} ∥u_{k}∥)}^{γ} v_{k}) \in gph Φ$ for all $k \in N$ .
The graphical subderivative of $Φ$ at $(\bar{x}, \bar{y})$ is the mapping $D_{s} u b Φ (\bar{x}, \bar{y}) : S_{X} ⇉ S_{Y}$ which assigns to $u \in S_{X}$ the set of all $v \in S_{Y}$ such that there are sequences ${u_{k}}_{k \in N} \subset X$ , ${v_{k}}_{k \in N} \subset Y$ , and ${t_{k}}_{k \in N}, {τ_{k}}_{k \in N} \subset R_{+}$ which satisfy $u_{k} \to u$ , $v_{k} \to v$ , $t_{k} ↓ 0$ , $τ_{k} ↓ 0$ , $τ_{k} / t_{k} \to \infty$ , and $(\bar{x} + t_{k} u_{k}, \bar{y} + τ_{k} v_{k}) \in gph Φ$ for all $k \in N$ .

Let us note that for every set-valued mapping $Φ : X ⇉ Y$ , whose graph is closed locally around $(\bar{x}, \bar{y}) \in gph Φ$ , we have $D_{1} Φ (\bar{x}, \bar{y}) (u) = D Φ (\bar{x}, \bar{y}) (u)$ for all $u \in S_{X}$ . Furthermore, for each $γ > 1$ , one obtains the trivial estimates

\begin{matrix} dom D_{γ} Φ (\bar{x}, \bar{y}) \subset ker D Φ (\bar{x}, \bar{y}) \end{matrix}

and

\begin{matrix} \forall u \in S_{X} : D_{s} u b Φ (\bar{x}, \bar{y}) (u) \subset D Φ (\bar{x}, \bar{y}) (0) \end{matrix}

2.3

right from the definition of these objects.

In the course of the paper, we are mainly interested in the graphical (sub)derivative associated with so-called normal cone mappings. In the next lemma, we present some corresponding upper estimates.

Lemma 2.4

Let $D \subset Y$ be a nonempty, closed, convex set such that the (single-valued) projection operator onto D, denoted by $Π_{D} : Y \to Y$ , is directionally differentiable. Fix $\bar{y} \in D$ and ${\bar{y}}^{*} \in N_{D} (\bar{y})$ . Then, for arbitrary $u \in Y$ , we find

\begin{matrix} D N_{D} (\bar{y}, {\bar{y}}^{*}) (u) & \subset {v \in Y | Π_{D}^{'} (\bar{y} + {\bar{y}}^{*} ; u + v) = u}, \end{matrix}

and for $u \in S_{Y}$ , we find

\begin{matrix} D_{s} u b N_{D} (\bar{y}, {\bar{y}}^{*}) (u) & \subset {v \in S_{Y} | Π_{D}^{'} (\bar{y} + {\bar{y}}^{*} ; v) = 0, ⟨ u, v ⟩ \geq 0} . \end{matrix}

Above, $Π_{D}^{'} (y, v)$ denotes the directional derivative of $Π_{D}$ at $y \in Y$ in direction $v \in Y$ .

Proof

By convexity of D, we have the well-known equivalence

\begin{matrix} \forall y, y^{*} \in Y : y^{*} \in N_{D} (y) ⟺ Π_{D} (y + y^{*}) = y . \end{matrix}

In the remainder of the proof, we set $\tilde{y} : = \bar{y} + {\bar{y}}^{*}$ for brevity. Next, let us fix $u, v \in Y$ as well as ${u_{k}}_{k \in N}, {v_{k}}_{k \in N} \subset Y$ and ${τ_{k}}_{k \in N}, {ε_{k}}_{k \in N} \subset R_{+}$ such that $u_{k} \to u$ , $v_{k} \to v$ , $τ_{k} ↓ 0$ , and ${\bar{y}}^{*} + τ_{k} v_{k} \in N_{D} (\bar{y} + τ_{k} ε_{k} u_{k})$ , i.e., $Π_{D} (\tilde{y} + τ_{k} ε_{k} u_{k} + τ_{k} v_{k}) = \bar{y} + τ_{k} ε_{k} u_{k}$ , for each $k \in N$ . Using $Π_{D} (\tilde{y}) = \bar{y}$ , we find

\begin{matrix} \forall k \in N : ε_{k} u_{k} = \frac{Π_{D} (\tilde{y} + τ_{k} ε_{k} u_{k} + τ_{k} v_{k}) - Π_{D} (\tilde{y})}{τ_{k}} . \end{matrix}

2.4

In the case where $v \in D N_{D} (\bar{y}, {\bar{y}}^{*}) (u)$ holds, we can choose $ε_{k} = 1$ for each $k \in N$ , and taking the limit $k \to \infty$ in (2.4) while exploiting directional differentiability and Lipschitzness of $Π_{D}$ yields $Π_{D}^{'} (\tilde{y} ; u + v) = u$ . This shows the first estimate.

Now, assume that $v \in D_{s} u b N_{D} (\bar{y}, {\bar{y}}^{*}) (u)$ is valid. Then $ε_{k} ↓ 0$ and $u, v \in S_{Y}$ can be postulated, and taking the limit $k \to \infty$ in (2.4) shows $Π_{D}^{'} (\tilde{y} ; v) = 0$ . By nature of the projection, we have

\begin{matrix} ⟨ \tilde{y} + τ_{k} ε_{k} u_{k} + τ_{k} v_{k} - Π_{D} (\tilde{y} + τ_{k} ε_{k} u_{k} + τ_{k} v_{k}), Π_{D} (\tilde{y}) - Π_{D} (\tilde{y} + τ_{k} ε_{k} u_{k} + τ_{k} v_{k}) ⟩ \leq 0 \end{matrix}

for each $k \in N$ . Exploiting (2.4), this is equivalent to

\begin{matrix} ⟨ \tilde{y} + τ_{k} v_{k} - Π_{D} (\tilde{y}), Π_{D} (\tilde{y}) - Π_{D} (\tilde{y} + τ_{k} ε_{k} u_{k} + τ_{k} v_{k}) ⟩ \leq 0 \end{matrix}

for each $k \in N$ . Some rearrangements and the characterization of the projection lead to

\begin{matrix} τ_{k} ⟨ v_{k}, Π_{D} (\tilde{y}) - Π_{D} (\tilde{y} + τ_{k} ε_{k} u_{k} + τ_{k} v_{k}) ⟩ \\ \leq ⟨ \tilde{y} - Π_{D} (\tilde{y}), Π_{D} (\tilde{y} + τ_{k} ε_{k} u_{k} + τ_{k} v_{k}) - Π_{D} (\tilde{y}) ⟩ \leq 0 . \end{matrix}

Division by $τ_{k}^{2} ε_{k}$ and (2.4), thus, give us $⟨ v_{k}, u_{k} ⟩ \geq 0$ for each $k \in N$ , and taking the limit, we obtain $⟨ u, v ⟩ \geq 0$ which shows the second estimate. $□$

Let us note that it has been shown in [75, Theorem 3.1, Corollary 3.1] that the estimate on the graphical derivative of the normal cone mapping $N_{D}$ holds as an equality in the situation where D is the convex cone of positive semidefinite symmetric matrices, and that the presented proof extends to arbitrary convex cones as long as the associated projection operator is directionally differentiable. This result can also be found in slightly more general form in [67, Theorem 3.3]. In order to make the estimates from Lemma 2.4 explicit, one needs to be in position to characterize the directional derivative of the projection onto the convex set D. This is easily possible if D is polyhedral, see [44] and Remark 3.1, but even in nonpolyhedral situations, e.g., where D is the second-order cone or the cone of positive semidefinite symmetric matrices, closed formulas for this directional derivative are available in the literature, see [69, Lemma 2] and [73, Theorem 4.7], respectively.

The following technical result will become handy later on.

Lemma 2.5

Let $D \subset Y$ be nonempty and closed, and fix $\bar{y} \in D$ . Then the following assertions hold.

For each $u \in Y$ , we have $D N_{D} (\bar{y}, 0) (u) \subset N_{D} (\bar{y} ; u)$ .
For each $u \in S_{Y}$ , we have $D_{s} u b N_{D} (\bar{y}, 0) (u) \subset N_{D} (\bar{y} ; u)$ .

Proof

We only prove validity of the first assertion. The second one can be shown in analogous fashion.

Fix $u \in Y$ and $v \in D N_{D} (\bar{y}, 0) (u)$ . Then we find sequences ${u_{k}}_{k \in N}, {v_{k}}_{k \in N} \subset Y$ and ${t_{k}}_{k \in N} \subset R_{+}$ with $u_{k} \to u$ , $v_{k} \to v$ , $t_{k} ↓ 0$ , and $t_{k} v_{k} \in N_{D} (\bar{y} + t_{k} u_{k})$ for each $k \in N$ . Since, for each $k \in N$ , $N_{D} (\bar{y} + t_{k} u_{k})$ is a cone, we find $v_{k} \in N_{D} (\bar{y} + t_{k} u_{k})$ , and $v \in N_{D} (\bar{y} ; u)$ follows by robustness of the directional limiting normal cone, see Lemma 2.1. $□$

In the next two results, we investigate the special situation $Y : = R^{m}$ in detail. First, in the case where we consider the normal cone mapping associated with polyhedral sets, there is no difference between graphical derivative and graphical subderivative as the subsequent lemma shows.

Lemma 2.6

Let $D \subset R^{m}$ be a polyhedral set. Then $gph N_{D}$ is polyhedral as well, and for arbitrary $(\bar{y}, {\bar{y}}^{*}) \in gph N_{D}$ and $u, v \in R^{m} \ {0}$ , we have

\begin{matrix} v \in D N_{D} (\bar{y}, {\bar{y}}^{*}) (u) ⟺ v / ∥v∥ \in D_{s} u b N_{D} (\bar{y}, {\bar{y}}^{*}) (u / ∥u∥) . \end{matrix}

Proof

It follows from [1, Theorem 2] that there exist finitely many convex polyhedral sets $D_{1}, \dots, D_{ℓ} \subset R^{m}$ and closed, convex, polyhedral cones $K_{1}, \dots, K_{ℓ} \subset R^{m}$ such that $gph N_{D} = ⋃_{i = 1}^{ℓ} D_{i} \times K_{i}$ . Particularly, $gph N_{D}$ is polyhedral.

Next, consider some nonzero $u, v \in R^{m}$ with $v / ∥v∥ \in D_{s} u b N_{D} (\bar{y}, {\bar{y}}^{*}) (u / ∥u∥)$ . Then we find ${{\tilde{u}}_{k}}_{k \in N}, {{\tilde{v}}_{k}}_{k \in N} \subset R^{m}$ and ${{\tilde{t}}_{k}}_{k \in N}, {τ_{k}}_{k \in N} \subset R_{+}$ such that $u_{k} : = {\tilde{u}}_{k} ∥u∥ \to u$ , $v_{k} : = {\tilde{v}}_{k} ∥v∥ \to v$ , $t_{k} : = {\tilde{t}}_{k} / ∥u∥ ↓ 0$ , $τ_{k} ↓ 0$ , $τ_{k} / t_{k} \to \infty$ , as well as $(\bar{y} + t_{k} u_{k}, {\bar{y}}^{*} + (τ_{k} / ∥v∥) v_{k}) \in gph N_{D}$ for all $k \in N$ . Thus, we can pick $j \in {1, \dots, ℓ}$ and a subsequence (without relabeling) such that $(\bar{y} + t_{k} u_{k}, {\bar{y}}^{*} + (τ_{k} / ∥v∥) v_{k}) \in D_{j} \times K_{j}$ and $τ_{k} / ∥v∥ > t_{k}$ for all $k \in N$ . By convexity of $K_{j}$ , we also have $(\bar{y} + t_{k} u_{k}, {\bar{y}}^{*} + t_{k} v_{k}) \in D_{j} \times K_{j}$ which shows $v \in D N_{D} (\bar{y}, {\bar{y}}^{*}) (u)$ . The converse implication can be proven in analogous fashion by multiplying the null sequence in the domain space with another null sequence. $□$

The next lemma shows how the graphical derivative of normal cone mappings associated with Cartesian products of polyhedral sets can be computed.

Lemma 2.7

Fix some $ℓ \in N$ . For each $i \in {1, \dots, ℓ}$ , let $D_{i} \subset R^{m_{i}}$ for some $m_{i} \in N$ be polyhedral. Set $D : = \prod_{i = 1}^{ℓ} D_{i}$ , $m : = \sum_{i = 1}^{ℓ} m_{i}$ , and $L : = {1, \dots, ℓ}$ . Then we have

\begin{matrix} gph N_{D} = {((y_{1}, \dots, y_{ℓ}), (y_{1}^{*}, \dots, y_{ℓ}^{*})) \in R^{m} \times R^{m} | \forall i \in L : (y_{i}, y_{i}^{*}) \in gph N_{D_{i}}}, \end{matrix}

and for arbitrary $\bar{y} : = ({\bar{y}}_{1}, \dots, {\bar{y}}_{ℓ}), {\bar{y}}^{*} : = ({\bar{y}}_{1}^{*}, \dots, {\bar{y}}_{ℓ}^{*}) \in R^{m}$ satisfying $(\bar{y}, {\bar{y}}^{*}) \in gph N_{D}$ as well as $u : = (u_{1}, \dots, u_{ℓ}) \in R^{m}$ , we find

\begin{matrix} D N_{D} (\bar{y}, {\bar{y}}^{*}) (u) = {v = (v_{1}, \dots, v_{ℓ}) \in R^{m} | \forall i \in L : v_{i} \in D N_{D_{i}} ({\bar{y}}_{i}, {\bar{y}}_{i}^{*}) (u_{i})} . \end{matrix}

Proof

The representation of $gph N_{D}$ is a simple consequence of the product rule for the computation of limiting normals, see e.g. [66, Proposition 1.4], and does not rely on the polyhedrality of the underlying sets. Thus, $gph N_{D}$ is, up to a permutation of components, the same as $\prod_{i = 1}^{ℓ} gph N_{D_{i}}$ . Since, for each $i \in L$ , $gph N_{D_{i}}$ is polyhedral by Lemma 2.6, the same has to hold for $gph N_{D}$ . The final formula of the lemma is a simple consequence of Lemma 2.3 and [71, Exercise 6.7]. $□$

Coderivatives, pseudo-coderivatives, and super-coderivatives

In the subsequently stated definition, we first recall the notion of regular and limiting coderivative of a set-valued mapping before introducing its so-called directional pseudo-coderivative. The latter will be of essential importance in the course of the paper. It corresponds to a minor modification of the notion of directional pseudo-coderivative introduced by Gfrerer in [37, Definition 2], which we recall as well.

Definition 2.3

Let $Φ : X ⇉ Y$ be a set-valued mapping possessing a closed graph locally around $(\bar{x}, \bar{y}) \in gph Φ$ . Furthermore, let $(u, v) \in X \times Y$ be a pair of directions.

The regular and limiting coderivative of $Φ$ at $(\bar{x}, \bar{y})$ are the set-valued mappings ${\hat{D}}^{*} Φ (\bar{x}, \bar{y}) : Y ⇉ X$ and $D^{*} Φ (\bar{x}, \bar{y}) : Y ⇉ X$ given, respectively, by
$\begin{matrix} \forall y^{*} \in Y : {\hat{D}}^{*} Φ (\bar{x}, \bar{y}) (y^{*}) & : = {x^{*} \in X | (x^{*}, - y^{*}) \in {\hat{N}}_{gph Φ} (\bar{x}, \bar{y})}, \\ D^{*} Φ (\bar{x}, \bar{y}) (y^{*}) & : = {x^{*} \in X | (x^{*}, - y^{*}) \in N_{gph Φ} (\bar{x}, \bar{y})} . \end{matrix}$
The set-valued mapping $D^{*} Φ ((\bar{x}, \bar{y}) ; (u, v)) : Y ⇉ X$ given by
$\begin{matrix} \forall y^{*} \in Y : D^{*} Φ ((\bar{x}, \bar{y}) ; (u, v)) (y^{*}) \\ : = {x^{*} \in X | (x^{*}, - y^{*}) \in N_{gph Φ} ((\bar{x}, \bar{y}) ; (u, v))} \end{matrix}$
is the limiting coderivative of $Φ$ at $(\bar{x}, \bar{y})$ in direction (u, v). If $Φ$ is single-valued at $\bar{x}$ , we use ${\hat{D}}^{*} Φ (\bar{x}), D^{*} Φ (\bar{x}), D^{*} Φ (\bar{x} ; (u, v)) : Y ⇉ X$ for brevity.
Given $γ \geq 1$ and $u \in S_{X}$ , the pseudo-coderivative of order $γ$ of $Φ$ at $(\bar{x}, \bar{y})$ in direction (u, v) is the mapping $D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, v)) : Y ⇉ X$ which assigns to $y^{*} \in Y$ the set of all $x^{*} \in X$ such that there are sequences ${u_{k}}_{k \in N}, {x_{k}^{*}}_{k \in N} \subset X$ , ${v_{k}}_{k \in N}, {y_{k}^{*}}_{k \in N} \subset Y$ , and ${t_{k}}_{k \in N} \subset R_{+}$ which satisfy $u_{k} \to u$ , $v_{k} \to v$ , $t_{k} ↓ 0$ , $x_{k}^{*} \to x^{*}$ , $y_{k}^{*} \to y^{*}$ , and
$\begin{matrix} \forall k \in N : (x_{k}^{*}, - \frac{y_{k}^{*}}{{(t_{k} ∥u_{k}∥)}^{γ - 1}}) \in {\hat{N}}_{gph Φ} (\bar{x} + t_{k} u_{k}, \bar{y} + {(t_{k} ∥u_{k}∥)}^{γ} v_{k}) . \end{matrix}$ 2.5
In the case $γ : = 1$ , this definition recovers the one of $D^{*} Φ ((\bar{x}, \bar{y}) ; (u, v))$ .
Given $γ \geq 1$ and $u \in S_{X}$ , Gfrerer’s pseudo-coderivative of order $γ$ of $Φ$ at $(\bar{x}, \bar{y})$ in direction (u, v) is the mapping ${\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, v)) : Y ⇉ X$ which assigns to $y^{*} \in Y$ the set of all $x^{*} \in X$ such that there are sequences ${u_{k}}_{k \in N}, {x_{k}^{*}}_{k \in N} \subset X$ , ${v_{k}}_{k \in N}, {y_{k}^{*}}_{k \in N} \subset Y$ , and ${t_{k}}_{k \in N} \subset R_{+}$ which satisfy $u_{k} \to u$ , $v_{k} \to v$ , $t_{k} ↓ 0$ , $x_{k}^{*} \to x^{*}$ , $y_{k}^{*} \to y^{*}$ , and
$\begin{matrix} \forall k \in N : (x_{k}^{*}, - \frac{y_{k}^{*}}{{(t_{k} ∥u_{k}∥)}^{γ - 1}}) \in {\hat{N}}_{gph Φ} (\bar{x} + t_{k} u_{k}, \bar{y} + t_{k} v_{k}) . \end{matrix}$ 2.6
Again, for $γ : = 1$ , we recover the definition of $D^{*} Φ ((\bar{x}, \bar{y}) ; (u, v))$ .

Let $Φ : X ⇉ Y$ be a set-valued mapping whose graph is closed locally around $(\bar{x}, \bar{y}) \in gph Φ$ and fix a pair of directions $(u, v) \in S_{X} \times Y$ , $(x^{*}, y^{*}) \in X \times Y$ , and $γ > 1$ . Then we obtain the trivial relations

\begin{matrix} x^{*} \in D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, v)) (y^{*}) ⟹ \{\begin{matrix} 0 \in D Φ (\bar{x}, \bar{y}) (u), 0 \in D^{*} Φ (\bar{x}, \bar{y}) (y^{*}), \\ 0 \in D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (y^{*}), \\ v \in D_{γ} Φ (\bar{x}, \bar{y}) (u), \\ x^{*} \in {\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (y^{*}) . \end{matrix}) \end{matrix}

2.7

Note also that the mappings $D^{*} Φ ((\bar{x}, \bar{y}) ; (u, v))$ and ${\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, v))$ have a nonempty graph if and only if $v \in D Φ (\bar{x}, \bar{y}) (u)$ while the mapping $D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, v))$ has a nonempty graph if and only if $v \in D_{γ} Φ (\bar{x}, \bar{y}) (u)$ .

Since the (directional) limiting coderivative is defined via the (directional) limiting normal cone, it possesses a robust behavior as well. In the subsequent lemma, we show a somewhat robust behavior of the directional pseudo-coderivatives under consideration, which will be important later on. Basically, we prove that one can replace the regular by the limiting normal cone in (2.5) and (2.6) without changing the resulting pseudo-coderivative. The technical proof, which is based on a standard diagonal sequence argument, is presented in Appendix A for the purpose of completeness.

Lemma 2.8

Definition 2.3 (b) and Definition 2.3 (c) can equivalently be formulated in terms of limiting normals.

To illustrate the pseudo-coderivatives from Definition 2.3, we revisit Example 2.1.

Example 2.2

For $γ > 1$ , we consider the mapping $Φ : R ⇉ R$ , given by $Φ (x) : = {| x |^{γ}}$ , $x \in R$ , already discussed in Example 2.1. Set $(\bar{x}, \bar{y}) : = (0, 0)$ as well as $u : = \pm 1$ and choose $v \in R$ arbitrarily. First, $v \in D Φ (\bar{x}, \bar{y}) (u)$ by definition requires sequences ${t_{k}}_{k \in N} \subset R_{+}$ and ${u_{k}}_{k \in N}, {v_{k}}_{k \in N} \subset R$ satisfying $t_{k} ↓ 0$ , $u_{k} \to u$ , $v_{k} \to v$ , and $t_{k} v_{k} = (t_{k} | u_{k} {|)}^{γ}$ for all $k \in N$ , showing $v = 0$ . Thus, we fix $v : = 0$ to find $D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (y^{*}) = {0}$ for all $y^{*} \in R$ as the defining sequences ${x_{k}^{*}}_{k \in N}, {y_{k}^{*}}_{k \in N} \subset R$ satisfy $x_{k}^{*} \to x^{*}$ , $y_{k}^{*} \to y^{*}$ , and $x_{k}^{*} = γ (t_{k} | u_{k} {|)}^{γ - 1} sgn (u_{k}) y_{k}^{*}$ for all $k \in N$ . Furthermore, ${\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (y^{*}) = {γ sgn (u) y^{*}}$ holds for each $y^{*} \in R$ as the defining sequences ${x_{k}^{*}}_{k \in N}, {y_{k}^{*}}_{k \in N} \subset R$ satisfy $x_{k}^{*} \to x^{*}$ , $y_{k}^{*} \to y^{*}$ , and $x_{k}^{*} = γ sgn (u_{k}) y_{k}^{*}$ for all $k \in N$ . Using similar arguments as above, one can check that $v \in D_{γ} Φ (\bar{x}, \bar{y}) (u)$ yields $v = 1$ , and for $v : = 1$ , we get $D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 1)) (y^{*}) = {γ sgn (u) y^{*}}$ for all $y^{*} \in R$ .

Below, we introduce yet another concept of coderivative which will become important in Sect. 5.3.

Definition 2.4

Let $Φ : X ⇉ Y$ be a set-valued mapping with a closed graph and fix $(\bar{x}, \bar{y}) \in gph Φ$ and $(u, v) \in S_{X} \times S_{Y}$ . The super-coderivative of $Φ$ at $(\bar{x}, \bar{y})$ in direction (u, v) is the mapping $D_{s}^{*} u p Φ ((\bar{x}, \bar{y}) ; (u, v)) : Y ⇉ X$ , which assigns to $y^{*} \in Y$ the set of all $x^{*} \in X$ such that there are sequences ${u_{k}}_{k \in N}, {x_{k}^{*}}_{k \in N} \subset X$ , ${v_{k}}_{k \in N}, {y_{k}^{*}}_{k \in N} \subset Y$ , and ${t_{k}}_{k \in N}, {τ_{k}}_{k \in N} \subset R_{+}$ which satisfy $u_{k} \to u$ , $v_{k} \to v$ , $x_{k}^{*} \to x^{*}$ , $y_{k}^{*} \to y^{*}$ , $t_{k} ↓ 0$ , $τ_{k} ↓ 0$ , and $τ_{k} / t_{k} \to 0$ such that

\begin{matrix} x_{k}^{*} \in {\hat{D}}^{*} Φ (\bar{x} + t_{k} u_{k}, \bar{y} + τ_{k} v_{k}) (((t_{k} ∥u_{k}∥) / (τ_{k} ∥v_{k}∥)) y_{k}^{*}) \end{matrix}

2.8

holds for all $k \in N$ .

We start with some remarks regarding Definition 2.4. First, observe that we only exploit the super-coderivative w.r.t. unit directions $(u, v) \in S_{X} \times S_{Y}$ which also means that ${u_{k}}_{k \in N} \subset X$ and ${v_{k}}_{k \in N} \subset Y$ can be chosen such that $u_{k} \neq 0$ and $v_{k} \neq 0$ hold for all $k \in N$ . Particularly, condition (2.8) is reasonable.

Second, we would like to note that $x^{*} \in D_{s}^{*} u p Φ ((\bar{x}, \bar{y}) ; (u, v)) (y^{*})$ implies the existence of ${u_{k}}_{k \in N} \subset X$ , ${v_{k}}_{k \in N} \subset Y$ , and ${t_{k}}_{k \in N}, {τ_{k}}_{k \in N} \subset R_{+}$ which satisfy $u_{k} \to u$ , $v_{k} \to v$ , $t_{k} ↓ 0$ , $τ_{k} ↓ 0$ , and $τ_{k} / t_{k} \to 0$ as well as $(\bar{x} + t_{k} u_{k}, \bar{y} + τ_{k} v_{k}) \in gph Φ$ for all $k \in N$ . Thus, in the light of Definition 2.2 (c) of the graphical subderivative, one might be tempted to say that the pair (u, v) belongs to the graph of the graphical super-derivative of $Φ$ at $(\bar{x}, \bar{y})$ . This justifies the terminology in Definition 2.4.

Let us briefly discuss the relation between pseudo-coderivatives and the novel super-coderivative from Definition 2.4. Consider $γ > 1$ and $x^{*} \in D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, v)) (y^{*})$ for $(u, v) \in S_{X} \times S_{Y}$ and $y^{*} \in Y^{*}$ . Setting $τ_{k} : = {(t_{k} ∥u_{k}∥)}^{γ}$ for each $k \in N$ , where ${t_{k}}_{k \in N} \subset R_{+}$ and ${u_{k}}_{k \in N} \subset X$ are the sequences from the definition of the pseudo-coderivative, we get $x^{*} \in D_{s}^{*} u p Φ ((\bar{x}, \bar{y}) ; (u, v)) (y^{*})$ since $t_{k}^{γ - 1} {∥u_{k}∥}^{γ} \to 0$ .

In the subsequent lemma, we comment on the converse inclusion which, to some extent, holds in the presence of a qualification condition in terms of the pseudo-coderivative.

Lemma 2.9

Let $(\bar{x}, \bar{y}) \in gph Φ$ , $(u, v) \in S_{X} \times S_{Y}$ , $y^{*} \in Y$ , and $γ > 1$ be fixed. Furthermore, assume that $ker D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) \subset {0}$ holds. Then there exists $α > 0$ such that

\begin{matrix} D_{s}^{*} u p Φ ((\bar{x}, \bar{y}) ; (u, v)) (y^{*}) & \subset {\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (0) \cup D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, α v)) (y^{*} / α) \\ \cup Im D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) \\ \subset Im {\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) . \end{matrix}

Proof

Let $x^{*} \in D_{s}^{*} u p Φ ((\bar{x}, \bar{y}) ; (u, v)) (y^{*})$ be arbitrarily chosen. Then we find sequences ${u_{k}}_{k \in N}, {x_{k}^{*}}_{k \in N} \subset X$ , ${v_{k}}_{k \in N}, {y_{k}^{*}}_{k \in N} \subset Y$ , and ${t_{k}}_{k \in N}, {τ_{k}}_{k \in N} \subset R_{+}$ which satisfy $u_{k} \to u$ , $v_{k} \to v$ , $x_{k}^{*} \to x^{*}$ , $y_{k}^{*} \to y^{*}$ , $t_{k} ↓ 0$ , $τ_{k} ↓ 0$ , and $τ_{k} / t_{k} \to 0$ as well as (2.8) for all $k \in N$ . This also gives us

\begin{matrix} x_{k}^{*} \in {\hat{D}}^{*} Φ (\bar{x} + t_{k} u_{k}, \bar{y} + {(t_{k} ∥u_{k}∥)}^{γ} \frac{τ_{k} v_{k}}{{(t_{k} ∥u_{k}∥)}^{γ}}) ({(t_{k} ∥u_{k}∥)}^{1 - γ}, \frac{{(t_{k} ∥u_{k}∥)}^{γ}}{τ_{k} ∥v_{k}∥}, y_{k}^{*}) \end{matrix}

2.9

for all $k \in N$ . Set ${\tilde{y}}_{k}^{*} : = {(t_{k} ∥u_{k}∥)}^{γ} / (τ_{k} ∥v_{k}∥) y_{k}^{*}$ for each $k \in N$ . In the case where ${{\tilde{y}}_{k}^{*}}_{k \in N}$ is not bounded, we have $(τ_{k} ∥v_{k}∥) / {(t_{k} ∥u_{k}∥)}^{γ} \to 0$ along a subsequence (without relabeling), and taking the limit in

\begin{matrix} x_{k}^{*} / ‖ {\tilde{y}}_{k}^{*} ‖ \in {\hat{D}}^{*} Φ (\bar{x} + t_{k} u_{k}, \bar{y} + {(t_{k} ∥u_{k}∥)}^{γ} \frac{τ_{k} v_{k}}{{(t_{k} ∥u_{k}∥)}^{γ}}) ({(t_{k} ∥u_{k}∥)}^{1 - γ} {\tilde{y}}_{k}^{*} / ‖ {\tilde{y}}_{k}^{*} ‖) \end{matrix}

yields that $ker D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ contains a nonzero element, which is a contradiction. Hence, ${{\tilde{y}}_{k}^{*}}_{k \in N}$ is bounded.

For each $k \in N$ , we set $α_{k} : = τ_{k} ∥v_{k}∥ / {(t_{k} ∥u_{k}∥)}^{γ}$ . First, suppose that ${α_{k}}_{k \in N}$ is not bounded. Then, along a subsequence (without relabeling), we may assume $α_{k} \to \infty$ . By boundedness of ${y_{k}^{*}}_{k \in N}$ , ${\tilde{y}}_{k}^{*} \to 0$ follows. Rewriting (2.9) yields

\begin{matrix} x_{k}^{*} \in {\hat{D}}^{*} Φ (\bar{x} + t_{k} u_{k}, \bar{y} + t_{k} \frac{τ_{k} v_{k}}{t_{k}}) ({(t_{k} ∥u_{k}∥)}^{1 - γ}, {\tilde{y}}_{k}^{*}) \end{matrix}

for each $k \in N$ , and taking the limit $k \to \infty$ while respecting $τ_{k} / t_{k} \to 0$ , thus, gives $x^{*} \in {\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (0)$ . In the case where ${α_{k}}_{k \in N}$ converges to some $α > 0$ (along a subsequence without relabeling), we can simply take the limit $k \to \infty$ in (2.9) in order to find $x^{*} \in D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, α v)) (y^{*} / α)$ . Finally, let us consider the case $α_{k} \to 0$ (along a subsequence without relabeling). Then, by boundedness of ${{\tilde{y}}_{k}^{*}}_{k \in N}$ , taking the limit $k \to \infty$ in (2.9) gives $x^{*} \in Im D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ . Thus, we have shown the first inclusion.

The second inclusion follows by the upper estimate (2.7) for the pseudo-coderivative. $□$

Sufficient conditions for pseudo-(sub)regularity

Graphical derivative and (directional) limiting coderivative are powerful tools for studying regularity properties of set-valued mappings, such as (strong) metric regularity and subregularity, as well as their inverse counterparts of Lipschitzness, such as Aubin property and (isolated) calmness. Indeed, given a closed-graph set-valued mapping $Φ : X ⇉ Y$ , metric regularity and strong metric subregularity at some point $(\bar{x}, \bar{y}) \in gph Φ$ are characterized, respectively, by

\begin{matrix} ker D^{*} Φ (\bar{x}, \bar{y}) & = {0}, \end{matrix}

2.10a

\begin{matrix} ker D Φ (\bar{x}, \bar{y}) & = {0}, \end{matrix}

2.10b

see e.g. [60, 66, 71] for the definition of these Lipschitzian properties as well as the above results. Let us mention that (2.10a) is referred to as Mordukhovich criterion in the literature, while (2.10b) is called Levy–Rockafellar criterion.

For fixed $u \in S_{X}$ , we will refer to

\begin{matrix} ker D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) \subset {0}, \end{matrix}

2.11

which implies that $Φ$ is metrically subregular at $(\bar{x}, \bar{y})$ in direction u, see e.g. [36, Theorem 5], as FOSCMS(u). Note that it is formulated as an inclusion as the left-hand side in (2.11) is empty whenever $u \notin ker D Φ (\bar{x}, \bar{y})$ . Indeed, in this case, $Φ$ is trivially metrically subregular at $(\bar{x}, \bar{y})$ in direction u. Furthermore, whenever (2.11) holds for all $u \in ker D Φ (\bar{x}, \bar{y}) \cap S_{X}$ , which we will refer to as FOSCMS, then $Φ$ is already metrically subregular at $(\bar{x}, \bar{y})$ , see [38, Lemma 2.7]. Above, FOSCMS abbreviates First-Order Sufficient Condition for Metric Subregularity, and this terminology has been coined in [36]. Clearly, each of the conditions from (2.10) is sufficient for FOSCMS. The relations (2.7) suggest that the pseudo-coderivative can be useful particularly in situations where the above regularity properties, which are related to (first-order) coderivatives, fail.

Note that the aforementioned notions of regularity and Lipschitzness express certain linear rate of change of the mapping. Similarly, there is an underlying linearity in the definition of graphical derivative and coderivatives. Take the graphical derivative for instance. Since the same sequence ${t_{k}}_{k \in N}$ appears in the domain as well as in the range space, if $v \in D Φ (\bar{x}, \bar{y}) (u)$ implies that $u \in X$ and $v \in Y$ are both nonzero, it suggests a proportional (linear) rate of change. Thus, in order to characterize pseudo-(sub)regularity of order $γ > 1$ of $Φ$ , it is not very surprising that we need to exploit derivative-like objects based on sub- or superlinear structure. Exemplary, this has been successfully visualized in [37, Corollary 2] by means of Gfrerer’s directional pseudo-coderivative of order $γ > 1$ from Definition 2.3 (c). Here, we show that the fundamental result from [37, Theorem 1(2)] yields also an analogous sufficient condition for metric pseudo-subregularity via the pseudo-coderivative from Definition 2.3 (b).

Lemma 2.10

Let $Φ : X ⇉ Y$ be a set-valued mapping having a closed graph locally around $(\bar{x}, \bar{y}) \in gph Φ$ , fix a direction $u \in S_{X}$ , and some $γ \geq 1$ . Assume that

\begin{matrix} ker D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) \subset {0} \end{matrix}

2.12

holds. Then $Φ$ is metrically pseudo-subregular of order $γ$ at $(\bar{x}, \bar{y})$ in direction u.

Proof

Suppose that $Φ$ is not metrically pseudo-subregular of order $γ$ at $(\bar{x}, \bar{y})$ in direction u. Due to [37, Theorem 1(2)], we find sequences ${t_{k}}_{k \in N} \subset R_{+}$ , ${u_{k}}_{k \in N}, {x_{k}^{*}}_{k \in N} \subset X$ , and ${v_{k}}_{k \in N}, {y_{k}^{*}}_{k \in N} \subset Y$ satisfying (among other things) $t_{k} ↓ 0$ , $u_{k} \to u$ , $t_{k}^{1 - γ} v_{k} \to 0$ , as well as $x_{k}^{*} \to 0$ , such that $| | y_{k}^{*} | | = 1$ and

\begin{matrix} (x_{k}^{*}, - y_{k}^{*} / {(t_{k} ∥u_{k}∥)}^{γ - 1}) \in {\hat{N}}_{gph Φ} ((\bar{x}, \bar{y}) + t_{k} (u_{k}, v_{k})) \end{matrix}

for each $k \in N$ . Let us set ${\tilde{v}}_{k} : = t_{k}^{1 - γ} {∥u_{k}∥}^{- γ} v_{k}$ for each $k \in N$ . Then we have

\begin{matrix} (x_{k}^{*}, - y_{k}^{*} / {(t_{k} ∥u_{k}∥)}^{γ - 1}) \in {\hat{N}}_{gph Φ} (\bar{x} + t_{k} u_{k}, \bar{y} + {(t_{k} ∥u_{k}∥)}^{γ} {\tilde{v}}_{k}) \end{matrix}

for each $k \in N$ and ${\tilde{v}}_{k} \to 0$ from $t_{k}^{1 - γ} v_{k} \to 0$ . Observing that ${y_{k}^{*}}_{k \in N}$ possesses a nonvanishing accumulation point $y^{*} \in Y$ , taking the limit along a suitable subsequence yields $0 \in D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (y^{*})$ which contradicts the assumptions of the lemma. $□$

Let us remark that due to (2.7), condition

\begin{matrix} ker {\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) \subset {0} \end{matrix}

2.13

is stronger than (2.12) and, thus, also sufficient for metric pseudo-subregularity of $Φ$ of order $γ \geq 1$ at $(\bar{x}, \bar{y})$ in direction u. By means of [37, Corollary 2], (2.13) is actually equivalent to $Φ$ being metrically pseudo-regular at $(\bar{x}, \bar{y})$ in direction (u, 0). Note that in the case $γ : = 1$ , both conditions (2.12) and (2.13) recover FOSCMS(u). In Example 2.2, (2.12) and (2.13) hold simultaneously. The following example illustrates that (2.12) can be strictly milder than (2.13).

Example 2.3

For $γ > 1$ , we consider the mapping $Φ : R ⇉ R$ given by

\begin{matrix} gph Φ : = {{(x, y) | | x |}^{γ} \leq y \leq {2 | x |}^{γ}} \cap (⋃_{k \in N} R \times {1 / 2^{k}}) . \end{matrix}

Essentially, $gph Φ$ is a closed staircase enclosed by the graphs of the functions $x \mapsto {| x |}^{γ}$ and $x \mapsto {2 | x |}^{γ}$ . Set $(\bar{x}, \bar{y}) : = (0, 0)$ and $u : = 1$ . First, it is easy to see that (2.12) is satisfied, because one can show $ker D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) = \emptyset$ . Indeed, the sequences ${t_{k}}_{k \in N} \subset R_{+}$ and ${u_{k}}_{k \in N}, {v_{k}}_{k \in N} \subset R$ from the definition of the pseudo-coderivative satisfy, among others, $t_{k} ↓ 0$ , $u_{k} \to u$ , $v_{k} \to v$ , and $(t_{k} | u_{k} {|)}^{γ} \leq (t_{k} | u_{k} {|)}^{γ} v_{k} \leq 2 (t_{k} | u_{k} {|)}^{γ}$ for each $k \in N$ . Thus, $D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, v))$ can have a nonempty graph only for $v \in [1, 2]$ . Next, let us argue that (2.13) fails due to $1 \in ker {\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ . We consider the sequences ${t_{k}}_{k \in N} \subset R_{+}$ and ${u_{k}}_{k \in N}, {v_{k}}_{k \in N} \subset R$ given by

\begin{matrix} \forall k \in N : t_{k} : = {(\frac{3}{2^{k + 2}})}^{1 / γ}, u_{k} : = 1, v_{k} : = \frac{1}{2^{k}} {(\frac{3}{2^{k + 2}})}^{- 1 / γ} . \end{matrix}

We obviously have $t_{k} ↓ 0$ , $u_{k} \to 1$ , as well as $v_{k} \to 0$ , and one can easily check that $(t_{k} u_{k}, t_{k} v_{k}) \in gph Φ$ holds for all $k \in N$ . By construction, there exist vertical normals to $gph Φ$ at $(t_{k} u_{k}, t_{k} v_{k})$ for each $k \in N$ , so we can choose $x_{k}^{*} : = 0$ and $y_{k}^{*} : = 1$ satisfying (2.6). Taking the limit $k \to \infty$ shows $1 \in ker {\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ .

Remark 2.1

Let $Φ : X ⇉ Y$ be a set-valued mapping having locally closed graph around $(\bar{x}, \bar{y}) \in gph Φ$ , and fix some $γ \geq 1$ . Note that if we replace the set $Φ^{- 1} (\bar{y})$ by just the singleton ${\bar{x}}$ in Definition 2.1 of metric pseudo-subregularity, the estimate (2.2) simplifies to

\begin{matrix} {∥x - \bar{x}∥}^{γ} \leq κ dist (\bar{y}, Φ (x)) . \end{matrix}

Asking this to hold for all $x \in B_{ε} (\bar{x})$ and some $ε > 0$ seems like a natural way to define strong metric pseudo-subregularity of order $γ$ of $Φ$ at $(\bar{x}, \bar{y})$ . It is an easy exercise to verify that this condition is satisfied if and only if $ker D_{γ} Φ (\bar{x}, \bar{y}) = {0}$ . This characterization is clearly an extension of the Levy–Rockafellar criterion (2.10b), and it provides a justification for the graphical pseudo-derivative.

Finally, by definition of the pseudo-coderivatives, we easily find the inclusions

\begin{matrix} ker D_{γ + ε}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) & \subset ker D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)), \\ ker {\tilde{D}}_{γ + ε}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) & \subset ker {\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) \end{matrix}

for each $γ \geq 1$ and $ε > 0$ . Hence, as $γ$ increases, the qualification conditions (2.12) and (2.13) become weaker.

Pseudo-(sub)regularity of order 2 for constraint mappings

In this section, we address the pseudo-coderivative calculus for so-called constraint mappings $Φ : X ⇉ Y$ which are given by $Φ (x) : = g (x) - D$ for all $x \in X$ , where $g : X \to Y$ is a single-valued continuous function and $D \subset Y$ is a closed set, and apply our findings from Sect. 2.3.4 in order to derive sufficient conditions for directional metric pseudo-(sub)regularity of order 2. Let us emphasize that this representation of $Φ$ will be a standing assumption in the overall section. The constraint mapping $Φ$ plays an important role for the analysis of so-called geometric constraint systems of type $g (x) \in D$ .

Directional pseudo-coderivatives and sufficient conditions

The first lemma of this subsection addresses upper estimates of the regular, limiting, and directional limiting coderivative of constraint mappings. These results are in principle quite standard, with the exception of the lower estimates in (a) and (c), which can be shown using [20, Theorem 3.1] and [18, Lemma 6.1], respectively. However, since we proceed in a fairly mild setting where g is assumed to be merely continuous, we cannot simply rely on change-or-coordinates formulas, see e.g. [71, Exercise 6.7], even for the proof of the standard parts in (a) and (b). Thus, we prove everything using the results from our recent paper [20].

Lemma 3.1

Fix $(x, y) \in gph Φ$ . Then the following statements hold.

For each $y^{*} \in Y$ , we have
$\begin{matrix} {\hat{D}}^{*} Φ (x, y) (y^{*}) \subset \{\begin{matrix} {\hat{D}}^{*} g (x) (y^{*}) & y^{*} \in {\hat{N}}_{D} (g (x) - y), \\ \emptyset & otherwise, \end{matrix}) \end{matrix}$
and the opposite inclusion holds if g is calm at x.
For each $y^{*} \in Y$ , we have
$\begin{matrix} D^{*} Φ (x, y) (y^{*}) \subset \{\begin{matrix} D^{*} g (x) (y^{*}) & y^{*} \in N_{D} (g (x) - y), \\ \emptyset & otherwise, \end{matrix}) \end{matrix}$
and the opposite inclusion holds whenever g is continuously differentiable at x.
For each pair of directions $(u, v) \in X \times Y$ and each $y^{*} \in Y$ , we have
$\begin{matrix} D^{*} Φ ((x, y) ; (u, v)) (y^{*}) \\ \subset \{\begin{matrix} ⋃_{w \in D g (x) (u)} D^{*} g (x ; (u, w)) (y^{*}) & y^{*} \in N_{D} (g (x) - y ; w - v), \\ \emptyset & otherwise \end{matrix}) \end{matrix}$
provided g is calm at x, and the opposite inclusion holds whenever g is continuously differentiable at x.

Proof

For the proof, we observe that $gph Φ = gph g + ({0} \times (- D))$ is valid. Now, we exploit the sum rule from [20]. Therefore, let us introduce the surrogate mapping $M : X \times Y ⇉ (X \times Y) \times (X \times Y)$ given by
$\begin{matrix} M (x, y) : = & \{((x_{1}, y_{1}), (x_{2}, y_{2})) \in gph g \times ({0} \times (- D)) |\begin{matrix} x & = x_{1} + x_{2} \\ y & = y_{1} + y_{2} \end{matrix})\} \\ = & \{\begin{matrix} {((x, g (x)), (0, y - g (x)))} & g (x) - y \in D, \\ \emptyset & otherwise \end{matrix}) \end{matrix}$ 3.1
for all $(x, y) \in X \times Y$ , and observe that $gph Φ = dom M$ holds while M is single-valued and continuous on $gph Φ$ . Now, we find
$\begin{matrix} {\hat{N}}_{gph Φ} (x, y) \subset {\hat{D}}^{*} M ((x, y), ((x, g (x)), (0, y - g (x)))) ((0, 0), (0, 0)) \end{matrix}$
for all $(x, y) \in gph Φ$ from [20, Theorem 3.1], and the converse inclusion holds if g is calm at x since this ensures that M is so-called isolatedly calm at the point of interest, see [20, Corollary 4.4, Section 5.1.1]. Now, computing the regular normal cone to $gph M$ via [20, Lemmas 2.1, 2.2] and applying the definition of the regular coderivative yields the claim.
The proof of the inclusion $\subset$ is similar as the one of the first statement. Again, we exploit the mapping M given in (3.1) and apply [20, Theorem 3.1] while observing that M is so-called inner semicompact w.r.t. its domain at each point $(x, y) \in gph Φ$ by continuity of g. In the presence of continuous differentiability, the converse inclusion $\supset$ follows easily by applying the change-of-coordinates formula provided in [71, Exercise 6.7].
This assertion can be shown in similar way as the second one, see [20, Lemma 2.1] as well.

$□$

Let us note that the upper estimate in (a) was also shown in [15, Lemma 3.2], but it actually follows directly from [71, Exercise 6.44] upon realizing $gph Φ = gph g + ({0} \times (- D))$ . In the case where g is not calm at the reference point, one can still obtain an upper estimate for the directional limiting coderivative from [20, Theorem 3.1] which is slightly more technical since it comprises another union over $w \in D g (x) (0) \cap S_{Y}$ .

Next, we estimate the directional pseudo-coderivatives of order 2 of constraint mappings in terms of initial problem data.

Theorem 3.1

Let g be twice continuously differentiable. Given $(\bar{x}, 0) \in gph Φ$ and a direction $u \in S_{X}$ , let

\begin{matrix} x^{*} \in {\tilde{D}}_{2}^{*} Φ ((\bar{x}, 0) ; (u, v)) (y^{*}) \end{matrix}

for some $v, y^{*} \in Y$ . Then there exists $z^{*} \in Y$ such that

\begin{matrix} x^{*} & = \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z^{*}, \end{matrix}

3.2a

\begin{matrix} y^{*} & \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u - v) \cap ker \nabla g {(\bar{x})}^{*} . \end{matrix}

3.2b

Further specifications of $z^{*}$ satisfying (3.2) are available under additional assumptions.

Each of the following two conditions
$\begin{matrix} D N_{D} (g (\bar{x}), y^{*}) (0) \cap ker \nabla g {(\bar{x})}^{*} = {0}, \end{matrix}$ 3.3a

$\begin{matrix} \nabla g (\bar{x}) u \neq v, & D_{sub} N_{D} (g (\bar{x}), y^{*}) (\frac{\nabla g (\bar{x}) u - v}{∥\nabla g (\bar{x}) u - v∥}) \cap ker \nabla g {(\bar{x})}^{*} = \emptyset \end{matrix}$ 3.3b
implies that we can find $z^{*} \in D N_{D} (g (\bar{x}), y^{*}) (\nabla g (\bar{x}) u - v)$ satisfying (3.2).
If $Y : = R^{m}$ and D is locally polyhedral around $g (\bar{x})$ , then $N_{D} (g (\bar{x}) ; \nabla g (\bar{x})$ $u - v) = N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u - v)$ , and there are two elements $z_{1}^{*}, z_{2}^{*} \in R^{m}$ satisfying (3.2) (for $z^{*} : = z_{i}^{*}$ with $i = 1, 2$ , respectively) with $z_{1}^{*} \in N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u - v)$ and $z_{2}^{*} \in T_{N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u - v)} (y^{*})$ .

Proof

Since $x^{*} \in {\tilde{D}}_{2}^{*} Φ ((\bar{x}, 0) ; (u, v)) (y^{*})$ , we find sequences ${t_{k}}_{k \in N} \subset R_{+}$ , ${u_{k}}_{k \in N}, {x_{k}^{*}}_{k \in N} \subset X$ , and ${v_{k}}_{k \in N}, {y_{k}^{*}}_{k \in N} \subset Y$ with $t_{k} ↓ 0$ , $u_{k} \to u$ , $v_{k} \to v$ , $x_{k}^{*} \to x^{*}$ , $y_{k}^{*} \to y^{*}$ , as well as

\begin{matrix} (x_{k}^{*}, - y_{k}^{*} / τ_{k}) \in {\hat{N}}_{gph Φ} (\bar{x} + t_{k} u_{k}, t_{k} v_{k}) \end{matrix}

for all $k \in N$ where we used $τ_{k} : = t_{k} ∥u_{k}∥$ for brevity of notation. Lemma 3.1 yields $x_{k}^{*} = \nabla g {(\bar{x} + t_{k} u_{k})}^{*} y_{k}^{*} / τ_{k}$ and $y_{k}^{*} \in τ_{k} {\hat{N}}_{D} (g (\bar{x} + t_{k} u_{k}) - t_{k} v_{k})$ for each $k \in N$ . Taking the limit in $τ_{k} x_{k}^{*} = \nabla g {(\bar{x} + t_{k} u_{k})}^{*} y_{k}^{*}$ , we find $y^{*} \in ker \nabla g {(\bar{x})}^{*}$ . Combining this with a Taylor expansion and denoting ${\tilde{w}}_{k} : = g (\bar{x} + t_{k} u_{k}) - t_{k} v_{k}$ gives us

3.4a

3.4b

for each $k \in N$ . We readily obtain $y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u - v)$ , i.e., (3.2b), as well as

\begin{matrix} x^{*} - \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) \in Im \nabla g {(\bar{x})}^{*}, \end{matrix}

i.e., (3.2a), due to the closedness of $Im \nabla g {(\bar{x})}^{*}$ .

In the general case (a), we will us the identity (3.4a) only with the right-hand side $\nabla g {(\bar{x})}^{*} (y_{k}^{*} - y^{*}) / τ_{k}$ , but in the polyhedral case (b), it is also reasonable to take a closer look at the expression $\nabla g {(\bar{x})}^{*} y_{k}^{*} / τ_{k}$ .

Let us now prove (a). Using the notation from above, let us first assume that ${z_{k}^{*}}_{k \in N}$ , given by $z_{k}^{*} : = (y_{k}^{*} - y^{*}) / τ_{k}$ for each $k \in N$ , remains bounded. Then we may pass to a subsequence (without relabeling) so that it converges to some $z^{*} \in Y$ . We get

and $z^{*} \in D N_{D} (g (\bar{x}), y^{*}) (\nabla g (\bar{x}) u - v)$ follows. Clearly, taking the limit in (3.4a) yields (3.2a) as well.

On the other hand, if ${z_{k}^{*}}_{k \in N}$ does not remain bounded, we pass to a subsequence (without relabeling) such that $τ_{k} / ‖ y_{k}^{*} - y^{*} ‖ \to 0$ and ${\hat{z}}_{k}^{*} \to {\hat{z}}^{*}$ for some ${\hat{z}}^{*} \in S_{Y}$ where we used ${\hat{z}}_{k}^{*} : = (y_{k}^{*} - y^{*}) / ‖ y_{k}^{*} - y^{*} ‖$ for each $k \in N$ . Multiplying (3.4a) by $τ_{k} / ‖ y_{k}^{*} - y^{*} ‖$ and taking the limit yields $\nabla g {(\bar{x})}^{*} {\hat{z}}^{*} = 0$ . Taking into account $({\tilde{w}}_{k} - g (\bar{x})) / τ_{k} \to \nabla g (\bar{x}) u - v$ , we get

\begin{matrix} \frac{∥{\tilde{w}}_{k} - g (\bar{x})∥}{‖ y_{k}^{*} - y^{*} ‖} = \frac{∥{\tilde{w}}_{k} - g (\bar{x})∥}{τ_{k}} \frac{τ_{k}}{‖ y_{k}^{*} - y^{*} ‖} \to 0 . \end{matrix}

3.5

Let us assume that $\nabla g (\bar{x}) u \neq v$ . Then, for sufficiently large $k \in N$ , we have ${\tilde{w}}_{k} \neq g (\bar{x})$ , so we can set ${\hat{q}}_{k} : = ({\tilde{w}}_{k} - g (\bar{x})) / ∥{\tilde{w}}_{k} - g (\bar{x})∥$ for any such $k \in N$ and find $\hat{q} \in S_{Y}$ such that ${\hat{q}}_{k} \to \hat{q}$ (along a subsequence without relabeling). Moreover, we have

\begin{matrix} y^{*} + ‖ y_{k}^{*} - y^{*} ‖ {\hat{z}}_{k}^{*} = y_{k}^{*} \in {\hat{N}}_{D} (g (\bar{x}) + ∥{\tilde{w}}_{k} - g (\bar{x})∥ {\hat{q}}_{k}) \end{matrix}

from (3.4b), so that (3.5) yields ${\hat{z}}^{*} \in D_{sub} N_{D} (g (\bar{x}), y^{*}) (\hat{q})$ . This contradicts (3.3b). In the case where $\nabla g (\bar{x}) u = v$ holds, (3.3b) is not applicable. However, we still have

\begin{matrix} y^{*} + ‖ y_{k}^{*} - y^{*} ‖ {\hat{z}}_{k}^{*} = y_{k}^{*} \in {\hat{N}}_{D} (g (\bar{x}) + ‖ y_{k}^{*} - y^{*} ‖ \frac{{\tilde{w}}_{k} - g (\bar{x})}{‖ y_{k}^{*} - y^{*} ‖}), \end{matrix}

so that taking the limit $k \to \infty$ while respecting (3.5) yields ${\hat{z}}^{*} \in D N_{D} (g (\bar{x}), y^{*}) (0)$ which contradicts (3.3a).

In the polyhedral case (b), we will show that one can always replace the potentially unbounded sequences from (3.4a) by bounded ones. To start, we prove that $y_{k}^{*} \in N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u - v)$ for all sufficiently large $k \in N$ . Lemma 2.2 (a) yields the existence of a neighborhood $V \subset R^{m}$ of 0 such that

\begin{matrix} T_{D} (g (\bar{x})) \cap V = (D - g (\bar{x})) \cap V, \end{matrix}

3.6

as well as the fact that $T_{D} (g (\bar{x}))$ is polyhedral. Thus, from (3.4b) we conclude

for all sufficiently large $k \in N$ .

Next, let us set $K : = N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u - v)$ for brevity of notation, and note that K is a polyhedral cone. From above we know that $y_{k}^{*} \in K$ holds for all sufficiently large $k \in N$ . Then we also get $y^{*}, y_{k}^{*} / τ_{k} \in K$ and, by Lemma 2.2 (a), $(y_{k}^{*} - y^{*}) / τ_{k} \in T_{K} (y^{*})$ , where $T_{K} (y^{*})$ is also a polyhedral cone. Thus, referring to (3.4a), we may invoke Hoffman’s lemma, see [31, Lemma 3C.4], to find some bounded sequences ${z_{1, k}^{*}}_{k \in N} \subset K$ and ${z_{2, k}^{*}}_{k \in N} \subset T_{K} (y^{*})$ satisfying

for $i = 1, 2$ . Thus, accumulation points $z_{i}^{*} \in R^{m}$ of ${z_{i, k}^{*}}_{k \in N}$ for $i = 1, 2$ satisfy (3.2a) and $z_{1}^{*} \in K$ and $z_{2}^{*} \in T_{K} (y^{*})$ . $□$

Below, we comment on the findings of Theorem 3.1. To start, we illustrate that the additional information on the multiplier $z^{*}$ provided in statements (a) and (b) is the same whenever D is a convex polyhedral set in $Y : = R^{m}$ .

Remark 3.1

We use the notation from Theorem 3.1. Suppose that D is a convex polyhedral set in $Y : = R^{m}$ . First, we claim that

\begin{matrix} N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u - v) & \subset N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u - v) + span (y^{*}) \\ = T_{N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u - v)} (y^{*}) \\ = D N_{D} (g (\bar{x}), y^{*}) (\nabla g (\bar{x}) u - v) . \end{matrix}

The first two relations are straightforward and so let us prove the last one. Based on the so-called reduction lemma, see [31, Lemma 2E.4], and [31, Proposition 2A.3], for each pair $(\bar{z}, {\bar{z}}^{*}) \in gph N_{D}$ , we get

\begin{matrix} (gph N_{D} - (\bar{z}, {\bar{z}}^{*})) \cap O \\ = {(w, w^{*}) | w \in K_{D} (\bar{z}, {\bar{z}}^{*}), w^{*} \in K_{D} {(\bar{z}, {\bar{z}}^{*})}^{\circ}, ⟨ w, w^{*} ⟩ = 0} \cap O, \end{matrix}

where $O \subset R^{m} \times R^{m}$ is a neighborhood of (0, 0) and $K_{D} (\bar{z}, {\bar{z}}^{*}) : = T_{D} (\bar{z}) \cap {[{\bar{z}}^{*}]}^{⊥}$ represents the critical cone to D at $(\bar{z}, {\bar{z}}^{*})$ . By Lemma 2.2 (a), this simply means

\begin{matrix} T_{gph N_{D}} (\bar{z}, {\bar{z}}^{*}) = {(w, w^{*}) | w \in K_{D} (\bar{z}, {\bar{z}}^{*}), w^{*} \in K_{D} {(\bar{z}, {\bar{z}}^{*})}^{\circ}, ⟨ w, w^{*} ⟩ = 0} . \end{matrix}

Thus, $z^{*} \in D N_{D} (g (\bar{x}), y^{*}) (\nabla g (\bar{x}) u - v)$ means $\nabla g (\bar{x}) u - v \in T_{D} (g (\bar{x})) \cap {[y^{*}]}^{⊥}$ , which gives us

\begin{matrix} y^{*} \in N_{D} (g (\bar{x})) \cap {[\nabla g (\bar{x}) u - v]}^{⊥} = N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u - v), \end{matrix}

and

\begin{matrix} z^{*} \in N_{K_{D} (g (\bar{x}), y^{*})} (\nabla g (\bar{x}) u - v) & = (T_{D} (g (\bar{x})) \cap {[y^{*}]}^{⊥})^{\circ} \cap {[\nabla g (\bar{x}) u - v]}^{⊥} \\ = (N_{D} (g (\bar{x})) + span (y^{*})) \cap {[\nabla g (\bar{x}) u - v]}^{⊥} \\ = N_{D} (g (\bar{x})) \cap {[\nabla g (\bar{x}) u - v]}^{⊥} + span (y^{*}) \\ = N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u - v) + span (y^{*}) \\ = T_{N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u - v)} (y^{*}) \end{matrix}

by the basic properties of convex polyhedral cones and Lemma 2.2 (b).

Hence, in the convex polyhedral case, the information on $y^{*}$ and $z^{*}$ from statements (a) and (b) (case $z_{2}^{*}$ ) of Theorem 3.1 is the same, while the information from statement (b) (case $z_{1}^{*}$ ) is seemingly sharper. Let us now demonstrate that it is actually also equivalent to the others.

Note that (3.2b) can be equivalently written as $y^{*} \in N_{D} (g (\bar{x})) \cap {[v]}^{⊥} \cap ker \nabla g {(\bar{x})}^{*}$ due to Lemma 2.2 (b) and ${[\nabla g (\bar{x}) s - v]}^{⊥} \cap ker \nabla g {(\bar{x})}^{*} = {[v]}^{⊥} \cap ker \nabla g {(\bar{x})}^{*}$ for all $s \in X$ . This also means that, for any such $y^{*}$ , the sets

\begin{matrix} A_{1} (y^{*}, v) & : = {s \in X | \nabla g (\bar{x}) s - v \in T_{D} (g (\bar{x}))}, \\ A_{2} (y^{*}, v) & : = {s \in X | \nabla g (\bar{x}) s - v \in K_{D} (g (\bar{x}), y^{*})} \end{matrix}

coincide, and viewing $x^{*}$ , $y^{*}$ , u, and v as parameters, the linear programs are the same for $i = 1, 2$ . On the other hand, (3.2a) with $z^{*} \in N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u - v)$ and $z^{*} \in N_{K_{D} (g (\bar{x}), y^{*})} (\nabla g (\bar{x}) u - v)$ , respectively, precisely characterizes the fact that u is a minimizer of LP(1) and LP(2). Hence, this information on $z^{*}$ is the same.

Some additional comments on Theorem 3.1 are stated subsequently.

Remark 3.2

We use the notation from Theorem 3.1.

Note that, in the case $\nabla g (\bar{x}) u \neq v$ , assumption (3.3b), which is stated in terms of the graphical subderivative, is milder than (3.3a) in terms or the standard graphical derivative, and it preserves the connection to the direction $\nabla g (\bar{x}) u - v$ . Let us also note that the case $\nabla g (\bar{x}) u = v$ is, anyhow, special since this would annihilate the directional information in (3.2b) completely.
If $Y : = R^{m}$ and D is locally polyhedral around $g (\bar{x})$ , conditions (3.3) reduce to
$\begin{matrix} D N_{D} (g (\bar{x}), y^{*}) (\nabla g (\bar{x}) u - v) \cap ker \nabla g {(\bar{x})}^{*} \subset {0} \end{matrix}$
thanks to Lemma 2.6.

In the polyhedral case, we can derive yet sharper information on $z^{*}$ if we start with the new pseudo-coderivative instead of the one utilized by Gfrerer. This is also the main reason for introducing the new definition. Throughout the paper, we will rely on the following result. Particularly, it plays an important role in Proposition 5.4 and Corollary 5.3, which we were not able to get using the estimates from Theorem 3.1.

Theorem 3.2

Let g be twice continuously differentiable. Given $(\bar{x}, 0) \in gph Φ$ , assume that $Y : = R^{m}$ and D is locally polyhedral around $g (\bar{x})$ . For a direction $u \in S_{X}$ , let

\begin{matrix} x^{*} \in D_{2}^{*} Φ ((\bar{x}, 0) ; (u, v)) (y^{*}) \end{matrix}

for some $v, y^{*} \in R^{m}$ . Then there exists $s \in X$ satisfying $y^{*} \in N_{T (u)} (w_{s} (u, v)) \cap ker \nabla g {(\bar{x})}^{*}$ where

\begin{matrix} T (u) : = T_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u), w_{s} (u, v) : = \nabla g (\bar{x}) s + 1 / 2 \nabla^{2} g (\bar{x}) [u, u] - v, \end{matrix}

3.7

together with two elements $z_{1}^{*} \in N_{T (u)} (w_{s} (u, v))$ and $z_{2}^{*} \in T_{N_{T (u)} (w_{s} (u, v))} (y^{*})$ satisfying $x^{*} = \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z_{i}^{*}$ for $i = 1, 2$ . Moreover, $v \in D_{2} Φ (\bar{x}, 0) (u)$ is equivalent to the existence of $s \in X$ with $w_{s} (u, v) \in T (u)$ .

Proof

Similar arguments as in the proof of Theorem 3.1 yield (3.4a) together with $y_{k}^{*} \in {\hat{N}}_{D} (w_{k})$ for each $k \in N$ where

As in the final part of the proof of Theorem 3.1, all we need to show is $y_{k}^{*} \in N_{T (u)} (w_{s} (u, v))$ for all sufficiently large $k \in N$ and some appropriately chosen $s \in X$ .

Noting that D is polyhedral while $T (u)$ is a polyhedral cone, we can apply Lemma 2.2 (a) to find neighborhoods $V, W \subset R^{m}$ of 0 such that (3.6) and

\begin{matrix} T (u) \cap W = T_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u) \cap W = (T_{D} (g (\bar{x})) - \nabla g (\bar{x}) u) \cap W . \end{matrix}

Consequently, we have $w_{k} - g (\bar{x}) \in T_{D} (g (\bar{x}))$ and, hence, also $(w_{k} - g (\bar{x})) / t_{k} \in T_{D} (g (\bar{x}))$ for sufficiently large $k \in N$ . Similarly, we conclude that $z_{k} \in T (u)$ . Taking into account that for each cone K, $q \in K$ , and $α > 0$ , one has $T_{K} (q) = T_{K} (α q)$ , we find

\begin{matrix} T_{D} (w_{k}) & = T_{g (\bar{x}) + T_{D} (g (\bar{x}))} (w_{k}) = T_{T_{D} (g (\bar{x}))} ((w_{k} - g (\bar{x})) / t_{k}) \\ = T_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u + t_{k} z_{k}) = T_{T_{D} (g (\bar{x})) - \nabla g (\bar{x}) u} (z_{k}) = T_{T (u)} (z_{k}) \end{matrix}

for all sufficiently large $k \in N$ , and we obtain $y_{k}^{*} \in {\hat{N}}_{D} (w_{k}) = {\hat{N}}_{T (u)} (z_{k})$ .

Since $T (u)$ is polyhedral, so is $gph N_{T (u)}$ , see Lemma 2.6, and it can be written as the union of finitely many convex polyhedral sets, say $C_{1}, \dots, C_{ℓ} \subset R^{m} \times R^{m}$ . Thus, we have

\begin{matrix} (z_{k}, y_{k}^{*}) \in gph {\hat{N}}_{T (u)} \subset gph N_{T (u)} = ⋃_{j = 1}^{ℓ} C_{j} \end{matrix}

for sufficiently large $k \in N$ . We may pick an index $\bar{j} \in {1, \dots, ℓ}$ such that $(z_{k}, y_{k}^{*}) \in C_{\bar{j}}$ holds for infinitely many $k \in N$ and suppose that $C_{\bar{j}}$ can be represented as $C_{\bar{j}} = {(z, y) | A z + B y \leq c}$ for some matrices A, B, as well as c of appropriate dimensions. Hence, by passing to a subsequence (without relabeling), we get

For each $k \in N$ , a generalized version of Hoffman’s lemma, see [47, Theorem 3], now yields the existence of $s_{k} \in X$ with

for some constant $β > 0$ not depending on k. Thus, ${s_{k}}_{k \in N}$ is bounded and satisfies

We may assume that ${s_{k}}_{k \in N}$ converges to some $s \in X$ . Exploiting (3.7), we infer

for all sufficiently large $k \in N$ from polyhedrality of $T (u)$ and the definition of the limiting normal cone.

To show the second statement, note that $v \in D_{2} Φ (\bar{x}, 0) (u)$ is equivalent to $0 \in D_{2}^{*} Φ ((\bar{x}, 0) ; (u, v)) (0)$ , so that any of these two conditions readily yields the existence of $s \in X$ with $w_{s} (u, v) \in T (u)$ . Conversely, suppose that there exists $s \in X$ with $w_{s} (u, v) \in T (u)$ . Let ${t_{k}}_{k \in N} \subset R_{+}$ be an arbitrary sequence with $t_{k} ↓ 0$ , and define the sequences ${u_{k}}_{k \in N} \subset X$ and ${v_{k}}_{k \in N}, {{\hat{w}}_{k}}_{k \in N} \subset Y$ by $u_{k} : = u + t_{k} s$ and

\begin{matrix} v_{k} : = (g (\bar{x} + t_{k} u_{k}) - {\hat{w}}_{k}) / {(t_{k} ∥u_{k}∥)}^{2}, {\hat{w}}_{k} : = g (\bar{x}) + t_{k} \nabla g (\bar{x}) u + t_{k}^{2} w_{s} (u, v) \end{matrix}

for all $k \in N$ . First, a second-order Taylor expansion together with $∥u_{k}∥ \to 1$ yields $v_{k} \to v$ . Next, using similar arguments as before, polyhedrality of $T_{D} (g (\bar{x}))$ and, locally around $g (\bar{x})$ , D, together with $w_{s} (u, v) \in T (u)$ , yields $g (\bar{x} + t_{k} u_{k}) - {(t_{k} ∥u_{k}∥)}^{2} v_{k} = {\hat{w}}_{k} \in D$ , i.e., $(\bar{x} + t_{k} u_{k}, {(t_{k} ∥u_{k}∥)}^{2} v_{k}) \in gph Φ$ , for sufficiently large $k \in N$ . Taking the limit $k \to \infty$ gives $v \in D_{2} Φ (\bar{x}, 0) (u)$ , and this completes the proof. $□$

Remark 3.3

Let us mention that if $Y : = R^{m}$ and D is locally polyhedral around $g (\bar{x})$ , we get the relations

\begin{matrix} N_{T (u)} (w_{s} (u, v)) & = N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u ; w_{s} (u, v)) \\ \subset N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u) \cap {[w_{s} (u, v)]}^{⊥} \\ \subset N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u) = N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u) = N_{T (u)} (0) \end{matrix}

from Lemma 2.2 (b). This also yields $T_{N_{T (u)} (w_{s} (u, v))} (y^{*}) \subset T_{N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u)} (y^{*})$ .

Again, in the convex polyhedral case, the two options provided by Theorem 3.2 coincide. This can be shown using the same arguments as in Remark 3.1 but with the sets

\begin{matrix} \begin{matrix} {\tilde{A}}_{1} (y^{*}, u, v) & : = \{\tilde{s} \in X | w_{\tilde{s}} (u, v) \in T (u)\}, \\ {\tilde{A}}_{2} (y^{*}, u, v) & : = \{\tilde{s} \in X | w_{\tilde{s}} (u, v) \in K_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u, y^{*})\} \end{matrix} \end{matrix}

3.8

which coincide because the required existence of $s \in X$ with $y^{*} \in N_{T (u)} (w_{s} (u, v)) \cap ker \nabla g {(\bar{x})}^{*} = {(T (u))}^{\circ} \cap {[w_{s} (u, v)]}^{⊥} \cap ker \nabla g {(\bar{x})}^{*}$ obviously yields the inclusion $y^{*} \in {[1 / 2 \nabla^{2} g (\bar{x}) [u, u] - v]}^{⊥}$ and, thus, $⟨ y^{*}, w_{\tilde{s}} (u, v) ⟩ = 0$ for all $\tilde{s} \in X$ . This means that our conditions from Theorem 3.2 precisely state that the associated linear programs (LP(i)), $i = 1, 2$ , with $A_{i} (y^{*}, v)$ replaced by ${\tilde{A}}_{i} (y^{*}, u, v)$ , have a solution.

From Theorems 3.1 and 3.2 we obtain the following explicit sufficient conditions for metric pseudo-(sub)regularity of constraint mappings.

Corollary 3.1

Let g be twice continuously differentiable. Consider $(\bar{x}, 0) \in gph Φ$ and a direction $u \in S_{X}$ . The characterization (2.13) of metric pseudo-regularity of order 2 of $Φ$ in direction (u, 0) at $(\bar{x}, 0)$ holds under conditions (a), (b), and (c), while the sufficient condition (2.12) for metric pseudo-subregularity of order 2 of $Φ$ in direction u at $(\bar{x}, 0)$ is valid also under (d).

One has
$\begin{matrix} (\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = 0, \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z^{*} = 0, \\ y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u) \end{matrix}\} ⟹ y^{*} = 0 . \end{matrix}$
One has
$\begin{matrix} (\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = 0, \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z^{*} = 0, \\ y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u), z^{*} \in D N_{D} (g (\bar{x}), y^{*}) (\nabla g (\bar{x}) u) \end{matrix}\} ⟹ y^{*} = 0 . \end{matrix}$ 3.9
Furthermore, we either have
$\begin{matrix} (\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = 0, \nabla g {(\bar{x})}^{*} {\hat{z}}^{*} = 0, \\ y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u), {\hat{z}}^{*} \in D N_{D} (g (\bar{x}), y^{*}) (0) \end{matrix}\} ⟹ {\hat{z}}^{*} = 0 \end{matrix}$ 3.10
or $\nabla g (\bar{x}) u \neq 0$ and
$\begin{matrix} (\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = 0, \nabla g {(\bar{x})}^{*} {\hat{z}}^{*} = 0, \\ y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u) \end{matrix}\} ⟹ {\hat{z}}^{*} \notin D_{s} u b N_{D} (g (\bar{x}), y^{*}) (\frac{\nabla g (\bar{x}) u}{∥\nabla g (\bar{x}) u∥}) . \end{matrix}$ 3.11
It holds $Y : = R^{m}$ , D is locally polyhedral around $g (\bar{x})$ , and
$\begin{matrix} (\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = 0, \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z^{*} = 0, \\ y^{*} \in N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u), \\ z^{*} \in N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u) (or z^{*} \in T_{N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u)} (y^{*})) \end{matrix}\} ⟹ y^{*} = 0 . \end{matrix}$ 3.12
It holds $Y : = R^{m}$ , D is locally polyhedral around $g (\bar{x})$ , and for each $s \in X$ one has
$\begin{matrix} (\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = 0, \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z^{*} = 0, \\ y^{*} \in N_{T (u)} (w_{s} (u, 0)), \\ z^{*} \in N_{T (u)} (w_{s} (u, 0)) (or z^{*} \in T_{N_{T (u)} (w_{s} (u, 0))} (y^{*})) \end{matrix}\} ⟹ y^{*} = 0 \end{matrix}$ 3.13
where $T (u)$ and $w_{s} (u, 0)$ are given as in (3.7).

Due to Remark 3.3, (3.12) indeed implies validity of (3.13) for arbitrarily chosen $s \in X$ .

Remark 3.4

Let us note that if $Y : = R^{m}$ and D is locally polyhedral around $g (\bar{x})$ , then (3.10) and (3.11) appearing in Corollary 3.1 (b) reduce to

\begin{matrix} (\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = 0, \nabla g {(\bar{x})}^{*} {\hat{z}}^{*} = 0, \\ y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u), {\hat{z}}^{*} \in D N_{D} (g (\bar{x}), y^{*}) (\nabla g (\bar{x}) u) \end{matrix}\} ⟹ {\hat{z}}^{*} = 0 \end{matrix}

3.14

thanks to Remark 3.2 (b).

The convex polyhedral case: a comparison with related results

Throughout the subsection, we assume that D is a convex polyhedral set in $Y : = R^{m}$ , and aim to compare our findings, at least partially, with available results from the literature. To start, we recall the definition of directional 2-regularity taken from [9, Definition 1].

Definition 3.1

Set $Y : = R^{m}$ , let D be convex and polyhedral, and fix $(\bar{x}, 0) \in gph Φ$ as well as $u \in X$ . Then the 2-regularity condition is said to hold at $\bar{x}$ in direction u if the following is valid:

\begin{matrix} Im \nabla g (\bar{x}) + \nabla^{2} g (\bar{x}) [u, \nabla g {(\bar{x})}^{- 1} T_{D} (g (\bar{x}))] - T_{D} (g (\bar{x})) = R^{m} . \end{matrix}

3.15

Let us mention that the original definition of directional 2-regularity from [9, Definition 1] is different from the one stated in Definition 3.1. However, both conditions are equivalent by [9, Proposition 1]. Furthermore, it should be noted that, in the setting of Definition 3.1, the 2-regularity condition in direction $u : = 0$ reduces to Robinson’s constraint qualification, see [25, Proposition 2.97]. Observe that, since $Im \nabla g (\bar{x})$ , $\nabla g {(\bar{x})}^{- 1} T_{D} (g (\bar{x}))$ , and $T_{D} (g (\bar{x}))$ are cones, 2-regularity in a nonzero direction u is equivalent to 2-regularity in direction $α u$ for arbitrary $α > 0$ . Hence, it is reasonable to consider merely directions from $S_{X}$ in Definition 3.1. In Proposition 3.1 below, we derive a dual characterization of 2-regularity in direction u, which states that the conditions can be satisfied only for $y^{*} = 0$ . Note that (C $(u, y^{*})$ ) can be stated in a $z^{*}$ -free manner by means of

\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = 0, 0 \in \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} N_{D} (g (\bar{x})), y^{*} \in N_{D} (g (\bar{x})) \end{matrix}

which is why we did not include $z^{*}$ in the abbreviation (C $(u, y^{*})$ ).

Second, we will compare our findings with the ones from [37]. Again, we just consider the situation where D is a convex polyhedral set. In [37, Theorem 2 (2)], pseudo-subregularity of the feasibility mapping $Φ$ of order 2 at $(\bar{x}, 0) \in gph Φ$ in some direction $u \in S_{X}$ which satisfies $\nabla g (\bar{x}) u \in T_{D} (g (\bar{x}))$ (for other directions, the concept is trivial) was shown to be present under the following condition:(C $(u, y^{*})$ )

\begin{matrix} (C (u, y^{*})), y^{*} \in \underset{{\hat{y}}^{*} \in N_{D} (g (\bar{x})) \cap ker \nabla g {(\bar{x})}^{*}}{argmax} 1 / 2 \nabla^{2} ⟨ {\hat{y}}^{*}, g ⟩ (\bar{x}) [u, u] ⟹ y^{*} = 0 . \end{matrix}

3.16

We will now derive alternative representations of (3.12) and (3.15), which are sufficient for directional pseudo-regularity of $Φ$ of order 2, as well as (3.13) and (3.16), being sufficient for directional pseudo-subregularity of $Φ$ of order 2, which allow for a comparison of all these conditions.

To start, let us present a technical lemma, collecting some consequences of having $s \in X$ with $w_{s} (u, 0) \in T (u)$ , see (3.7) for the definition of $w_{s} (u, 0)$ and $T (u)$ .

Lemma 3.2

Set $Y : = R^{m}$ , let D be convex and polyhedral, and fix $(\bar{x}, 0) \in gph Φ$ as well as $u \in S_{X}$ such that $\nabla g (\bar{x}) u \in T_{D} (g (\bar{x}))$ . The existence of $s \in X$ with $w_{s} (u, 0) \in T (u)$ is equivalent to the existence of $\tilde{s} \in X$ with $w_{\tilde{s}} (u, 0) \in T_{D} (g (\bar{x}))$ , and these conditions imply $\nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] \leq 0$ for arbitrary $y^{*} \in N_{D} (g (\bar{x})) \cap ker \nabla g {(\bar{x})}^{*}$ . If, additionally, (C $(u, y^{*})$ ) holds, then we even have $\nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] = 0$ .

Proof

Let us start to prove the first assertion. Note that $T (u) = T_{D} (g (\bar{x})) + span (\nabla g (\bar{x}) u)$ holds due to polyhedrality of D yielding polyhedrality of $T_{D} (g (\bar{x}))$ . Hence, if $s \in X$ satisfies $w_{s} (u, 0) \in T (u)$ , then $\tilde{s} : = s + α u$ for some $α \in R$ satisfies $w_{\tilde{s}} (u, 0) = w_{s} (u, 0) + α \nabla g (\bar{x}) u \in T_{D} (g (\bar{x}))$ . The converse relation is trivial due to $T_{D} (g (\bar{x})) \subset T (u)$ .

The second assertion is a consequence of the definition of $w_{s} (u, 0)$ .

To show the final assertion, note that (C $(u, y^{*})$ ) gives

\begin{matrix} \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] = - ⟨ z^{*}, \nabla g (\bar{x}) u ⟩ \geq 0 \end{matrix}

as $z^{*} \in N_{D} (g (\bar{x}))$ and $\nabla g (\bar{x}) u \in T_{D} (g (\bar{x}))$ . $□$

Now, we are in position to state the central result of this subsection.

Proposition 3.1

Set $Y : = R^{m}$ , let D be convex and polyhedral, and fix $(\bar{x}, 0) \in gph Φ$ as well as $u \in S_{X}$ such that $\nabla g (\bar{x}) u \in T_{D} (g (\bar{x}))$ . Then the following statements hold.

The 2-regularity condition (3.15) is equivalent to the implication
$\begin{matrix} (C (u, y^{*})) ⟹ y^{*} = 0 . \end{matrix}$ 3.17
Condition (3.12) is equivalent to
$\begin{matrix} (C (u, y^{*})), ⟨ z^{*}, \nabla g (\bar{x}) u ⟩ = 0 ⟹ y^{*} = 0 . \end{matrix}$ 3.18
Gfrerer’s condition (3.16) and condition (3.13) are both equivalent to
$\begin{matrix} (C (u, y^{*})), w_{s} (u, 0) \in T (u) ⟹ y^{*} = 0 . \end{matrix}$ 3.19

Proof

Let us start to prove (a). If the 2-regularity condition holds at $\bar{x}$ in direction u, then computing the polar cone on both sides of (3.15) while respecting [23, Exercises 3.4(d) and 3.5] gives

\begin{matrix} {0} = N_{D} (g (\bar{x})) \cap ker \nabla g {(\bar{x})}^{*} \cap {y^{*} \in R^{m} | - \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) \in {(\nabla g {(\bar{x})}^{- 1} T_{D} (g (\bar{x})))}^{\circ}} . \end{matrix}

Relying on [23, Exercise 3.5] again while taking convexity and polyhedrality of D (and, thus, of $N_{D} (g (\bar{x}))$ ) into account, we find

\begin{matrix} {0} = N_{D} (g (\bar{x})) \cap ker \nabla g {(\bar{x})}^{*} \cap {y^{*} \in R^{m} | - \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) \in \nabla g {(\bar{x})}^{*} N_{D} (g (\bar{x}))} . \end{matrix}

3.20

Hence, (3.17) holds. Conversely, if (3.17) is valid, then (3.20) holds as well. Computing the polar cone on both sides, we can exploit [23, Exercises 3.4(d) and 3.5] once again in order to obtain

\begin{matrix} cl (Im \nabla g (\bar{x}) + \nabla^{2} g (\bar{x}) [u, \nabla g {(\bar{x})}^{- 1} T_{D} (g (\bar{x}))] - T_{D} (g (\bar{x}))) = R^{m} . \end{matrix}

Finally, one has to observe that the set within the closure operator is a convex polyhedral cone and, thus, closed in order to find validity of the 2-regularity condition at $\bar{x}$ in direction u.

Statement (b) follows immediately from Lemma 2.2 (b).

Finally, let us turn to the proof of statement (c). In order to show the equivalence between conditions (3.16) and (3.19), it suffices to prove that (3.16) is equivalent to

\begin{matrix} (C (u, y^{*})), \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] = 0, w_{\tilde{s}} (u, 0) \in T_{D} (g (\bar{x})) ⟹ y^{*} = 0, \end{matrix}

3.21

since the latter is equivalent to (3.19) by Lemma 3.2. The maximization problem appearing in (3.16) is a linear program whose feasible set is a nonempty, convex polyhedral cone. Furthermore, $y^{*} \in N_{D} (g (\bar{x})) \cap ker \nabla g {(\bar{x})}^{*}$ is a maximizer if and only if

\begin{matrix} \begin{matrix} 1 / 2 \nabla^{2} g (\bar{x}) [u, u] & \in N_{N_{D} (g (\bar{x})) \cap ker \nabla g {(\bar{x})}^{*}} (y^{*}) = {(N_{D} (g (\bar{x})) \cap ker \nabla g {(\bar{x})}^{*})}^{\circ} \cap {[y^{*}]}^{⊥} \\ = (T_{D} (g (\bar{x})) + Im \nabla g (\bar{x})) \cap {[y^{*}]}^{⊥} . \end{matrix} \end{matrix}

Here, we made use of [23, Exercise 3.4(d)] to compute the polar cone of the appearing intersection, and the latter is a polyhedral cone and, thus, closed. This inclusion, in turn, is equivalent to the existence of $\tilde{s} \in X$ such that

\begin{matrix} \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] = 0, w_{\tilde{s}} (u, 0) \in T_{D} (g (\bar{x})), \end{matrix}

showing the claimed equivalence between (3.16) and (3.21) as $y^{*} \in N_{D} (g (\bar{x})) \cap ker \nabla g {(\bar{x})}^{*}$ is already included in (C $(u, y^{*})$ ).

Clearly, (3.19) implies (3.13) by Lemma 2.2 (b) and Remark 3.3, so we only need to verify the converse implication. Thus, let us prove the premise of (3.13) assuming that (C $(u, y^{*})$ ) holds while there exists some $s \in X$ with $w_{s} (u, 0) \in T (u)$ . Particularly, from these two we infer $⟨ z^{*}, \nabla g (\bar{x}) u ⟩ = 0$ with the help of Lemma 3.2, so the premise of (3.18) is valid. Taking into account Remark 3.1, this means that u is a solution of the linear program ( $\hat{LP} (0))$ where we used for some parameter $q \in R^{m}$ .

For arbitrary $q \in R^{m}$ , we claim that whether ( $\hat{LP} (q)$ ) has a solution depends only on its feasibility since, for feasible problems, the issue of boundedness is independent of q. This follows from [17, Lemma 4], stating that, whenever ( $\hat{LP} (q)$ ) is feasible, then it possesses a solution if and only if there does not exist $s \in X$ satisfying $\nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, s] < 0$ and

\begin{matrix} \nabla g (\bar{x}) s \in {(T_{D} (g (\bar{x})) - q)}^{\infty} = {(T_{D} (g (\bar{x})))}^{\infty} = T_{D} (g (\bar{x})), \end{matrix}

and these conditions are, indeed, independent of q. Above, we have used [71, Exercises 3.12 and 6.34(c)]. Since ( $\hat{LP} (0))$ has a solution, ( $\hat{LP} (q)$ ) has a solution for each $q \in R^{m}$ for which it is feasible. Particularly, Lemma 3.2 thus yields that ( $\hat{LP} (\bar{q}))$ has a solution $\bar{s} \in X$ for $\bar{q} : = \nabla^{2} g (\bar{x}) [u, u]$ .

Finally, we claim that $\bar{s}$ is also a solution of the (feasible) linear program

\begin{matrix} min_{\tilde{s}} {\nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, \tilde{s}] | w_{\tilde{s}} (u, 0) \in T (u)}, \end{matrix}

whose feasible set equals the set ${\tilde{A}}_{1} (y^{*}, u, 0)$ from (3.8). As explained just below (3.8), this will confirm the premise of (3.13) and thus conclude the proof. Suppose that $\bar{s}$ is not a solution of this problem, i.e., there exists $\hat{s}$ with $\nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, \hat{s} - \bar{s}] < 0$ and

\begin{matrix} w_{\hat{s}} (u, 0) \in T (u) = T_{D} (g (\bar{x})) + span (\nabla g (\bar{x}) u) . \end{matrix}

Then $\hat{s} + α u$ is a feasible point of ( $\hat{LP} (\bar{q}))$ for some $α \in R$ , while

\begin{matrix} \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, (\hat{s} + α u) - \bar{s}] = \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, \hat{s} - \bar{s}] < 0 \end{matrix}

follows from $\nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] = 0$ which holds by Lemma 3.2. The latter, however, means that $\bar{s}$ is not optimal for ( $\hat{LP} (\bar{q}))$ - a contradiction. $□$

Let us mention that the first assertion of Proposition 3.1 generalizes [40, Proposition 2].

As a corollary of Proposition 3.1, we now can easily interrelate the different sufficient conditions for pseudo-(sub)regularity.

Corollary 3.2

Set $Y : = R^{m}$ , let D be convex and polyhedral, and fix $(\bar{x}, 0) \in gph Φ$ as well as $u \in S_{X}$ such that $\nabla g (\bar{x}) u \in T_{D} (g (\bar{x}))$ . Then the following implications hold:

\begin{matrix} (3.15) ⟹ (3.12) ⟹ (3.16) ⟺ (3.13) . \end{matrix}

Particularly, (3.15) implies that $Φ$ is metrically pseudo-regular of order 2 at $(\bar{x}, 0)$ in direction (u, 0). Moreover, if there exists $s \in X$ with $w_{s} (u, 0) \in T (u)$ , all four conditions are equivalent.

Proof

The first implication and the equivalence are immediately clear by Proposition 3.1. In order to show the second implication, we first make use of Proposition 3.1 in order to see that it suffices to verify that (3.18) implies (3.19). This, however, is clear since (C $(u, y^{*})$ ) and the existence of $s \in X$ such that $w_{s} (u, 0) \in T (u)$ imply $\nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] = 0$ , see Lemma 3.2, and $⟨ z^{*}, \nabla g (\bar{x}) u ⟩ = 0$ follows by (C $(u, y^{*})$ ).

The fact that (3.15) is sufficient for directional pseudo-regularity of $Φ$ now follows from Corollary 3.1. The final statement is obvious from Proposition 3.1. $□$

The following example shows that our sufficient condition (3.12) for directional pseudo-regularity is strictly milder than directional 2-regularity from (3.15).

Example 3.1

Let $g : R \to R^{2}$ and $D_{i} \subset R^{2}$ , $i = 1, 2$ , be given by $g (x) : = (x, - x^{2})$ , $x \in R$ , and

\begin{matrix} D_{1} : = R \times R_{+}, D_{2} : = R_{-} \times R_{+} . \end{matrix}

Observe that $D_{i}$ is a convex polyhedral set for $i = 1, 2$ . We consider the constraint mappings $Φ_{i} : R ⇉ R^{2}$ given by $Φ_{i} (x) : = g (x) - D_{i}$ , $x \in R$ , for $i = 1, 2$ and fix $\bar{x} : = 0$ and $u : = - 1$ . Note that $(\bar{x}, 0) \in gph Φ_{i}$ for $i = 1, 2$ .

Let us start with the investigation of the mapping $Φ_{1}$ . Due to $N_{D_{1}} (g (\bar{x})) = {0} \times R_{-}$ and

\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = y_{1}^{*}, \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z^{*} = 2 y_{2}^{*} + z_{1}^{*}, \end{matrix}

one can easily check that (3.17) and (3.18) are both satisfied. Consequently, due to Proposition 3.1, (3.12) and (3.15) hold in parallel.

Let us now consider the mapping $Φ_{2}$ . Clearly, (3.18) remains valid since the appearing variable $z^{*}$ has to be chosen from the set $N_{D_{2}} (g (\bar{x})) \cap {[\nabla g (\bar{x}) u]}^{⊥} = {0} \times R_{-}$ . Hence, due to Proposition 3.1, (3.12) holds (and, thus, pseudo-regularity of order 2 of $Φ_{2}$ in direction u at $(\bar{x}, 0)$ ). However, we have $N_{D_{2}} (g (\bar{x})) = R_{+} \times R_{-}$ , so that choosing $y^{*} : = (0, - 1)$ and $z^{*} : = (2, 0)$ yields a violation of (3.17) in this situation. Consulting Proposition 3.1 once again, (3.15) is violated as well. Let us also note that, for each $s \in R$ , we have

\begin{matrix} w_{s} (u, 0) = \nabla g (\bar{x}) s + 1 / 2 \nabla^{2} g (\bar{x}) [u, u] = (s, - 1) \notin R_{-} \times R_{+} = T_{D_{2}} (g (\bar{x})), \end{matrix}

see Corollary 3.2. Hence, (3.12) is strictly milder than (3.15).

Let us take a closer look at the particular situation where $D : = {0}$ .

Remark 3.5

Set $Y : = R^{m}$ , $D : = {0}$ , and fix $(\bar{x}, 0) \in gph Φ$ as well as $u \in S_{X}$ such that $u \in ker \nabla g (\bar{x})$ . Let us consider the sufficient conditions for directional metric pseudo-(sub)regularity discussed in Proposition 3.1. The constraint qualification (3.17) obviously reduces to

\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = 0, \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z^{*} = 0 ⟹ y^{*} = 0, \end{matrix}

3.22

and the latter is equivalent to the 2-regularity condition (3.15) at $\bar{x}$ in direction u by Proposition 3.1. One can easily check that (3.12) also reduces to (3.22). Furthermore, due to Proposition 3.1, (3.13) and (3.16) reduce to

\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = 0, \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z^{*} = 0, w_{s} (u, 0) = 0 ⟹ y^{*} = 0, \end{matrix}

and the latter is strictly milder than (3.22) as we will illustrate in Example 3.2 below.

To close the remark, let us mention that whenever (3.22) has to hold for all $u \in S_{X}$ (this implies metric pseudo-subregularity of order 2 of $Φ$ at $(\bar{x}, 0)$ for all unit directions), then either $\nabla g (\bar{x})$ is surjective or the zero operator, see [34, Remark 2.1], i.e., this situation is rather special. We believe, however, that this is mainly because D is trivial and partially due to the precise definition of 2-regularity. Let us point the interested reader to [37, Example 2], which suggests that metric pseudo-subregularity of order 2 in all unit directions might be a reasonable assumption.

The following example, which has been motivated by Remark 3.5, indicates that (3.16) is strictly milder than (3.12).

Example 3.2

Let $g : R^{2} \to R^{3}$ and $D \subset R^{3}$ be given by $g (x) : = (x_{1}^{2}, x_{2}^{2}, x_{1} x_{2})$ , $x \in R^{2}$ , and $D : = {0}$ . We consider the point $\bar{x} : = 0$ . As $\nabla g (\bar{x})$ vanishes while we have $T_{D} (g (\bar{x})) = {0}$ , each direction $u \in S_{R^{2}}$ satisfies $\nabla g (\bar{x}) u \in T_{D} (g (\bar{x}))$ , and we pick any such u. Due to Remark 3.5, (3.12) and (3.15) reduce to

\begin{matrix} y_{1}^{*} (2 u_{1}, 0) + y_{2}^{*} (0, 2 u_{2}) + y_{3}^{*} (u_{2}, u_{1}) = (0, 0) ⟹ y^{*} = 0, \end{matrix}

and since three vectors in $R^{2}$ are always linearly dependent, this condition is trivially violated. On the other hand, (3.13) and (3.16) can be stated as

\begin{matrix} (\begin{matrix} y_{1}^{*} (2 u_{1}, 0) + y_{2}^{*} (0, 2 u_{2}) + y_{3}^{*} (u_{2}, u_{1}) = (0, 0), \\ (u_{1}^{2}, u_{2}^{2}, u_{1} u_{2}) = (0, 0, 0) \end{matrix}\} ⟹ y^{*} = 0, \end{matrix}

and this condition holds as the premise regarding u cannot be satisfied by any $u \in S_{R^{2}}$ .

We close this subsection with some more general remarks about (directional) 2-regularity and Gfrerer’s sufficient condition for metric pseudo-(sub)regularity from [37, Theorem 2].

In this subsection, for simplicity, we restricted ourselves to the convex polyhedral case, but neither our approach nor the other results are limited to this case. The original definition of directional 2-regularity in [9] is stated for merely convex sets D (no polyhedrality is assumed in the latter paper), but involves the radial cone to D which is not necessarily closed for curved sets D. Interestingly, [37, Example 2], already mentioned in Remark 3.5, provides a mapping which is metrically pseudo-regular of order 2 in every direction (u, 0) with $u \neq 0$ , particularly metrically pseudo-subregular of order 2 in every unit direction, but the 2-regularity condition is violated for every direction; the chosen set D in this example is the Euclidean unit ball in $R^{2}$ which is not polyhedral.

Let us mention that [37, Theorem 2] is stated in the general polyhedral case (no convexity is assumed), and it yields the existence of several elements $s \in X$ corresponding to the active components of the set D. Looking into the proof of Theorem 3.2, it seems like we could get a similar result with only minor adjustments, but since we do not need such a result here, we did not develop this approach for the purpose of brevity.

Let us also note that the conditions from statements (a) and (b) of Corollary 3.1 are not covered by [9] (since D does not need to be convex for our findings) or by [37, Theorem 2] (since D does not need to be polyhedral).

Finally, let us point out that the concept of 2-regularity is useful for the design and the convergence analysis of Newton-type methods, aiming to solve smooth and nonsmooth equations, see e.g. [33, 49] and the references therein.

Directional asymptotic stationarity in nonsmooth optimization

This section is devoted to directional asymptotic stationarity conditions and related results. It contains the foundation of our research, Theorem 4.1, which also motivates our considerations in Sect. 5.

For a locally Lipschitz continuous function $φ : X \to R$ , a set-valued mapping $Φ : X ⇉ Y$ with a closed graph, and $\bar{y} \in Im Φ$ , we investigate the rather abstract optimization problem

\begin{matrix} min_{x} {φ (x) | \bar{y} \in Φ (x)} . \end{matrix}

Throughout the section, the feasible set of (P) will be denoted by $F \subset X$ . Clearly, we have $F \neq \emptyset$ from $\bar{y} \in Im Φ$ . Note that the model (P) covers numerous classes of optimization problems from the literature including standard nonlinear problems, problems with geometric (particularly, disjunctive or conic) constraints, problems with (quasi) variational inequality constraints, and bilevel optimization problems. Furthermore, we would like to mention that choosing $\bar{y} : = 0$ would not be restrictive since one could simply consider $\tilde{Φ} : X ⇉ Y$ given by $\tilde{Φ} (x) : = Φ (x) - \bar{y}$ , $x \in X$ , in the case where $\bar{y}$ does not vanish. Optimality conditions and constraint qualifications for problems of this type can be found, e.g., in [36, 63, 65, 76]. A standard notion of stationarity, which applies to (P) and is based on the tools of limiting variational analysis, is the one of M-stationarity.

Definition 4.1

A feasible point $\bar{x} \in F$ of (P) is called M-stationary whenever there is a multiplier $λ \in Y$ such that

\begin{matrix} 0 \in \partial φ (\bar{x}) + D^{*} Φ (\bar{x}, \bar{y}) (λ) . \end{matrix}

Later in Corollary 4.3, we will show that directional metric subregularity of $Φ$ serves as a constraint qualification for M-stationarity. In the following lemma, whose proof is analogous to the one of [14, Lemma 3.1], we point out that directional metric subregularity of $Φ$ implies that penalizing the constraint in (P) with the aid of the distance function yields a directionally exact penalty function.

Lemma 4.1

Let $\bar{x} \in F$ be a local minimizer of (P), and assume that $Φ$ is metrically subregular at $(\bar{x}, \bar{y})$ in direction $u \in S_{X}$ . Then there are constants $ε > 0$ , $δ > 0$ , and $C > 0$ such that $\bar{x}$ is a global minimizer of

\begin{matrix} min_{x} {φ (x) + C dist (\bar{y}, Φ (x)) | x \in \bar{x} + B_{ε, δ} (u)} . \end{matrix}

4.1

Let us note that this result refines well-known findings about classical exact penalization in the presence of metric subregularity, see e.g. [27, 28, 56].

Approaching mixed-order stationarity conditions

To start, let us introduce a quite general notion of critical directions associated with (P).

Definition 4.2

For some feasible point $\bar{x} \in F$ and a pair $(γ_{0}, γ) \in R \times R$ such that $γ_{0} \geq 1$ as well as $γ \geq 1$ , a direction $u \in X$ is called critical of order $(γ_{0}, γ)$ for (P) at $\bar{x}$ whenever there are sequences ${u_{k}}_{k \in N} \subset X$ , ${α_{k}}_{k \in N} \subset R$ , ${v_{k}}_{k \in N} \subset Y$ , and ${t_{k}}_{k \in N} \subset R_{+}$ satisfying $u_{k} \to u$ , $t_{k} ↓ 0$ , $α_{k} \to 0$ , $v_{k} \to 0$ , and, for all $k \in N$ ,

\begin{matrix} (\bar{x} + t_{k} u_{k}, φ (\bar{x}) + {(t_{k} ∥u_{k}∥)}^{γ_{0}} α_{k}) \in epi φ, (\bar{x} + t_{k} u_{k}, \bar{y} + {(t_{k} ∥u_{k}∥)}^{γ} v_{k}) \in gph Φ . \end{matrix}

4.2

If $(γ_{0}, γ) : = (1, 1)$ , we simply call u a critical direction for (P) at $\bar{x}$ .

Clearly, $u : = 0$ is critical of every order. Moreover, the set of all critical directions of any fixed order is a cone. The most standard case $(γ_{0}, γ) : = (1, 1)$ corresponds to [36, Definition 5]. If $φ$ is directionally differentiable at $\bar{x}$ , it is easily seen that $u \in X$ is critical for (P) at $\bar{x}$ if and only if $φ^{'} (\bar{x} ; u) \leq 0$ and $u \in ker D Φ (\bar{x}, \bar{y})$ , see [72, Proposition 3.5] as well. Let us note that whenever $\bar{x} \in F$ is a feasible point of (P) such that no critical direction for (P) at $\bar{x}$ exists, then $\bar{x}$ is a strict local minimizer of (P). Conversely, there may exist strict local minimizers of (P) such that a critical direction for (P) at this point exists.

While in this paper, we will not go beyond the case $γ_{0} : = 1$ (the case $γ_{0} : = 2$ is briefly mentioned in Lemma 4.3), the situation $γ > 1$ (particularly $γ : = 2$ ) will be very important. For $γ_{0} : = 1$ and arbitrary $γ \geq 1$ , a critical direction $u \in X$ still satisfies $d φ (\bar{x}) (u) \leq 0$ and $u \in ker D_{γ} Φ (\bar{x}, \bar{y})$ , and the converse is valid whenever $φ$ is continuously differentiable at $\bar{x}$ . In the next lemma, we show that if $Φ$ is metrically pseudo-subregular of order $γ$ at $(\bar{x}, \bar{y})$ , then u is actually critical of order $(1, γ^{'})$ for each $γ^{'} \geq 1$ .

Lemma 4.2

Fix a feasible point $\bar{x} \in F$ of (P), $γ \geq 1$ , and a critical direction $u \in X$ of order $(1, γ)$ for (P) at $\bar{x}$ . If $Φ$ is metrically pseudo-subregular of order $γ$ in direction u at $(\bar{x}, \bar{y})$ , then u is critical of order $(1, γ^{'})$ for (P) at $\bar{x}$ for each $γ^{'} \geq 1$ .

Proof

Let ${u_{k}}_{k \in N} \subset X$ , ${α_{k}}_{k \in N} \subset R$ , ${v_{k}}_{k \in N} \subset Y$ , and ${t_{k}}_{k \in N} \subset R_{+}$ be sequences satisfying $u_{k} \to u$ , $t_{k} ↓ 0$ , $α_{k} \to 0$ , $v_{k} \to 0$ , as well as (4.2) for all $k \in N$ . By metric pseudo-subregularity of order $γ$ of $Φ$ at $(\bar{x}, \bar{y})$ , there is a constant $κ > 0$ such that, for sufficiently large $k \in N$ , we get the existence of ${\tilde{x}}_{k} \in Φ^{- 1} (\bar{y})$ with

\begin{matrix} ∥{\tilde{x}}_{k} - (\bar{x} + t_{k} u_{k})∥ \leq κ \frac{dist (\bar{y}, Φ (\bar{x} + t_{k} u_{k}))}{{(t_{k} ∥u_{k}∥)}^{γ - 1}} \leq κ \frac{{(t_{k} ∥u_{k}∥)}^{γ} ∥v_{k}∥}{{(t_{k} ∥u_{k}∥)}^{γ - 1}} = κ t_{k} ∥u_{k}∥ ∥v_{k}∥ . \end{matrix}

Particularly, we find $‖ ({\tilde{x}}_{k} - \bar{x}) / t_{k} - u_{k} ‖ \to 0$ from $u_{k} \to u$ and $v_{k} \to 0$ . Moreover, Lipschitzianity of $φ$ yields

\begin{matrix} φ ({\tilde{x}}_{k}) & \leq φ ({\tilde{x}}_{k}) - φ (\bar{x} + t_{k} u_{k}) + φ (\bar{x}) + t_{k} ∥u_{k}∥ α_{k} \\ \leq L κ t_{k} ∥u_{k}∥ ∥v_{k}∥ + φ (\bar{x}) + t_{k} ∥u_{k}∥ α_{k} = φ (\bar{x}) + t_{k} ∥u_{k}∥ (L κ ∥v_{k}∥ + α_{k}) \end{matrix}

for some constant $L > 0$ and sufficiently large $k \in N$ . Thus, setting ${\tilde{u}}_{k} : = ({\tilde{x}}_{k} - \bar{x}) / t_{k}$ , ${\tilde{α}}_{k} : = L κ ∥v_{k}∥ + α_{k}$ , and ${\tilde{t}}_{k} : = t_{k}$ for large enough $k \in N$ yields ${\tilde{u}}_{k} \to u$ , ${\tilde{t}}_{k} ↓ 0$ , ${\tilde{α}}_{k} \to 0$ , as well as

\begin{matrix} (\bar{x} + {\tilde{t}}_{k} {\tilde{u}}_{k}, φ (\bar{x}) + {\tilde{t}}_{k} ‖ {\tilde{u}}_{k} ‖ {\tilde{α}}_{k}) \in epi φ, (\bar{x} + {\tilde{t}}_{k} {\tilde{u}}_{k}, \bar{y}) \in gph Φ \end{matrix}

for large enough $k \in N$ , and so u is critical of order $(1, γ^{'})$ for (P) at $\bar{x}$ for each $γ^{'} \geq 1$ . $□$

The following result, inspired by and based on [37, Proposition 2], provides an important interpretation of the notion from Definition 4.2 in terms of the so-called epigraphical map $M_{0} : X ⇉ R$ associated with $φ$ and given by $M_{0} (x) : = φ (x) + R_{+}$ , $x \in X$ . The proof follows simply from the fact that $gph M_{0} = epi φ$ together with Remark 2.1.

Proposition 4.1

Given a feasible point $\bar{x} \in F$ and a pair $(γ_{0}, γ) \in R \times R$ such that $γ_{0} \geq 1$ as well as $γ \geq 1$ , a direction $u \in S_{X}$ is critical of order $(γ_{0}, γ)$ for (P) at $\bar{x}$ if and only if there exist sequences ${u_{k}}_{k \in N} \subset X$ and ${t_{k}}_{k \in N} \subset R_{+}$ such that $u_{k} \to u$ , $t_{k} ↓ 0$ , and

\begin{matrix} \frac{dist (φ (\bar{x}), M_{0} (\bar{x} + t_{k} u_{k}))}{{(t_{k} ∥u_{k}∥)}^{γ_{0}}} \to 0, \frac{dist (\bar{y}, Φ (\bar{x} + t_{k} u_{k}))}{{(t_{k} ∥u_{k}∥)}^{γ}} \to 0 . \end{matrix}

4.3

Moreover, if $γ_{0} = γ$ , this is further equivalent to the condition

\begin{matrix} u \in ker D_{γ} M (\bar{x}, (φ (\bar{x}), \bar{y})) \end{matrix}

for the mapping $M : X ⇉ R \times Y$ given by $M (x) : = M_{0} (x) \times Φ (x)$ , $x \in X$ .

Interestingly, Gfrerer used the conditions (4.3) as a basis of his optimality conditions in [37, Proposition 2], but he did not notice, or at least did not mention, that these conditions actually provide a natural extension of his own notion of a critical direction from [36, Definition 5]. This observation enables us to formulate an extension of the common pattern “for every critical direction there is a multiplier satisfying an FJ-type optimality condition” in Corollary 4.1 below.

Remark 4.1

Gfrerer recognized the importance of considering Cartesian product mappings $M : X ⇉ Y_{0} \times Y_{1} \times \dots \times Y_{s}$ , given by

\begin{matrix} \forall x \in X : M (x) : = M_{0} (x) \times M_{1} (x) \times \dots \times M_{s} (x) \end{matrix}

for the component maps $M_{i} : X ⇉ Y_{i}$ , $i = 0, 1, \dots, s$ , and Euclidean spaces $Y_{0}, Y_{1}, \dots, Y_{s}$ , and to allow different orders $γ_{i} \geq 1$ of pseudo-(sub)regularity of these component mappings, see [37, Definition 1]. In the same manner, he defined his pseudo-coderivative [37, Definition 2]. This was essential for his approach to optimality conditions. For brevity of presentation, we avoid these definitions and bypass explicitly using these notions by applying [37, Proposition 2] in combination with the sufficient conditions for pseudo-subregularity from [37, Theorem 1(2)] to prove Corollary 4.1.

Corollary 4.1

Let $\bar{x} \in F$ be a local minimizer of (P) and let $u \in S_{X}$ be a critical direction of order $(1, γ)$ for (P) at $\bar{x}$ with $γ \geq 1$ . Then there exist multipliers $(0, 0) \neq (α^{*}, λ) \in R_{+} \times Y$ satisfying

\begin{matrix} 0 \in α^{*} \partial φ (\bar{x} ; u) + D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (λ) . \end{matrix}

If the sufficient condition (2.12) for metric pseudo-subregularity of order $γ$ of $Φ$ in direction u at $(\bar{x}, \bar{y})$ holds, then the above condition holds with $α^{*} : = 1$ .

Proof

Applying Proposition 4.1 and then [37, Proposition 2 and Theorem 1(2)] yields an element $z^{*} \in X$ and sequences ${t_{k}}_{k \in N} \subset R_{+}$ , ${u_{k}}_{k \in N}, {x_{k}^{*}}_{k \in N} \subset X$ , ${α_{k}}_{k \in N}, {α_{k}^{*}}_{k \in N} \subset R$ , and ${v_{k}}_{k \in N}, {λ_{k}}_{k \in N} \subset Y$ satisfying (among other things) $t_{k} ↓ 0$ , $u_{k} \to u$ , $α_{k} \to 0$ , $t_{k}^{1 - γ} v_{k} \to 0$ , as well as $x_{k}^{*} \to 0$ , such that, for each $k \in N$ , $∥(α_{k}^{*}, λ_{k})∥ = 1$ and

\begin{matrix} (x_{k}^{*}, - α_{k}^{*}, - λ_{k} / {(t_{k} ∥u_{k}∥)}^{γ - 1}) \in {\hat{N}}_{gph M^{*}} (\bar{x} + t_{k} u_{k}, φ (\bar{x}) + t_{k} α_{k}, \bar{y} + t_{k}^{γ} {\tilde{v}}_{k}), \end{matrix}

4.4

where we used ${\tilde{v}}_{k} : = t_{k}^{1 - γ} v_{k}$ as well as the mapping $M^{*} : X ⇉ R \times Y$ defined by $M^{*} (x) : = M_{0}^{*} (x) \times Φ (x)$ , $x \in X$ , with the perturbed epigraphical mapping $M_{0}^{*} : X ⇉ R$ given by

\begin{matrix} \forall x \in X : M_{0}^{*} (x) : = φ (x) + {| ⟨ z^{*}, x - \bar{x} ⟩ |}^{3} + R_{+} . \end{matrix}

Note that we have $gph M^{*} = (gph M_{0}^{*} \times Y) \cap P (gph Φ \times R)$ , where the permutation mapping $P : X \times Y \times R \to X \times R \times Y$ just swaps the last two components. After replacing the regular by the larger limiting normal cone in (4.4) and noting that $x \mapsto φ (x) + | ⟨ z^{*}, x - \bar{x} ⟩ |^{3}$ is locally Lipschitzian, we can apply the intersection rule for limiting normals from [71, Theorem 6.42]. The latter yields, for each $k \in N$ , $x_{k, 1}^{*}, x_{k, 2}^{*} \in X$ with $x_{k}^{*} = x_{k, 1}^{*} + x_{k, 2}^{*}$ and

\begin{matrix} (x_{k, 1}^{*}, - α_{k}^{*}) & \in N_{gph M_{0}^{*}} (\bar{x} + t_{k} u_{k}, φ (\bar{x}) + t_{k} α_{k}), \\ (x_{k, 2}^{*}, - λ_{k} / {(t_{k} ∥u_{k}∥)}^{γ - 1}) & \in N_{gph Φ} (\bar{x} + t_{k} u_{k}, \bar{y} + t_{k}^{γ} {\tilde{v}}_{k}) . \end{matrix}

Now, local Lipschitzness of $x \mapsto φ (x) + | ⟨ z^{*}, x - \bar{x} ⟩ |^{3}$ together with boundedness of ${α_{k}^{*}}_{k \in N}$ implies boundedness of ${x_{k, 1}^{*}}_{k \in N}$ . This, in turn, gives boundedness of ${x_{k, 2}^{*}}_{k \in N}$ . Since ${λ_{k}}_{k \in N}$ is also bounded, taking the limit along a suitable subsequence yields some $x^{*} \in X$ , $α^{*} \in R$ , and $λ \in Y$ satisfying $(α^{*}, λ) \neq (0, 0)$ as well as

\begin{matrix} x^{*} \in D^{*} M_{0}^{*} ((\bar{x}, φ (\bar{x})) ; (u, 0)) (α^{*}), - x^{*} \in D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (λ) . \end{matrix}

Here, we used the robustness of the directional limiting coderivative, see Lemma 2.1, as well as Lemma 2.8. Taking into account that $x \mapsto | ⟨ z^{*}, x - \bar{x} ⟩ |^{3}$ is smooth with its gradient vanishing at $\bar{x}$ and using Lemma 3.1 (c) as well as [18, Proposition 5.1], we get $α^{*} \geq 0$ and $D^{*} M_{0}^{*} ((\bar{x}, φ (\bar{x})) ; (u, 0)) (α^{*}) \subset α^{*} \partial φ (\bar{x} ; u)$ . This proves the first statement.

Finally, (2.12) clearly implies $α^{*} > 0$ , and by rescaling, we can set $α^{*} : = 1$ . $□$

Mixed-order and asymptotic stationarity conditions

The following result provides asymptotic necessary optimality conditions for (P) which hold in the absence of constraint qualifications. The derived conditions depend on a certain order $γ \geq 1$ . Furthermore, our result specifies how the asymptotic case (d) can be ruled out by metric pseudo-subregularity of $Φ$ of order $γ$ at the reference point.

Theorem 4.1

Let $\bar{x} \in F$ be a local minimizer of (P) and consider $γ \geq 1$ . Then one of the following conditions holds.

The point $\bar{x}$ is M-stationary for (P).
There exists a critical direction $u \in S_{X}$ for (P) at $\bar{x}$ such that
$\begin{matrix} 0 \in \partial φ (\bar{x} ; u) + {\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (0) . \end{matrix}$ 4.5
There exist a critical direction $u \in S_{X}$ for (P) at $\bar{x}$ , a nonvanishing multiplier $λ \in Y$ , and $α \geq 0$ such that, for $v : = α λ$ , we have
$\begin{matrix} 0 \in \partial φ (\bar{x} ; u) + D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, v)) (λ) . \end{matrix}$ 4.6
There exist a critical direction $u \in S_{X}$ of order $(γ_{0}, γ)$ for (P) at $\bar{x}$ for each $γ_{0} \geq 1$ , some $y^{*} \in Y$ , and sequences ${x_{k}}_{k \in N}, {η_{k}}_{k \in N} \subset X$ as well as ${y_{k}}_{k \in N} \subset Y$ such that $x_{k} \notin Φ^{- 1} (\bar{y})$ and $y_{k} \neq \bar{y}$ for all $k \in N$ , satisfying the convergence properties
$\begin{matrix} \begin{matrix} x_{k} & \to \bar{x}, & y_{k} & \to \bar{y}, & η_{k} & \to 0, & \frac{x_{k} - \bar{x}}{‖ x_{k} - \bar{x} ‖} & \to u, \\ v_{k}^{γ} & \to 0, & ‖ λ_{k}^{γ} ‖ & \to \infty, & ‖ v_{k}^{γ} ‖ λ_{k}^{γ} & \to y^{*}, \end{matrix} \end{matrix}$
where we used
$\begin{matrix} \forall k \in N : v_{k}^{γ} : = \frac{y_{k} - \bar{y}}{{∥x_{k} - \bar{x}∥}^{γ}}, λ_{k}^{γ} : = k {‖ x_{k} - \bar{x} ‖}^{γ - 1} (y_{k} - \bar{y}), \end{matrix}$ 4.7
as well as
$\begin{matrix} \forall k \in N : η_{k} \in \partial φ (x_{k}) + D^{*} Φ (x_{k}, y_{k}) (\frac{λ_{k}^{γ}}{{∥x_{k} - \bar{x}∥}^{γ - 1}}) . \end{matrix}$ 4.8

Moreover, if $Φ$ is metrically pseudo-subregular of order $γ$ at $(\bar{x}, \bar{y})$ in each direction $u \in ker D Φ (\bar{x}, \bar{y}) \cap S_{X}$ , $\bar{x}$ satisfies one of the alternatives (a), (b), or (c).

Proof

Let $ε > 0$ be chosen such that $φ (x) \geq φ (\bar{x})$ holds for all $x \in F \cap B_{ε} (\bar{x})$ and, for each $k \in N$ , consider the optimization problem For each $k \in N$ , the objective function of (P(k)) is bounded from below, continuous on the closed feasible set of this problem, and coercive in the variable y, so (P(k)) possesses a global minimizer $(x_{k}, y_{k}) \in X \times Y$ . By feasibility of $(\bar{x}, \bar{y})$ for (P(k)), we find

\begin{matrix} \forall k \in N : φ (x_{k}) + \frac{k}{2} {∥y_{k} - \bar{y}∥}^{2} + \frac{1}{2} {∥x_{k} - \bar{x}∥}^{2} \leq φ (\bar{x}) . \end{matrix}

4.9

By boundedness of ${x_{k}}_{k \in N} \subset B_{ε} (\bar{x})$ , we may assume $x_{k} \to \tilde{x}$ for some $\tilde{x} \in B_{ε} (\bar{x})$ . Observing that ${φ (x_{k})}_{k \in N}$ is bounded by continuity of $φ$ , $y_{k} \to \bar{y}$ easily follows from (4.9). Furthermore, the closedness of $gph Φ$ guarantees $(\tilde{x}, \bar{y}) \in gph Φ$ , i.e., $\tilde{x} \in F \cap B_{ε} (\bar{x})$ leading to $φ (\bar{x}) \leq φ (\tilde{x})$ . From (4.9), we find

\begin{matrix} φ (\bar{x}) \leq φ (\tilde{x}) \leq φ (\tilde{x}) + \frac{1}{2} {∥\tilde{x} - \bar{x}∥}^{2} = lim_{k \to \infty} (φ (x_{k}) + \frac{1}{2} {∥x_{k} - \bar{x}∥}^{2}) \leq φ (\bar{x}), \end{matrix}

and $\tilde{x} = \bar{x}$ follows. Thus, we have $x_{k} \to \bar{x}$ .

Let us assume that there is some $k_{0} \in N$ such that $x_{k_{0}}$ is feasible to (P). By (4.9), we find

\begin{matrix} φ (\bar{x}) + \frac{k_{0}}{2} ‖ y_{k_{0}} - \bar{y} ‖^{2} + \frac{1}{2} {‖ x_{k_{0}} - \bar{x} ‖}^{2} \\ \leq φ (x_{k_{0}}) + \frac{k_{0}}{2} ‖ y_{k_{0}} - \bar{y} ‖^{2} + \frac{1}{2} {‖ x_{k_{0}} - \bar{x} ‖}^{2} \leq φ (\bar{x}), \end{matrix}

i.e., $x_{k_{0}} = \bar{x}$ and $y_{k_{0}} = \bar{y}$ . Applying [66, Theorem 6.1], the subdifferential sum rule [66, Theorem 2.19], and the definition of the limiting coderivative to find stationarity conditions of (P(k)) at $(\bar{x}, \bar{y})$ yields $0 \in \partial φ (\bar{x}) + D^{*} Φ (\bar{x}, \bar{y}) (0)$ , which is covered by (a).

Thus, we may assume that $x_{k} \notin F = Φ^{- 1} (\bar{y})$ holds for all $k \in N$ . Particularly, $x_{k} \neq \bar{x}$ and $y_{k} \neq \bar{y}$ is valid for all $k \in N$ in this situation. Assume without loss of generality that ${x_{k}}_{k \in N}$ belongs to the interior of $B_{ε} (\bar{x})$ .

We can apply Fermat’s rule, see [66, Proposition 1.30 (i)], the semi-Lipschitzian sum rule for limiting subgradients from [66, Corollary 2.20], and the definition of the limiting coderivative in order to find

\begin{matrix} \forall k \in N : \bar{x} - x_{k} \in \partial φ (x_{k}) + D^{*} Φ (x_{k}, y_{k}) (k (y_{k} - \bar{y})) . \end{matrix}

4.10

Setting $η_{k} : = \bar{x} - x_{k}$ for each $k \in N$ , we find $η_{k} \to 0$ . Since ${(x_{k} - \bar{x}) / ‖ x_{k} - \bar{x} {‖}}_{k \in N} \subset S_{X}$ , we may assume $(x_{k} - \bar{x}) / ‖ x_{k} - \bar{x} ‖ \to u$ for some $u \in S_{X}$ .

Next, we claim that ${y_{k}^{*}}_{k \in N} \subset Y$ , given by $y_{k}^{*} : = k (y_{k} - \bar{y}) ∥y_{k} - \bar{y}∥ / ∥x_{k} - \bar{x}∥$ for each $k \in N$ , is bounded. Rearranging (4.9), leaving a nonnegative term away, and division by $‖ x_{k} - \bar{x} ‖$ give us

\begin{matrix} \forall k \in N : \frac{φ (x_{k}) - φ (\bar{x})}{‖ x_{k} - \bar{x} ‖} + \frac{k}{2} \frac{‖ y_{k} {- \bar{y} ‖}^{2}}{‖ x_{k} - \bar{x} ‖} \leq 0 . \end{matrix}

4.11

Lipschitzianity of $φ$ yields boundedness of the first fraction, so that the sequence ${k ‖ y_{k} {- \bar{y} ‖}^{2} / ‖ x_{k} - \bar{x} {‖}}_{k \in N}$ must be bounded and, consequently, ${y_{k}^{*}}_{k \in N}$ as well. Thus, we may assume $y_{k}^{*} \to y^{*}$ for some $y^{*} \in Y$ .

Suppose that ${(y_{k} - \bar{y}) / ‖ x_{k} - \bar{x} {‖}}_{k \in N}$ does not converge to zero. This, along a subsequence (without relabeling), yields boundedness of the sequence ${k (y_{k} - \bar{y})}_{k \in N}$ , and taking the limit $k \to \infty$ in (4.10) along yet another subsequence while respecting robustness of the limiting subdifferential and the limiting coderivative yields (a).

Thus, we may assume $(y_{k} - \bar{y}) / ∥x_{k} - \bar{x}∥ \to 0$ for the remainder of the proof. Observe that we have

\begin{matrix} (\bar{x} + ∥x_{k} - \bar{x}∥ (x_{k} - \bar{x}) / ∥x_{k} - \bar{x}∥, \bar{y} + ∥x_{k} - \bar{x}∥ (y_{k} - \bar{y}) / ∥x_{k} - \bar{x}∥) \in gph Φ \end{matrix}

for all $k \in N$ . Additionally, (4.11) yields

\begin{matrix} φ (\bar{x} + ∥x_{k} - \bar{x}∥ (x_{k} - \bar{x}) / ∥x_{k} - \bar{x}∥) - φ (\bar{x}) \leq 0, \end{matrix}

so u is a critical direction of order $(γ_{0}, 1)$ for each $γ_{0} \geq 1$ for (P) at $\bar{x}$ .

In the remainder of the proof, we are going to exploit the sequences ${v_{k}^{γ}}_{k \in N}, {λ_{k}^{γ}}_{k \in N} \subset Y$ given as in (4.7). Observe that $y_{k}^{*} = ‖ v_{k}^{γ} ‖ λ_{k}^{γ}$ , i.e., $λ_{k}^{γ} = y_{k}^{*} {∥x_{k} - \bar{x}∥}^{γ} / ∥y_{k} - \bar{y}∥$ is valid for each $k \in N$ . Note that the optimality condition (4.10) can be rewritten as

\begin{matrix} \forall k \in N : η_{k} \in \partial φ (x_{k}) + D^{*} Φ (x_{k}, \bar{y} + {∥x_{k} - \bar{x}∥}^{γ} v_{k}^{γ}) (\frac{λ_{k}^{γ}}{{∥x_{k} - \bar{x}∥}^{γ - 1}}) . \end{matrix}

4.12

Now, we need to distinguish three options.

Let us assume that $λ_{k}^{γ} \to 0$ . Using $t_{k} : = ∥x_{k} - \bar{x}∥$ , we can reformulate (4.12) as

\begin{matrix} \forall k \in N : η_{k} \in \partial φ (x_{k}) + D^{*} Φ (\bar{x} + t_{k} \frac{x_{k} - \bar{x}}{∥x_{k} - \bar{x}∥}, \bar{y} + t_{k} \frac{y_{k} - \bar{y}}{∥x_{k} - \bar{x}∥}) (\frac{λ_{k}^{γ}}{t_{k}^{γ - 1}}) . \end{matrix}

Taking the limit $k \to \infty$ while respecting robustness of the directional limiting subdifferential as well as Lemma 2.8 yields (b) since $(y_{k} - \bar{y}) / ∥x_{k} - \bar{x}∥ \to 0$ and u has already been shown to be critical for (P) at $\bar{x}$ .

If ${λ_{k}^{γ}}_{k \in N}$ remains bounded but, along a subsequence (without relabeling), stays away from zero, we also get boundedness of ${v_{k}^{γ}}_{k \in N}$ from boundedness of ${y_{k}^{*}}_{k \in N}$ , and taking the limit along a convergent subsequence (without relabeling) in (4.12) while respecting robustness of the directional limiting subdifferential and Lemma 2.8 yields precisely (4.6), where $λ, v \in Y$ with $λ \neq 0$ satisfy $λ_{k}^{γ} \to λ$ and $v_{k}^{γ} \to v$ , respectively, and using $α_{k} : = {(k {∥x_{k} - \bar{x}∥}^{2 γ - 1})}^{- 1}$ for all $k \in N$ as well as (4.7), we find $v_{k}^{γ} = α_{k} λ_{k}^{γ}$ for all $k \in N$ , $α_{k} \to ∥v∥ / ∥λ∥ = : α$ , and $v = α λ$ . Criticality of u for (P) at $\bar{x}$ has been shown above. Thus, situation (c) has been verified.

If ${λ_{k}^{γ}}_{k \in N}$ is not bounded, we pass to a subsequence (without relabeling) such that $‖ λ_{k}^{γ} ‖ \to \infty$ and so we also get $v_{k}^{γ} \to 0$ along this subsequence by boundedness of ${y_{k}^{*}}_{k \in N}$ . This means that u is actually critical of order $(γ_{0}, γ)$ for (P) at $\bar{x}$ and so all conditions stated in (d) have been verified since (4.8) follows from (4.12).

Finally, let us argue that option (d) can be avoided, i.e., that the sequence ${λ_{k}^{γ}}_{k \in N}$ from above remains bounded if we assume that $Φ$ is metrically pseudo-subregular of order $γ$ in direction u at $(\bar{x}, \bar{y})$ . By boundedness of ${y_{k}^{*}}_{k \in N}$ , we immediately obtain the boundedness of ${λ_{k}^{γ}}_{k \in N}$ unless we have $v_{k}^{γ} \to 0$ . Thus, let us assume the latter. By metric pseudo-subregularity of $Φ$ , there is a constant $κ > 0$ such that, for sufficiently large $k \in N$ , we get the existence of ${\tilde{x}}_{k} \in Φ^{- 1} (\bar{y})$ with

\begin{matrix} ∥x_{k} - {\tilde{x}}_{k}∥ \leq κ \frac{dist (\bar{y}, Φ (x_{k}))}{{∥x_{k} - \bar{x}∥}^{γ - 1}} \leq κ \frac{∥y_{k} - \bar{y}∥}{{∥x_{k} - \bar{x}∥}^{γ - 1}} = κ ∥x_{k} - \bar{x}∥ ‖ v_{k}^{γ} ‖ . \end{matrix}

4.13

Particularly, we find $‖ x_{k} - {\tilde{x}}_{k} ‖ \to 0$ from $v_{k}^{γ} \to 0$ , and ${\tilde{x}}_{k} \to \bar{x}$ follows. Since $(x_{k}, y_{k})$ is a global minimizer of (P(k)), we get

\begin{matrix} \frac{φ (x_{k}) - φ ({\tilde{x}}_{k})}{∥x_{k} - \bar{x}∥} + \frac{k}{2} \frac{{∥y_{k} - \bar{y}∥}^{2}}{∥x_{k} - \bar{x}∥} + \frac{1}{2} \frac{{∥x_{k} - \bar{x}∥}^{2} - {∥{\tilde{x}}_{k} - \bar{x}∥}^{2}}{∥x_{k} - \bar{x}∥} \leq 0 \end{matrix}

for all sufficiently large $k \in N$ . Due to $v_{k}^{γ} \neq 0$ for all $k \in N$ , rearranging the above estimate and using (4.7) as well as (4.13) yield

\begin{matrix} ‖ λ_{k}^{γ} ‖ = \frac{k {∥y_{k} - \bar{y}∥}^{2}}{∥x_{k} - \bar{x}∥ ‖ v_{k}^{γ} ‖} \leq \frac{2 κ | φ (x_{k}) - φ ({\tilde{x}}_{k}) |}{∥x_{k} - {\tilde{x}}_{k}∥} + \frac{κ | {∥x_{k} - \bar{x}∥}^{2} - {∥{\tilde{x}}_{k} - \bar{x}∥}^{2} |}{∥x_{k} - {\tilde{x}}_{k}∥} . \end{matrix}

Boundedness of ${λ_{k}^{γ}}_{k \in N}$ thus follows from Lipschitzianity of $φ$ and the estimate

\begin{matrix} | {∥x_{k} - \bar{x}∥}^{2} - {∥{\tilde{x}}_{k} - \bar{x}∥}^{2} | \leq ∥x_{k} - {\tilde{x}}_{k}∥ (∥{\tilde{x}}_{k} - \bar{x}∥ + ∥x_{k} - \bar{x}∥) . \end{matrix}

This completes the proof. $□$

Let us note that for the price of some more technicalities in the proof, involving the fuzzy sum rule for the regular subdifferential, see e.g. [66, Exercise 2.26], it is possible to formulate statement (d) in terms of the regular tools of variational analysis, see [21, Theorem 4.3] which is a preprint version of this paper. This more involved approach then allows for an easier comparison to available results in the literature which are partially stated in infinite dimensions, see e.g. [37], where the limiting tools are of limited use and sequential characterizations in terms of the regular tools are, thus, preferred. However, for our purposes, the way Theorem 4.1 has been formulated will be enough to proceed.

In the rest of this subsection, we discuss some applications of Theorem 4.1, which are then further developed in the rest of the paper. First, we focus on mixed-order stationarity conditions, involving first-order generalized derivatives of the objective function and pseudo-coderivatives of order $γ$ , and enhance the result of Corollary 4.1 as follows.

Corollary 4.2

Let $\bar{x} \in F$ be a local minimizer of (P) and consider $γ \geq 1$ . Then the following assertions hold.

If $Φ$ is metrically pseudo-subregular of order $γ$ at $(\bar{x}, \bar{y})$ in each unit direction, then one of the alternatives (a), (b), or (c) of Theorem 4.1 holds.
If there are no critical directions of order $(1, γ)$ for (P) at $\bar{x}$ , then one of the alternatives (a), (b), or (c) from Theorem 4.1 is valid. If there exists a critical directions $u \in S_{X}$ of order $(1, γ)$ for (P) at $\bar{x}$ satisfying (2.12), then even
$\begin{matrix} 0 \in \partial φ (\bar{x} ; u) + D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (λ) \end{matrix}$ 4.14
holds for some $λ \in Y$ .

Proof

The first assertion follows directly from Theorem 4.1. Let us now prove the second assertion. Theorem 4.1 says that either one of the alternatives (a), (b), or (c) holds, or there exists a critical direction $u \in S_{X}$ of order $(1, γ)$ for (P) at $\bar{x}$ (with certain properties). If among these critical directions, there is one that satisfies (2.12), Corollary 4.1 yields (4.14). $□$

We conjecture that the sufficient condition (2.12) can be weakened to just pseudo-subregularity of $Φ$ in Corollary 4.2 (b). However, it would require a different proof to show this, so we will not explore this option. For $γ : = 1$ , such a result is known to hold, see [36, Theorem 7].

Note that (4.14) is covered by the alternative (b) (if $λ = 0$ , see (2.7)) or (c) (if $λ \neq 0$ ) of Theorem 4.1. Hence, the optimality conditions from Corollary 4.2 give either M-stationarity of the underlying local minimizer or validity of alternative (b) or (c) of Theorem 4.1 for some critical direction (of order (1, 1) or $(1, γ)$ ).

Remark 4.2

Corollary 4.2 offers two distinct paths to an optimality condition of type “M-stationarity or (4.14)”, both with some advantages and disadvantages.

Assuming pseudo-subregularity in each unit direction yields this type of condition by ruling out the alternative (d) of Theorem 4.1. However, this can sometimes be an undesirable type of assumption as pointed out in Remark 3.5.
The refined assumptions in Corollary 4.2 (b) are clearly milder, but they depend on a critical direction (of order $(1, γ)$ ), which in turn depends also on the objective function, not just on $Φ$ . These assumptions do not rule out the alternative (d). Instead, they just secure that (a), (b), or (c) from Theorem 4.1 holds.

These two types of assumptions will be prevalent throughout this section.

Recall that all the assumptions in Corollary 4.2 become less restrictive as $γ$ increases, see Sect. 2.3.4 as well. On the contrary, with increasing $γ$ , the involved pseudo-coderivatives become more difficult to handle which, exemplary, can be seen for constraint mappings when comparing the cases $γ : = 1$ and $γ : = 2$ from Sect. 3. In this regard, in Corollary 4.2, $γ$ should be chosen as small as possible such that the exploited qualification condition is valid.

In Sect. 4.3, we work out the conditions from Corollary 4.2 for $γ : = 2$ in the setting where $Φ$ is a constraint mapping as the appearing pseudo-coderivatives actually can be computed, see Sect. 3, and, hence, we obtain conditions in terms of initial problem data. In Sect. 4.4, we further apply these results to two specific problem classes and compare them with similar results based on 2-regularity.

Theorem 4.1 also opens a way to the identification of new conditions which guarantee that local minimizers of (P) are M-stationary. One of the most prominent conditions that implies this is metric subregularity, and the corresponding result, which we state next, can be obtained simply by setting $γ : = 1$ in Corollary 4.2, taking also into account [36, Theorem 7]. For us, this result serves as a basis for comparison. Later on, we will derive new conditions, which are independent of (directional) metric subregularity, but which are milder than various known sufficient conditions for metric subregularity.

Corollary 4.3

A local minimizer $\bar{x} \in F$ of (P) is M-stationary if one of the following conditions holds.

The mapping $Φ$ is metrically subregular at $(\bar{x}, \bar{y})$ in each unit direction.
There are no critical directions for (P) at $\bar{x}$ , or there is a critical direction $u \in S_{X}$ for (P) at $\bar{x}$ and $Φ$ is metrically subregular at $(\bar{x}, \bar{y})$ in direction u, in which case there is a multiplier $λ \in Y$ such that
$\begin{matrix} 0 \in \partial φ (\bar{x} ; u) + D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (λ) . \end{matrix}$

Let us now discuss two novel approaches to M-stationarity. The first approach corresponds to using Corollary 4.2 with $γ > 1$ and then making sure that the derived mixed-order conditions in terms of pseudo-coderivatives actually yield M-stationarity. To formalize the idea, we introduce the following assumption.

Assumption 4.1

Given $u \in S_{X}$ and $γ \geq 1$ , we say that $A^{γ} (u)$ holds if (2.12) is satisfied and

\begin{matrix} {\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (0) \cup ⋃_{w \in S_{Y}} D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, α w)) (β w) \subset Im D^{*} Φ (\bar{x}, \bar{y}) \end{matrix}

4.15

is valid for all $α, β \geq 0$ .

Let us mention that

\begin{matrix} Im {\tilde{D}}_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) \subset Im D^{*} Φ (\bar{x}, \bar{y}) \end{matrix}

4.16

is a sufficient condition for (4.15) due to (2.7). Assumption $A^{γ} (u)$ leads to the problem of how to compute or estimate the appearing pseudo-coderivatives. As mentioned above, for $γ : = 2$ and in the setting where $Φ$ is a constraint mapping, these objects can be computed and assumption $A^{γ} (u)$ can be made explicit. We discuss this case in detail in Sect. 5.3.2, where we show that $A^{2} (u)$ is (strictly) weaker than FOSCMS(u) as well as its refinement SOSCMS(u) in the polyhedral case. Here, we just explain how $A^{γ} (u)$ can be used to secure M-stationarity and how to compare it with sufficient conditions for metric subregularity.

To proceed, let $\bar{x} \in F$ be a local minimizer of (P) and consider $γ \geq 1$ . Assuming that $A^{γ} (u)$ holds in every unit direction u implies that options (b) or (c) from Theorem 4.1 yield M-stationarity of $\bar{x}$ , and that option (d) cannot occur. Thus, we end up with $\bar{x}$ being M-stationary. Similarly, if $A^{γ} (u)$ holds in a critical direction $u \in S_{X}$ of order $(1, γ)$ for (P) at $\bar{x}$ , (4.14) is satisfied. Due to (4.15), this also shows M-stationarity of $\bar{x}$ . Thus, we obtain the following from Corollary 4.2.

Corollary 4.4

Let $\bar{x} \in F$ be a local minimizer of (P) and consider $γ \geq 1$ . Then each of the following conditions implies that $\bar{x}$ is M-stationary.

Condition $A^{γ} (u)$ holds in each unit direction u.
There are no critical directions for (P) at $\bar{x}$ , or there is a critical direction $u \in S_{X}$ of order $(1, γ)$ for (P) at $\bar{x}$ such that $A^{γ} (u)$ holds.

In the following remark, we compare our approach from Corollary 4.4 with the results from Corollary 4.3 in the presence of any sufficient condition for directional metric subregularity.

Remark 4.3

Due to Corollary 4.3, directional metric subregularity serves as a constraint qualification guaranteeing M-stationarity of local minimizers. However, given $u \in S_{X}$ , metric subregularity in direction u is difficult to verify, so it is often replaced by some stronger condition which is easier to check - exemplary, FOSCMS(u). Let us label such a sufficient condition as SCMS(u). Clearly, Corollary 4.3 can be restated in terms of SCMS(u). Suppose that we can show that $A^{γ} (u)$ is milder than SCMS(u) for every $u \in S_{X}$ (even strictly milder for some u). Naturally, option (a) from Corollary 4.4 then provides a (strictly) milder assumption than requiring SCMS(u) to hold for all unit directions. However, does an analogous relationship hold for the more complicated option (b) from Corollary 4.4? Both approaches yield M-stationarity if there are no critical directions. If there is a critical direction $u \in S_{X}$ such that SCMS(u) holds, then Lemma 4.2 yields that u is actually critical of order $(1, γ)$ and, thus, the milder assumption $A^{γ} (u)$ from the case (b) of Corollary 4.4 can be applied. This means that our approach via Corollary 4.4 is indeed better than an approach via any sufficient condition for metric subregularity in direction u which is stronger that $A^{γ} (u)$ .

The second approach to M-stationarity can be called “asymptotic” and is based on the following result, a generalization of [63, Theorem 3.9], which reinspects Theorem 4.1 in the situation $γ : = 1$ . Particularly, we exploit that, in this case, both notions of a pseudo-coderivative from Definition 2.3 coincide with the directional limiting coderivative.

Corollary 4.5

Let $\bar{x} \in F$ be a local minimizer of (P). Then $\bar{x}$ is M-stationary or there exist a critical direction $u \in S_{X}$ for (P) at $\bar{x}$ , some $y^{*} \in Y$ , and sequences ${x_{k}}_{k \in N}, {η_{k}}_{k \in N} \subset X$ as well as ${y_{k}}_{k \in N} \subset Y$ such that $x_{k} \notin Φ^{- 1} (\bar{y})$ and $y_{k} \neq \bar{y}$ for all $k \in N$ , satisfying the convergence properties

\begin{matrix} x_{k} & \to \bar{x}, & y_{k} & \to \bar{y}, & η_{k} & \to 0, \end{matrix}

4.17a

\begin{matrix} \frac{x_{k} - \bar{x}}{‖ x_{k} - \bar{x} ‖} & \to u, & \frac{y_{k} - \bar{y}}{‖ x_{k} - \bar{x} ‖} & \to 0, \end{matrix}

4.17b

\begin{matrix} k \frac{‖ y_{k} - \bar{y} ‖}{‖ x_{k} - \bar{x} ‖} (y_{k} - \bar{y}) & \to y^{*}, & k ‖ y_{k} - \bar{y} ‖ & \to \infty, \end{matrix}

4.17c

and

\begin{matrix} \forall k \in N : η_{k} \in \partial φ (x_{k}) + D^{*} Φ (x_{k}, y_{k}) (k (y_{k} - \bar{y})) . \end{matrix}

4.18

The above result shows that each local minimizer of (P) either is M-stationary or satisfies asymptotic stationarity conditions w.r.t. a certain critical direction and an unbounded sequence of multiplier estimates ${λ_{k}}_{k \in N}$ given by

\begin{matrix} \forall k \in N : λ_{k} : = k (y_{k} - \bar{y}) . \end{matrix}

4.19

Note that in the case where ${λ_{k}}_{k \in N}$ would be bounded, one could simply take the limit in (4.18) along a suitable subsequence and, respecting the convergences from (4.17a), would end up with M-stationarity again taking into account robustness of the limiting subdifferential and coderivative. Thus, divergence of the multiplier estimates is natural since not all local minimizers of (P) are M-stationary in general, see [63, Lemma 3.4] as well. Related results in nondirectional form can be found in [58, 63]. The story of asymptotic stationarity conditions in variational analysis, however, can be traced back to [57, 59]. This concept has been rediscovered as a valuable tool for the analysis of convergence properties for solution algorithms associated with standard nonlinear optimization problems in [3, 7], and extensions were made to disjunctive, conic, and even infinite-dimensional optimization, see e.g. [2, 4, 26, 70] and the references therein.

The sequential information from (4.17) describes in great detail what must “go wrong” if M-stationarity fails. We will refer to (4.17a), (4.17b), and (4.17c) as basic, directional, and multiplier (sequential) information, respectively. Clearly, one can secure M-stationarity of a local minimizer by ruling out the second alternative in Corollary 4.5 and, as we will show, various known constraint qualifications for M-stationarity indeed do precisely that. Let us mention here two such conditions. Rescaling (4.18) by $∥λ_{k}∥$ , for ${λ_{k}}_{k \in N}$ as given in (4.19), and taking the limit $k \to \infty$ leads to a contradiction with the Mordukhovich criterion (2.10a), i.e., metric regularity of $Φ$ at $(\bar{x}, \bar{y})$ . Respecting also the directional information (4.17b) yields a contradiction with FOSCMS(u) at $(\bar{x}, \bar{y})$ .

In both cases, we have essentially discarded the multiplier information (4.17c) which deserves some remarks. We have used $∥λ_{k}∥ \to \infty$ , but this information is not really very important since, as we already explained, if the multipliers remain bounded, we end up with M-stationarity anyway. The fact that ${λ_{k} ∥y_{k} - \bar{y}∥ / ∥x_{k} - \bar{x}∥}_{k \in N}$ converges tells us how fast the multipliers ${λ_{k}}_{k \in N}$ blow up. We note that the concept of super-coderivatives from Definition 2.4 collects this information, and we will come back to it in Sect. 5.3, where it is used to design constraint qualifications for M-stationarity. As we will show in Sect. 5.3, this approach is closely related to the hypothesis $A^{γ} (u)$ which we formulated in Assumption 4.1, and its role as a constraint qualification has already been illustrated in Corollary 4.4.

Finally, note that $(y_{k} - \bar{y}) / ‖ y_{k} - \bar{y} ‖ = λ_{k} / ‖ λ_{k} ‖$ means that the multipliers precisely capture the direction from which ${y_{k}}_{k \in N}$ converges to $\bar{y}$ . Particularly, we find $⟨ λ_{k} / ‖ λ_{k} ‖, (y_{k} - \bar{y}) / ‖ y_{k} - \bar{y} ‖ ⟩ = 1$ , which is clearly more restrictive than the condition $⟨ λ_{k} / ‖ λ_{k} ‖, (y_{k} - \bar{y}) / ‖ y_{k} - \bar{y} ‖ ⟩ \to 1$ . The latter convergence, which is used in the sufficient condition for metric subregularity in [37, Corollary 1], can be recast as $(y_{k} - \bar{y}) / ‖ y_{k} - \bar{y} ‖ - λ_{k} / ‖ λ_{k} ‖ \to 0$ . This information is respected by the new constraint qualifications which we are going to suggest in Sect. 5.

Mixed-order necessary optimality conditions for optimization problems with geometric constraints in the case $γ : = 2$

In this part, we apply Corollary 4.2 with $γ : = 2$ to the case where $Φ : X ⇉ Y$ is given in the form of a constraint mapping, i.e., $Φ (x) : = g (x) - D$ , $x \in X$ , holds where $g : X \to Y$ is twice continuously differentiable and $D \subset Y$ is a closed set. Since, in Sect. 3, we computed the pseudo-coderivative and the graphical pseudo-derivative of order 2 of $Φ$ , we are able to derive explicit conditions in terms of initial problem data. For that purpose, we assume $\bar{y} : = 0$ in (P) throughout the section which can be done without loss of generality.

We start with a description of critical directions of order (1, 2) and (2, 2).

Lemma 4.3

Fix $\bar{x} \in F$ and let $u \in S_{X}$ be a critical direction of order (1, 2) of (P) at $\bar{x}$ . Suppose that $Y : = R^{m}$ and D is locally polyhedral around $g (\bar{x})$ . Then

\begin{matrix} u \in C^{1, 2} (\bar{x}) : = {u \in X | d φ (\bar{x}) (u) \leq 0, \exists s \in X : w_{s} (u, 0) \in T (u)}, \end{matrix}

where $w_{s} (u, 0)$ and $T (u)$ are defined in (3.7). If $φ$ is continuously differentiable, $C^{1, 2} (\bar{x})$ corresponds precisely to the set of critical directions of order (1, 2) of (P) at $\bar{x}$ . Moreover, if $φ$ is even twice continuously differentiable at $\bar{x}$ , the set of all critical directions of order (2, 2) of (P) at $\bar{x}$ equals

\begin{matrix} {u \in X | \exists s \in X : \nabla φ (\bar{x}) s + 1 / 2 \nabla^{2} φ (\bar{x}) [u, u] \in T_{R_{-}} (\nabla φ (\bar{x}) u), w_{s} (u, 0) \in T (u)} . \end{matrix}

4.20

Proof

A critical direction u of order (1, 2) of (P) at $\bar{x}$ satisfies $d φ (\bar{x}) (u) \leq 0$ and $0 \in D_{2} Φ (\bar{x}, 0) (u)$ , with equivalence being valid if $φ$ is continuously differentiable at $\bar{x}$ . Hence, the first statement follows from Theorem 3.2.

By Proposition 4.1, a direction u is critical of order (2, 2) of (P) at $\bar{x}$ if and only if $u \in ker D_{2} M (\bar{x}, (φ (\bar{x}), 0))$ for $M : X ⇉ R \times R^{m}$ given by $M (x) : = (φ (x), g (x)) - (R_{-} \times D)$ , $x \in X$ . Hence, Theorem 3.2 can be applied again, yielding the second statement. $□$

Remark 4.4

Note that Lemma 4.3 shows that the set of directions $C_{2} (\bar{x})$ from [13, Theorem 3] and its extension labeled second-order tightened critical cone in [9, Theorem 3] actually correspond to $C^{1, 2} (\bar{x})$ , while the set of directions used in [37, Theorem 3(2)] corresponds to the one in (4.20). We believe that interpreting these directions as critical (of some order) is very natural. Moreover, our approach justifies the name. Indeed, as already mentioned, our definition of criticality is an extension of the one stated in [36, Definition 5]. More importantly, we have shown in Corollary 4.2 that in the absence of nonzero critical directions (of order $(1, γ)$ for some $γ \geq 1$ ), the corresponding mixed-order optimality conditions (involving a pseudo-coderivative of order $γ$ ) are satisfied without any additional assumptions.

Based on Theorems 3.1 and 3.2 as well as Corollary 4.2, we obtain the following result.

Proposition 4.2

Let $\bar{x} \in F$ be a local minimizer of (P).

If (3.9), as well as (3.10) or, in the case $\nabla g (\bar{x}) u \neq 0$ , (3.11) hold for every unit direction, then $\bar{x}$ is M-stationary or there exist a critical direction $u \in S_{X}$ and
$\begin{matrix} y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u) \cap ker \nabla g {(\bar{x})}^{*}, z^{*} \in D N_{D} (g (\bar{x}), y^{*}) (\nabla g (\bar{x}) u) \end{matrix}$ 4.21
such that
$\begin{matrix} 0 \in \partial φ (\bar{x} ; u) + \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z^{*} . \end{matrix}$ 4.22
If there exists a critical direction $u \in S_{X}$ of order (1, 2) of (P) at $\bar{x}$ satisfying (3.9), as well as (3.10) or, in the case $\nabla g (\bar{x}) u \neq 0$ , (3.11), then there exist $y^{*}, z^{*} \in Y$ satisfying (4.21) and (4.22) for this u.
Let $Y : = R^{m}$ and D be locally polyhedral around $g (\bar{x})$ . If either $C^{1, 2} (\bar{x}) = {0}$ or if (3.13) holds for every unit direction, then $\bar{x}$ is M-stationary or there exist a critical direction $u \in S_{X}$ , $s \in X$ , $y^{*}, z_{i}^{*} \in R^{m}$ for $i = 1, 2$ , and $α \geq 0$ , satisfying $\nabla g {(\bar{x})}^{*} y^{*} = 0$ ,
$\begin{matrix} y^{*}, z_{1}^{*} \in N_{T (u)} (w_{s} (u, v)), z_{2}^{*} \in T_{N_{T (u)} (w_{s} (u, v))} (y^{*}), \end{matrix}$
and (4.22) (with $z^{*} = z_{i}^{*}, i = 1, 2$ ), where $v : = α y^{*}$ , and $w_{s} (u, v)$ and $T (u)$ have been defined in (3.7). If there exists $u \in C^{1, 2} (\bar{x}) \cap S_{X}$ satisfying (3.13), then there exist $s \in X$ and
$\begin{matrix} y^{*}, z_{1}^{*} \in N_{T (u)} (w_{s} (u, 0)), z_{2}^{*} \in T_{N_{T (u)} (w_{s} (u, 0))} (y^{*}) \end{matrix}$
satisfying (4.22) (with $z^{*} = z_{i}^{*}, i = 1, 2$ ) as well as $\nabla g {(\bar{x})}^{*} y^{*} = 0$ .

Proof

For the proof of (a), in the first alternative, we apply Corollary 3.1 in order to verify that (2.12) holds for every unit direction. Corollary 4.2 in turn yields that $\bar{x}$ is M-stationary or one of the cases (b) and (c) from Theorem 4.1 holds. In the case of Theorem 4.1 (b), however, from Theorem 3.1 (a) we get $0 \in \partial φ (\bar{x} ; u) + \nabla g {(\bar{x})}^{*} z^{*}$ with

\begin{matrix} z^{*} \in D N_{D} (g (\bar{x}), 0) (\nabla g (\bar{x}) u) \subset N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u) \subset N_{D} (g (\bar{x})), \end{matrix}

see Lemma 2.5, and M-stationarity of $\bar{x}$ follows. In the case of Theorem 4.1 (c), from (2.7) and Theorem 3.1 (a) we precisely obtain $y^{*}$ and $z^{*}$ as stated. Similarly, the second alternative follows from successively applying Corollaries 3.1 and 4.2, (2.7), and Theorem 3.1 (a).

For the proof of (b), we first would like to hint to Lemma 4.3. In the first alternative, taking into account Corollary 3.1, Corollary 4.2 yields that $\bar{x}$ is M-stationary or one of the cases (b) and (c) from Theorem 4.1 holds. As before, in the case of Theorem 4.1 (b), from Theorem 3.1 (b) we get $0 \in \partial φ (\bar{x} ; u) + \nabla g {(\bar{x})}^{*} z^{*}$ with $z^{*} \in N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u) \subset N_{D} (g (\bar{x}))$ , see Lemma 2.2, and M-stationarity of $\bar{x}$ follows. In the case of Theorem 4.1 (c), from Theorem 3.2 we precisely obtain $y^{*}, z_{1}^{*}$ , and $z_{2}^{*}$ as stated. The second alternative follows from Corollaries 3.1 and 4.2 as well as Theorem 3.2. $□$

Similar optimality conditions involving a mixture of first- and second-order derivatives were proposed e.g. in [9, 11–13, 37]. Let us now explain that in the convex polyhedral case, where $Y : = R^{m}$ holds while D is convex and polyhedral, all these optimality conditions are the same and can be stated simply as follows: If there exists $u \in C^{1, 2} (\bar{x}) \cap S_{X}$ satisfying (3.17), then there are $y^{*}, z^{*} \in R^{m}$ satisfying

\begin{matrix} \nabla φ (\bar{x}) + \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z^{*} = 0, \nabla g {(\bar{x})}^{*} y^{*} = 0, y^{*}, z^{*} \in N_{D} (g (\bar{x})) \end{matrix}

4.23

(for a fair comparison, we assume that $φ$ is continuously differentiable).

In Examples 3.1 and 3.2, we have shown that the 2-regularity assumption (3.17) used in [9] is, in general, strictly stronger than our condition (3.12), which is, in turn, strictly stronger than the mutually equivalent conditions (3.16) from [37] and (3.13) from Corollary 3.1. However, as shown in Corollary 3.2, all these assumptions are equivalent if applied to a critical direction u of order (1, 2), i.e, $u \in C^{1, 2} (\bar{x})$ , as this yields the existence of $s \in X$ with $w_{s} (u, 0) \in T (u)$ .

Clearly, although the aforementioned qualification conditions are equivalent, the optimality conditions may differ due to the additional information regarding the multipliers. However, this is also not the case, and it can be shown following the proof of Proposition 3.1 (c). First, as mentioned above, we automatically have $s \in X$ with $w_{s} (u, 0) \in T (u)$ from $u \in C^{1, 2} (\bar{x})$ , which can be added to (4.23). Now, we are in the same situation as when proving Proposition 3.1 (c), but we have to work with (4.23) instead of (C $(u, y^{*})$ ). From $u \in C^{1, 2} (\bar{x})$ we also get $\nabla φ (\bar{x}) u \leq 0$ , while $w_{s} (u, 0) \in T (u)$ and Lemma 3.2 yield $\nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] \leq 0$ , and $⟨ z^{*}, \nabla g (\bar{x}) u ⟩ \leq 0$ follows from $z^{*} \in N_{D} (g (\bar{x}))$ and $\nabla g (\bar{x}) u \in T_{D} (g (\bar{x}))$ , which is implicitly required due to $w_{s} (u, 0) \in T (u)$ . Thus, multiplying the essential equation of (4.23) by u, the three nonpositive terms sum up to zero, so they all must vanish. Hence, the arguments which we used to prove Proposition 3.1 (c) also work with (C $(u, y^{*})$ ) replaced by (4.23).

Applications

In this subsection, we highlight some aspects of our results from Sect. 4.3 in two popular settings of optimization theory. More precisely, we focus on the feasible regions of complementarity-constrained and nonlinear semidefinite problems. As mentioned at the end of Sect. 4.3, we do not obtain any new insights for standard nonlinear programs as these can be reformulated with the aid of a constraint mapping where the involved set is convex and polyhedral. Hence, we do not specify our findings for this elementary setting for brevity of presentation but refer the interested reader to [12, 13] where the associated mixed-order optimality conditions and constraint qualifications are worked out.

Mathematical programs with complementarity constraints

Let us introduce

\begin{matrix} C : = (R_{+} \times {0}) \cup ({0} \times R_{+}), \end{matrix}

the so-called complementarity angle. For twice continuously differentiable functions $G, H : X \to R^{m}$ with components $G_{1}, \dots, G_{m} : X \to R$ and $H_{1}, \dots, H_{m} : X \to R$ , we address the constraint region given by

\begin{matrix} (G_{i} (x), H_{i} (x)) \in C i \in I \end{matrix}

MPCC

where $I : = {1, \dots, m}$ . The latter is distinctive for so called mathematical programs with complementarity constraints which have been studied intensively throughout the last decades, see e.g. [62, 68] for some classical references. We observe that (MPCC) can be formulated via a constraint map using $D : = C^{m}$ . Note that standard inequality and equality constraints can be added without any difficulties due to Lemmas 2.3 and 2.7 when taking the findings from [12, 13] into account. Here, we omit them for brevity of presentation.

Fix some feasible point $\bar{x} \in X$ of (MPCC). A critical direction $u \in S_{X}$ of the associated problem (P) necessarily needs to satisfy

\begin{matrix} \begin{matrix} \nabla G_{i} (\bar{x}) u & = 0 & i \in I^{0 +} (\bar{x}), \\ \nabla H_{i} (\bar{x}) u & = 0 & i \in I^{+ 0} (\bar{x}), \\ (\nabla G_{i} (\bar{x}) u, \nabla H_{i} (\bar{x}) u) & \in C & i \in I^{00} (\bar{x}), \end{matrix} \end{matrix}

4.24

where we used the well-known index sets

\begin{matrix} I^{0 +} (\bar{x}) : = & {i \in I | G_{i} (\bar{x}) = 0, H_{i} (\bar{x}) > 0}, \\ I^{+ 0} (\bar{x}) : = & {i \in I | G_{i} (\bar{x}) > 0, H_{i} (\bar{x}) = 0}, \\ I^{00} (\bar{x}) : = & {i \in I | G_{i} (\bar{x}) = 0, H_{i} (\bar{x}) = 0} . \end{matrix}

We start with an illustration of Proposition 4.2 (a). Thanks to Remark 3.4, we need to check the constraint qualifications (3.9) and (3.14), and these can be specified to the present setting with the aid of Lemmas 2.2, 2.3 and 2.7. For brevity of presentation, we abstain from a discussion of the case where critical directions of order (1, 2) are involved. Based on the representation

\begin{matrix} gph N_{C} = (R_{+} \times {0} \times {0} \times R) \cup ({0} \times R_{+} \times R \times {0}) \cup ({0} \times {0} \times R_{-} \times R_{-}), \end{matrix}

some elementary calculations show

\begin{matrix} D N_{C} ((a, b), (μ, ν)) (v) = \{\begin{matrix} {0} \times R & a > 0, b = μ = 0, v_{2} = 0, \\ R \times {0} & a = ν = 0, b > 0, v_{1} = 0, \\ R^{2} & a = b = 0, μ, ν < 0, v = 0, \\ {0} \times R & a = b = μ = 0, ν < 0, v_{1} > 0, v_{2} = 0, \\ R_{-} \times R & a = b = μ = 0, ν < 0, v = 0, \\ R \times {0} & a = b = ν = 0, μ < 0, v_{1} = 0, v_{2} > 0, \\ R \times R_{-} & a = b = ν = 0, μ < 0, v = 0, \\ {0} \times R & a = b = μ = 0, ν > 0, v_{1} \geq 0, v_{2} = 0, \\ R \times {0} & a = b = ν = 0, μ > 0, v_{1} = 0, v_{2} \geq 0, \\ {0} \times R & a = b = μ = ν = 0, v_{1} > 0, v_{2} = 0, \\ R \times {0} & a = b = μ = ν = 0, v_{1} = 0, v_{2} > 0, \\ N_{C} (0) & a = b = μ = ν = 0, v = 0, \\ \emptyset & otherwise \end{matrix}) \end{matrix}

4.25

for arbitrary $((a, b), (μ, ν)) \in gph N_{C}$ and $v \in R^{2}$ . Consequently, for $u \in S_{X}$ satisfying (4.24), (3.9) reduces to

\begin{matrix} (\begin{matrix} \nabla G {(\bar{x})}^{*} μ + \nabla H {(\bar{x})}^{*} ν = 0, \\ \sum_{i = 1}^{m} (μ_{i} \nabla^{2} G_{i} (\bar{x}) + ν_{i} \nabla^{2} H_{i} (\bar{x})) u \\ + \nabla G {(\bar{x})}^{*} \tilde{μ} + \nabla H {(\bar{x})}^{*} \tilde{ν} = 0, \\ \forall i \in I^{+ 0} (\bar{x}) \cup I_{+ 0}^{00} (\bar{x}, u) : μ_{i} = 0, \\ \forall i \in I^{0 +} (\bar{x}) \cup I_{0 +}^{00} (\bar{x}, u) : ν_{i} = 0, \\ \forall i \in I_{00}^{00} (\bar{x}, u) : μ_{i}, ν_{i} \leq 0 or μ_{i} ν_{i} = 0, \\ \forall i \in I : ({\tilde{μ}}_{i}, {\tilde{ν}}_{i}) \in D N_{C} (({\bar{G}}_{i}, {\bar{H}}_{i}), (μ_{i}, ν_{i})) (\nabla {\bar{G}}_{i} u, \nabla {\bar{H}}_{i} u) \end{matrix}\} ⟹ μ = ν = 0, \end{matrix}

4.26

while (3.14) reads as

\begin{matrix} (\begin{matrix} \nabla G {(\bar{x})}^{*} μ + \nabla H {(\bar{x})}^{*} ν = 0, \nabla G {(\bar{x})}^{*} \tilde{μ} + \nabla H {(\bar{x})}^{*} \tilde{ν} = 0, \\ \forall i \in I^{+ 0} (\bar{x}) \cup I_{+ 0}^{00} (\bar{x}, u) : μ_{i} = 0, \\ \forall i \in I^{0 +} (\bar{x}) \cup I_{0 +}^{00} (\bar{x}, u) : ν_{i} = 0, \\ \forall i \in I_{00}^{00} (\bar{x}, u) : μ_{i}, ν_{i} \leq 0 or μ_{i} ν_{i} = 0, \\ \forall i \in I : ({\tilde{μ}}_{i}, {\tilde{ν}}_{i}) \in D N_{C} (({\bar{G}}_{i}, {\bar{H}}_{i}), (μ_{i}, ν_{i})) (\nabla {\bar{G}}_{i} u, \nabla {\bar{H}}_{i} u) \end{matrix}\} ⟹ \tilde{μ} = \tilde{ν} = 0 . \end{matrix}

4.27

Above, for each $i \in I$ , we used ${\bar{G}}_{i} : = G_{i} (\bar{x})$ , ${\bar{H}}_{i} : = H_{i} (\bar{x})$ , $\nabla {\bar{G}}_{i} u : = \nabla G_{i} (\bar{x}) u$ , and $\nabla {\bar{H}}_{i} u : = \nabla H_{i} (\bar{x}) u$ for brevity as well as the index sets

\begin{matrix} I_{0 +}^{00} (\bar{x}, u) & : = {i \in I^{00} (\bar{x}) | \nabla {\bar{G}}_{i} u = 0, \nabla {\bar{H}}_{i} u > 0}, \\ I_{+ 0}^{00} (\bar{x}, u) & : = {i \in I^{00} (\bar{x}) | \nabla {\bar{G}}_{i} u > 0, \nabla {\bar{H}}_{i} u = 0}, \\ I_{00}^{00} (\bar{x}, u) & : = {i \in I^{00} (\bar{x}) | \nabla {\bar{G}}_{i} u = 0, \nabla {\bar{H}}_{i} u = 0} . \end{matrix}

The first assertion of Proposition 4.2 (a) now yields that whenever $\bar{x}$ is a local minimizer for the associated problem (P) and for each $u \in S_{X}$ satisfying (4.24), (4.26) and (4.27) hold, then $\bar{x}$ is either M-stationary, i.e., there are multipliers $μ, ν \in R^{m}$ satisfying

\begin{matrix} 0 \in \partial φ (\bar{x}) + \nabla G {(\bar{x})}^{*} μ + \nabla H {(\bar{x})}^{*} ν, \\ \forall i \in I^{+ 0} (\bar{x}) : μ_{i} = 0, \\ \forall i \in I^{0 +} (\bar{x}) : ν_{i} = 0, \\ \forall i \in I^{00} (\bar{x}) : μ_{i}, ν_{i} \leq 0 or μ_{i} ν_{i} = 0, \end{matrix}

or we find $u \in S_{X}$ satisfying (4.24) and $d φ (\bar{x}) (u) \leq 0$ as well as multipliers $μ, ν, \tilde{μ}, \tilde{ν} \in R^{m}$ such that

\begin{matrix} \begin{matrix} 0 \in \partial φ (\bar{x} ; u) + \sum_{i = 1}^{m} (μ_{i} \nabla^{2} G_{i} (\bar{x}) + ν_{i} \nabla^{2} H_{i} (\bar{x})) u + \nabla G {(\bar{x})}^{*} \tilde{μ} + \nabla H {(\bar{x})}^{*} \tilde{ν}, \\ 0 = \nabla G {(\bar{x})}^{*} μ + \nabla H {(\bar{x})}^{*} ν, \\ \forall i \in I^{+ 0} (\bar{x}) \cup I_{+ 0}^{00} (\bar{x}, u) : μ_{i} = 0, \\ \forall i \in I^{0 +} (\bar{x}) \cup I_{0 +}^{00} (\bar{x}, u) : ν_{i} = 0, \\ \forall i \in I_{00}^{00} (\bar{x}, u) : μ_{i}, ν_{i} \leq 0 or μ_{i} ν_{i} = 0, \\ \forall i \in I : ({\tilde{μ}}_{i}, {\tilde{ν}}_{i}) \in D N_{C} (({\bar{G}}_{i}, {\bar{H}}_{i}), (μ_{i}, ν_{i})) (\nabla {\bar{G}}_{i} u, \nabla {\bar{H}}_{i} u) . \end{matrix} \end{matrix}

4.28

For brevity, we present the results from Proposition 4.2 (b) only in simplified form, where $w_{s} (u, v)$ is replaced by 0, see Remark 3.3 as well, and we do not comment on the cases where critical directions of order (1, 2) are involved, but this would clearly yield further refinements.

In order to characterize (3.13), we observe that

\begin{matrix} N_{T_{C} ({\bar{G}}_{i}, {\bar{H}}_{i})} (\nabla {\bar{G}}_{i} u, \nabla {\bar{H}}_{i} u) = \{\begin{matrix} {0} \times R & i \in I^{+ 0} (\bar{x}) \cup I_{+ 0}^{00} (\bar{x}, u), \\ R \times {0} & i \in I^{0 +} (\bar{x}) \cup I_{0 +}^{00} (\bar{x}, u), \\ N_{C} (0) & i \in I_{00}^{00} (\bar{x}, u) \end{matrix}) \end{matrix}

is valid for each $i \in I$ . For each pair $(μ_{i}, ν_{i}) \in N_{T_{C} ({\bar{G}}_{i}, {\bar{H}}_{i})} (\nabla {\bar{G}}_{i} u, \nabla {\bar{H}}_{i} u)$ , elementary calculations and a comparison with (4.25) show

\begin{matrix} T_{N_{T_{C} ({\bar{G}}_{i}, {\bar{H}}_{i})} (\nabla {\bar{G}}_{i} u, \nabla {\bar{H}}_{i} u)} (μ_{i}, ν_{i}) & = \{\begin{matrix} {0} \times R & i \in I^{+ 0} (\bar{x}) \cup I_{+ 0}^{00} (\bar{x}, u), \\ R \times {0} & i \in I^{0 +} (\bar{x}) \cup I_{0 +}^{00} (\bar{x}, u), \\ R^{2} & i \in I_{00}^{00} (\bar{x}, u), μ_{i} < 0, ν_{i} < 0, \\ R_{-} \times R & i \in I_{00}^{00} (\bar{x}, u), μ_{i} = 0, ν_{i} < 0, \\ R \times R_{-} & i \in I_{00}^{00} (\bar{x}, u), μ_{i} < 0, ν_{i} = 0, \\ {0} \times R & i \in I_{00}^{00} (\bar{x}, u), μ_{i} = 0, ν_{i} > 0, \\ R \times {0} & i \in I_{00}^{00} (\bar{x}, u), μ_{i} > 0, ν_{i} = 0, \\ N_{C} (0) & i \in I_{00}^{00} (\bar{x}, u), μ_{i} = ν_{i} = 0 \end{matrix}) \\ = D N_{C} (({\bar{G}}_{i}, {\bar{H}}_{i}), (μ_{i}, ν_{i})) (\nabla {\bar{G}}_{i}, \nabla {\bar{H}}_{i}) . \end{matrix}

Thus, validity of (4.26) for each $u \in S_{X}$ satisfying (4.24) is already enough to infer that whenever $\bar{x}$ is a local minimizer, then it is either M-stationary or there are $u \in S_{X}$ satisfying (4.24) as well as $d φ (\bar{x}) (u) \leq 0$ and multipliers $μ, ν, \tilde{μ}, \tilde{ν} \in R^{m}$ solving the stationarity conditions (4.28).

Let us further note that Proposition 4.2 (b) also allows for the consideration of a qualification and stationarity condition where simply $({\tilde{μ}}_{i}, {\tilde{ν}}_{i}) \in N_{T_{C} ({\bar{G}}_{i}, {\bar{H}}_{i})} (\nabla {\bar{G}}_{i} u, \nabla {\bar{H}}_{i} u)$ has to hold for all $i \in I$ , see Remark 3.3 again. One can easily check that there is no general inclusion between $N_{T_{C} ({\bar{G}}_{i}, {\bar{H}}_{i})} (\nabla {\bar{G}}_{i} u, \nabla {\bar{H}}_{i} u)$ and $T_{N_{T_{C} ({\bar{G}}_{i}, {\bar{H}}_{i})} (\nabla {\bar{G}}_{i} u, \nabla {\bar{H}}_{i} u)} (μ_{i}, ν_{i})$ , i.e., this procedure leads to conditions not related to (4.26) and (4.28) which are, however, easier to evaluate.

The following example illustrates a situation where (4.26) is valid while (4.27) is violated, i.e., where Proposition 4.2 (b) is applicable while Proposition 4.2 (a) is not. This provides yet another justification of a separate consideration of the polyhedral situation.

Example 4.1

Let us consider (MPCC) with $X : = R$ , $m : = 1$ , and $G (x) : = x$ as well as $H (x) : = x^{2}$ for all $x \in R$ . We are interested in the unique feasible point $\bar{x} : = 0$ of this system. The only direction from the unit sphere that satisfies (4.24) is $u : = 1$ . Hence, (4.26) reduces to

\begin{matrix} (\begin{matrix} μ = 0, 2 ν + \tilde{μ} = 0, \\ (\tilde{μ}, \tilde{ν}) \in D N_{C} ((0, 0), (μ, ν)) (1, 0) \end{matrix}\} ⟹ μ = ν = 0 . \end{matrix}

Let the premise be valid and assume $ν \neq 0$ . This gives $D N_{C} ((0, 0), (0, ν)) (1, 0) = {0} \times R$ due to (4.25), i.e., $\tilde{μ} = 0$ , and, thus, $ν = 0$ which yields a contradiction. Hence, this constraint qualification holds. However, (4.27) is given by

\begin{matrix} (\begin{matrix} μ = 0, \tilde{μ} = 0, \\ (\tilde{μ}, \tilde{ν}) \in D N_{C} ((0, 0), (μ, ν)) (1, 0) \end{matrix}\} ⟹ \tilde{μ} = \tilde{ν} = 0, \end{matrix}

and one can easily check with the aid of (4.25) that the premise holds for $(μ, ν) : = (\tilde{μ}, \tilde{ν}) : = (0, 1)$ , i.e., this condition is violated.

Finally, we would like to refer the interested reader to [51, Section 6] and [50] where the theory of 2–regularity is first extended to mappings which are once but not twice differentiable and then applied to a suitable reformulation of complementarity constraints as a system of once but not twice differentiable equations. We abstain from a detailed comparison of our findings with the ones from [50, 51] for the following reasons. First, in these papers, a different way of stating the system of complementarity constraints is used, and it would be laborious to transfer the results to the formulation (MPCC). Second, at least in [51], some additional assumptions are used to simplify the calculations while we do not need to assume anything artificial to make the calculus accessible. Third, the final characterization of 2-regularity obtained in these papers does not comprise any second-order derivatives of the involved data functions and, thus, is anyhow clearly different from (4.26). Let us, however, close with the remark that the system of necessary optimality conditions derived in [50, Theorem 4.2] is closely related to (4.28).

Semidefinite programming

Let us consider the Hilbert space $S_{m}$ of all real symmetric matrices equipped with the standard (Frobenius) inner product. We denote by $S_{m}^{+}$ and $S_{m}^{-}$ the cone of all positive and negative semidefinite matrices in $S_{m}$ , respectively. The foundations of variational analysis in this space can be found, e.g., in [25, Section 5.3]. For some twice continuously differentiable mapping $g : X \to S_{m}$ , we investigate the constraint system

\begin{matrix} g (x) \in S_{m}^{+} . \end{matrix}

NLSD

It is well known that the closed, convex cone $S_{m}^{+}$ is not polyhedral. Nevertheless, the constraint (NLSD), associated with so-called nonlinear semidefinite programming, can be encoded via a constraint map. Subsequently, we merely illustrate the first assertion of Proposition 4.2 (a). As $S_{m}^{+}$ is not polyhedral, Lemma 4.3 cannot be used for a characterization of critical directions of order (1, 2).

Let $\bar{x} \in X$ be feasible to (NLSD) and, for some $u \in S_{X}$ , fix $Ω \in N_{S_{m}^{+}} (g (\bar{x}) ; \nabla g (\bar{x}) u)$ . For later use, fix an orthogonal matrix $P \in R^{m \times m}$ and a diagonal matrix $Λ \in R^{m \times m}$ whose diagonal elements $λ_{1}, \dots, λ_{m}$ are ordered nonincreasingly such that $g (\bar{x}) + Ω = P Λ P^{⊤}$ . The index sets corresponding to the positive, zero, and negative entries on the main diagonal of $Λ$ are denoted by $α$ , $β$ , and $γ$ , respectively. We emphasize that, here and throughout the subsection, $α$ is a constant index set while $β$ and $γ$ depend on the precise choice of $Ω$ . Subsequently, we use $Q^{P} : = P^{⊤} Q P$ and $Q_{IJ}^{P} : = {(Q^{P})}_{IJ}$ for each matrix $Q \in S_{m}$ and index sets $I, J \subset {1, \dots, m}$ where $M_{IJ}$ is the submatrix of $M \in S_{m}$ which possesses only those rows and columns of $M$ whose indices can be found in I and J, respectively.

The above constructions yield

\begin{matrix} g (\bar{x}) = P max (Λ, O) P^{⊤}, Ω = P min (Λ, O) P^{⊤} \end{matrix}

where $max$ and $min$ have to be understood in entrywise fashion and $O$ is an all-zero matrix of appropriate dimensions. Due to

\begin{matrix} \nabla g (\bar{x}) u \in T_{S_{m}^{+}} (g (\bar{x})) = \{Q \in S_{m} | Q_{β \cup γ, β \cup γ}^{P} \in S_{| β \cup γ |}^{+}\}, \end{matrix}

we find

\begin{matrix} 0 & = ⟨ Ω, \nabla g (\bar{x}) u ⟩ = trace (Ω \nabla g (\bar{x}) u) = trace (P min (Λ, O) P^{⊤} P {[\nabla g (\bar{x}) u]}^{P} P^{⊤}) \\ = trace (min (Λ, O) {[\nabla g (\bar{x}) u]}^{P}) = \sum_{i \in γ} \underset{< 0}{\underset{⏟}{λ_{i}}} \underset{\geq 0}{\underset{⏟}{{[\nabla g (\bar{x}) u]}_{i, i}^{P}}} \end{matrix}

which directly gives us ${[\nabla g (\bar{x}) u]}_{β γ}^{P} = O$ , ${[\nabla g (\bar{x}) u]}_{γ γ}^{P} = O$ , and ${[\nabla g (\bar{x}) u]}_{β β}^{P} \in S_{| β |}^{+}$ . Furthermore, we note

\begin{matrix} N_{S_{m}^{+}} (g (\bar{x})) = \{\tilde{Ω} \in S_{m} | {\tilde{Ω}}_{α α}^{P} = O, {\tilde{Ω}}_{α β}^{P} = O, {\tilde{Ω}}_{α γ}^{P} = O, {\tilde{Ω}}_{β \cup γ, β \cup γ}^{P} \in S_{| β \cup γ |}^{-}\} . \end{matrix}

Finally, let $Ξ_{α γ} \in R^{| α | \times | γ |}$ be the matrix given by

\begin{matrix} \forall i \in α \forall j \in γ : {[Ξ_{α γ}]}_{ij} : = - \frac{λ_{j}}{λ_{i}} . \end{matrix}

It is well known that the projection onto $S_{m}^{+}$ is directionally differentiable. With the aid of Lemma 2.4 and [75, Corollary 3.1], we find

\begin{matrix} D N_{S_{m}^{+}} (g (\bar{x}), Ω) (\nabla g (\bar{x}) u) \\ = \{\tilde{Ω} \in S_{m} |\begin{matrix} {\tilde{Ω}}_{α α}^{P} = O, {\tilde{Ω}}_{α β}^{P} = O, {\tilde{Ω}}_{α γ}^{P} = Ξ_{α γ} ∙ {[\nabla g (\bar{x}) u]}_{α γ}^{P}, \\ {\tilde{Ω}}_{β β}^{P} \in S_{| β |}^{-}, ⟨ {\tilde{Ω}}_{β β}^{P}, {[\nabla g (\bar{x}) u]}_{β β}^{P} ⟩ = 0 \end{matrix})\}, \end{matrix}

and if $\nabla g (\bar{x}) u \neq O$ , we obtain

\begin{matrix} D_{s} u b N_{S_{m}^{+}} (g (\bar{x}), Ω) (\frac{\nabla g (\bar{x}) u}{∥\nabla g (\bar{x}) u∥}) \\ \subset \{\tilde{Ω} \in S_{m} | \begin{matrix} {\tilde{Ω}}_{α α}^{P} = O, {\tilde{Ω}}_{α β}^{P} = O, {\tilde{Ω}}_{α γ}^{P} = O, \\ {\tilde{Ω}}_{β β}^{P} \in S_{| β |}^{-}, ⟨ {\tilde{Ω}}_{β β}^{P}, {[\nabla g (\bar{x}) u]}_{β β}^{P} ⟩ = 0 \end{matrix}\} . \end{matrix}

Above, $∙$ represents the Hadamard, i.e., entrywise product. Note that validity of the final orthogonality condition in the estimate for the graphical subderivative follows from Lemma 2.4 since $\tilde{Ω} \in D_{s} u b N_{S_{m}^{+}} (g (\bar{x}), Ω) (\nabla g (\bar{x}) u / ∥\nabla g (\bar{x}) u∥)$ and $∥\nabla g (\bar{x}) u∥ > 0$ yield

\begin{matrix} 0 & \leq ⟨ \tilde{Ω}, \nabla g (\bar{x}) u ⟩ = trace (\tilde{Ω} \nabla g (\bar{x}) u) = trace (P {\tilde{Ω}}^{P} P^{⊤} P {[\nabla g (\bar{x}) u]}^{P} P^{⊤}) \\ = trace ({\tilde{Ω}}^{P} {[\nabla g (\bar{x}) u]}^{P}) = trace ({\tilde{Ω}}_{β β}^{P} {[\nabla g (\bar{x}) u]}_{β β}^{P}) \leq 0 \end{matrix}

due to ${\tilde{Ω}}_{α α}^{P} = O$ , ${\tilde{Ω}}_{α β}^{P} = O$ , ${\tilde{Ω}}_{α γ}^{P} = O$ , ${\tilde{Ω}}_{β β}^{P} \in S_{| β |}^{-}$ , ${[\nabla g (\bar{x}) u]}_{β γ}^{P} = O$ , ${[\nabla g (\bar{x}) u]}_{γ γ}^{P} = O$ , and ${[\nabla g (\bar{x}) u]}_{β β}^{P} \in S_{| β |}^{+}$ . Thus, for each $u \in S_{X}$ , (3.9) takes the form

\begin{matrix} (\begin{matrix} \nabla g {(\bar{x})}^{*} Ω = 0, \nabla^{2} ⟨ Ω, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} \tilde{Ω} = 0, \\ Ω_{α α}^{P} = O, Ω_{α β}^{P} = O, Ω_{α γ}^{P} = O, Ω_{β \cup γ, β \cup γ}^{P} \in S_{| β \cup γ |}^{-}, \\ {[\nabla g (\bar{x}) u]}_{β γ}^{P} = O, {[\nabla g (\bar{x}) u]}_{γ γ}^{P} = O, {[\nabla g (\bar{x}) u]}_{β β}^{P} \in S_{| β |}^{+}, \\ {\tilde{Ω}}_{α α}^{P} = O, {\tilde{Ω}}_{α β}^{P} = O, {\tilde{Ω}}_{α γ}^{P} = Ξ_{α γ} ∙ {[\nabla g (\bar{x}) u]}_{α γ}^{P}, \\ {\tilde{Ω}}_{β β}^{P} \in S_{| β |}^{-}, ⟨ {\tilde{Ω}}_{β β}^{P}, {[\nabla g (\bar{x}) u]}_{β β}^{P} ⟩ = 0 \end{matrix}\} ⟹ Ω = O, \end{matrix}

while (3.10) and (3.11) (the latter in the case $\nabla g (\bar{x}) u \neq O$ ) are both implied by

\begin{matrix} (\begin{matrix} \nabla g {(\bar{x})}^{*} Ω = 0, \nabla g {(\bar{x})}^{*} \tilde{Ω} = 0, \\ Ω_{α α}^{P} = O, Ω_{α β}^{P} = O, Ω_{α γ}^{P} = O, Ω_{β \cup γ, β \cup γ}^{P} \in S_{| β \cup γ |}^{-}, \\ {[\nabla g (\bar{x}) u]}_{β γ}^{P} = O, {[\nabla g (\bar{x}) u]}_{γ γ}^{P} = O, {[\nabla g (\bar{x}) u]}_{β β}^{P} \in S_{| β |}^{+}, \\ {\tilde{Ω}}_{α α}^{P} = O, {\tilde{Ω}}_{α β}^{P} = O, {\tilde{Ω}}_{α γ}^{P} = O, \\ {\tilde{Ω}}_{β β}^{P} \in S_{| β |}^{-}, ⟨ {\tilde{Ω}}_{β β}^{P}, {[\nabla g (\bar{x}) u]}_{β β}^{P} ⟩ = 0 \end{matrix}\} ⟹ \tilde{Ω} = O . \end{matrix}

In the case where $\bar{x}$ is a local minimizer of the associated problem (P), validity of these conditions for each $u \in S_{X}$ guarantees that $\bar{x}$ is either M-stationary (we omit stating this well-known system here) or we find $u \in S_{X}$ and $Ω, \tilde{Ω} \in S_{m}$ such that

\begin{matrix} 0 \in \partial φ (\bar{x} ; u) + \nabla^{2} ⟨ Ω, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} \tilde{Ω}, \nabla g {(\bar{x})}^{*} Ω = 0, \\ Ω_{α α}^{P} = O, Ω_{α β}^{P} = O, Ω_{α γ}^{P} = O, Ω_{β \cup γ, β \cup γ}^{P} \in S_{| β \cup γ |}^{-}, \\ d φ (\bar{x}) (u) \leq 0, {[\nabla g (\bar{x}) u]}_{β γ}^{P} = O, {[\nabla g (\bar{x}) u]}_{γ γ}^{P} = O, {[\nabla g (\bar{x}) u]}_{β β}^{P} \in S_{| β |}^{+}, \\ {\tilde{Ω}}_{α α}^{P} = O, {\tilde{Ω}}_{α β}^{P} = O, {\tilde{Ω}}_{α γ}^{P} = Ξ_{α γ} ∙ {[\nabla g (\bar{x}) u]}_{α γ}^{P}, \\ {\tilde{Ω}}_{β β}^{P} \in S_{| β |}^{+}, ⟨ {\tilde{Ω}}_{β β}^{P}, {[\nabla g (\bar{x}) u]}_{β β}^{P} ⟩ = 0 . \end{matrix}

Directional asymptotic regularity in nonsmooth optimization

In this section, we focus on (directional) asymptotic regularity conditions, which essentially correspond to conditions ensuring that (directional) asymptotic stationarity from Corollary 4.5, which serves as a necessary optimality condition for (P) even in the absence of constraint qualifications, translates into M-stationarity. We provide a comprehensive comparison of (directional) asymptotic regularity with various known constraint qualifications. Throughout the section, we consider a set-valued mapping $Φ : X ⇉ Y$ with a closed graph.

On the concept of directional asymptotic regularity

Based on Corollary 4.5, the following definition introduces concepts which may serve as (directional) qualification conditions for the model problem (P).

Definition 5.1

Let $(\bar{x}, \bar{y}) \in gph Φ$ be fixed.

The map $Φ$ is said to be asymptotically regular at $(\bar{x}, \bar{y})$ whenever the following condition holds: for every sequences ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ , ${x_{k}^{*}}_{k \in N} \subset X$ , and ${λ_{k}}_{k \in N} \subset Y$ as well as $x^{*} \in X$ satisfying $x_{k} \to \bar{x}$ , $y_{k} \to \bar{y}$ , $x_{k}^{*} \to x^{*}$ , and $x_{k}^{*} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ for all $k \in N$ , we find $x^{*} \in Im D^{*} Φ (\bar{x}, \bar{y})$ .
For the fixed direction $u \in S_{X}$ , $Φ$ is said to be asymptotically regular at $(\bar{x}, \bar{y})$ in direction u whenever the following condition holds: for every sequences ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ , ${x_{k}^{*}}_{k \in N} \subset X$ , and ${λ_{k}}_{k \in N} \subset Y$ as well as $x^{*} \in X$ and $y^{*} \in Y$ satisfying $x_{k} \notin Φ^{- 1} (\bar{y})$ , $y_{k} \neq \bar{y}$ , and $x_{k}^{*} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ for each $k \in N$ as well as the convergences
$\begin{matrix} \begin{matrix} x_{k} & \to \bar{x}, & y_{k} & \to \bar{y}, & x_{k}^{*} & \to x^{*}, \\ \frac{x_{k} - \bar{x}}{∥x_{k} - \bar{x}∥} & \to u, & \frac{y_{k} - \bar{y}}{∥x_{k} - \bar{x}∥} & \to 0, \\ \frac{∥y_{k} - \bar{y}∥}{∥x_{k} - \bar{x}∥} λ_{k} & \to y^{*}, & ∥λ_{k}∥ & \to \infty, & \frac{y_{k} - \bar{y}}{∥y_{k} - \bar{y}∥} - \frac{λ_{k}}{∥λ_{k}∥} & \to 0, \end{matrix} \end{matrix}$ 5.1
we find $x^{*} \in Im D^{*} Φ (\bar{x}, \bar{y})$ .
For the fixed direction $u \in S_{X}$ , $Φ$ is said to be strongly asymptotically regular at $(\bar{x}, \bar{y})$ in direction u whenever the following condition holds: for every sequences ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ , ${x_{k}^{*}}_{k \in N} \subset X$ , and ${λ_{k}}_{k \in N} \subset Y$ as well as $x^{*} \in X$ and $y^{*} \in Y$ satisfying $x_{k} \notin Φ^{- 1} (\bar{y})$ , $y_{k} \neq \bar{y}$ , and $x_{k}^{*} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ for each $k \in N$ as well as the convergences (5.1), we have $x^{*} \in Im D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ .

Before commenting in detail on these conditions, we would like to emphasize that they can be equivalently formulated in terms of limiting coderivatives completely. The mainly technical proof of this result can be found in Appendix A.

Proposition 5.1

Definition 5.1 can equivalently be formulated in terms of limiting normals.

Having Proposition 5.1 available, let us briefly note that asymptotic regularity of a set-valued mapping $Φ : X ⇉ Y$ at some point $(\bar{x}, 0) \in gph Φ$ in the sense of Definition 5.1 equals AM-regularity of the set $Φ^{- 1} (0)$ at $\bar{x}$ mentioned in [63, Remark 3.17]. The concepts of directional asymptotic regularity from Definition 5.1 (c) and (c) are new.

In the subsequent remark, we summarize some obvious relations between the different concepts from Definition 5.1.

Remark 5.1

Let $(\bar{x}, \bar{y}) \in gph Φ$ be fixed. Then the following assertions hold.

Let $u \in S_{X}$ be arbitrarily chosen. If $Φ$ is strongly asymptotically regular at $(\bar{x}, \bar{y})$ in direction u, it is asymptotically regular at $(\bar{x}, \bar{y})$ in direction u.
If $Φ$ is asymptotically regular at $(\bar{x}, \bar{y})$ , then it is asymptotically regular at $(\bar{x}, \bar{y})$ in each direction from $S_{X}$ .

We note that strong asymptotic regularity in each unit direction is indeed not related to asymptotic regularity. On the one hand, the subsequently stated example, taken from [63, Example 3.15], shows that asymptotic regularity does not imply strong asymptotic regularity in each unit direction. On the other hand, Example 5.2 from below illustrates that strong asymptotic regularity in each unit direction does not yield asymptotic regularity.

Example 5.1

We consider $Φ : R ⇉ R$ given by

\begin{matrix} \forall x \in R : Φ (x) : = \{\begin{matrix} R & if x \leq 0, \\ [x^{2}, \infty) & if x > 0 \end{matrix}) \end{matrix}

at $(\bar{x}, \bar{y}) : = (0, 0)$ . It is demonstrated in [63, Example 3.15] that $Φ$ is asymptotically regular at $(\bar{x}, \bar{y})$ . We find $T_{gph Φ} (\bar{x}, \bar{y}) = {(u, v) \in R^{2} | u \leq 0 or v \geq 0}$ so $(\pm 1, 0) \in T_{gph Φ} (\bar{x}, \bar{y})$ . Let us consider $u : = 1$ . Then we find $Im D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) = {0}$ . Taking $x^{*} : = 1$ , $y^{*} : = 1 / 2$ , as well as

\begin{matrix} \forall k \in N : x_{k} : = \frac{1}{k}, y_{k} : = \frac{1}{k^{2}}, x_{k}^{*} : = 1, λ_{k} : = \frac{k}{2}, \end{matrix}

we have $x_{k}^{*} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ for all $k \in N$ as well as the convergences (5.1). However, due to $x_{k}^{*} \to x^{*} \notin Im D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ , $Φ$ is not strongly asymptotically regular at $(\bar{x}, \bar{y})$ in direction u.

Combining Corollary 4.5 with the concepts from Definition 5.1, we immediately obtain the following result which motivates our interest in directional asymptotic regularity.

Corollary 5.1

Let $\bar{x} \in F$ be a local minimizer of (P) such that, for each critical direction $u \in S_{X}$ for (P) at $\bar{x}$ , $Φ$ is asymptotically regular at $(\bar{x}, \bar{y})$ in direction u. Then $\bar{x}$ is M-stationary.

Proof

Due to Corollary 4.5, it suffices to consider the situation where there are a critical direction $u \in S_{X}$ for (P) at $\bar{x}$ and $y^{*} \in Y$ as well as sequences ${x_{k}}_{k \in N}, {η_{k}}_{k \in N}, {x_{k}^{*}}_{k \in N} \subset X$ and ${y_{k}}_{k \in N} \subset Y$ such that $x_{k} \notin Φ^{- 1} (\bar{y})$ , $y_{k} \neq \bar{y}$ , $x_{k}^{*} \in \partial φ (x_{k})$ , and

\begin{matrix} η_{k} - x_{k}^{*} \in D^{*} Φ (x_{k}, y_{k}) (k (y_{k} - \bar{y})) \end{matrix}

for all $k \in N$ as well as the convergences (4.17) are valid.

Since $φ$ is a locally Lipschitz continuous function, ${x_{k}^{*}}_{k \in N}$ is bounded, see e.g. [66, Theorem 1.22], and, thus, converges (along a subsequence), to some point $x^{*} \in X$ which belongs to $\partial φ (\bar{x})$ by robustness of the limiting subdifferential.

We can set $λ_{k} : = k (y_{k} - \bar{y})$ for each $k \in N$ and obtain $λ_{k} ‖ y_{k} - \bar{y} ‖ / ‖ x_{k} - \bar{x} ‖ \to y^{*}$ and $‖ λ_{k} ‖ \to \infty$ from (4.17c) as well as $(y_{k} - \bar{y}) / ‖ y_{k} - \bar{y} ‖ = λ_{k} / ‖ λ_{k} ‖$ for each $k \in N$ by construction. Additionally, $η_{k} - x_{k}^{*} \in D^{*} Φ (x_{k}, y_{k}) (λ_{k})$ is valid for each $k \in N$ .

Now, asymptotic regularity of $Φ$ at $(\bar{x}, \bar{y})$ in direction u, Proposition 5.1, and the remaining convergences from (4.17) yield $- x^{*} \in Im D^{*} Φ (\bar{x}, \bar{y})$ , i.e., there exists $λ \in Y$ such that $- x^{*} \in D^{*} Φ (\bar{x}, \bar{y}) (λ)$ . Recalling $x^{*} \in \partial φ (\bar{x})$ shows the claim. $□$

In the light of Remark 5.1 (b), our result from Corollary 5.1 improves [63, Theorem 3.9] by a directional refinement of the constraint qualification since it suffices to check asymptotic regularity w.r.t. particular directions.

We point out that, unlike typical constraint qualifications, (directional) asymptotic regularity allows the existence of sequences satisfying (5.1) as long as the limit $x^{*}$ is included in $Im D^{*} Φ (\bar{x}, \bar{y})$ which is enough for M-stationarity.

Remark 5.2

Corollary 5.1 requires asymptotic regularity in every (critical) unit direction. Taking into account Remark 4.2, we could also consider an alternative approach to secure M-stationarity, demanding either that there does not exist a critical direction together with the sequences from Definition 5.1 (c), or, in the case of existence, that $Φ$ is asymptotically regular at least in one of these critical directions. For brevity of presentation, we abstain from developing this approach further.

Since (directional) asymptotic regularity (w.r.t. all critical unit directions) yields M-stationarity of a local minimizer by Corollary 5.1, in the remaining part of the paper, we put it into context of other common assumptions that work as a constraint qualification for M-stationarity associated with problem (P). Let us clarify here some rather simple or known connections.

A polyhedral mapping is asymptotically regular at each point of its graph.
Metric regularity implies asymptotic regularity.
Strong metric subregularity implies asymptotic regularity.
FOSCMS does not imply asymptotic regularity, but it implies strong asymptotic regularity in each unit direction.
Metric subregularity does not imply asymptotic regularity in each unit direction. However, if the map of interest is metrically subregular at every point of its graph near the reference point with a uniform constant, then strong asymptotic regularity in each unit direction follows.
Neither asymptotic regularity nor strong directional asymptotic regularity yields the directional exact penalty property of Lemma 4.1.

Statements (a) and (b) were shown in [63, Theorems 3.10 and 3.12]. Let us now argue that strong metric subregularity (the “inverse” property associated with isolated calmness), see [31], also implies asymptotic regularity at the point. This follows easily from the discussion above [20, Corollary 4.6], which yields that the domain of the limiting coderivative, at the point where the mapping is isolatedly calm, is the whole space. Equivalently, the range of the limiting coderivative, at the point where the mapping is strongly metrically subregular, is the whole space and asymptotic regularity thus follows trivially. Thus, statement (c) follows.

Regarding (d), the fact that FOSCMS implies strong asymptotic regularity in each unit direction easily follows by similar arguments that show that metric regularity implies asymptotic regularity, see [63, Lemma 3.11, Theorem 3.12]. Actually, it can be proved that validity of FOSCMS(u) for some unit direction u implies strong asymptotic regularity in direction u. For constraint mappings, this also follows from Corollary 5.3 from below.

The following example shows that FOSCMS does not imply asymptotic regularity.

Example 5.2

Let $Φ : R ⇉ R$ be given by

\begin{matrix} \forall x \in R : Φ (x) : = \{\begin{matrix} [x, \infty) & if x \leq 0, \\ [\frac{1}{k} - \frac{1}{k} (x - \frac{1}{k}), \infty) & if x \in (\frac{1}{k + 1}, \frac{1}{k}] for some k \in N, \\ \emptyset & otherwise. \end{matrix}) \end{matrix}

Then ${(1 / k, 1 / k)}_{k \in N} \subset gph Φ$ converges to $(\bar{x}, \bar{y}) : = (0, 0)$ and

\begin{matrix} N_{gph Φ} (1 / k, 1 / k) = {(x^{*}, y^{*}) \in R^{2} | y^{*} \leq 0, y^{*} \leq k x^{*}} \end{matrix}

is valid showing that $Im D^{*} Φ (1 / k, 1 / k) = R$ holds for all $k \in N$ . On the other hand, we have

\begin{matrix} N_{gph Φ} (0, 0) = {(x^{*}, y^{*}) \in R^{2} | x^{*} \geq 0, y^{*} \leq 0}, \end{matrix}

and, thus, $Im D^{*} Φ (0, 0) = R_{+}$ . This means that $Φ$ is not asymptotically regular at $(\bar{x}, \bar{y})$ .

On the other hand, we find

\begin{matrix} T_{gph Φ} (\bar{x}, \bar{y}) = {(u, v) \in R^{2} | u \leq v} . \end{matrix}

Each pair $(u, 0) \in T_{gph Φ} (\bar{x}, \bar{y})$ with $u \neq 0$ satisfies $u < 0$ , i.e., the direction (u, 0) points into the interior of $gph Φ$ . Thus, we have $N_{gph Φ} ((\bar{x}, \bar{y}), (u, 0)) = {(0, 0)}$ which shows that FOSCMS is valid.

Regarding (e), let us fix $(\bar{x}, \bar{y}) \in gph Φ$ and note that metric subregularity of $Φ$ on a neighborhood of $(\bar{x}, \bar{y})$ (restricted to $gph Φ$ ) with a uniform constant $κ > 0$ is clearly milder than metric regularity at $(\bar{x}, \bar{y})$ since it is automatically satisfied, e.g., for polyhedral mappings. To see that it implies asymptotic regularity, consider sequences ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ , ${x_{k}^{*}}_{k \in N} \subset X$ , and ${λ_{k}}_{k \in N} \subset Y$ as well as $x^{*} \in X$ and $y^{*} \in Y$ satisfying $x_{k}^{*} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ for each $k \in N$ and the convergences (5.1) for some unit direction $u \in S_{X}$ . Due to [20, Theorem 3.2] and $- x_{k}^{*} \in dom {\hat{D}}^{*} Φ^{- 1} (y_{k}, x_{k})$ , we find $x_{k}^{*} \in {\hat{N}}_{Φ^{- 1} (y_{k})} (x_{k}) \subset N_{Φ^{- 1} (y_{k})} (x_{k})$ for each $k \in N$ . Furthermore, [20, Theorem 3.2] also gives the existence of ${\tilde{λ}}_{k} \in Y$ with $‖ {\tilde{λ}}_{k} ‖ \leq κ ‖ x_{k}^{*} ‖$ and $x_{k}^{*} \in D^{*} Φ (x_{k}, y_{k}) ({\tilde{λ}}_{k})$ . Noting that ${x_{k}^{*}}_{k \in N}$ converges, this shows that there is an accumulation point $λ \in Y$ of ${{\tilde{λ}}_{k}}_{k \in N}$ which satisfies $x^{*} \in D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0)) (λ)$ by robustness of the directional limiting coderivative, see Lemma 2.1. Hence, $Φ$ is strongly asymptotically regular at $(\bar{x}, \bar{y})$ in direction u. Note that for the above arguments to work, we only need uniform metric subregularity along all sequences ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ converging to $(\bar{x}, \bar{y})$ from direction (u, 0).

The following example shows that metric subregularity in the neighborhood of the point of interest does not imply asymptotic regularity in each unit direction.

Example 5.3

We consider the mapping $Φ : R ⇉ R$ given by

\begin{matrix} \forall x \in R : Φ (x) : = {0, x^{2}} . \end{matrix}

Due to $Φ^{- 1} (0) = R$ , $Φ$ is metrically subregular at all points (x, 0) where $x \in R$ is arbitrary. Furthermore, at all points $(x, x^{2})$ where $x \neq 0$ holds, the Mordukhovich criterion (2.10a) shows that $Φ$ is metrically regular. Thus, $Φ$ is metrically subregular at each point of its graph. Note that the moduli of metric subregularity tend to $\infty$ along the points $(x, x^{2})$ as $x ↓ 0$ or $x ↑ 0$ .

Let us consider the point $(\bar{x}, \bar{y}) : = (0, 0)$ where we have $N_{gph Φ} (\bar{x}, \bar{y}) = {0} \times R$ and, thus, $Im D^{*} Φ (\bar{x}, \bar{y}) = {0}$ . Choosing $x^{*} : = 1$ , $y^{*} : = 1 / 2$ , as well as

\begin{matrix} \forall k \in N : x_{k} : = \frac{1}{k}, y_{k} : = \frac{1}{k^{2}}, x_{k}^{*} : = 1, λ_{k} : = \frac{k}{2}, \end{matrix}

we have $x_{k}^{*} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ for all $k \in N$ as well as the convergences (5.1) for $u : = 1$ . Due to $x_{k}^{*} \to x^{*} \notin Im D^{*} Φ (\bar{x}, \bar{y})$ , $Φ$ is not asymptotically regular at $(\bar{x}, \bar{y})$ in direction u.

Finally, let us address item (f) with the aid of an example.

Example 5.4

Let us define $φ : R \to R$ and $Φ : R ⇉ R$ by means of

\begin{matrix} \forall x \in R : φ (x) : = - x, Φ (x) : = \{\begin{matrix} R & if x \leq 0, \\ [x^{2}, \infty) & if x = \frac{1}{k} for some k \in N, \\ \emptyset & otherwise. \end{matrix}) \end{matrix}

Furthermore, we fix $\bar{y} : = 0$ . One can easily check that $\bar{x} : = 0$ is the uniquely determined global minimizer of the associated problem (P). Furthermore, we have $Im D^{*} Φ (\bar{x}, \bar{y}) = Im D^{*} Φ ((\bar{x}, \bar{y}) ; (1, 0)) = R$ which shows that $Φ$ is asymptotically regular at $(\bar{x}, \bar{y})$ as well as strongly asymptotically regular at $(\bar{x}, \bar{y})$ in direction 1. Furthermore, it is obvious that $Φ$ is strongly asymptotically regular at $(\bar{x}, \bar{y})$ in direction $- 1$ . Finally, let us mention that $Φ$ fails to be metrically subregular at $(\bar{x}, \bar{y})$ in direction 1.

Now, define $x_{k} : = 1 / k$ for each $k \in N$ and observe that for each constant $C > 0$ and sufficiently large $k \in N$ , we have $φ (x_{k}) + C dist (\bar{y}, Φ (x_{k})) = - 1 / k + C / k^{2} < 0 = φ (\bar{x})$ , i.e., $\bar{x}$ is not a minimizer of (4.1) for any choice of $C > 0$ , $ε > 0$ , $δ > 0$ , and $u : = 1$ .

Directional pseudo- and quasi-normality

In this section, we connect asymptotic regularity with the notions of pseudo- and quasi-normality. Note that the latter concepts have been introduced for standard nonlinear programs in [24, 46], and extensions to more general geometric constraints have been established in [43]. Furthermore, problem-tailored notions of these conditions have been coined e.g. for so-called cardinality-, complementarity-, and switching-constrained optimization problems, see [52, 54, 61]. Let us point out that these conditions are comparatively mild constraint qualifications and sufficient for the presence of metric subregularity of the associated constraint mapping, see e.g. [43, Theorem 5.2]. Here, we extend pseudo- and quasi-normality from the common setting of geometric constraint systems to arbitrary set-valued mappings and comment on the qualitative properties of these conditions. Naturally, we aim for directional versions of these concepts, which, in the setting of geometric constraints, were recently introduced in [15] and further explored in [16].

On the general concept of directional pseudo- and quasi-normality

The definition below introduces the notions of our interest.

Definition 5.2

Fix $(\bar{x}, \bar{y}) \in gph Φ$ and a direction $u \in S_{X}$ .

We say that pseudo-normality in direction u holds at $(\bar{x}, \bar{y})$ if there does not exist a nonzero vector $λ \in ker D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ satisfying the following condition: there are sequences ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ with $x_{k} \neq \bar{x}$ for all $k \in N$ and ${λ_{k}}_{k \in N} \subset Y$ , ${η_{k}}_{k \in N} \subset X$ , such that
$\begin{matrix} \begin{matrix} x_{k} & \to \bar{x}, & y_{k} & \to \bar{y}, & λ_{k} & \to λ, \\ η_{k} & \to 0, & \frac{x_{k} - \bar{x}}{∥x_{k} - \bar{x}∥} & \to u, & \frac{y_{k} - \bar{y}}{∥x_{k} - \bar{x}∥} & \to 0, \end{matrix} \end{matrix}$ 5.2
and $η_{k} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ as well as $⟨ λ, y_{k} - \bar{y} ⟩ > 0$ for all $k \in N$ .
Let $E : = {e_{1}, \dots, e_{m}} \subset Y$ be an orthonormal basis of $Y$ . We say that quasi-normality in direction u holds at $(\bar{x}, \bar{y})$ w.r.t. $E$ if there does not exist a nonzero vector $λ \in ker D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ satisfying the following condition: there are sequences ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ with $x_{k} \neq \bar{x}$ for all $k \in N$ and ${λ_{k}}_{k \in N} \subset Y$ , ${η_{k}}_{k \in N} \subset X$ , such that we have the convergences from (5.2) and, for all $k \in N$ and $i \in {1, \dots, m}$ , $η_{k} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ as well as $⟨ λ, e_{i} ⟩ ⟨ y_{k} - \bar{y}, e_{i} ⟩ > 0$ if $⟨ λ, e_{i} ⟩ \neq 0$ .

In the case where the canonical basis is chosen in $Y : = R^{m}$ , the above concept of quasi-normality is a direct generalization of the original notion from [24] which was coined for standard nonlinear problems and neglected directional information. Let us just mention that a reasonable, basis-independent definition of quasi-normality would require that there exists some basis w.r.t. which the mapping of interest is quasi-normal, see also Theorem 5.1.

Note that the sequence ${y_{k}}_{k \in N}$ in the definition of directional pseudo- and quasi-normality needs to satisfy $y_{k} \neq \bar{y}$ for all $k \in N$ . In the definition of directional pseudo-normality, this is clear from $⟨ λ, y_{k} - \bar{y} ⟩ > 0$ for all $k \in N$ . Furthermore, in the definition of directional quasi-normality, observe that $λ \neq 0$ implies the existence of $j \in {1, \dots, m}$ such that $⟨ λ, e_{j} ⟩ \neq 0$ holds, so that $⟨ y_{k} - \bar{y}, e_{j} ⟩ \neq 0$ is necessary for each $k \in N$ .

In the following lemma, we show the precise relation between directional pseudo- and quasi-normality.

Lemma 5.1

Fix $(\bar{x}, \bar{y}) \in gph Φ$ and some direction $u \in S_{X}$ . Then $Φ$ is pseudo-normal at $(\bar{x}, \bar{y})$ in direction u if and only if $Φ$ is quasi-normal at $(\bar{x}, \bar{y})$ in direction u w.r.t. each orthonormal basis of $Y$ .

Proof

$[⟹]$ Let $Φ$ be pseudo-normal at $(\bar{x}, \bar{y})$ in direction u, let $E : = {e_{1}, \dots, e_{m}} \subset Y$ be an orthonormal basis of $Y$ , and pick $λ \in ker D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ as well as sequences ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ with $x_{k} \neq \bar{x}$ for all $k \in N$ and ${λ_{k}}_{k \in N} \subset Y$ , ${η_{k}}_{k \in N} \subset X$ , satisfying the convergences (5.2) and, for all $k \in N$ and $i \in {1, \dots, m}$ , $η_{k} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ as well as $⟨ λ, e_{i} ⟩ ⟨ y_{k} - \bar{y}, e_{i} ⟩ > 0$ if $⟨ λ, e_{i} ⟩ \neq 0$ . Observing that we have

\begin{matrix} ⟨ λ, y_{k} - \bar{y} ⟩ & = 〈\sum_{i = 1}^{m} ⟨ λ, e_{i} ⟩ e_{i}, \sum_{j = 1}^{m} ⟨ y_{k} - \bar{y}, e_{j} ⟩ e_{j}〉 \\ = \sum_{i = 1}^{m} \sum_{j = 1}^{m} ⟨ λ, e_{i} ⟩ ⟨ y_{k} - \bar{y}, e_{j} ⟩ ⟨ e_{i}, e_{j} ⟩ \\ = \sum_{i = 1}^{m} ⟨ λ, e_{i} ⟩ ⟨ y_{k} - \bar{y}, e_{i} ⟩, \end{matrix}

validity of pseudo-normality at $(\bar{x}, \bar{y})$ in direction u gives $λ = 0$ , i.e., $Φ$ is quasi-normal at $(\bar{x}, \bar{y})$ in direction u w.r.t. $E$ .

$[⟸]$ Assume that $Φ$ is quasi-normal at $(\bar{x}, \bar{y})$ in direction u w.r.t. each orthonormal basis of $Y$ . Suppose that $Φ$ is not pseudo-normal at $(\bar{x}, \bar{y})$ in direction u. Then we find some nonzero $λ \in ker D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ as well as sequences ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ with $x_{k} \neq \bar{x}$ for all $k \in N$ and ${λ_{k}}_{k \in N} \subset Y$ , ${η_{k}}_{k \in N} \subset X$ , satisfying the convergences (5.2) and $η_{k} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ as well as $⟨ λ, y_{k} - \bar{y} ⟩ > 0$ for all $k \in N$ . Noting that $λ$ does not vanish, we can construct an orthonormal basis $E_{λ} : = {e_{1}^{λ}, \dots, e_{m}^{λ}}$ of $Y$ with $e_{1}^{λ} : = λ / ∥λ∥$ . Note that, for $i \in {1, \dots, m}$ , we have $⟨ λ, e_{i}^{λ} ⟩ \neq 0$ if and only if $i = 1$ by construction of $E_{λ}$ . Furthermore, we find

\begin{matrix} ⟨ λ, e_{1}^{λ} ⟩ ⟨ y_{k} - \bar{y}, e_{1}^{λ} ⟩ = ∥λ∥ ⟨ λ / ∥λ∥, y_{k} - \bar{y} ⟩ = ⟨ λ, y_{k} - \bar{y} ⟩ > 0 . \end{matrix}

This, however, contradicts quasi-normality of $Φ$ at $(\bar{x}, \bar{y})$ in direction u w.r.t. $E_{λ}$ . $□$

Let us note that [24, Example 1] shows in the nondirectional situation of standard nonlinear programming that pseudo-normality might be more restrictive than quasi-normality w.r.t. the canonical basis in $R^{m}$ . On the other hand, due to Lemma 5.1, there must exist another basis such that quasi-normality w.r.t. this basis fails since pseudo-normality fails. This depicts that validity of quasi-normality indeed may depend on the chosen basis. In [15], the authors define directional quasi-normality for geometric constraints in Euclidean spaces in componentwise fashion although this is somehow unclear in situations where the image space is different from $R^{m}$ . Exemplary, in the $\frac{1}{2} m (m + 1)$ -dimensional space $S_{m}$ of all real symmetric $m \times m$ -matrices, the canonical basis, which seems to be associated with a componentwise calculus, comprises precisely $\frac{1}{2} (m - 1) m$ matrices with precisely two nonzero entries. Our definition of quasi-normality from Definition 5.2 gives some more freedom since the choice of the underlying basis allows to rotate the coordinate system.

Following the arguments in [16, Section 3.2], it also might be reasonable to define intermediate conditions bridging pseudo- and quasi-normality. In the light of this paper, however, the concepts from Definition 5.2 are sufficient for our purposes.

As the following theorem shows, directional quasi- and, thus, pseudo-normality also serve as sufficient conditions for strong directional asymptotic regularity and directional metric subregularity which explains our interest in these conditions. Both statements follow once we clarify that pseudo- and quasi-normality are in fact specifications of the multiplier sequential information in (5.1), namely the convergence $(y_{k} - \bar{y}) / ‖ y_{k} - \bar{y} ‖ - λ_{k} / ‖ λ_{k} ‖ \to 0$ .

Theorem 5.1

If $Φ : X ⇉ Y$ is quasi-normal in direction $u \in S_{X}$ at $(\bar{x}, \bar{y}) \in gph Φ$ w.r.t. some orthonormal basis $E : = {e_{1}, \dots, e_{m}} \subset Y$ of $Y$ , then it is also strongly asymptotically regular as well as metrically subregular in direction u at $(\bar{x}, \bar{y})$ .

Proof

Fix arbitrary sequences ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ , ${x_{k}^{*}}_{k \in N} \subset X$ , and ${λ_{k}}_{k \in N} \subset Y$ as well as $x^{*} \in X$ and $y^{*} \in Y$ satisfying $x_{k} \notin Φ^{- 1} (\bar{y})$ , $y_{k} \neq \bar{y}$ , and $x_{k}^{*} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ for each $k \in N$ as well as the convergences (5.1). Let us define $w_{k} : = (y_{k} - \bar{y}) / ∥y_{k} - \bar{y}∥$ and ${\tilde{λ}}_{k} : = λ_{k} / ∥λ_{k}∥$ for each $k \in N$ . The requirements from (5.1) imply that ${w_{k}}_{k \in N}$ and ${{\tilde{λ}}_{k}}_{k \in N}$ converge, along a subsequence (without relabeling), to the same nonvanishing limit which we will call $λ \in S_{Y}$ . Moreover, given $i \in {1, \dots, m}$ with $⟨ λ, e_{i} ⟩ \neq 0$ , for sufficiently large $k \in N$ , we get $⟨ w_{k}, e_{i} ⟩ \neq 0$ and

\begin{matrix} 0 < ⟨ λ, e_{i} ⟩ ⟨ w_{k}, e_{i} ⟩ = ⟨ λ, e_{i} ⟩ ⟨ y_{k} - \bar{y}, e_{i} ⟩ / ∥y_{k} - \bar{y}∥ . \end{matrix}

Observing that we have $x_{k}^{*} / ∥λ_{k}∥ \to 0$ from (5.1), we find $λ \in ker D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ by definition of the directional limiting coderivative. This contradicts validity of quasi-normality of $Φ$ at $(\bar{x}, \bar{y})$ in direction u w.r.t. $E$ . Particularly, such sequences ${(x_{k}, y_{k})}_{k \in N}$ , ${x_{k}^{*}}_{k \in N}$ , and ${λ_{k}}_{k \in N}$ cannot exist which means that $Φ$ is strongly asymptotically regular in direction u at $(\bar{x}, \bar{y})$ .

The claim about metric subregularity now follows from [37, Corollary 1], since the only difference from quasi-normality is the requirement

\begin{matrix} ⟨ λ_{k} / ‖ λ_{k} ‖, (y_{k} - \bar{y}) / ‖ y_{k} - \bar{y} ‖ ⟩ \to 1 \end{matrix}

which is the same as $(y_{k} - \bar{y}) / ‖ y_{k} - \bar{y} ‖ - λ_{k} / ‖ λ_{k} ‖ \to 0$ as mentioned in the comments after Corollary 4.5. $□$

Relying on this result, [36, Theorem 7] yields that directional pseudo- and quasi-normality provide constraint qualifications for (P) which ensure validity of directional M-stationarity at local minimizers.

We would like to point the reader’s attention to the fact that nondirectional versions of pseudo- and quasi-normality are not comparable with the nondirectional version of asymptotic regularity. This has been observed in the context of standard nonlinear programming, see [5, Sections 4.3, 4.4]. The reason is that the standard version of asymptotic regularity makes no use of the multiplier information (4.17c).

In [22, Section 4.2], which is a preprint version of this paper, our new notions of directional pseudo- and quasi-normality from Definition 5.2 are worked out for so-called optimization problem with equilibrium constraints which cover models with variational inequality constraints, see e.g. [32, 62, 68], or bilevel optimization problems, see e.g. [29, 30].

Directional pseudo- and quasi-normality for geometric constraint systems

Let us now also justify the terminology by showing that the new notions from Definition 5.2 coincide with directional pseudo- and quasi-normality in the case of standard constraint mappings as studied in [16].

We start with a general result relying on calmness of the constraint function. Note that we consider $\bar{y} : = 0$ for simplicity of notation. Furthermore, we only focus on the concept of directional quasi-normality in our subsequently stated analysis. Analogous results hold for directional pseudo-normality.

Proposition 5.2

A constraint mapping $Φ : X ⇉ Y$ given by $Φ (x) : = g (x) - D$ , $x \in X$ , where $g : X \to Y$ is a continuous function which is calm in direction $u \in S_{X}$ at $\bar{x} \in X$ such that $(\bar{x}, 0) \in gph Φ$ and $D \subset Y$ is closed, is quasi-normal in direction u at $(\bar{x}, 0)$ w.r.t. some orthonormal basis $E : = {e_{1}, \dots, e_{m}} \subset Y$ of $Y$ provided there do not exist a direction $v \in Y$ and a nonzero vector $λ \in N_{D} (g (\bar{x}) ; v)$ with $0 \in D^{*} g (\bar{x} ; (u, v)) (λ)$ satisfying the following condition: there are sequences ${x_{k}}_{k \in N} \subset X$ with $x_{k} \neq \bar{x}$ for all $k \in N$ , ${z_{k}}_{k \in N} \subset D$ , ${λ_{k}}_{k \in N} \subset Y$ , and ${η_{k}}_{k \in N} \subset X$ satisfying $x_{k} \to \bar{x}$ , $z_{k} \to g (\bar{x})$ , $λ_{k} \to λ$ , $η_{k} \to 0$ ,

\begin{matrix} \frac{x_{k} - \bar{x}}{∥x_{k} - \bar{x}∥} \to u, \frac{z_{k} - g (\bar{x})}{∥x_{k} - \bar{x}∥} \to v, \frac{g (x_{k}) - g (\bar{x})}{∥x_{k} - \bar{x}∥} \to v, \end{matrix}

5.3

and, for all $k \in N$ and $i \in {1, \dots, m}$ , $η_{k} \in {\hat{D}}^{*} g (x_{k}) (λ_{k})$ , $λ_{k} \in {\hat{N}}_{D} (z_{k})$ , as well as $⟨ λ, e_{i} ⟩ ⟨ g (x_{k}) - z_{k}, e_{i} ⟩ > 0$ if $⟨ λ, e_{i} ⟩ \neq 0$ .

Moreover, if g is even calm near $\bar{x}$ , the two conditions are equivalent.

Proof

$[⟸]$ Choose $λ \in ker D^{*} Φ ((\bar{x}, 0) ; (u, 0))$ and sequences ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ with $x_{k} \neq \bar{x}$ for all $k \in N$ and ${λ_{k}}_{k \in N} \subset Y$ , ${η_{k}}_{k \in N} \subset X$ satisfying (5.2) with $\bar{y} : = 0$ and, for all $k \in N$ and $i \in {1, \dots, m}$ , $η_{k} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ as well as $⟨ λ, e_{i} ⟩ ⟨ y_{k}, e_{i} ⟩ > 0$ if $⟨ λ, e_{i} ⟩ \neq 0$ . Applying Lemma 3.1 (a) yields $η_{k} \in {\hat{D}}^{*} g (x_{k}) (λ_{k})$ and $λ_{k} \in {\hat{N}}_{D} (g (x_{k}) - y_{k})$ for each $k \in N$ . The assumed calmness of g at $\bar{x}$ in direction u yields boundedness of the sequence ${(g (x_{k}) - g (\bar{x})) / ∥x_{k} - \bar{x}∥}_{k \in N}$ , i.e., along a subsequence (without relabeling) it converges to some $v \in Y$ . Note also that $(u, v) \in T_{gph g} (\bar{x}, g (\bar{x}))$ , i.e., $v \in D g (\bar{x}) (u)$ , and that ${(x_{k}, g (x_{k}))}_{k \in N}$ converges to $(\bar{x}, g (\bar{x}))$ from direction (u, v). Setting $z_{k} : = g (x_{k}) - y_{k}$ for each $k \in N$ , we get $z_{k} \to g (\bar{x})$ by continuity of g as well as $λ_{k} \in {\hat{N}}_{D} (z_{k})$ and $⟨ λ, e_{i} ⟩ ⟨ g (x_{k}) - z_{k}, e_{i} ⟩ > 0$ if $⟨ λ, e_{i} ⟩ \neq 0$ for each $k \in N$ and $i \in {1, \dots, m}$ . Moreover, we have

\begin{matrix} \frac{z_{k} - g (\bar{x})}{∥x_{k} - \bar{x}∥} = \frac{g (x_{k}) - g (\bar{x})}{∥x_{k} - \bar{x}∥} - \frac{y_{k}}{∥x_{k} - \bar{x}∥} \to v - 0 = v \end{matrix}

and $v \in T_{D} (g (\bar{x}))$ follows as well. Finally, taking the limit yields $λ \in N_{D} (g (\bar{x}) ; v)$ and $0 \in D^{*} g (\bar{x} ; (u, v)) (λ)$ , so that the assumptions of the proposition imply $λ = 0$ . Consequently, $Φ$ is quasi-normal in direction u at $(\bar{x}, 0)$ w.r.t. $E$ .

$[⟹]$ Assume that quasi-normality in direction u holds at $(\bar{x}, 0)$ w.r.t. $E$ and that g is calm around $\bar{x}$ . Suppose that there are some $v \in Y$ , $λ \in N_{D} (g (\bar{x}) ; v)$ with $0 \in D^{*} g (\bar{x} ; (u, v)) (λ)$ , and sequences ${x_{k}}_{k \in N} \subset X$ with $x_{k} \neq \bar{x}$ for all $k \in N$ and ${z_{k}}_{k \in N} \subset D$ , ${λ_{k}}_{k \in N} \subset Y$ , ${η_{k}}_{k \in N} \subset X$ with $x_{k} \to \bar{x}$ , $z_{k} \to g (\bar{x})$ , $λ_{k} \to λ$ , $η_{k} \to 0$ , (5.3), and, for all $k \in N$ and $i \in {1, \dots, m}$ , $η_{k} \in {\hat{D}}^{*} g (x_{k}) (λ_{k})$ , $λ_{k} \in {\hat{N}}_{D} (z_{k})$ , as well as $⟨ λ, e_{i} ⟩ ⟨ g (x_{k}) - z_{k}, e_{i} ⟩ > 0$ as soon as $⟨ λ, e_{i} ⟩ \neq 0$ . Set $y_{k} : = g (x_{k}) - z_{k}$ for each $k \in N$ . Then we have $y_{k} \to 0$ ,

\begin{matrix} \frac{y_{k}}{∥x_{k} - \bar{x}∥} = \frac{g (x_{k}) - z_{k}}{∥x_{k} - \bar{x}∥} = \frac{g (x_{k}) - g (\bar{x})}{∥x_{k} - \bar{x}∥} - \frac{z_{k} - g (\bar{x})}{∥x_{k} - \bar{x}∥} \to v - v = 0, \end{matrix}

and, for all $k \in N$ and $i \in {1, \dots, m}$ , $λ_{k} \in {\hat{N}}_{D} (g (x_{k}) - y_{k})$ as well as $⟨ λ, e_{i} ⟩ ⟨ y_{k}, e_{i} ⟩ > 0$ if $⟨ λ, e_{i} ⟩ \neq 0$ . Since $η_{k} \in {\hat{D}}^{*} g (x_{k}) (λ_{k})$ , calmness of g at $x_{k}$ implies $η_{k} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ due to Lemma 3.1 (a), and taking the limit yields $λ \in ker D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ . Thus, the assumed quasi-normality of $Φ$ at $(\bar{x}, 0)$ in direction u w.r.t. $E$ yields $λ = 0$ and the claim follows. $□$

If g is continuously differentiable, the situation becomes a bit simpler and we precisely recover the notion of directional quasi-normality for geometric constraint systems as discussed in [16, Definition 3.4].

Corollary 5.2

A constraint mapping $Φ : X ⇉ Y$ given by $Φ (x) = g (x) - D$ , $x \in X$ , where $g : X \to Y$ is continuously differentiable and $D \subset Y$ is closed, is quasi-normal in direction $u \in S_{X}$ at $(\bar{x}, 0) \in gph Φ$ w.r.t. some orthonormal basis ${e_{1}, \dots, e_{m}} \subset Y$ of $Y$ if and only if there does not exist a nonzero vector $λ \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u)$ with $\nabla g {(\bar{x})}^{*} λ = 0$ satisfying the following condition: there are sequences ${x_{k}}_{k \in N} \subset X$ with $x_{k} \neq \bar{x}$ for all $k \in N$ , ${z_{k}}_{k \in N} \subset D$ , and ${λ_{k}}_{k \in N} \subset Y$ satisfying $x_{k} \to \bar{x}$ , $z_{k} \to g (\bar{x})$ , $λ_{k} \to λ$ ,

\begin{matrix} \frac{x_{k} - \bar{x}}{∥x_{k} - \bar{x}∥} \to u, \frac{z_{k} - g (\bar{x})}{∥x_{k} - \bar{x}∥} \to \nabla g (\bar{x}) u, \end{matrix}

5.4

and, for all $k \in N$ and $i \in {1, \dots, m}$ , $λ_{k} \in {\hat{N}}_{D} (z_{k})$ and $⟨ λ, e_{i} ⟩ ⟨ g (x_{k}) - z_{k}, e_{i} ⟩ > 0$ if $⟨ λ, e_{i} ⟩ \neq 0$ .

In [16, Section 3.3], it has been reported that under additional conditions on the set D, we can drop the sequences ${z_{k}}_{k \in N}$ and ${λ_{k}}_{k \in N}$ from the characterization of directional quasi-normality in Corollary 5.2. Particularly, this can be done for so-called ortho-disjunctive programs which cover, e.g., standard nonlinear, complementarity-, cardinality-, or switching-constrained optimization problems. In this regard, Corollary 5.2 reveals that some results from [24, 46, 52, 54, 61] are covered by our general concept from Definition 5.2.

Let us briefly compare our results with the approach from [15].

Remark 5.3

Let us consider the setting discussed in Proposition 5.2. The directional versions of pseudo- and quasi-normality from [15] operate with all nonzero pairs of directions (u, v), rather than just a fixed u. The advantage is that calmness of g plays no role. The reason is, however, that the authors in [15] only derive statements regarding metric subregularity, but not metric subregularity in some fixed direction. Calmness of g is needed precisely for preservation of directional information. We believe that it is useful to know how to verify if a mapping is metrically subregular in a specific direction since only some directions play a role in many situations. We could drop the calmness assumption from Proposition 5.2, but, similarly as in [18, Theorem 3.1], additional directions of the type (0, v) for a nonzero v would appear. Clearly, such directions are included among all nonzero pairs (u, v), but the connection to the original direction u would have been lost.

Sufficient conditions for asymptotic regularity via pseudo-coderivatives

The role of super-coderivatives

We start this section by interrelating the concept of super-coderivatives from Definition 2.4 and asymptotic regularity. Fix $(\bar{x}, \bar{y}) \in gph Φ$ and choose ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ , ${x_{k}^{*}}_{k \in N} \subset X$ , and ${λ_{k}}_{k \in N} \subset Y$ as well as $x^{*} \in X$ and $y^{*} \in Y$ satisfying $x_{k} \notin Φ^{- 1} (\bar{y})$ , $y_{k} \neq \bar{y}$ , and $x_{k}^{*} \in {\hat{D}}^{*} Φ (x_{k}, y_{k}) (λ_{k})$ for all $k \in N$ as well as the convergences (5.1). For each $k \in N$ , we set $t_{k} : = ∥x_{k} - \bar{x}∥$ , $τ_{k} : = ∥y_{k} - \bar{y}∥$ ,

\begin{matrix} u_{k} : = \frac{x_{k} - \bar{x}}{∥x_{k} - \bar{x}∥}, v_{k} : = \frac{y_{k} - \bar{y}}{∥y_{k} - \bar{y}∥}, y_{k}^{*} : = \frac{∥y_{k} - \bar{y}∥}{∥x_{k} - \bar{x}∥} λ_{k}, \end{matrix}

and find $τ_{k} / t_{k} \to 0$ as well as

\begin{matrix} \forall k \in N : x_{k}^{*} \in {\hat{D}}^{*} Φ (\bar{x} + t_{k} u_{k}, \bar{y} + τ_{k} v_{k}) ((t_{k} / τ_{k}) y_{k}^{*}) . \end{matrix}

Along a subsequence (without relabeling), $v_{k} \to v$ holds for some $v \in S_{Y}$ . Thus, taking the limit $k \to \infty$ , we have $x^{*} \in D_{s}^{*} u p Φ ((\bar{x}, \bar{y}) ; (u, v)) (y^{*})$ by definition of the super-coderivative. Moreover, from (5.1) we also know that $y^{*} = ∥y^{*}∥ v$ . Consequently, we come up with the following lemma.

Lemma 5.2

Let $(\bar{x}, \bar{y}) \in gph Φ$ and $u \in S_{X}$ be fixed. If

\begin{matrix} ⋃_{v \in S_{Y}} D_{s}^{*} u p Φ ((\bar{x}, \bar{y}) ; (u, v)) (β v) \subset Im D^{*} Φ (\bar{x}, \bar{y}) \end{matrix}

holds for all $β \geq 0$ , then $Φ$ is asymptotically regular at $(\bar{x}, \bar{y})$ in direction u. If the above estimate holds for all $β \geq 0$ with $Im D^{*} Φ (\bar{x}, \bar{y})$ replaced by $Im D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ , then $Φ$ is strongly asymptotically regular at $(\bar{x}, \bar{y})$ in direction u.

The next result, which is based on hypothesis $A^{γ} (u)$ , see Assumption 4.1, follows as a corollary of Lemmas 2.9 and 5.2, and gives new sufficient conditions for directional asymptotic regularity. Note that strong directional asymptotic regularity can be handled analogously by employing an adjusted version of $A^{γ} (u)$ where $Im D^{*} Φ (\bar{x}, \bar{y})$ in the right-hand side of (4.15) is replaced by $Im D^{*} Φ ((\bar{x}, \bar{y}) ; (u, 0))$ .

Theorem 5.2

Let $(\bar{x}, \bar{y}) \in gph Φ$ , $u \in S_{X}$ , and $γ > 1$ be fixed. If $A^{γ} (u)$ holds, then $Φ$ is asymptotically regular at $(\bar{x}, \bar{y})$ in direction u.

In the case where the pseudo-coderivatives involved in the construction of $A^{γ} (u)$ can be computed or estimated from above, new applicable sufficient conditions for (strong) directional asymptotic regularity are provided by Theorem 5.2. Particularly, in situations where $Φ$ is given in form of a constraint mapping and $γ : = 2$ is fixed, we can rely on the results obtained in Sect. 3 in order to make the findings of Theorem 5.2 more specific. This will be done in the next subsection.

The case of constraint mappings

Throughout the section, we assume that $Φ : X ⇉ Y$ is given by $Φ (x) : = g (x) - D$ , $x \in X$ , where $g : X \to Y$ is a twice continuously differentiable function and $D \subset Y$ is a closed set. Furthermore, for simplicity of notation, we fix $\bar{y} : = 0$ which is not restrictive as already mentioned earlier.

We start with a general result which does not rely on any additional structure of the set D.

Theorem 5.3

Let $(\bar{x}, 0) \in gph Φ$ as well as $u \in S_{X}$ be fixed. Assume that (3.9) holds, as well as (3.10) or, in the case $\nabla g (\bar{x}) u \neq 0$ , (3.11). If, for each $x^{*} \in X$ and $y^{*}, z^{*} \in Y$ satisfying

\begin{matrix} x^{*} & = \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z^{*}, \end{matrix}

5.5a

\begin{matrix} y^{*} & \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u) \cap ker \nabla g {(\bar{x})}^{*}, \end{matrix}

5.5b

\begin{matrix} z^{*} & \in D N_{D} (g (\bar{x}), y^{*}) (\nabla g (\bar{x}) u), \end{matrix}

5.5c

there is some $λ \in N_{D} (g (\bar{x}))$ such that $x^{*} = \nabla g {(\bar{x})}^{*} λ$ , then $Φ$ is asymptotically regular at $(\bar{x}, 0)$ in direction u. Moreover, $Φ$ is even strongly asymptotically regular at $(\bar{x}, 0)$ in direction u if $λ$ can be chosen from $N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u)$ .

Proof

Theorem 3.1 (a) implies $ker {\tilde{D}}_{2}^{*} Φ ((\bar{x}, 0) ; (u, 0)) = {0}$ (and so, due to (2.7), also $ker D_{2}^{*} Φ ((\bar{x}, 0) ; (u, 0)) = {0}$ ) as well as that for $x^{*} \in Im {\tilde{D}}_{2}^{*} Φ ((\bar{x}, 0) ; (u, 0))$ , we find $y^{*}, z^{*} \in Y$ satisfying (5.5). The assumptions guarantee that we can find $λ \in N_{D} (g (\bar{x}))$ such that $x^{*} = \nabla g {(\bar{x})}^{*} λ \in Im D^{*} Φ (\bar{x}, 0)$ where we used Lemma 3.1 (b). It follows $Im {\tilde{D}}_{2}^{*} Φ ((\bar{x}, 0) ; (u, 0)) \subset Im D^{*} Φ (\bar{x}, 0)$ . Thus, Theorem 5.2 shows that $Φ$ is asymptotically regular at $(\bar{x}, 0)$ in direction u. The statement regarding strong asymptotic regularity follows in analogous way while respecting Lemma 3.1 (c). $□$

We note that (3.10) is stronger than (3.11) when $\nabla g (\bar{x}) u \neq 0$ holds, see (2.3). Naturally, this means that it is sufficient to check (3.10) regardless whether $\nabla g (\bar{x}) u$ vanishes or not. In the case $\nabla g (\bar{x}) u \neq 0$ , however, it is already sufficient to check the milder condition (3.11). This will be important later on, see Proposition 5.3 and Remark 5.4 below.

Note also that we implicitly relied on condition (4.16) (with $\bar{y} : = 0$ and $γ : = 2$ ) in the proof of Theorem 5.3, and not on the milder refined condition (4.15) (again with $\bar{y} : = 0$ and $γ : = 2$ ) which appears in the statement of $A^{γ} (u)$ . This happened due to the generality of the setting in Theorem 5.3. In the polyhedral situation, (4.15) can be employed to obtain the following improved result.

Theorem 5.4

Let $(\bar{x}, 0) \in gph Φ$ as well as $u \in S_{X}$ be fixed. Let $Y : = R^{m}$ and let D be polyhedral locally around $g (\bar{x})$ . Assume that condition (3.13) holds for each $s \in X$ . If, for each $x^{*}, s \in X$ , $y^{*}, z^{*} \in R^{m}$ , and $α \geq 0$ satisfying (5.5a) and

\begin{matrix} \begin{matrix} y^{*} & \in N_{T (u)} (w_{s} (u, v)) \cap ker \nabla g {(\bar{x})}^{*}, \\ z^{*} & \in N_{T (u)} (w_{s} (u, v)) (or z^{*} \in T_{N_{T (u)} (w_{s} (u, v))} (y^{*})), \end{matrix} \end{matrix}

5.6

where $v : = α y^{*}$ , and $T (u)$ as well as $w_{s} (u, v)$ have been defined in (3.7), there is some $λ \in N_{D} (g (\bar{x}))$ such that $x^{*} = \nabla g {(\bar{x})}^{*} λ$ , then $Φ$ is asymptotically regular at $(\bar{x}, 0)$ in direction u. Moreover, $Φ$ is even strongly asymptotically regular at $(\bar{x}, 0)$ in direction u if $λ$ can be chosen from $N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u)$ .

Proof

Due to Theorem 3.2, (3.13) yields $ker D_{2}^{*} Φ ((\bar{x}, 0) ; (u, 0)) \subset {0}$ in the present situation. Now, fix $x^{*} \in {\tilde{D}}_{2}^{*} Φ ((\bar{x}, 0) ; (u, 0)) (0)$ . Then Theorem 3.1 (b) shows the existence of $z^{*} \in N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u)$ such that $x^{*} = \nabla g {(\bar{x})}^{*} z^{*}$ . Let us now consider the case $x^{*} \in D_{2}^{*} Φ ((\bar{x}, 0) ; (u, \bar{α} w)) (\bar{β} w)$ for some $w \in S_{R^{m}}$ and $\bar{α}, \bar{β} \geq 0$ . If $\bar{β} = 0$ holds, we can employ (2.7) to find $x^{*} \in {\tilde{D}}_{2}^{*} Φ ((\bar{x}, 0) ; (u, 0)) (0)$ and, thus, the above argumentation applies. Thus, let us consider $\bar{β} > 0$ and set $α : = \bar{α} / \bar{β}$ . Then we have $x^{*} \in D_{2}^{*} Φ ((\bar{x}, 0) ; (u, v)) (y^{*})$ for $v = α y^{*}$ with $v : = \bar{α} w$ and $y^{*} : = \bar{β} w$ . Theorem 3.2 implies the existence of $s \in X$ such that (5.5a) and (5.6) hold with $v = α y^{*}$ . Now, the postulated assumptions guarantee the existence of $λ \in N_{D} (g (\bar{x}))$ such that $x^{*} = \nabla g {(\bar{x})}^{*} λ$ . Respecting Lemma 3.1 (b), this shows (4.15) with $\bar{y} : = 0$ and $γ : = 2$ . Thus, Theorem 5.2 yields that $Φ$ is asymptotically regular at $(\bar{x}, 0)$ in direction u. The statement regarding strong asymptotic regularity follows analogously. $□$

Due to Corollary 5.1, Theorems 5.3 and 5.4 provide constraint qualifications for M-stationarity. Interestingly, one can easily check that the same conditions can also be obtained from Proposition 4.2 by demanding that any mixed-order stationary point is already M-stationary.

In the remaining part of the section, we prove that the assumptions of Theorem 5.3 are not stronger than FOSCMS(u) while the assumptions of Theorem 5.4 are strictly weaker than the so-called Second-Order Sufficient Condition for Metric Subregularity (SOSCMS) in direction u.

Given a point $\bar{x} \in X$ with $(\bar{x}, 0) \in gph Φ$ , Lemma 3.1 (c) shows that the condition

\begin{matrix} u \in S_{X}, \nabla g (\bar{x}) u \in T_{D} (g (\bar{x})), \nabla g {(\bar{x})}^{*} y^{*} = 0, y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u) ⟹ y^{*} = 0 \end{matrix}

equals FOSCMS in the current setting. In the case where D is locally polyhedral around $g (\bar{x})$ , the refined condition

\begin{matrix} (\begin{matrix} u \in S_{X}, \nabla g (\bar{x}) u \in T_{D} (g (\bar{x})), \nabla g {(\bar{x})}^{*} y^{*} = 0, \\ \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] \geq 0, y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u) \end{matrix}\} ⟹ y^{*} = 0, \end{matrix}

is referred to as SOSCMS in the literature. As these names suggest, both conditions are sufficient for metric subregularity of $Φ$ at $(\bar{x}, 0)$ , see [39, Corollary 1]. Particularly, they provide constraint qualifications for M-stationarity of local minimizers. Again, with the aid of Lemma 3.1 (c), one can easily check that

\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = 0, y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u) ⟹ y^{*} = 0 \end{matrix}

equals FOSCMS(u) in the present setting, and

\begin{matrix} \nabla g {(\bar{x})}^{*} y^{*} = 0, \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] \geq 0, y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u) ⟹ y^{*} = 0 \end{matrix}

will be denoted by SOSCMS(u). Each of the conditions FOSCMS(u) and SOSCMS(u) is sufficient for metric subregularity of $Φ$ at $(\bar{x}, 0)$ in direction u.

Proposition 5.3

Consider $(\bar{x}, 0) \in gph Φ$ and $u \in S_{X}$ . Under FOSCMS(u) all assumptions of Theorem 5.3 are satisfied.

Proof

Let $y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u)$ be such that $\nabla g {(\bar{x})}^{*} y^{*} = 0$ . Then FOSCMS(u) yields $y^{*} = 0$ and so (3.9) is satisfied. Moreover, we only need to show the remaining assertions for $y^{*} = 0$ .

Assume that $\nabla g (\bar{x}) u \neq 0$ holds. Suppose now that (3.11) is violated, i.e., there exists ${\hat{z}}^{*} \in D_{sub} N_{D} (g (\bar{x}), 0) (q)$ for $q : = \nabla g (\bar{x}) u / ∥\nabla g (\bar{x}) u∥$ with $\nabla g {(\bar{x})}^{*} {\hat{z}}^{*} = 0$ . By Lemma 2.5 and FOSCMS(u), we thus get ${\hat{z}}^{*} = 0$ which is a contradition since ${\hat{z}}^{*} \in S_{Y}$ by Definition 2.2. Similarly, in the case $\nabla g (\bar{x}) u = 0$ , we can verify (3.10) which reduces to

\begin{matrix} \nabla g {(\bar{x})}^{*} {\hat{z}}^{*} = 0, {\hat{z}}^{*} \in D N_{D} (g (\bar{x}), 0) (0) ⟹ {\hat{z}}^{*} = 0 . \end{matrix}

Applying Lemma 2.5 again, we get ${\hat{z}}^{*} \in N_{D} (g (\bar{x}))$ which implies ${\hat{z}}^{*} = 0$ since FOSCMS(u) corresponds to the Mordukhovich criterion due to $\nabla g (\bar{x}) u = 0$ . Thus, we have shown that (3.10) or, in the case $\nabla g (\bar{x}) u \neq 0$ , (3.11) holds.

Validity of the last assumption follows immediately since $z^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u)$ is obtained from Lemma 2.5, and so we can just take $λ : = z^{*}$ due to $y^{*} = 0$ . $□$

Remark 5.4

Note that for $u \in S_{X}$ satisfying $\nabla g (\bar{x}) u \neq 0$ , we have the trivial upper estimate $D_{sub} N_{D} (g (\bar{x}), y^{*}) (\nabla g (\bar{x}) u / ∥\nabla g (\bar{x}) u∥) \subset D N_{D} (g (\bar{x}), y^{*}) (0)$ . Hence, in Theorem 5.3, it is possible to replace validity of (3.10) or, in the case $\nabla g (\bar{x}) u \neq 0$ , (3.11) by the slightly stronger assumption that (3.10) has to hold (even in the case $\nabla g (\bar{x}) u \neq 0$ ). However, we cannot show anymore that FOSCMS(u) is sufficient for this stronger assumption to hold, i.e., dropping directional information comes for a price.

Proposition 5.4

Let $(\bar{x}, 0) \in gph Φ$ as well as $u \in S_{X}$ be fixed, let $Y : = R^{m}$ , and let D be polyhedral locally around $g (\bar{x})$ . If SOSCMS(u) is valid, then the assumptions of Theorem 5.4 are satisfied.

Proof

The key step is to realize that if $y^{*} \in N_{T (u)} (w_{s} (u, v)) \cap ker \nabla g {(\bar{x})}^{*}$ for some $s \in X$ and $v \in R^{m}$ , then we get

\begin{matrix} \frac{1}{2} \nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] = ⟨ w_{s} (u, v), y^{*} ⟩ + ⟨ v, y^{*} ⟩ = ⟨ v, y^{*} ⟩ \end{matrix}

by Remark 3.3 and $\nabla g {(\bar{x})}^{*} y^{*} = 0$ , and $y^{*} \in N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u)$ also holds, again by Remark 3.3.

Then (3.13) follows because for $y^{*} \in N_{T (u)} (w_{s} (u, 0)) \cap ker \nabla g {(\bar{x})}^{*}$ , the relation $\nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] = 0$ is obtained, and SOSCMS(u) yields $y^{*} = 0$ .

Next, for arbitrary $y^{*} \in N_{T (u)} (w_{s} (u, v)) \cap ker \nabla g {(\bar{x})}^{*}$ with $s \in X$ and $v : = α y^{*}$ for some $α \geq 0$ , we get $\nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) [u, u] = 2 ⟨ v, y^{*} ⟩ = 2 α {‖ y^{*} ‖}^{2} \geq 0$ , so SOSCMS(u) can still be applied to give $y^{*} = 0$ . Now, we can always take $λ : = z^{*}$ since $z^{*} \in N_{T (u)} (w_{s} (u, v)) \subset N_{D} (g (\bar{x}) ; \nabla g (\bar{x}) u)$ . $□$

We immediately arrive at the following corollary.

Corollary 5.3

The constraint mapping $Φ$ is strongly asymptotically regular at $(\bar{x}, 0) \in gph Φ$ in direction $u \in S_{X}$ if FOSCMS(u) holds or if $Y : = R^{m}$ , D is locally polyhedral around $g (\bar{x})$ , and SOSCMS(u) holds.

The following example shows that our new conditions from Theorem 5.4 are in fact strictly milder than SOSCMS.

Example 5.5

Let $g : R \to R^{2}$ and $D \subset R^{2}$ be given by $g (x) : = (x, - x^{2})$ , $x \in R$ , and $D : = (R_{+} \times R) \cup (R \times R_{+})$ . Observe that D is a polyhedral set. We consider the constraint map $Φ : R ⇉ R^{2}$ given by $Φ (x) : = g (x) - D$ , $x \in R$ . We note that $Φ^{- 1} (0) = [0, \infty)$ holds. Hence, fixing $\bar{x} : = 0$ , we can easily check that $Φ$ is metrically subregular at $(\bar{x}, 0)$ in direction 1 but not in direction $- 1$ , i.e., FOSCMS and SOSCMS must be violated.

First, we claim that all the assumptions from Theorem 5.4 are satisfied for $u = \pm 1$ . Taking into account Remark 3.3, it suffices to verify these assumptions for $w_{s} (u, v)$ replaced by 0. Let us fix $u = \pm 1$ , $y^{*}, z^{*} \in N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u)$ such that $\nabla g {(\bar{x})}^{*} y^{*} = 0$ and $\nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) + \nabla g {(\bar{x})}^{*} z^{*} = x^{*}$ for $x^{*} \in R$ . We have $\nabla g (\bar{x}) u = (u, 0)$ , $\nabla^{2} ⟨ y^{*}, g ⟩ (\bar{x}) (u) = - 2 y_{2}^{*} u$ , and

\begin{matrix} N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u) = \{\begin{matrix} {0} \times R_{-} & u = - 1, \\ {(0, 0)} & u = 1 . \end{matrix}) \end{matrix}

Thus, for $u = 1$ , we have $y^{*} = 0$ regardless of $x^{*}$ . Hence, condition (3.13) holds trivially and we can choose $λ : = z^{*}$ to find $x^{*} = \nabla g {(\bar{x})}^{*} λ$ as well as $λ \in N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u)$ . For $u = - 1$ , we get $y_{1}^{*} = z_{1}^{*} = 0$ and $y_{2}^{*} \leq 0$ . Thus, if $x^{*} = 0$ , from $- 2 y_{2}^{*} u + z_{1}^{*} = 0$ we deduce $y_{2}^{*} = 0$ , and (3.13) follows. For arbitrary $x^{*} \in R$ , we get $x^{*} = - 2 y_{2}^{*} u + z_{1}^{*} = 2 y_{2}^{*} \leq 0$ and we can choose $λ : = (x^{*}, 0) \in N_{D} (g (\bar{x}))$ to obtain $\nabla g {(\bar{x})}^{*} λ = x^{*}$ . Note, however, that $(x^{*}, 0) \notin N_{T_{D} (g (\bar{x}))} (\nabla g (\bar{x}) u) = {0} \times R_{-}$ unless $x^{*} = 0$ .

Regarding the assumptions of Theorem 5.3, let us just mention, without providing the details, that (3.10) and (3.11) fail since the graphical (sub)derivative is too large. Particularly, this clarifies that these assumptions are not necessary e.g. in the polyhedral setting, but not because they would be satisfied automatically.

Concluding remarks

In this paper, we enriched the general concepts of asymptotic stationarity and regularity with the aid of tools from directional limiting variational analysis. Our central result Theorem 4.1 states that, even in the absence of any constraint qualification, local minimizers of a rather general optimization problem are M-stationary, mixed-order stationary in terms of a suitable pseudo-coderivative, or asymptotically stationary in a critical direction (of a certain order). By ruling out the last option, we were in position to distill new mixed-order necessary optimality conditions. Some novel upper estimates for the second-order directional pseudo-coderivative of constraint mappings were successfully employed to make these results fully explicit in the presence of geometric constraints. Our findings also gave rise to the formulation of directional notions of asymptotic regularity for set-valued mappings. These conditions have been shown to serve as constraint qualifications guaranteeing M-stationarity of local minimizers in nonsmooth optimization. We embedded these new qualification conditions into the landscape of constraint qualifications which are already known from the literature, showing that these conditions are comparatively mild. Noting that directional asymptotic regularity might be difficult to check in practice, we then focused on the derivation of applicable sufficient conditions for its validity. First, we suggested directional notions of pseudo- and quasi-normality for that purpose which have been shown to generalize related concepts for geometric constraint systems to arbitrary set-valued mappings. Second, with the aid of so-called super- and pseudo-coderivatives, sufficient conditions for the presence of directional asymptotic regularity for geometric constraint systems in terms of first- and second-order derivatives of the associated mapping as well as standard variational objects associated with the underlying set were derived. These sufficient conditions turned out to be not stronger than the First- and Second-Order Sufficient Condition for Metric Subregularity from the literature.

In this paper, we completely neglected to study the potential value of directional asymptotic regularity in numerical optimization which might be a promising topic of future research. Furthermore, it has been shown in [63] that nondirectional asymptotic regularity can be applied nicely as a qualification condition in the limiting variational calculus. Most likely, directional asymptotic regularity may play a similar role in the directional limiting calculus. Finally, it seems desirable to further develop the calculus for pseudo-coderivatives for mappings which possess a more difficult structure than constraint mappings.

Acknowledgements

The authors would like to thank the referees and the associated editor for valuable comments which helped to improve the presentation of the material. Particularly, the authors are grateful to one of the reviewers who pointed out the close relationship with 2-regularity and suggested Example 4.1. Some critical remarks from another reviewer about the presentation of an earlier version of Proposition 4.2, that are thankfully acknowledged, led to improvements which allowed for a better comparison with related results from the literature in Sections 3.2 and 4.3. The research of Matúš Benko was supported by the Austrian Science Fund (FWF) under grant P32832-N as well as by the infrastructure of the Institute of Computational Mathematics, Johannes Kepler University Linz, Austria.

Missing proofs

Proof of Lemma 2.8

We only verify the (more technical) assertion regarding Definition 2.3 (b) as the proof for the assertion which addresses Definition 2.3 (c) follows in similar (but slightly easier) fashion.

Thus, fix $x^{*} \in X$ and $y^{*} \in Y$ as well as ${u_{k}}_{k \in N}, {x_{k}^{*}}_{k \in N} \subset X$ , ${v_{k}}_{k \in N}, {y_{k}^{*}}_{k \in N} \subset Y$ , and ${t_{k}}_{k \in N} \subset R_{+}$ which satisfy $u_{k} \to u$ , $v_{k} \to v$ , $t_{k} ↓ 0$ , $x_{k}^{*} \to x^{*}$ , $y_{k}^{*} \to y^{*}$ , and

\begin{matrix} \forall k \in N : (x_{k}^{*}, - \frac{y_{k}^{*}}{(t_{k} ‖ u_{k} {‖)}^{γ - 1}}) \in N_{gph Φ} (\bar{x} + t_{k} u_{k}, \bar{y} + (t_{k} ‖ u_{k} {‖)}^{γ} v_{k}) . \end{matrix}

By definition of the limiting normal cone, for each $k \in N$ , we find ${x_{k, ℓ}}_{ℓ \in N}, {x_{k, ℓ}^{*}}_{ℓ \in N} \subset X$ and ${y_{k, ℓ}}_{ℓ \in N}, {y_{k, ℓ}^{*}}_{ℓ \in N} \subset Y$ such that $x_{k, ℓ} \to \bar{x} + t_{k} u_{k}$ , $y_{k, ℓ} \to \bar{y} + (t_{k} ‖ u_{k} {‖)}^{γ} v_{k}$ , $x_{k, ℓ}^{*} \to x_{k}^{*}$ , and $y_{k, ℓ}^{*} \to y_{k}^{*} / (t_{k} ‖ u_{k} {‖)}^{γ - 1}$ as $ℓ \to \infty$ and $(x_{k, ℓ}^{*}, - y_{k, ℓ}^{*}) \in {\hat{N}}_{gph Φ} (x_{k, ℓ}, y_{k, ℓ})$ for all $ℓ \in N$ .

For each $k \in N$ , let us define sequences ${u_{k, ℓ}}_{ℓ \in N} \subset X$ and ${v_{k, ℓ}}_{ℓ \in N}, {{\hat{y}}_{k, ℓ}^{*}}_{ℓ \in N} \subset Y$ by means of

\begin{matrix} \forall ℓ \in N : u_{k, ℓ} : = \frac{x_{k, ℓ} - \bar{x}}{t_{k}}, v_{k, ℓ} : = \frac{y_{k, ℓ} - \bar{y}}{(t_{k} ‖ u_{k, ℓ} {‖)}^{γ}}, {\hat{y}}_{k, ℓ}^{*} : = (t_{k} ‖ u_{k, ℓ} {‖)}^{γ - 1} y_{k, ℓ}^{*} . \end{matrix}

This gives

\begin{matrix} \forall ℓ \in N : (x_{k, ℓ}^{*}, - \frac{{\hat{y}}_{k, ℓ}^{*}}{(t_{k} ‖ u_{k, ℓ} {‖)}^{γ - 1}}) \in {\hat{N}}_{gph Φ} (\bar{x} + t_{k} u_{k, ℓ}, \bar{y} + (t_{k} ‖ u_{k, ℓ} {‖)}^{γ} v_{k, ℓ}) . \end{matrix}

A.1

Furthermore, we have the convergences $u_{k, ℓ} \to u_{k}$ , $v_{k, ℓ} \to v_{k}$ , and ${\hat{y}}_{k, ℓ}^{*} \to y_{k}^{*}$ as $ℓ \to \infty$ by construction. Thus, for each $k \in N$ , we find an index $ℓ (k) \in N$ such that

\begin{matrix} ‖ u_{k, ℓ (k)} - u_{k} ‖ \leq \frac{1}{k}, ‖ v_{k, ℓ (k)} - v_{k} ‖ \leq \frac{1}{k}, \\ ‖ x_{k, ℓ (k)}^{*} - x_{k}^{*} ‖ \leq \frac{1}{k}, ‖ {\hat{y}}_{k, ℓ (k)}^{*} - y_{k}^{*} ‖ \leq \frac{1}{k} . \end{matrix}

Let us set ${\tilde{u}}_{k} : = u_{k, ℓ (k)}$ , ${\tilde{v}}_{k} : = v_{k, ℓ (k)}$ , ${\tilde{x}}_{k}^{*} : = x_{k, ℓ (k)}^{*}$ , and ${\tilde{y}}_{k}^{*} : = {\hat{y}}_{k, ℓ (k)}^{*}$ for each $k \in N$ . The above estimates and $u_{k} \to u$ , $v_{k} \to v$ , $x_{k}^{*} \to x^{*}$ , as well as $y_{k}^{*} \to y^{*}$ give ${\tilde{u}}_{k} \to u$ , ${\tilde{v}}_{k} \to v$ , ${\tilde{x}}_{k}^{*} \to x^{*}$ , as well as ${\tilde{y}}_{k}^{*} \to y^{*}$ . Additionally, (A.1) guarantees

\begin{matrix} \forall k \in N : ({\tilde{x}}_{k}^{*}, - \frac{{\tilde{y}}_{k}^{*}}{(t_{k} ‖ {\tilde{u}}_{k} {‖)}^{γ - 1}}) \in {\hat{N}}_{gph Φ} (\bar{x} + t_{k} {\tilde{u}}_{k}, \bar{y} + (t_{k} ‖ {\tilde{u}}_{k} {‖)}^{γ} {\tilde{v}}_{k}) . \end{matrix}

By definition of the directional pseudo-coderivative, $x^{*} \in D_{γ}^{*} Φ ((\bar{x}, \bar{y}) ; (u, v)) (y^{*})$ , is obtained and this shows the claim. $□$

Proof of Proposition 5.1

For nondirectional asymptotic regularity, the proof is standard and follows from a simple diagonal sequence argument. The proof for strong directional asymptotic regularity parallels the one for directional asymptotic regularity which is presented below.

Since one implication is clear by definition of the regular and limiting coderivative, we only show the other one. Therefore, let $Φ$ be asymptotically regular at $(\bar{x}, \bar{y})$ in direction u. Let us fix sequences ${(x_{k}, y_{k})}_{k \in N} \subset gph Φ$ , ${x_{k}^{*}}_{k \in N} \subset X$ , and ${λ_{k}}_{k \in N} \subset Y$ as well as $x^{*} \in X$ and $y^{*} \in Y$ satisfying $x_{k} \notin Φ^{- 1} (\bar{y})$ , $y_{k} \neq \bar{y}$ , and $x_{k}^{*} \in D^{*} Φ (x_{k}, y_{k}) (λ_{k})$ for each $k \in N$ as well as the convergences (5.1). For each $k \in N$ , we find sequences ${(x_{k, ℓ}, y_{k, ℓ})}_{ℓ \in N} \subset gph Φ$ , ${x_{k, ℓ}^{*}}_{ℓ \in N} \subset X$ , and ${λ_{k, ℓ}}_{ℓ \in N} \subset Y$ with $x_{k, ℓ} \to x_{k}$ , $x_{k, ℓ}^{*} \to x_{k}^{*}$ , $y_{k, ℓ} \to y_{k}$ , and $λ_{k, ℓ} \to λ_{k}$ as $ℓ \to \infty$ as well as $x_{k, ℓ}^{*} \in {\hat{D}}^{*} Φ (x_{k, ℓ}, y_{k, ℓ}) (λ_{k, ℓ})$ for each $ℓ \in N$ . Observing that $Φ^{- 1} (\bar{y})$ is closed, its complement is open so that $x_{k, ℓ} \notin Φ^{- 1} (\bar{y})$ holds for sufficiently large $ℓ \in N$ . Furthermore, since $∥x_{k} - \bar{x}∥ > 0$ and $∥y_{k} - \bar{y}∥ > 0$ are valid, we can choose an index $ℓ (k) \in N$ so large such that the estimates

\begin{matrix} \begin{matrix} ‖ x_{k, ℓ (k)} - x_{k} ‖ & < \frac{1}{k} ∥x_{k} - \bar{x}∥, & ‖ x_{k, ℓ (k)}^{*} - x_{k}^{*} ‖ & < \frac{1}{k}, \\ ‖ y_{k, ℓ (k)} - y_{k} ‖ & < \frac{1}{k} ∥y_{k} - \bar{y}∥, & ‖ λ_{k, ℓ (k)} - λ_{k} ‖ & < \frac{1}{k} \end{matrix} \end{matrix}

and $x_{k, ℓ (k)} \notin Φ^{- 1} (\bar{y})$ as well as $y_{k, ℓ (k)} \neq \bar{y}$ are valid. For each $k \in N$ , we set ${\tilde{x}}_{k} : = x_{k, ℓ (k)}$ , ${\tilde{x}}_{k}^{*} : = x_{k, ℓ (k)}^{*}$ , ${\tilde{y}}_{k} : = y_{k, ℓ (k)}$ , and ${\tilde{λ}}_{k} : = λ_{k, ℓ (k)}$ . Clearly, we have ${\tilde{x}}_{k} \to \bar{x}$ , ${\tilde{y}}_{k} \to \bar{y}$ , ${\tilde{x}}_{k}^{*} \to x^{*}$ , $‖ {\tilde{λ}}_{k} ‖ \to \infty$ , ${({\tilde{x}}_{k}, {\tilde{y}}_{k})}_{k \in N} \subset gph Φ$ , and ${\tilde{x}}_{k} \notin Φ^{- 1} (\bar{y})$ , ${\tilde{y}}_{k} \neq \bar{y}$ , as well as ${\tilde{x}}_{k}^{*} \in {\hat{D}}^{*} Φ ({\tilde{x}}_{k}, {\tilde{y}}_{k}) ({\tilde{λ}}_{k})$ for each $k \in N$ by construction. Furthermore, we find

\begin{matrix} ∥{\tilde{x}}_{k} - \bar{x}∥ \geq ∥x_{k} - \bar{x}∥ - ∥{\tilde{x}}_{k} - x_{k}∥ \geq \frac{k - 1}{k} ∥x_{k} - \bar{x}∥ \end{matrix}

for each $k \in N$ . With the above estimates at hand, we obtain

\begin{matrix} ∥\frac{x_{k} - \bar{x}}{‖ x_{k} - \bar{x} ‖} - \frac{{\tilde{x}}_{k} - \bar{x}}{∥{\tilde{x}}_{k} - \bar{x}∥}∥ & = ∥\frac{x_{k} - {\tilde{x}}_{k}}{∥x_{k} - \bar{x}∥} + ({\tilde{x}}_{k} - \bar{x}) (\frac{1}{‖ x_{k} - \bar{x} ‖} - \frac{1}{‖ {\tilde{x}}_{k} - \bar{x} ‖})∥ \\ \leq \frac{∥x_{k} - {\tilde{x}}_{k}∥}{∥x_{k} - \bar{x}∥} + \frac{∥{\tilde{x}}_{k} - \bar{x}∥ ∥x_{k} - {\tilde{x}}_{k}∥}{∥x_{k} - \bar{x}∥ ∥{\tilde{x}}_{k} - \bar{x}∥} \leq \frac{2}{k} \end{matrix}

and

\begin{matrix} \begin{matrix} ∥\frac{y_{k} - \bar{y}}{‖ x_{k} - \bar{x} ‖} - \frac{{\tilde{y}}_{k} - \bar{y}}{∥{\tilde{x}}_{k} - \bar{x}∥}∥ & = ∥\frac{y_{k} - {\tilde{y}}_{k}}{∥x_{k} - \bar{x}∥} + ({\tilde{y}}_{k} - \bar{y}) (\frac{1}{‖ x_{k} - \bar{x} ‖} - \frac{1}{‖ {\tilde{x}}_{k} - \bar{x} ‖})∥ \\ \leq \frac{∥y_{k} - {\tilde{y}}_{k}∥}{∥x_{k} - \bar{x}∥} + \frac{∥{\tilde{y}}_{k} - \bar{y}∥ ∥x_{k} - {\tilde{x}}_{k}∥}{∥x_{k} - \bar{x}∥ ∥{\tilde{x}}_{k} - \bar{x}∥} \\ \leq \frac{1}{k} \frac{∥y_{k} - \bar{y}∥}{∥x_{k} - \bar{x}∥} + \frac{1}{k - 1} \frac{∥{\tilde{y}}_{k} - y_{k}∥ + ∥y_{k} - \bar{y}∥}{∥x_{k} - \bar{x}∥} \\ \leq (\frac{1}{k} + \frac{1}{k (k - 1)} + \frac{1}{k - 1}) \frac{∥y_{k} - \bar{y}∥}{∥x_{k} - \bar{x}∥} \\ = \frac{2}{k - 1} \frac{∥y_{k} - \bar{y}∥}{∥x_{k} - \bar{x}∥}, \end{matrix} \end{matrix}

A.2

so that, with the aid of (5.1), we find $({\tilde{x}}_{k} - \bar{x}) / ∥{\tilde{x}}_{k} - \bar{x}∥ \to u$ as well as $({\tilde{y}}_{k} - \bar{y}) / ∥{\tilde{x}}_{k} - \bar{x}∥ \to 0$ . With the aid of (A.2),

\begin{matrix} ∥\frac{‖ {\tilde{y}}_{k} - \bar{y} ‖}{‖ {\tilde{x}}_{k} - \bar{x} ‖} {\tilde{λ}}_{k} - \frac{∥y_{k} - \bar{y}∥}{∥x_{k} - \bar{x}∥} λ_{k}∥ & \leq \frac{‖ {\tilde{y}}_{k} - \bar{y} ‖}{‖ {\tilde{x}}_{k} - \bar{x} ‖} ‖ {\tilde{λ}}_{k} - λ_{k} ‖ + |\frac{‖ {\tilde{y}}_{k} - \bar{y} ‖}{‖ {\tilde{x}}_{k} - \bar{x} ‖} - \frac{∥y_{k} - \bar{y}∥}{∥x_{k} - \bar{x}∥}| ∥λ_{k}∥ \\ \leq \frac{1}{k} \frac{‖ {\tilde{y}}_{k} - \bar{y} ‖}{‖ {\tilde{x}}_{k} - \bar{x} ‖} + \frac{2}{k - 1} \frac{∥y_{k} - \bar{y}∥}{∥x_{k} - \bar{y}∥} ∥λ_{k}∥ \end{matrix}

is obtained, which gives ${\tilde{λ}}_{k} ‖ {\tilde{y}}_{k} - \bar{y} ‖ / ‖ {\tilde{x}}_{k} - \bar{x} ‖ \to y^{*}$ . Similar as above, we find

\begin{matrix} ∥\frac{{\tilde{y}}_{k} - \bar{y}}{∥{\tilde{y}}_{k} - \bar{y}∥} - \frac{y_{k} - \bar{y}}{∥y_{k} - \bar{y}∥}∥ \leq \frac{2}{k} \end{matrix}

and

\begin{matrix} ‖ {\tilde{λ}}_{k} / ‖ {\tilde{λ}}_{k} ‖ - λ_{k} / ∥λ_{k}∥ ‖ \leq 2 ‖ λ_{k} - {\tilde{λ}}_{k} ‖ / ∥λ_{k}∥ \leq 2 / (k ∥λ_{k}∥), \end{matrix}

so that (5.1) gives us

\begin{matrix} lim_{k \to \infty} (\frac{{\tilde{y}}_{k} - \bar{y}}{∥{\tilde{y}}_{k} - \bar{y}∥} - \frac{{\tilde{λ}}_{k}}{‖ {\tilde{λ}}_{k} ‖}) = lim_{k \to \infty} (\frac{y_{k} - \bar{y}}{∥y_{k} - \bar{y}∥} - \frac{λ_{k}}{∥λ_{k}∥}) = 0 . \end{matrix}

Now, since $Φ$ is asymptotically regular at $(\bar{x}, \bar{y})$ in direction u, we obtain $x^{*} \in Im D^{*} Φ (\bar{x}, \bar{y})$ . $□$

Funding

Open Access funding enabled and organized by Projekt DEAL.

Declarations

Conflict of interest

No potential Conflict of interest was reported by the authors.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Adam, L., Červinka, M., Pištěk, M.: Normally admissible stratifications and calculation of normal cones to a finite union of polyhedral sets. Set-Valued Var. Anal. 24, 207–229 (2016). 10.1007/s11228-015-0325-8 [Google Scholar]
2.Andreani, R., Gómez, W., Haeser, G., Mito, L.M., Ramos, A.: On optimality conditions for nonlinear conic programming. Math. Oper. Res. 47(3), 2160–2185 (2021). 10.1287/moor.2021.1203 [Google Scholar]
3.Andreani, R., Haeser, G., Martínez, J.M.: On sequential optimality conditions for smooth constrained optimization. Optimization 60(5), 627–641 (2011). 10.1080/02331930903578700 [Google Scholar]
4.Andreani, R., Haeser, G., Secchin, L.D., Silva, P.J.S.: New sequential optimality conditions for mathematical programs with complementarity constraints and algorithmic consequences. SIAM J. Optim. 29(4), 3201–3230 (2019). 10.1137/18M121040X [Google Scholar]
5.Andreani, R., Martínez, J.M., Ramos, A., Silva, P.J.S.: A cone-continuity constraint qualification and algorithmic consequences. SIAM J. Optim. 26(1), 96–110 (2016). 10.1137/15M1008488 [Google Scholar]
6.Andreani, R., Martínez, J.M., Ramos, A., Silva, P.J.S.: Strict constraint qualifications and sequential optimality conditions for constrained optimization. Math. Oper. Res. 43(3), 693–717 (2018). 10.1287/moor.2017.0879 [Google Scholar]
7.Andreani, R., Martínez, J.M., Svaiter, B.F.: A new sequential optimality condition for constrained optimization and algorithmic consequences. SIAM J. Optim. 20(6), 3533–3554 (2010). 10.1137/090777189
8.Arutyunov, A.V., Izmailov, A.F.: Covering on a convex set in the absence of Robinson’s regularity. SIAM J. Optim. 30(1), 604–629 (2020). 10.1137/19M1256634 [Google Scholar]
9.Arutyunov, A.V., Avakov, E.R., Izmailov, A.F.: Necessary optimality conditions for constrained optimization problems under relaxed constraint qulifications. Math. Program. 114, 37–68 (2008). 10.1007/s10107-006-0082-4
10.Aubin, J.P., Frankowska, H.: Set-valued Analysis. Modern Birkhäuser Classics, Birkhäuser, Boston, (2009). 10.1007/978-0-8176-4848-0, reprint of the 1990 edition
11.Avakov, E.R.: Extremum conditions for smooth problems with equality-type constraints. USSR Comput. Math. Math. Phys. 25(3), 24–32 (1985). 10.1016/0041-5553(85)90069-2 [Google Scholar]
12.Avakov, E.R.: Necessary extremum conditions for smooth anormal problems with equality- and inequality-type constraints. Math. Notes Acad. Sci. USSR 45, 431–437 (1989). 10.1007/BF01158229 [Google Scholar]
13.Avakov, E.R., Arutyunov, A.V., Izmailov, A.F.: Necessary conditions for an extremum in a mathematical programming problem. Proc. Stekalov Inst. Math. 256, 2–25 (2007). 10.1134/S0081543807010014 [Google Scholar]
14.Bai, K., Ye, J.J.: Directional necessary optimality conditions for bilevel programs. Math. Oper. Res. 47(2), 1169–1191 (2022). 10.1287/moor.2021.1164 [Google Scholar]
15.Bai, K., Ye, J.J., Zhang, J.: Directional quasi-/pseudo-normality as sufficient conditions for metric subregularity. SIAM J. Optim. 29(4), 2625–2649 (2019). 10.1137/18M1232498 [Google Scholar]
16.Benko, M., Červinka, M., Hoheisel, T.: Sufficient conditions for metric subregularity of constraint systems with applications to disjunctive and ortho-disjunctive programs. Set-Valued Var. Anal. 30, 1143–177 (2022). 10.1007/s11228-020-00569-7 [Google Scholar]
17.Benko, M., Gfrerer, H.: New verifiable stationarity concepts for a class of mathematical programs with disjunctive constraints. Optimization 67(1), 1–23 (2018). 10.1080/02331934.2017.1387547 [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Benko, M., Gfrerer, H., Outrata, J.V.: Calculus for directional limiting normal cones and subdifferentials. Set-Valued Var. Anal. 27(3), 713–745 (2019). 10.1007/s11228-018-0492-5 [Google Scholar]
19.Benko, M., Gfrerer, H., Ye, J.J., Zhang, J., Zhou, J.C.: Second-order optimality conditions for general nonconvex optimization problems and variational analysis of disjunctive systems. SIAM J. Optim. 33(4), 2625–2653 (2023). 10.1137/22M1484742 [Google Scholar]
20.Benko, M., Mehlitz, P.: Calmness and calculus: two basic patterns. Set-Valued Var. Anal. 30, 81–117 (2022a). 10.1007/s11228-021-00589-x
21.Benko, M., Mehlitz, P.: On the directional asymptotic approach in optimization theory Part A: approximate, M-, and mixed-order stationarity. Tech. rep., preprint arXiv, (2022b) arxiv:2204.13932
22.Benko, M., Mehlitz, P.: On the directional asymptotic approach in optimization theory Part B: constraint qualifications. Tech. rep., preprint arXiv, (2022c) arxiv:2205.00775 [DOI] [PMC free article] [PubMed]
23.Bertsekas, D., Nedić, A., Ozdaglar, A.E.: Convex analysis and optimization. Athena Scientific, Belmont (2003) [Google Scholar]
24.Bertsekas, D.P., Ozdaglar, A.E.: Pseudonormality and a Lagrange multiplier theory for constrained optimization. J. Optim. Theory Appl. 114, 287–343 (2002). 10.1023/A:1016083601322 [Google Scholar]
25.Bonnans, J.F., Shapiro, A.: Perturbation Analysis of Optimization Problems. Springer, New York (2000). 10.1007/978-1-4612-1394-9
26.Börgens, E., Kanzow, C., Mehlitz, P., Wachsmuth, G.: New constraint qualifications for optimization problems in Banach spaces based on asymptotic KKT conditions. SIAM J. Optim. 30(4), 2956–2982 (2020). 10.1137/19M1306804 [Google Scholar]
27.Burke, J.V.: Calmness and exact penalization. SIAM J. Control. Optim. 29(2), 493–497 (1991). 10.1137/0329027 [Google Scholar]
28.Clarke, F.: Optimization and Nonsmooth Analysis. Wiley, New York (1983). 10.1137/1.9781611971309 [Google Scholar]
29.Dempe, S.: Foundations of Bilevel Programming. Kluwer, Dordrecht (2002). 10.1007/b101970 [Google Scholar]
30.Dempe, S., Kalashnikov, V., Pérez-Valdéz, G., Kalashnykova, N.: Bilevel Programming Problems - Theory. Algorithms and Applications to Energy Networks, Springer, Berlin (2015). 10.1007/978-3-662-45827-3 [Google Scholar]
31.Dontchev, A.L., Rockafellar, R.T.: Implicit Functions and Solution Mappings. Springer, Heidelberg (2014). 10.1007/978-0-387-87821-8 [Google Scholar]
32.Facchinei, F., Pang, J.S.: Finite-Dimensional Variational Inequalities and Complementarity Problems. Springer, New York (2003). 10.1007/b97543 [Google Scholar]
33.Fischer, A., Izmailov, A.F., Jelitte, M.: Newton-type methods near critical solutions of piecewise smooth nonlinear equations. Comput. Optim. Appl. 80, 587–615 (2021). 10.1007/s10589-021-00306-2 [Google Scholar]
34.Fischer, A., Izmailov, A.F., Jelitte, M.: Behavior of Newton-type methods near critical solutions of nonlinear equations with semismooth derivatives. J. Optim. Theory Appl. (2023). 10.1007/s10957-023-02350-w [Google Scholar]
35.Gfrerer, H.: Second-order necessary conditions for nonlinear optimization problems with abstract constraints: the degenerate case. SIAM J. Optim. 18(2), 589–612 (2007). 10.1137/050641387 [Google Scholar]
36.Gfrerer, H.: On directional metric regularity, subregularity and optimality conditions for nonsmooth mathematical programs. Set-Valued Var. Anal. 21(2), 151–176 (2013). 10.1007/s11228-012-0220-5 [Google Scholar]
37.Gfrerer, H.: On metric pseudo-(sub)regularity of multifunctions and optimality conditions for degenerated mathematical programs. Set-Valued Var. Anal. 22(1), 79–115 (2014). 10.1007/s11228-013-0266-z [Google Scholar]
38.Gfrerer, H.: Optimality conditions for disjunctive programs based on generalized differentiation with application to mathematical programs with equilibrium constraints. SIAM J. Optim. 24(2), 898–931 (2014). 10.1137/130914449 [Google Scholar]
39.Gfrerer, H., Klatte, D.: Lipschitz and Hölder stability of optimization problems and generalized equations. Math. Program. 158, 35–75 (2016). 10.1007/s10107-015-0914-1 [Google Scholar]
40.Gfrerer, H., Outrata, J.V.: On computation of limiting coderivatives of the normal-cone mapping to inequality systems and their applications. Optimization 65(4), 671–700 (2016). 10.1080/02331934.2015.1066372 [Google Scholar]
41.Gfrerer, H., Ye, J.J.: New constraint qualifications for mathematical programs with equilibrium constraints via variational analysis. SIAM J. Optim. 27(2), 842–865 (2017). 10.1137/16M1088752 [Google Scholar]
42.Gfrerer, H., Ye, J.J., Zhou, J.: Second-order optimality conditions for nonconvex set-constrained optimization problems. Math. Oper. Res. 47(3), 2344–2365 (2022). 10.1287/moor.2021.1211 [Google Scholar]
43.Guo, L., Ye, J.J., Zhang, J.: Mathematical programs with geometric constraints in Banach spaces: enhanced optimality, exact penalty, and sensitivity. SIAM J. Optim. 23(4), 2295–2319 (2013). 10.1137/130910956 [Google Scholar]
44.Haraux, A.: How to differentiate the projection on a convex set in Hilbert space. Some applications to variational inequalities. J. Math. Soc. Jpn. 29(4), 615–631 (1977). 10.2969/jmsj/02940615 [Google Scholar]
45.Helou, E.S., Santos, S.A., Simões, L.E.A.: A new sequential optimality condition for constrained nonsmooth optimization. SIAM J. Optim. 30(2), 1610–1637 (2020). 10.1137/18M1228608 [Google Scholar]
46.Hestenes, M.R.: Optimization Theory - the Finite-Dimensional Case. Wiley, New York (1975) [Google Scholar]
47.Ioffe, A.D.: Regular points of Lipschitz functions. Trans. Am. Math. Soc. 251, 61–69 (1979). 10.1090/S0002-9947-1979-0531969-6 [Google Scholar]
48.Ioffe, A.D.: Variational Analysis of Regular Mappings. Springer, Cham (2017). 10.1007/978-3-319-64277-2 [Google Scholar]
49.Izmailov, A.F., Kurennoy, A.S., Solodov, M.V.: Critical solutions of nonlinear equations: local attraction for Newton-type methods. Math. Program. 167, 355–379 (2018). 10.1007/s10107-017-1128-5 [Google Scholar]
50.Izmailov, A.F., Solodov, M.V.: Complementarity constraint qualification via the theory of 2-regularity. SIAM J. Optim. 13(2), 368–385 (2002). 10.1137/S1052623499365292 [Google Scholar]
51.Izmailov, A.F., Solodov, M.V.: The theory of 2-regularity for mappings with Lipschitzian derivatives and its applications to optimality conditions. Math. Oper. Res. 27(3), 614–635 (2002). 10.1287/moor.27.3.614.308 [Google Scholar]
52.Kanzow, C., Raharja, A.B., Schwartz, A.: An augmented Lagrangian method for cardinality-constrained optimization problems. J. Optim. Theory Appl. 189, 793–813 (2021). 10.1007/s10957-021-01854-7 [Google Scholar]
53.Kanzow, C., Raharja, A.B., Schwartz, A.: Sequential optimality conditions for cardinality-constrained optimization problems with applications. Comput. Optim. Appl. 80, 185–211 (2021). 10.1007/s10589-021-00298-z [Google Scholar]
54.Kanzow, C., Schwartz, A.: Mathematical programs with equilibrium constraints: enhanced Fritz John-conditions, new constraint qualifications, and improved exact penalty results. SIAM J. Optim. 20(5), 2730–2753 (2010). 10.1137/090774975 [Google Scholar]
55.Kanzow, C., Steck, D., Wachsmuth, D.: An augmented Lagrangian method for optimization problems in Banach spaces. SIAM J. Control. Optim. 56(1), 272–291 (2018). 10.1137/16M1107103 [Google Scholar]
56.Klatte, D., Kummer, B.: Constrained minima and Lipschitzian penalties in metric spaces. SIAM J. Optim. 13(2), 619–633 (2002). 10.1137/S105262340139625X [Google Scholar]
57.Kruger, A.Y.: Generalized differentials of nonsmooth functions and necessary conditions for an extremum. Sibirian Math. J. 26, 370–379 (1985) [Google Scholar]
58.Kruger, A.Y., Mehlitz, P.: Optimality conditions, approximate stationarity, and applications-a story beyond Lipschitzness. ESAIM Control Optim. Calc. Var. 28, 42 (2022). 10.1051/cocv/2022024 [Google Scholar]
59.Kruger, A.Y., Mordukhovich, B.S.: Extremal points and the Euler equation in nonsmooth optimization problems. Doklady Akademii Nauk BSSR 24(8), 684–687 (1980) [Google Scholar]
60.Levy, A.B.: Implicit multifunction theorems for the sensitivity analysis of variational conditions. Math. Program. 74, 333–350 (1996). 10.1007/BF02592203 [Google Scholar]
61.Liang, Y.C., Ye, J.J.: Optimality conditions and exact penalty for mathematical programs with switching constraints. J. Optim. Theory Appl. 190, 1–31 (2021). 10.1007/s10957-021-01879-y [Google Scholar]
62.Luo, Z.Q., Pang, J.S., Ralph, D.: Mathematical Programs with Equilibrium Constraints. Cambridge University Press, Cambridge (1996). 10.1017/CBO9780511983658 [Google Scholar]
63.Mehlitz, P.: Asymptotic stationarity and regularity for nonsmooth optimization problems. J. Nonsmooth Anal. Optim. 1,6575, (2020) 10.46298/jnsao-2020-6575 [DOI]
64.Mehlitz, P.: Asymptotic regularity for Lipschitzian nonlinear optimization problems with applications to complementarity constrained and bilevel programming. Optimization 72(1), 277–320 (2023). 10.1080/02331934.2022.2031190 [Google Scholar]
65.Mordukhovich, B.S.: Variational Analysis and Generalized Differentiation, Part I: Basic Theory. Applications. Springer, Berlin, Part II (2006). 10.1007/3-540-31247-1
66.Mordukhovich, B.S.: Variational Analysis and Applications. Springer, Cham (2018). 10.1007/978-3-319-92775-6
67.Mordukhovich, B.S., Outrata, J.V., Ramírez, C.H.: Second-order variational analysis in conic programming with applications to optimality and stability. SIAM J. Optim. 25(1), 76–101 (2015). 10.1137/120903221 [Google Scholar]
68.Outrata, J.V., Kočvara, M., Zowe, J.: Nonsmooth Approach to Optimization Problems with Equilibrium Constraints. Kluwer Academic, Dordrecht (1998). 10.1007/978-1-4757-2825-5 [Google Scholar]
69.Outrata, J.V., Sun, D.: On the coderivative of the projection operator onto the second-order cone. Set-Valued Anal. 16, 999–1014 (2008). 10.1007/s11228-008-0092-x [Google Scholar]
70.Ramos, A.: Mathematical programs with equilibrium constraints: a sequential optimality condition, new constraint qualifications and algorithmic consequences. Optim. Methods Softw. 36, 45–81 (2021). 10.1080/10556788.2019.1702661 [Google Scholar]
71.Rockafellar, R.T., Wets, R.J.B.: Variational Analysis, Grundlehren der mathematischen Wissenschaften, vol. 317. Springer, Berlin (1998). 10.1007/978-3-642-02431-3 [Google Scholar]
72.Shapiro, A.: On concepts of directional differentiability. J. Optim. Theory Appl. 66, 477–478 (1990). 10.1007/BF00940933 [Google Scholar]
73.Sun, D., Sun, J.: Semismooth matrix-valued functions. Math. Oper. Res. 27(1), 150–169 (2002). 10.1287/moor.27.1.150.342 [Google Scholar]
74.Tret’ Yakov, A.A.: Necessary and sufficient conditions for optimality of -th order. USSR Comput. Math. Math. Phys. 24(1), 123–127 (1984). 10.1016/0041-5553(84)90132-0 [Google Scholar]
75.Wu, J., Zhang, L., Zhang, Y.: Mathematical programs with semidefinite cone complementarity constraints: constraint qualifications and optimality conditions. Set-Valued Var. Anal. 22, 155–187 (2014). 10.1007/s11228-013-0242-7 [Google Scholar]
76.Ye, J.J., Ye, X.Y.: Necessary optimality conditions for optimization problems with variational inequality constraints. Math. Oper. Res. 22(4), 977–997 (1997). 10.1287/moor.22.4.977 [Google Scholar]

[CR1] 1.Adam, L., Červinka, M., Pištěk, M.: Normally admissible stratifications and calculation of normal cones to a finite union of polyhedral sets. Set-Valued Var. Anal. 24, 207–229 (2016). 10.1007/s11228-015-0325-8 [Google Scholar]

[CR2] 2.Andreani, R., Gómez, W., Haeser, G., Mito, L.M., Ramos, A.: On optimality conditions for nonlinear conic programming. Math. Oper. Res. 47(3), 2160–2185 (2021). 10.1287/moor.2021.1203 [Google Scholar]

[CR3] 3.Andreani, R., Haeser, G., Martínez, J.M.: On sequential optimality conditions for smooth constrained optimization. Optimization 60(5), 627–641 (2011). 10.1080/02331930903578700 [Google Scholar]

[CR4] 4.Andreani, R., Haeser, G., Secchin, L.D., Silva, P.J.S.: New sequential optimality conditions for mathematical programs with complementarity constraints and algorithmic consequences. SIAM J. Optim. 29(4), 3201–3230 (2019). 10.1137/18M121040X [Google Scholar]

[CR5] 5.Andreani, R., Martínez, J.M., Ramos, A., Silva, P.J.S.: A cone-continuity constraint qualification and algorithmic consequences. SIAM J. Optim. 26(1), 96–110 (2016). 10.1137/15M1008488 [Google Scholar]

[CR6] 6.Andreani, R., Martínez, J.M., Ramos, A., Silva, P.J.S.: Strict constraint qualifications and sequential optimality conditions for constrained optimization. Math. Oper. Res. 43(3), 693–717 (2018). 10.1287/moor.2017.0879 [Google Scholar]

[CR7] 7.Andreani, R., Martínez, J.M., Svaiter, B.F.: A new sequential optimality condition for constrained optimization and algorithmic consequences. SIAM J. Optim. 20(6), 3533–3554 (2010). 10.1137/090777189

[CR8] 8.Arutyunov, A.V., Izmailov, A.F.: Covering on a convex set in the absence of Robinson’s regularity. SIAM J. Optim. 30(1), 604–629 (2020). 10.1137/19M1256634 [Google Scholar]

[CR9] 9.Arutyunov, A.V., Avakov, E.R., Izmailov, A.F.: Necessary optimality conditions for constrained optimization problems under relaxed constraint qulifications. Math. Program. 114, 37–68 (2008). 10.1007/s10107-006-0082-4

[CR10] 10.Aubin, J.P., Frankowska, H.: Set-valued Analysis. Modern Birkhäuser Classics, Birkhäuser, Boston, (2009). 10.1007/978-0-8176-4848-0, reprint of the 1990 edition

[CR11] 11.Avakov, E.R.: Extremum conditions for smooth problems with equality-type constraints. USSR Comput. Math. Math. Phys. 25(3), 24–32 (1985). 10.1016/0041-5553(85)90069-2 [Google Scholar]

[CR12] 12.Avakov, E.R.: Necessary extremum conditions for smooth anormal problems with equality- and inequality-type constraints. Math. Notes Acad. Sci. USSR 45, 431–437 (1989). 10.1007/BF01158229 [Google Scholar]

[CR13] 13.Avakov, E.R., Arutyunov, A.V., Izmailov, A.F.: Necessary conditions for an extremum in a mathematical programming problem. Proc. Stekalov Inst. Math. 256, 2–25 (2007). 10.1134/S0081543807010014 [Google Scholar]

[CR14] 14.Bai, K., Ye, J.J.: Directional necessary optimality conditions for bilevel programs. Math. Oper. Res. 47(2), 1169–1191 (2022). 10.1287/moor.2021.1164 [Google Scholar]

[CR15] 15.Bai, K., Ye, J.J., Zhang, J.: Directional quasi-/pseudo-normality as sufficient conditions for metric subregularity. SIAM J. Optim. 29(4), 2625–2649 (2019). 10.1137/18M1232498 [Google Scholar]

[CR16] 16.Benko, M., Červinka, M., Hoheisel, T.: Sufficient conditions for metric subregularity of constraint systems with applications to disjunctive and ortho-disjunctive programs. Set-Valued Var. Anal. 30, 1143–177 (2022). 10.1007/s11228-020-00569-7 [Google Scholar]

[CR17] 17.Benko, M., Gfrerer, H.: New verifiable stationarity concepts for a class of mathematical programs with disjunctive constraints. Optimization 67(1), 1–23 (2018). 10.1080/02331934.2017.1387547 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Benko, M., Gfrerer, H., Outrata, J.V.: Calculus for directional limiting normal cones and subdifferentials. Set-Valued Var. Anal. 27(3), 713–745 (2019). 10.1007/s11228-018-0492-5 [Google Scholar]

[CR19] 19.Benko, M., Gfrerer, H., Ye, J.J., Zhang, J., Zhou, J.C.: Second-order optimality conditions for general nonconvex optimization problems and variational analysis of disjunctive systems. SIAM J. Optim. 33(4), 2625–2653 (2023). 10.1137/22M1484742 [Google Scholar]

[CR20] 20.Benko, M., Mehlitz, P.: Calmness and calculus: two basic patterns. Set-Valued Var. Anal. 30, 81–117 (2022a). 10.1007/s11228-021-00589-x

[CR21] 21.Benko, M., Mehlitz, P.: On the directional asymptotic approach in optimization theory Part A: approximate, M-, and mixed-order stationarity. Tech. rep., preprint arXiv, (2022b) arxiv:2204.13932

[CR22] 22.Benko, M., Mehlitz, P.: On the directional asymptotic approach in optimization theory Part B: constraint qualifications. Tech. rep., preprint arXiv, (2022c) arxiv:2205.00775 [DOI] [PMC free article] [PubMed]

[CR23] 23.Bertsekas, D., Nedić, A., Ozdaglar, A.E.: Convex analysis and optimization. Athena Scientific, Belmont (2003) [Google Scholar]

[CR24] 24.Bertsekas, D.P., Ozdaglar, A.E.: Pseudonormality and a Lagrange multiplier theory for constrained optimization. J. Optim. Theory Appl. 114, 287–343 (2002). 10.1023/A:1016083601322 [Google Scholar]

[CR25] 25.Bonnans, J.F., Shapiro, A.: Perturbation Analysis of Optimization Problems. Springer, New York (2000). 10.1007/978-1-4612-1394-9

[CR26] 26.Börgens, E., Kanzow, C., Mehlitz, P., Wachsmuth, G.: New constraint qualifications for optimization problems in Banach spaces based on asymptotic KKT conditions. SIAM J. Optim. 30(4), 2956–2982 (2020). 10.1137/19M1306804 [Google Scholar]

[CR27] 27.Burke, J.V.: Calmness and exact penalization. SIAM J. Control. Optim. 29(2), 493–497 (1991). 10.1137/0329027 [Google Scholar]

[CR28] 28.Clarke, F.: Optimization and Nonsmooth Analysis. Wiley, New York (1983). 10.1137/1.9781611971309 [Google Scholar]

[CR29] 29.Dempe, S.: Foundations of Bilevel Programming. Kluwer, Dordrecht (2002). 10.1007/b101970 [Google Scholar]

[CR30] 30.Dempe, S., Kalashnikov, V., Pérez-Valdéz, G., Kalashnykova, N.: Bilevel Programming Problems - Theory. Algorithms and Applications to Energy Networks, Springer, Berlin (2015). 10.1007/978-3-662-45827-3 [Google Scholar]

[CR31] 31.Dontchev, A.L., Rockafellar, R.T.: Implicit Functions and Solution Mappings. Springer, Heidelberg (2014). 10.1007/978-0-387-87821-8 [Google Scholar]

[CR32] 32.Facchinei, F., Pang, J.S.: Finite-Dimensional Variational Inequalities and Complementarity Problems. Springer, New York (2003). 10.1007/b97543 [Google Scholar]

[CR33] 33.Fischer, A., Izmailov, A.F., Jelitte, M.: Newton-type methods near critical solutions of piecewise smooth nonlinear equations. Comput. Optim. Appl. 80, 587–615 (2021). 10.1007/s10589-021-00306-2 [Google Scholar]

[CR34] 34.Fischer, A., Izmailov, A.F., Jelitte, M.: Behavior of Newton-type methods near critical solutions of nonlinear equations with semismooth derivatives. J. Optim. Theory Appl. (2023). 10.1007/s10957-023-02350-w [Google Scholar]

[CR35] 35.Gfrerer, H.: Second-order necessary conditions for nonlinear optimization problems with abstract constraints: the degenerate case. SIAM J. Optim. 18(2), 589–612 (2007). 10.1137/050641387 [Google Scholar]

[CR36] 36.Gfrerer, H.: On directional metric regularity, subregularity and optimality conditions for nonsmooth mathematical programs. Set-Valued Var. Anal. 21(2), 151–176 (2013). 10.1007/s11228-012-0220-5 [Google Scholar]

[CR37] 37.Gfrerer, H.: On metric pseudo-(sub)regularity of multifunctions and optimality conditions for degenerated mathematical programs. Set-Valued Var. Anal. 22(1), 79–115 (2014). 10.1007/s11228-013-0266-z [Google Scholar]

[CR38] 38.Gfrerer, H.: Optimality conditions for disjunctive programs based on generalized differentiation with application to mathematical programs with equilibrium constraints. SIAM J. Optim. 24(2), 898–931 (2014). 10.1137/130914449 [Google Scholar]

[CR39] 39.Gfrerer, H., Klatte, D.: Lipschitz and Hölder stability of optimization problems and generalized equations. Math. Program. 158, 35–75 (2016). 10.1007/s10107-015-0914-1 [Google Scholar]

[CR40] 40.Gfrerer, H., Outrata, J.V.: On computation of limiting coderivatives of the normal-cone mapping to inequality systems and their applications. Optimization 65(4), 671–700 (2016). 10.1080/02331934.2015.1066372 [Google Scholar]

[CR41] 41.Gfrerer, H., Ye, J.J.: New constraint qualifications for mathematical programs with equilibrium constraints via variational analysis. SIAM J. Optim. 27(2), 842–865 (2017). 10.1137/16M1088752 [Google Scholar]

[CR42] 42.Gfrerer, H., Ye, J.J., Zhou, J.: Second-order optimality conditions for nonconvex set-constrained optimization problems. Math. Oper. Res. 47(3), 2344–2365 (2022). 10.1287/moor.2021.1211 [Google Scholar]

[CR43] 43.Guo, L., Ye, J.J., Zhang, J.: Mathematical programs with geometric constraints in Banach spaces: enhanced optimality, exact penalty, and sensitivity. SIAM J. Optim. 23(4), 2295–2319 (2013). 10.1137/130910956 [Google Scholar]

[CR44] 44.Haraux, A.: How to differentiate the projection on a convex set in Hilbert space. Some applications to variational inequalities. J. Math. Soc. Jpn. 29(4), 615–631 (1977). 10.2969/jmsj/02940615 [Google Scholar]

[CR45] 45.Helou, E.S., Santos, S.A., Simões, L.E.A.: A new sequential optimality condition for constrained nonsmooth optimization. SIAM J. Optim. 30(2), 1610–1637 (2020). 10.1137/18M1228608 [Google Scholar]

[CR46] 46.Hestenes, M.R.: Optimization Theory - the Finite-Dimensional Case. Wiley, New York (1975) [Google Scholar]

[CR47] 47.Ioffe, A.D.: Regular points of Lipschitz functions. Trans. Am. Math. Soc. 251, 61–69 (1979). 10.1090/S0002-9947-1979-0531969-6 [Google Scholar]

[CR48] 48.Ioffe, A.D.: Variational Analysis of Regular Mappings. Springer, Cham (2017). 10.1007/978-3-319-64277-2 [Google Scholar]

[CR49] 49.Izmailov, A.F., Kurennoy, A.S., Solodov, M.V.: Critical solutions of nonlinear equations: local attraction for Newton-type methods. Math. Program. 167, 355–379 (2018). 10.1007/s10107-017-1128-5 [Google Scholar]

[CR50] 50.Izmailov, A.F., Solodov, M.V.: Complementarity constraint qualification via the theory of 2-regularity. SIAM J. Optim. 13(2), 368–385 (2002). 10.1137/S1052623499365292 [Google Scholar]

[CR51] 51.Izmailov, A.F., Solodov, M.V.: The theory of 2-regularity for mappings with Lipschitzian derivatives and its applications to optimality conditions. Math. Oper. Res. 27(3), 614–635 (2002). 10.1287/moor.27.3.614.308 [Google Scholar]

[CR52] 52.Kanzow, C., Raharja, A.B., Schwartz, A.: An augmented Lagrangian method for cardinality-constrained optimization problems. J. Optim. Theory Appl. 189, 793–813 (2021). 10.1007/s10957-021-01854-7 [Google Scholar]

[CR53] 53.Kanzow, C., Raharja, A.B., Schwartz, A.: Sequential optimality conditions for cardinality-constrained optimization problems with applications. Comput. Optim. Appl. 80, 185–211 (2021). 10.1007/s10589-021-00298-z [Google Scholar]

[CR54] 54.Kanzow, C., Schwartz, A.: Mathematical programs with equilibrium constraints: enhanced Fritz John-conditions, new constraint qualifications, and improved exact penalty results. SIAM J. Optim. 20(5), 2730–2753 (2010). 10.1137/090774975 [Google Scholar]

[CR55] 55.Kanzow, C., Steck, D., Wachsmuth, D.: An augmented Lagrangian method for optimization problems in Banach spaces. SIAM J. Control. Optim. 56(1), 272–291 (2018). 10.1137/16M1107103 [Google Scholar]

[CR56] 56.Klatte, D., Kummer, B.: Constrained minima and Lipschitzian penalties in metric spaces. SIAM J. Optim. 13(2), 619–633 (2002). 10.1137/S105262340139625X [Google Scholar]

[CR57] 57.Kruger, A.Y.: Generalized differentials of nonsmooth functions and necessary conditions for an extremum. Sibirian Math. J. 26, 370–379 (1985) [Google Scholar]

[CR58] 58.Kruger, A.Y., Mehlitz, P.: Optimality conditions, approximate stationarity, and applications-a story beyond Lipschitzness. ESAIM Control Optim. Calc. Var. 28, 42 (2022). 10.1051/cocv/2022024 [Google Scholar]

[CR59] 59.Kruger, A.Y., Mordukhovich, B.S.: Extremal points and the Euler equation in nonsmooth optimization problems. Doklady Akademii Nauk BSSR 24(8), 684–687 (1980) [Google Scholar]

[CR60] 60.Levy, A.B.: Implicit multifunction theorems for the sensitivity analysis of variational conditions. Math. Program. 74, 333–350 (1996). 10.1007/BF02592203 [Google Scholar]

[CR61] 61.Liang, Y.C., Ye, J.J.: Optimality conditions and exact penalty for mathematical programs with switching constraints. J. Optim. Theory Appl. 190, 1–31 (2021). 10.1007/s10957-021-01879-y [Google Scholar]

[CR62] 62.Luo, Z.Q., Pang, J.S., Ralph, D.: Mathematical Programs with Equilibrium Constraints. Cambridge University Press, Cambridge (1996). 10.1017/CBO9780511983658 [Google Scholar]

[CR63] 63.Mehlitz, P.: Asymptotic stationarity and regularity for nonsmooth optimization problems. J. Nonsmooth Anal. Optim. 1,6575, (2020) 10.46298/jnsao-2020-6575 [DOI]

[CR64] 64.Mehlitz, P.: Asymptotic regularity for Lipschitzian nonlinear optimization problems with applications to complementarity constrained and bilevel programming. Optimization 72(1), 277–320 (2023). 10.1080/02331934.2022.2031190 [Google Scholar]

[CR65] 65.Mordukhovich, B.S.: Variational Analysis and Generalized Differentiation, Part I: Basic Theory. Applications. Springer, Berlin, Part II (2006). 10.1007/3-540-31247-1

[CR66] 66.Mordukhovich, B.S.: Variational Analysis and Applications. Springer, Cham (2018). 10.1007/978-3-319-92775-6

[CR67] 67.Mordukhovich, B.S., Outrata, J.V., Ramírez, C.H.: Second-order variational analysis in conic programming with applications to optimality and stability. SIAM J. Optim. 25(1), 76–101 (2015). 10.1137/120903221 [Google Scholar]

[CR68] 68.Outrata, J.V., Kočvara, M., Zowe, J.: Nonsmooth Approach to Optimization Problems with Equilibrium Constraints. Kluwer Academic, Dordrecht (1998). 10.1007/978-1-4757-2825-5 [Google Scholar]

[CR69] 69.Outrata, J.V., Sun, D.: On the coderivative of the projection operator onto the second-order cone. Set-Valued Anal. 16, 999–1014 (2008). 10.1007/s11228-008-0092-x [Google Scholar]

[CR70] 70.Ramos, A.: Mathematical programs with equilibrium constraints: a sequential optimality condition, new constraint qualifications and algorithmic consequences. Optim. Methods Softw. 36, 45–81 (2021). 10.1080/10556788.2019.1702661 [Google Scholar]

[CR71] 71.Rockafellar, R.T., Wets, R.J.B.: Variational Analysis, Grundlehren der mathematischen Wissenschaften, vol. 317. Springer, Berlin (1998). 10.1007/978-3-642-02431-3 [Google Scholar]

[CR72] 72.Shapiro, A.: On concepts of directional differentiability. J. Optim. Theory Appl. 66, 477–478 (1990). 10.1007/BF00940933 [Google Scholar]

[CR73] 73.Sun, D., Sun, J.: Semismooth matrix-valued functions. Math. Oper. Res. 27(1), 150–169 (2002). 10.1287/moor.27.1.150.342 [Google Scholar]

[CR74] 74.Tret’ Yakov, A.A.: Necessary and sufficient conditions for optimality of -th order. USSR Comput. Math. Math. Phys. 24(1), 123–127 (1984). 10.1016/0041-5553(84)90132-0 [Google Scholar]

[CR75] 75.Wu, J., Zhang, L., Zhang, Y.: Mathematical programs with semidefinite cone complementarity constraints: constraint qualifications and optimality conditions. Set-Valued Var. Anal. 22, 155–187 (2014). 10.1007/s11228-013-0242-7 [Google Scholar]

[CR76] 76.Ye, J.J., Ye, X.Y.: Necessary optimality conditions for optimization problems with variational inequality constraints. Math. Oper. Res. 22(4), 977–997 (1997). 10.1287/moor.22.4.977 [Google Scholar]

PERMALINK

On the directional asymptotic approach in optimization theory

Matúš Benko

Patrick Mehlitz

Abstract

Introduction

Notation and preliminaries

Basic notation

Fundamentals of variational analysis

Lemma 2.1

Lemma 2.2

Lemma 2.3

Definition 2.1

Example 2.1

Generalized differentiation

Subdifferentials

Graphical derivatives

Definition 2.2

Lemma 2.4

Proof

Lemma 2.5

Proof

Lemma 2.6

Proof

Lemma 2.7

Proof

Coderivatives, pseudo-coderivatives, and super-coderivatives

Definition 2.3

Lemma 2.8

Example 2.2

Definition 2.4

Lemma 2.9

Proof

Sufficient conditions for pseudo-(sub)regularity

Lemma 2.10

Proof

Example 2.3

Remark 2.1

Pseudo-(sub)regularity of order 2 for constraint mappings

Directional pseudo-coderivatives and sufficient conditions

Lemma 3.1

Proof

Theorem 3.1

Proof

Remark 3.1

Remark 3.2

Theorem 3.2

Proof

Remark 3.3

Corollary 3.1

Remark 3.4

The convex polyhedral case: a comparison with related results

Definition 3.1

Lemma 3.2

Proof

Proposition 3.1

Proof

Corollary 3.2

Proof

Example 3.1

Remark 3.5

Example 3.2

Directional asymptotic stationarity in nonsmooth optimization

Definition 4.1

Lemma 4.1

Approaching mixed-order stationarity conditions

Definition 4.2

Lemma 4.2

Proof

Proposition 4.1

Remark 4.1

Corollary 4.1

Proof

Mixed-order and asymptotic stationarity conditions

Theorem 4.1

Proof

Corollary 4.2

Proof

Remark 4.2

Corollary 4.3

Mixed-order necessary optimality conditions for optimization problems with geometric constraints in the case $γ : = 2$