Skip to main content
Springer logoLink to Springer
. 2021 Feb 18;194(1-2):191–227. doi: 10.1007/s10107-021-01623-4

Complexity of linear relaxations in integer programming

Gennadiy Averkov 1, Matthias Schymura 1,
PMCID: PMC9237013  PMID: 35782488

Abstract

For a set X of integer points in a polyhedron, the smallest number of facets of any polyhedron whose set of integer points coincides with X is called the relaxation complexity rc(X). This parameter, introduced by Kaibel & Weltge (2015), captures the complexity of linear descriptions of X without using auxiliary variables. Using tools from combinatorics, geometry of numbers, and quantifier elimination, we make progress on several open questions regarding rc(X) and its variant rcQ(X), restricting the descriptions of X to rational polyhedra. As our main results we show that rc(X)=rcQ(X) when: (a) X is at most four-dimensional, (b) X represents every residue class in (Z/2Z)d, (c) the convex hull of X contains an interior integer point, or (d) the lattice-width of X is above a certain threshold. Additionally, rc(X) can be algorithmically computed when X is at most three-dimensional, or X satisfies one of the conditions (b), (c), or (d) above. Moreover, we obtain an improved lower bound on rc(X) in terms of the dimension of X.

Mathematics Subject Classification: Primary: 90C10, Secondary: 90C57, 52B05, 52B20, 03C10

Introduction

Encoding discrete optimization instances as integer vectors satisfying a system of linear constraints is a fundamental principle of combinatorial optimization, successfully applied in a multitude of cases in the last six decades. This approach establishes a connection to integer programming and, by this, allows to use linear-programming based solution techniques. In the course of development, classical combinatorial optimization problems have been endowed with standard formulations as integer programs. For example, the formulation based on the subtour elimination constraints is the standard formulation of the traveling salesman problem (cf. [1]). Nevertheless, we would like to draw attention to the fact that if a certain discrete set of feasible solutions is represented as a set X of integer vectors, then a priori there are many different ways to describe X via a system of linear inequalities in integer variables. Thus, it makes sense to investigate the family of all possible representations of X using integer-programming constraints in order to detect the most interesting ones. So far, there have been many results about the tight descriptions based on the facet-defining inequalities for the convex hull of X. The interest in tight descriptions is easily motivated by the fact that knowing the inequalities that describe the convex hull of X allows to optimize a linear objective over X exactly using linear programming. However, since tight descriptions can have a very large size, we believe that one should not only focus on the tightness but also investigate possibilities of finding formulations of small size. While the size of the description is determined by the number of constraints and the size of the coefficients, here we only address problems related to the number of constraints. Regarding the size of the coefficients, we refer to the recent work [20, 21] of Hojny. The study of extended formulations [15, 24] is yet another interesting line of research, but here we focus on the formulations in the original space.

We work at the level of general integer programming, which means that we consider an arbitrary finite set XZd of integer points in a polyhedron and investigate possibilities to describe this set within the integer lattice Zd by a possibly small number of linear inequalities in integer variables. Our research is motivated by a recent contribution of Kaibel & Weltge [25] (see also Weltge’s PhD thesis [38]), who posed a number of fundamental problems in this context. We also point out that, in a similar spirit, descriptions of sets of 0/1 points within the discrete hypercube {0,1}d were considered by Jeroslow [23] in the 1970s and are an important subject in the theory of social choice [19, 36].

We now introduce some notation and present our results.

Definition 1.1

For a system of linear constraints Axb,Ux=v with real coefficients, consider

X=xZd:Axb,Ux=v. 1

We call the system Axb,Ux=v, as well as the polyhedron that it defines, a relaxation of X within the integer lattice Zd.

When (1) holds and Axb is a system of k inequalities, we say that X is described by k linear inequalities within Zd. If the coefficients of AUbv are rational numbers, we say that X is described by k rational linear inequalities within Zd. See Fig. 1 for an illustration.

Fig. 1.

Fig. 1

Descriptions of {0,1}2 by four and three (rational) linear inequalities within Z2

The minimal k such that X can be described by k linear inequalities (resp. rational linear inequalities) within Zd is called the relaxation complexity of X (resp. rational relaxation complexity of X) within Zd and denoted by rc(X) (resp. rcQ(X)).

We call a set XZd satisfying conv(X)Zd=X lattice-convex. It is easy to see that a finite subset X of Zd has a relaxation within Zd if and only if X is lattice-convex. Since in combinatorial optimization one is usually interested in finite subsets XZd, we deal with finite lattice-convex sets in the sequel.

The two values rc(X) and rcQ(X) were recently introduced by Kaibel & Weltge [25]. In terms of polyhedra, rc(X) and rcQ(X) is the minimum number of facets of a polyhedron (resp. rational polyhedron) P satisfying X=PZd. Kaibel & Weltge posed the following three general questions for arbitrary finite lattice-convex sets:

  1. Does the inequality rc(X)dim(X)+1 hold?

  2. Do rc(X) and rcQ(X) coincide?

  3. Are rc(X) and rcQ(X) algorithmically computable?

We note that a positive answer to (Q2) implies a positive answer to (Q1), because every rational relaxation of X is necessarily bounded within the affine hull of X, and thus rcQ(X)dim(X)+1. On the other hand, a positive answer to (Q2) simplifies but does not automatically resolve (Q3), because (Q3) is open for both rc(X) and rcQ(X). Regarding (Q3) we point out that this question is open to the extent that we do not even know rc(X) for very concrete and simple looking examples, like the set Δd={0,e1,,ed} of lattice points of the standard simplex.

Our first contribution concerns (Q1) and provides an improved lower bound on the relaxation complexity in terms of the dimension:

Theorem 1.2

Let XZd be a finite lattice-convex set of dimension dim(X)4. Then,

rc(X)>log2(dim(X))-log2log2(dim(X)).

We prove this bound in Sect. 2. In Sect. 3, we show that for d4, Question (Q1) can be answered affirmatively.

Weltge [38, Sect. 7.5] showed that in dimension two, rc(X) and rcQ(X) coincide and are computable. Already passing from dimension two to dimension three both Questions (Q2) and (Q3) get considerably harder. As our main contributions, we give an affirmative answer to both questions in various settings: First, we successfully treat small dimensions in full generality.

Theorem 1.3

Let X be a finite lattice-convex subset of Zd.

  1. If d4, then rc(X)=rcQ(X).

  2. If d3, then rc(X) and rcQ(X) can be computed algorithmically.

Second, we identify large families of sets X in arbitrary dimensions that are defined by natural conditions and for which Questions (Q2) and (Q3) can be answered positively.

Theorem 1.4

Let X be a finite lattice-convex subset of Zd satisfying one of the following conditions:

  1. The points in X represent every residue class in (Z/2Z)d.

  2. The polytope conv(X) contains interior lattice points.

  3. The lattice-width of X is bigger than the finiteness threshold width w(d), introduced in [10].

Then, rc(X) and rcQ(X) coincide and can be computed algorithmically.

The value of w(d) is known in small dimensions. We have w(1)=w(2)=0, w(3)=1, and w(4)=2. Theorem 1.3(a) is based on the peculiarity that in dimensions at most four every relaxation of X is bounded, whereas for dimensions d5 this is not necessarily the case. We obtain this result using a description of maximal lattice-free sets provided by Lovász [27]. Once the boundedness of every relaxation is established, it is not hard to deduce that a relaxation P of X having k facets can be modified to a rational relaxation of X that still has k facets by slightly moving the facets out and perturbing the facet normals to rational normals. The details will be discussed in Sect. 3.

The proof of Theorem 1.3(b) relies on Theorem 1.4, so we discuss this first. To prove Theorem 1.4, we investigate the structure of the set of so-called observers of X. We say that a point yZd\X outside a lattice-convex set XZd is an observer of X if the set X{y} is lattice-convex as well.

It turns out that whenever P is a (rational) polyhedron that contains X as a subset, but that does not contain any observer of X, then P is actually a relaxation of X. Moreover, if X satisfies any of the three conditions in Theorem 1.4, then the set of observers of X is finite and can be computed. It thus remains to determine the minimum number k of inequalities that is sufficient to separate X from its observers. The latter task can be carried out algorithmically using mixed-integer linear programming as an auxiliary tool. This approach is developed in Sect. 4.

For proving Theorem 1.3(b) it suffices to consider sets X of dimension three, since for dim(X)2 we can use the computability of the relaxation complexity in dimension two established by Weltge [38] in his thesis. As a first step towards computability of rc(X) in dimension three, we characterize those X that have finitely many observers, and deal with them as in Theorem 1.4 as discussed before. If the set of observers of X is infinite, then it still turns out to be structured enough for an algorithmic treatment. To this end, we need to solve a special quantifier elimination problem for mixed-integer linear quantified expressions, which we find interesting in its own right. These arguments will be laid out in Sect. 6.

Basic notation and terminology. The affine and convex hull of a set XRd are denoted by aff(X) and conv(X), respectively. The line segment with endpoints a,bRd is written as [a,b]:=conv({a,b}). By dim(X) we denote the dimension of X, which we define to be the dimension of the affine hull of X. For a positive integer m, we write [m]={1,,m}. We use standard terminology from polyhedral theory such as polyhedron, vertex, face and facet and basic notions of the geometry of numbers, such as lattice. A lattice point is a point of the integer lattice Zd. A non-zero lattice point uZd\{0} is called primitive if the only lattice points of the line segment [0, u] are its two endpoints. We define an affine lattice to be a translation of a lattice by an arbitrary translation vector. By e1,,ed we denote the standard unit vectors of Rd. Two sets A,BRd are called unimodularly equivalent, if there is a unimodular transformation f:RdRd such that A=f(B), where a unimodular transformation is a transformation of the form f(x)=Ux+t, with U being an integral d×d-matrix of determinant ±1, and tZd. Elements of Rd are interpreted as columns in analytic expressions. For more information and basic properties for all of these concepts we refer to the textbooks of Schrijver [33] and Gruber [17].

Lower bounds on rc(X) in terms of the dimension of X

In this section, we prove Theorem 1.2. Although our result is still far from answering (Q1), it is the best lower bound known so far. Our argument is inspired by the proof and the result of Weltge [38, Prop. 8.1.4] who established the implication

dim(X)k!rc(X)k

for X={0,e1,,ed} by induction on k. In fact, Weltge’s argument can be applied for an arbitrary finite lattice-convex set X. We are able to replace k! with a single-exponential function in k by replacing his inductive argument with a pigeonhole-principle type argument. As in the proof of Weltge, we need the following auxiliary result.

Lemma 2.1

([2, Lem. 4]) Every unbounded full-dimensional polyhedron PRd that contains a lattice point in its interior contains infinitely many lattice points in its interior.

Theorem 2.2

Let XZd be a finite lattice-convex set satisfying

dim(X)(k-1)kk/2

for some integer value k2. Then rc(X)k+1.

Proof

By restricting considerations to the affine hull of X, without loss of generality, we can assume that X is full-dimensional and thus d=dim(X)(k-1)kk/2. Assuming that X has a relaxation

P=xRd:a1xβ1,,akxβk

given by k linear inequalities, we derive a contradiction. To this end, fix p1,,pd+1 to be d+1 affinely independent points in X. With each j[d+1] we associate the set

Ij:=i[k]:aipj=βi

of indices of the inequalities of the relaxation of X that are active on pj. None of the sets I1,,Id+1 is empty. Indeed, the relaxation P is unbounded, because since k2, we have k(k-1)kk/2d. Thus, if Ij were empty, then pj would be an interior lattice point of P. By Lemma 2.1 it then follows that P contains infinitely many lattice points, a contradiction.

To explain the proof idea, we first show a weaker assertion, namely that

d=dim(X)(k-1)2krc(X)k+1. 2

This implication is weaker than what we actually want to prove because, as 2kkk/2, the assumption in (2) is stronger. Now, the inequality dim(X)(k-1)2k implies that I1,,Id+1 is a list of at least (k-1)2k+1 subsets of [k]. There are 2k different subsets of [k] in total. So, by a variant of the pigeonhole principle there exists a subset I[k] that occurs in the list I1,,Id+1 at least (d+1)/2k times, which means that the set

J:=j[d+1]:Ij=I

has at least (d+1)/2k elements. We have

d+12k(k-1)2k+12k>k-1,

so that |J|k. Without loss of generality assume that I1==Ik=I. Let H:=aff({p1,,pk}). The inequality ajxbj holds with equality for all xH and all jI. Hence, the polyhedron

Q=xH:ajxbjfor allj[k]\I

is a relaxation of the (k-1)-dimensional set HX within the affine lattice HZd. The relaxation is given by k-|I|k-1 inequalities. Since the points p1,,pd+1X were chosen to be affinely independent, we have dim(H)=k-1, and so the relaxation Q is unbounded. By construction, the k-|I| inequalities defining the relaxation Q hold strictly on p1,,pk. Thus, p1,,pk belong to the relative interior of Q, which contradicts Lemma 2.1.

For the improved assertion, the approach is similar, but we use Sperner’s theorem on the size of antichains in the boolean lattice to strengthen the argument. The idea is that we do not need to have all of the points p1,,pk in the relative interior of Q, but rather just one of them, in order to obtain a contradiction.

To this end, observe that the family of sets {I1,,Id+1} is partially ordered by inclusion, and consider the m-element subfamily {S1,,Sm} of all inclusion-minimal elements of {I1,,Id+1}. This subfamily consists of pairwise incomparable subsets of [k] and thus forms what is called an antichain in the boolean lattice of subsets of [k]. Sperner’s theorem [34] asserts that mkk/2. As each of the sets I1,,Id+1 contains one of the inclusion-minimal sets S1,,Sm we have

[d+1]=t=1mj[d+1]:StIj.

Consequently,

d+1=t=1mj[d+1]:StIjt=1mj[d+1]:StIjm·maxt[m]j[d+1]:StIj.

We have thus shown the existence of an index t[m] such that St is contained in at least (d+1)/m sets from the list I1,,Id+1. Without loss of generality, we assume that this holds for the set S1. Using the lower bound on dim(X), we see that

d+1m(k-1)kk/2+1kk/2>k-1.

Thus, at least k of the sets from the list I1,,Id+1 contain S1 as a subset, and we may assume that S1Ij, for j[k], and that S1=I1. We can now repeat the above argument, replacing the set I by S1, in order to find that the point p1 lies in the relative interior of the corresponding relaxation Q of HX within HZd.

Let us derive an explicit lower bound on rc(X) in terms of dim(X), which goes to infinity, when dim(X).

Proof of Theorem 1.2

Recall that we want to prove that

rc(X)>log2(dim(X))-log2log2(dim(X)).

In fact, we will see that this already follows from the weaker statement (2) which does not rely on Sperner’s Theorem. So, let k2 be maximal such that dim(X)(k-1)2k. Then, by (2) and since dim(X)<k2k+1, we get

rc(X)k+1>log2(dim(X))-log2(k).

The claimed inequality follows, since k2 implies dim(X)(k-1)2k2k, and thus log2(k)log2log2(dim(X)).

Remark 2.3

One way to measure progress on Question (Q1) is to find a smallest possible function f(k) such that rc(X)k holds whenever dim(X)f(k). Weltge’s recursive approach (generalized to arbitrary X) yields f(k)=k!, while our non-recursive strategy leads roughly to f(k)=kk/22k. In this setting, the improvement may appear significant. If one however measures improvement using a function F satisfying rc(X)F(dim(X)), then Weltge’s lower bound on rc(X) is of order log(dim(X))/loglog(dim(X)), which we improve by a factor of loglog(dim(X)), up to lower order terms. Our main motivation for presenting the proof of Theorem 2.2 is to make the connection to extremal combinatorics via Sperner’s Theorem.

The role of rationality in dimensions d4

This part is mainly devoted to proving Theorem 1.3(a), that is, showing that rc(X)=rcQ(X), for all at most four-dimensional lattice-convex sets X. We also see how the developed methods enable us to answer (Q1) affirmatively in these dimensions. Our main observation is that there is a qualitative difference between low and high dimensions: We show that in dimensions up to four, relaxations of finite sets are necessarily bounded, while in higher dimensions this is not necessarily the case.

The arguments are based on the notion of maximal lattice-free sets. We call a k-dimensional closed convex set LRd such that aff(L)Zd is a k-dimensional affine lattice, a k-dimensional lattice-free set if the relative interior of L does not contain points of Zd. Further, we call such a set L a k-dimensional maximal lattice-free set, if L is not properly contained in another k-dimensional lattice-free set.

Proposition 3.1

Every k-dimensional lattice-free set is a subset of a maximal k-dimensional lattice-free set.

Proof

This is well-known and can be easily derived from Zorn’s lemma, or by a topological argument [6, Prop. 3.1]; see also Basu et al. [8, Cor. 2.2] for a more constructive proof.

The following structural result has been formulated by Lovász in [27, Sect. 3]. A complete proof can be found in [2] and [8].

Theorem 3.2

Every d-dimensional maximal lattice-free set L is a polyhedron. If L is bounded, then L has at most 2d facets and the relative interior of each facet contains a point of the lattice Zd. If L is unbounded, then L is unimodularly equivalent to L×Rm for some m{1,,d-1} and some bounded (m-d)-dimensional maximal lattice-free set LRd-m.

The main insight towards the aforementioned results is to show that relaxations of low-dimensional finite lattice-convex sets are always bounded. We say that a vector rRd is a recession vector of a polyhedron PRd if P+rP. If r0, the ray γ=λr:λ0 in direction of r is called a recession ray. The set of all recession vectors of P is called the recession cone rec(P) of P.

Lemma 3.3

Let XZd be a finite lattice-convex set and let one of the following conditions hold:

  1. d=2, dim(X)1,

  2. d=3, dim(X)2,

  3. d=4, dim(X)=4.

Then, every relaxation P of X is bounded.

Proof

Let the polyhedron P be a relaxation of X, which means PZd=X. We assume to the contrary that P is unbounded, that is, by the basic theory of convexity there is a recession ray γ of P, with recession vector rrec(P)\{0} (cf. [33]).

Case 1: d=2,dim(X)1. We consider two distinct points ab of X. If the ray γ is parallel to [ab], say r=b-a, then a+k(b-a), with kZ0, are infinitely many lattice points that are contained in P, which is a contradiction. If γ is not parallel to [ab], consider the two-dimensional set [a,b]+γP. If [a,b]+γ is not lattice-free, then Lemma 2.1 yields that [a,b]+γ contains infinitely many interior lattice points, which contradicts PZ2=X. If [a,b]+γ is lattice-free, then by Proposition 3.1 there is a two-dimensional maximal lattice-free set L containing [a,b]+γ. In view of Theorem 3.2, L is unimodularly equivalent to [0,1]×R. Thus, the recession cone of L is a rational line. On the other hand, the recession cone of P contains γ as a subset. Hence, the ray γ has a rational direction. It follows that a+γ contains infinitely many points of Z2, which again contradicts PZ2=X.

Case 2: d=3,dim(X)2. We consider three affinely independent points a,b,cX and the triangle T=conv({a,b,c}). If γ is parallel to the plane affinely spanned by abc, then using Case 1 for the subset aff({a,b,c})X of the two-dimensional affine lattice aff({a,b,c})Z3, we arrive at a contradiction. Otherwise, T+γ is a three-dimensional subset of P. If T+γ is not lattice-free, then Lemma 2.1 yields that T+γ contains infinitely many lattice points, which is a contradiction. If T+γ is lattice-free, then let L be a three-dimensional maximal lattice-free set containing T+γ. By Theorem 3.2, L is unimodularly equivalent to a set of the form L×Rm, where m{1,2} and L is a bounded (3-m)-dimensional maximal lattice-free set.

If m=1, then the recession cone of L is a rational line and so γ, being a subset of the recession cone of P is a rational ray. This shows that a+γ contains infinitely many points of Z3, again a contradiction.

If m=2, then the boundary of L is a union of two parallel planes and one of these planes, which we denote by H, contains at least two of the points abc. The ray γ is parallel to H, and HX is a lattice-convex set of dimension at least one in the affine lattice HZ3. Applying the assertion of Case 1 to the set HX in HZ3 yields the desired contradiction.

Case 3: d=4,dim(X)=4. We pick five affinely independent points p0,,p4 in X and consider the simplex S=conv({p0,,p4}). Clearly, S+γ is four-dimensional. If S+γ is not lattice-free, we get a contradiction just as in the previous cases. If S+γ is lattice-free, again analogously to the previous cases, we find a maximal four-dimensional lattice-free set LS+γ. This set is unimodularly equivalent to L×Rm, where m{1,2,3} and L is a bounded (4-m)-dimensional maximal lattice-free set. Without loss of generality, we assume that L=L×Rm.

If m=1, then the recession cone of L is a rational line, and so γ is a ray in a rational direction. In this case, p0+γ contains infinitely many points of Z4, which is a contradiction.

If m=2, then L is a bounded two-dimensional maximal lattice-free set. Consider the projection map π:R4R2, π(x1,x2,x3,x4)=(x1,x2). If two of the points π(p0),,π(p4) coincide, say q=π(p0)=π(p1), then the fiber H:=π-1(q) is a two-dimensional affine space containing the points p0 and p1. Thus, we can use Case 1 for the set XH of dimension at least one in the two-dimensional affine lattice HZ4 to arrive at a contradiction. Thus, we can assume that the five points qi=π(pi), with i{0,1,,4}, are pairwise distinct lattice points in L.

Maximal lattice-free sets in dimension two are completely classified (see [5, Thm. 4] and the references therein). The classification restricts L as follows:

  1. L is a triangle or a quadrilateral.

  2. If L is a quadrilateral, then L contains exactly four lattice points.

  3. If L is a triangle, then all but two lattice points of L are contained in the same edge, or L is unimodularly equivalent to conv({0,2e1,2e2}).

Since q0,,q4 are five distinct points, L cannot be a quadrilateral. Thus, L is a triangle and we conclude that three of these five points, say q0,q1,q2, lie in the same edge of L. Then p0,p1,p2 lie in the same facet of L. Let us denote by H the hyperplane in R4 spanned by the facet of L that contains p0,p1,p2. The set HX is a lattice-convex set of dimension at least two in the three-dimensional affine lattice HZ4. Thus, we can use assertion of Case 2 to arrive at a contradiction.

If m=3, then the boundary of L is a union of two parallel hyperplanes and one of them, say H, contains at least three of the affinely independent points p0,,p4. The ray γ is parallel to H, and HX is a lattice-convex set of dimension at least two in the three-dimensional affine lattice HZ4. Applying the assertion of Case 2 to the set HX in HZ4, yields the desired contradiction.

Lemma 3.3 is constrained to small dimensions. However, we will see that it already provides the full picture on boundedness of arbitrary relaxations. The proof of the following auxiliary result presents a general construction hidden behind the example of Kaibel & Weltge [25, Ex. 1] (cf. [38, Sect. 7.3]).

Lemma 3.4

Let dmk be integers such that dm2 and d-mkmin(d,2d-m-1). Then, the k-dimensional lattice-convex set X:={0,e1,,ek} of Zd has an unbounded relaxation.

Proof

Fix a 0/1 matrix A{0,1}(d-m)×d whose first k columns are pairwise distinct 0/1 vectors linearly spanning Rd-m, and whose last d-k columns have all entries equal to 0. Since the kernel of A has dimension m2, we can choose an irrational line g within this kernel and which satisfies gZd={0}. We claim that the unbounded polyhedron P:=conv(X)+g is a relaxation of X. The inclusion XPZd is clear. To show the converse, consider xconv(X) and ug such that x+uZd. We need to show x+uX. Clearly, Ax=A(x+u)Zd-m and AxQ:=conv(Az:zX). Consequently, v:=AxQZd-m is a vertex of the 0/1 polytope Q. The pre-image of the vertex v in conv(X) under the linear map defined by A is a face of conv(X). Since each vertex of conv(X) is sent to a different vertex of Q, the mentioned pre-image is a 0-dimensional face. This means xX. It follows that u=(x+u)-xZd and, by the choice of g, we conclude that u=0. Thus, x+u=xX.

We can now show that Lemma 3.3 lists all the pairs (d,dim(X)) for which every relaxation of an arbitrary lattice-convex set XZd is bounded:

Theorem 3.5

For integers d and k with d1,k0 and dk, the following conditions are equivalent:

  • (i)

    Every relaxation of every k-dimensional finite lattice-convex set of Zd is bounded.

  • (ii)

    Every relaxation of the k-dimensional set {0,e1,,ek} of Zd is bounded.

  • (iii)

    Either d3 and kd-1, or d=k=4.

Proof

The implication (i) (ii) is trivial. The implication (ii) (iii) follows from Lemma 3.4: We assume that (iii) does not hold and provide a k-dimensional finite lattice-convex set of Zd that has an unbounded relaxation. In the case kd-2, pick a k-dimensional finite lattice-convex set X of Zk and an irrational hyperplane H in the space Rd-k (of dimension at least 2) that satisfies HZd-k={0}. The set conv(X)×H is an unbounded relaxation of the k-dimensional subset X×{0}d-k of Zd. We are left with the case k{d-1,d}. Since (iii) is not fulfilled, we have either k=d5 or k=d-1 and d4. In each of these cases, we use Lemma 3.4 with m=2. The remaining implication (iii) (i) is exactly the statement of Lemma 3.3.

Returning to the main focus of this section, we use Lemma 3.3 and show now that rationality does not play a distinguished role in dimensions at most four, proving Theorem 1.3(a).

Theorem 3.6

Let d4. Then, rc(X)=rcQ(X) for every finite lattice-convex set XZd.

Proof

For computing rc(X) and rcQ(X), we can pass to the affine lattice aff(X)Zd. Thus, without loss of generality we can assume that X is a d-dimensional lattice-convex set in Zd. The inequality rc(X)rcQ(X) is trivial and so we need to show rcQ(X)rc(X).

Lemma 3.3 implies that, under our assumptions, every relaxation P of X is bounded. Choose a relaxation P with k:=rc(X) facets. It suffices to prove the existence of a rational relaxation with at most k facets. First note that by slightly increasing the right hand sides of the inequality description of P, we can assume that X is contained in the interior of P. Let F1,,Fk be the facets of P. Each of these facets is disjoint with X and so, for each i[k], there is a hyperplane Hi that separates X from Fi, which means that Hi determines halfspaces Hi+ and Hi- such that X lies in the interior of Hi+ and Fi lies in the interior of Hi-. Since P is bounded, Hi can be chosen to be a rational hyperplane. It follows that R:=i=1kHi+ is a rational relaxation of X satisfying RP and having at most k facets. This shows rcQ(X)rc(X).

Remark 3.7

The proof of Theorem 3.6 works for every XZd such that every of its relaxations is bounded, independently of the dimension d.

For the sake of a discussion of (Q1) in small dimensions, observe that in view of Theorem 2.2, a lattice-convex set XZd is guaranteed to satisfy rc(X)k+1, for k=3 and k=4, only if dim(X)6 and dim(X)18, respectively. We solve (Q1) in these cases, by providing optimal bounds.

Corollary 3.8

Let d4. Then, rc(X)d+1 holds for every d-dimensional finite lattice-convex set XZd.

Proof

By Lemma 3.3, every relaxation of X is bounded. Every bounded d-dimensional polyhedron has at least d+1 facets. This gives rc(X)d+1.

As a further consequence, we get that rc(Δd)=d+1, for every d4, where Δd={0,e1,,ed} is the set of lattice points of the standard simplex. Weltge [38, Prob. 11] (cf. [25]) conjectures that this identity holds in arbitrary dimension. However, even for this particular case we need to develop new tools, because Theorem 3.5 applied to d=k5 shows that Δd has an unbounded relaxation for every d5 (cf. [25, Ex. 1]).

Conditions on a lattice-convex set to have finitely many observers

This section is concerned with the proof of Theorem 1.4. Our argument is split up into two main parts: First, we study the set of observers of a lattice-convex set XZd, which is a subset of the lattice points outside of X that is of course necessary, but more importantly, also sufficient to be separated by the minimal number of inequalities. By applying techniques from the Geometry of Numbers we find that the set of observers of X is finite if (a) X is parity-complete, (b) conv(X) is not lattice-free, or (c) the lattice-width of X is not too small. We introduce these notions below.

In the second part, we explain how mixed-integer linear programming (MILP) can be used to compute the minimal number of inequalities that are needed to separate X from a finite subset YZd\X. We moreover argue that for the separation problem for such finite sets X and Y, there is no loss of generality to restrict to rational linear descriptions.

The set of observers of a lattice-convex set

Definition 4.1

Let XZd be a finite lattice-convex set. We say that a point yZd\X observes X if conv(X{y})Zd=X{y}, that is, X{y} is lattice-convex as well. Write

Obs(X):=yZd\X:yobservesX

for the set of points that observe X. See also Fig. 2.

Fig. 2.

Fig. 2

A lattice-convex set X with an observer y and a non-observer z

Our notion of observers is inspired by Weltge’s definition of a guard set for X, which is a set GZd\X with the property that for every pZd\X we have Gconv(X{p}). Indeed, every guard set G contains Obs(X), so that the set of observers is the smallest guard set with respect to inclusion. Weltge proved that for two-dimensional lattice-convex sets XZ2 there is always a finite guard set, and thus in particular there are only finitely many observers.

Proposition 4.2

(Weltge [38, Prop. 7.5.6 & Thm. 7.5.7]) If XZ2 is a full-dimensional finite lattice-convex set, then Obs(X) is finite and can be computed algorithmically.

However, if XZd is such that dim(X)<d, then |Obs(X)|=. Indeed, every lattice point in a neighboring lattice plane to aff(X) is an observer of X. Even more, in dimensions d3, there are full-dimensional lattice-convex sets that have infinitely many observers. One example is the set Δd={0,e1,,ed} of lattice points of the standard simplex.

For the sake of convenient notation, we extend the definition of the relaxation complexity as follows: For XYZd, the minimal k such that X can be separated from Y\X by k linear inequalities (resp. rational linear inequalities) is denoted by rc(X,Y) (resp. rcQ(X,Y)). So, in particular rc(X)=rc(X,Zd) and rcQ(X)=rcQ(X,Zd).

The utility of the concept of observers stems from the fact that for a polyhedron P to be a relaxation of X, it suffices that P separates X from Obs(X). This follows directly from the definition of Obs(X).

Proposition 4.3

Let XZd be a finite lattice-convex set and let Axb be a system of linear inequalities. The following conditions are equivalent:

  • (i)

    The system Axb separates X from Zd\X.

  • (ii)

    The system Axb separates X from Obs(X).

In particular,

rc(X)=rc(X,XObs(X))andrcQ(X)=rcQ(X,XObs(X)).

If Obs(X) is finite and can be computed, then it serves as a finite certificate for rc(X) and rcQ(X), that allows to algorithmically determine the minimal size relaxation of X. Before we develop this algorithm in Sect. 4.2, we derive three sufficient conditions on a lattice-convex set XZd under which there are only finitely many observers.

Parity-complete sets

For every fundamental cell F of Zd, that is, for every unimodular image of the unit cube [0,1]d, each residue class in (Z/2Z)d has a representative that is a vertex of F. A generic lattice-convex set XZd with sufficiently many points will contain a fundamental cell of Zd. These observations motivate the following class of examples and show its abundance.

We call XZd parity-complete if for every lattice point z in the affine hull of X there exists an xX congruent to z modulo 2, which means that (x-z)/2Zd.

Theorem 4.4

Let XZd be a full-dimensional finite parity-complete and lattice-convex set. Then, Obs(X)2X-X and in particular Obs(X) is finite and computable.

Proof

Let zZd be an observer of X. Since X is parity-complete, there exists xX satisfying (x+z)/2Zd. We conclude that x:=(x+z)/2X, since otherwise z would not be an observer. We thus have z=2x-x2X-X.

Existence of interior lattice points

A second class of lattice-convex sets with only finitely many observers is given by those X for which conv(X) is not lattice-free. Before we can prove this result we need to revisit some crucial results in the Geometry of Numbers.

We call the convex hull of finitely many lattice points a lattice polytope, as usual. Blichfeldt’s theorem [12] is a classical upper bound on the number of lattice points in a full-dimensional lattice polytope PRd in terms of its volume. It states that

|PZd|d!vol(P)+d. 3

Lower bounds on |PZd| exist, if P is not lattice-free. However, the best possible such bound is still not known and this problem received a considerable amount of interest in the last years. The best result to date is due to [4] and reads

vol(P)cdd|int(P)Zd|, 4

where cd=d(2d+1)(s2d+1-1) and (si)iZ>0 is the Sylvester sequence. This sequence is recursively defined by s1=2 and si=1+s1··si-1, for i2. It grows double-exponentially and satisfies the upper bound si22i-1.

The proof of Inequality (4) is based on estimating the coefficient of asymmetry ca(P,y) of the polytope P with respect to an interior point yint(P). This magnitude is defined as

ca(P,y):=maxx=1maxλ>0:y+λxPmaxλ>0:y-λxP.

In [4, Thm. 1.4] it is proven that there exists a lattice point yint(P)Zd such that

ca(P,y)cd=d(2d+1)(s2d+1-1). 5

With these preparations we can now formulate and prove our anticipated description of the set of observers of a lattice-convex set X with the property that conv(X) is not lattice-free. Our arguments are somewhat similar to those used in [3, Thm. 12].

Theorem 4.5

Let XZd be a finite lattice-convex set such that conv(X) is not lattice-free. Then,

Obs(X)X+cd·conv(X-X),

where cd=d(2d+1)(s2d+1-1).

In particular, Obs(X) is finite and we have the explicit bound

|Obs(X)|cd·|X|,

where cd=d!(1+cd)2d2ddO(14dd5d22d·22d).

Proof

Let pObs(X) and write Qp=conv(X{p}). Further, let yint(Qp)Zd be a lattice point satisfying (5), that is, ca(Qp,y)d(2d+1)(s2d+1-1). Since p was taken to be an observer, we necessarily have that yX. Let zconv(X) be the intersection point of conv(X) with the ray in direction y-p and emanating from p. Then, by the definition of ca(Qp,y), there is a positive number 0<λca(Qp,y) such that p-y=λ(y-z). Thus,

p=y+λ(y-z)X+λ·conv(X-X)X+ca(Qp,y)·conv(X-X),

so that by (5) the claimed inclusion holds with cd=d(2d+1)(s2d+1-1).

In order to estimate the number of observers of X, we first assume without loss of generality that 0X. This implies XX-X, which in turn gives X+cd·conv(X-X)(1+cd)·conv(X-X). Blichfeldt’s bound (3) applied to P=conv(X-X) gives us

|Obs(X)||((1+cd)·conv(X-X))Zd|d!(1+cd)dvol(conv(X-X))+dd!(1+cd)d2ddvol(conv(X))+d 6
d!(1+cd)d2ddcdd·|X|+dcd·|X|. 7

For the inequality (6) we observe that by conv(X-X)conv(X)-conv(X) we can apply the Rogers-Shephard inequality vol(K-K)2ddvol(K) for the volume of a d-dimensional compact convex set (cf. [32]). For (7) we use the fact that X is lattice-convex, and employ the volume bound (4). The claimed asymptotic growth of the dimensional constant cd follows by that of cd, Stirling’s approximation of d!, and by the approximation 2ddO(4dπd).

Remark 4.6

The best-known bound (5) on the minimal coefficient of asymmetry of an interior lattice point y in a lattice polytope PRd is certainly quite far from optimal. Pikhurko [30] proposes that the optimal bound should rather read

ca(P,y)sd2-2,

which would be an enormous improvement given the double-exponential growth rate of the Sylvester sequence.

Sets of large lattice-width

Our third class of examples with only finitely many observers is informally described as those X that cannot be sandwiched between two parallel lattice planes of small distance. More precisely, for an integral vector uZd\{0} the width of a subset SRd in direction u is defined as

w(S,u)=maxxSux-minxSux,

and the lattice-width of S is defined as

w(S)=minuZd\{0}w(S,u).

In order to describe our result, we moreover need a concept introduced by Blanco et al. [10]: The finiteness threshold width is the constant w(d)Z>0 such that for every nZ>0, up to unimodular equivalence, all but finitely many lattice d-polytopes with n lattice points have lattice-width at most w(d). In [10] it is proven that d-2w(d)O(d32), and the authors obtain the exact values w(3)=1 and w(4)=2, the former being shown already in [11].

Theorem 4.7

Let XZd be a full-dimensional finite lattice-convex set. If w(X)>w(d), then Obs(X) is finite.

Proof

Assume for contradiction that |Obs(X)|=. Let pObs(X) and write Qp=conv(X{p}). Observe that |QpZd|=|X|+1=:n and w(Qp)w(X)>w(d). Now, for every M0, there is an observer pObs(X) outside the box [-M,M]d. Since X is full-dimensional the corresponding polytope Qp has a facet F such that the height of p over F is lower bounded by an increasing function in M. As a consequence there are infinitely many possible values for the volume of Qp, and hence there are infinitely many unimodularly non-equivalent lattice polytopes Qp, which all have exactly n lattice points and lattice-width >w(d). This contradicts the definition of the finiteness threshold width.

Unlike in Theorem 4.4 and Theorem 4.5, the proof of Theorem 4.7 does not provide a mean to algorithmically compute the set Obs(X) in the case that w(X)>w(d). Computability would follow if we are given an explicit upper bound f(wdn) on the volume of the finitely many lattice d-polytopes with n lattice points and lattice-width w>w(d). To the best of our knowledge such an explicit volume bound has not been proven by the time of writing. Moreover, it is not clear how to determine the constant w(d) algorithmically in a given dimension d, and thus how to algorithmically decide the condition w(X)>w(d) in Theorem 4.7.

However, in a later section we develop a general algorithm that computes Obs(X) under the sole assumption that this set is finite (see Theorem 5.1).

Separation of two finite sets using MILP

In this section, we address the following computational problem.

Definition 4.8

(Separation Problem) Let X,YQd be finite subsets and let kZ>0. We say that X is separated from Y by k linear inequalities, if there is a system Axb of k linear inequalities such that Axb is fulfilled for every xX, and not fulfilled for any yY. We call the computational problem of determining such a system the problem of separation of X from Y by k linear inequalities.

Phrased geometrically, the problem asks to determine a d-dimensional polyhedron P with at most k facets satisfying XP and PY=. In view of this interpretation it is clear that the separation problem is strongly related to the notion of the relaxation complexity. The difference is that in contrast to the relaxation complexity, where Y=Zd\X is an infinite set, in the setting of the separation problem both X and Y are finite. This makes the problem much more accessible from the algorithmic perspective.

Various special versions of this problem were considered in the literature. For example, if X{0,1}d and Y={0,1}d\X, then in game theory the minimal k such that X can be separated from Y by k linear inequalities is called the the dimension of a game (cf. [36]), or threshold number of a game (cf. [19]). In our notation this number equals rc(X,{0,1}d), and in yet a different language, Jeroslow [23] proved that rc(X,{0,1}d)2d-1 and exhibited examples that attain equality. Hojny [20, 21] studied relaxations within {0,1}d with respect to the size of the coefficients used. Megiddo [28] studies the computational complexity of variants of the separation problem, depending on which parameters are given as part of the input.

If we were only interested in solving the separation problem theoretically, we could employ a brute-force approach. For instance, one could assign for each point yY a hyperplane that is supposed to separate y from X, and then check a linear program for feasibility. This would lead to |Y|k assignments to test, a possibly very large number. However, we want to follow a more practical approach that could lead to a usable implementation, and show how to reduce the separation problem to mixed-integer linear programming (MILP). To this end, we introduce the parameter

ρ:=max{x:xX}, 8

where x denotes the maximum norm of x, so that X[-ρ,ρ]d. We also fix the big-M parameter

M:=2(dρ+1). 9

We formulate a MILP that uses binary variables to encode the decision whether a given inequality separates X from a given point of Y. The real variables of the MILP are the coefficients of the system Axb, and the lower bound μ is the margin by which an inequality of Axb not valid on a point yY is violated:

(SEP-MILP)maximizeμsubject toAxbxX,1sy1yY,Ay+M(1-sy)b+1μyY,A[-1,1]k×db[-dρ,dρ]kμ[0,1]sy{0,1}kyY.

Proposition 4.9

Let X and Y be non-empty finite subsets of Qd and let kZ>0. Then the following conditions are equivalent:

  • (i)

    The set X can be separated from Y by a system of k linear inequalities.

  • (ii)

    The set X can be separated from Y by a system of k rational linear inequalities.

  • (iii)

    The mixed-integer linear problem (SEP-MILP) with k(d+1)+1 real and |Y|k binary variables and parameters ρ and M, given by (8) and (9), respectively, has a strictly positive optimal value.

Furthermore, the following statements hold:

  1. If A,b,μ,sy(yY) is a feasible solution of (SEP-MILP) with a strictly positive optimal value μ>0, then Axb is a system of k linear inequalities that separates X from Y.

  2. If k and |Y| are fixed, then (SEP-MILP), and by this also the separation problem with input X, Y and k, can be solved in polynomial time.

Proof

(i) (iii): If X can be separated from Y by a system Axb of k linear inequalities, then we can rescale each inequality of the system by an appropriate non-negative value to ensure A[-1,1]k×d, which means that each coefficient of the left-hand side lies in the range [-1,1]. Afterwards, we can change b to ensure that each inequality of Axb is attained with equality on some point of X, by appropriately decreasing the respective right-hand side. After these modifications, we have b[-dρ,dρ]d and Ax[-dρ,dρ]d for every xX.

We now show that A and b constructed above can be extended to a feasible solution of (SEP-MILP) that has a positive objective value μ>0. For each yY, there exists an index i=i(y)[k] such that the i-th inequality aixbi of the system Axb is violated on y, for i=i(y). We fix sy=ei(y) and μ=min{1}{ai(y)y-bi(y):yY}>0. It is not hard to check that the above choice of A,b,μ,sy(yY) is feasible. In fact, Axb holds for every xX by construction, 1sy1 holds since sy{e1,,ek}. Let us check that Ay+M(1-sy)b+1μ holds, too. Consider the i-th inequality aiy+M(1-sy,i)bi+μ. If sy,i=0, then the left-hand side is at least ρd+2 in view of aiy-ρd and M=2(ρd+1), while the right-hand side is at most ρd+1 in view of biρd and μ1. If sy,i=1, then the i-th inequality of Axb is not valid by the margin, which is at least μ, and so we see that aiy+M(1-sy,i)=aipbi+μ, as desired.

(iii) (ii): It is clear that the set of feasible solutions of (SEP-MILP) is a union of finitely many rational polyhedra. This shows that in case of feasibility, the problem always has a rational optimal solution. It is straightforward to see that such a rational optimal solution yields a rational system Axb of inequalities that separates X from Y.

The implication (ii) (i) is clear.

Claim (a) follows from the interpretation of the constraints of (SEP-MILP) that has been given in the proof above. As for assertion (b), note that if k and |Y| are fixed, the number of possible choices of the variables sy(yY) is 2k|Y|, which is a constant. Thus, by enumeration of all possible choices, solving (SEP-MILP) gets reduced to solving 2k|Y| linear programs.

As a consequence of the above studies, we can now prove Parts (a) and (b) of Theorem 1.4.

Corollary 4.10

Let X,YZd be finite sets such that X is lattice-convex and XY. Then, rc(X,Y)=rcQ(X,Y) and this number can be computed algorithmically.

In particular, rc(X)=rcQ(X) is computable if X is parity-complete or conv(X) is not lattice-free.

Proof

Follows from Propositions 4.9 and 4.3, and from the finiteness and computability of Obs(X) in the case that X is parity-complete or conv(X) is not lattice-free, established in Theorem 4.4 and Theorem 4.5, respectively.

Based on Theorem 4.7 we prove Theorem 1.4(c) along the same lines. At this point however, we are missing computability of Obs(X) in the case of lattice-width w(X)>w(d). This will be completed with Theorem 5.1 below.

Mixed-integer quantifier elimination and applications to the computation of the relaxation complexity

The purpose of this section is two-fold. First, we describe an algorithm that computes the set of observers under the sole assumption that it is finite. Besides the interest in its own right, this result will complete the proof of Theorem 1.4 as described at the end of the previous section.

Second, we utilize the theory of quantifier elimination towards deciding computability of the relaxation complexity of specially structured lattice-convex sets. For this purpose, we develop quantifier elimination for a special mixed-integer version of quantified Boolean combinations of linear inequalities. This will be one of the key ingredients in our proof of Theorem 1.3(b), since it enables us to algorithmically compute the relaxation complexity for three-dimensional lattice-convex sets that have infinitely many observers, that is, those that cannot be dealt with using the tools from Sect. 4.2.

Computing finite sets of observers

Here we prove a result that was already hinted at in the end of Sect. 4.1. It will also complete the proof of Theorem 1.4(c).

Theorem 5.1

If a finite lattice-convex set XZd is such that Obs(X) is finite, then there is an algorithm that computes Obs(X) and rc(X)=rcQ(X).

Our proof involves the following algorithm that maintains a potentially infinite set S of lattice points. We will see how such a set can actually be represented using finite data. graphic file with name 10107_2021_1623_Figa_HTML.jpg

Proof of Theorem 5.1

It suffices to show that we can compute the set Obs(X), because if this is done then Corollary 4.10 implies that rc(X)=rcQ(X) and that this number is computable. A pseudocode of our algorithm to compute Obs(X) is given in Algorithm 1, and an illustration of the workings of one iteration of the main loop is given in Fig. 3.

Fig. 3.

Fig. 3

An illustration of one iteration of the while-loop in Algorithm 1

We first argue that Algorithm 1 is correct: Observe that in every stage of the algorithm the set S consists of those lattice points for which we have currently decided whether or not they belong to the set of observers Obs(X). At the initialization step in Line 1 this is clear, because none of the points in X is an observer. If a point qZd\S is found, then Qq may or may not contain lattice points besides X{q}. The loop through Lines 36 successively reduces Qq until QqZd=X{q}, which means that qObs(X). By definition of the cone Cq in Line 9, q is the only observer it contains. Thus, we can safely disregard the other lattice points in Cq and thus augment S accordingly in Line 11. This establishes the claimed invariance property of S. These arguments also show that each new iteration in the while loop through Lines 213 provides us with a new observer. Since by assumption Obs(X) is finite, this means that Algorithm 1 terminates after finitely many steps, and that indeed the returned set O in Line 14 equals Obs(X).

Finally, we need to make sure that the conditions qZd\S in Line 2 and pQqZd\(X{q}) in Line 4 are decidable and that the points q and p can be determined algorithmically in the cases when the respective condition is fulfilled. We start with Line 4. The condition pQqZd\(X{q}) can be handled easily in a brute-force fashion. Given X and q, one can determine a bounding box containing X and q, iterate through all integer points p in this box and check if pQq and pX{q}. Checking pQq=conv(X{q}), for given p and X{q}, can be carried out using linear programming. It is clear that less brute-force approaches can also be employed, but we do not need to discuss such issues to prove our computability assertion.

In order to computationally check the condition qZd\S in Line 2, we can use integer linear programming. We introduce the Boolean formula B:Rd{true,false} defined by B(q):=xX(q=x) which models the condition in question as a disjunction of linear equations. More precisely, we have qZd\S if and only if qZd:¬B(q). Each update in Line 11 corresponds to an update of the Boolean formula in the form B(q):=B(q)i=1m(aiqbi). The formula ¬B(q) can be brought into the disjunctive form ¬B(q)=j=1M(Ujqwj), where each Ujqwj is a system of linear inequalities with integer coefficients in the variable q. Having carried out such a transformation, checking qZd:¬B(q) boils down to M feasibility problems qZd:(Ujqwj) in integer linear programming.

In Fig. 3 we illustrate the first iteration of the while-loop in Algorithm 1 on the lattice-convex set X={0,e1,e2,-e1-e2}Z2. The points of the set S of safely handled lattice points is colored green, the observers of X, that are not yet dealt with, are colored red, and the remaining lattice points in Z2 are colored black. On the left, we see that every observer of X is at lattice-distance one from conv(X), and that the algorithm starts with the initialization S=X. In the middle, a point qZ2\S is taken and conv(X{q}) contains the additional lattice point p, which turns out to be an observer of X. The picture on the right illustrates the cone Cq, with q now being the computed observer in the step before, and that the lattice points in Cq are added to the safely handled points S.

A special quantifier elimination problem

This section is devoted to proving Theorem 5.3, which is a special case of the mixed-integer linear quantifier elimination problem. We need this in Sect. 6 in order to settle the computability of rc(X) for three-dimensional lattice-convex sets X, in those situations where X has infinitely many observers, but which are still structured enough to retain control over. In particular, we establish computability of rc(X) for lattice-convex sets XZd whose possibly infinitely many observers are distributed on finitely many parallel lines (see Corollary 5.4).

Before we are able to describe the details of these findings, we need to review the language and some background on the theory of quantifier elimination: Let f1,,fm:RdR be affine functions in d variables with coefficients in Q, and let B:{true,false}m{true,false} be a Boolean function in m variables. We call the function C:Rd{true,false} defined by

C(x):=B((f1(x)0),,(fm(x)0)),xRd,

a Boolean combination of linear inequalities, for short BCLI. More generally, we also allow to use fi(x)0 with {=,,,>,<} in any combination, which does not increase the expressive power of BCPI’s, because fi(x)>0 is a negation of fi(x)0, fi(x)0 is equivalent to -fi(x)0, fi(x)=0 can be expressed as conjuction of fi(x)0 and -fi(x)0, etc.

It has been of great interest to decide the validity of quantified expressions of the form

Q1x1R1QkxkRk:C(x), 10

where Q1,,Qk{,}, R1,,Rk{R,Z}. The unquantified variables xk+1,,xd are usually called free variables. If in every fragment QixiRi in (10) the ring Ri=R, we say that we consider a real quantified expression; if Ri=Z in every fragment we call the expression integer, and if both cases occur we call it a mixed-integer quantified expression.

A very successful approach is to investigate whether a corresponding expression (10) admits quantifier elimination, which means that there is an algorithm that constructs a BCLI D(y) that is equivalent to (10). The presumably first result in this direction is what is nowadays called Fourier-Motzkin elimination in linear programming. This is originally phrased for existential quantifiers only (and for a system of non-strict inequalities), but can be adjusted to establish that every real quantified expression for BCLI admits quantifier elimination (cf. Schrijver [33, §12.2]). Much more generally, Tarski [35] showed that every real quantified expression is decidable, where the functions f1,,fm are moreover allowed to be polynomials of any degree (cf. Basu, Pollack & Roy [9]).

Regarding the case of pure integer quantifications, a landmark result is the decidability of Presburger arithmetic, meaning that every integer quantified expression for BCLI admits quantifier elimination (cf. Presburger’s original work [31] and the excellent survey article by Haase [18]). On the negative side, Jeroslow [22] showed that Quadratic Integer Programming is undecidable. Liberti [26] discusses undecidability of general mixed-integer nonlinear programming in great detail.

In view of these results one may ask whether every mixed-integer quantified expression for BCLI is decidable, and if such formulas even admit quantifier elimination. Weispfennig [37] indeed answered both questions in the affirmative. An alternative decision procedure based on finite automata has been designed by Boigelot, Jodogne & Wolper [13].

Theorem 5.2

([37, Thm. 3.1] & [13, Sect. 6]) Let C(x) be a BCLI and consider the mixed-integer quantified expression

Q1x1R1QkxkRk:C(x),

where Q1,,Qk{,} and R1,,Rk{R,Z}. Then, there exists an algorithm that constructs a BCLI D(y) equivalent to it.

Motivated by an application to determining rc(X,XY) for specially structured infinite sets Y, we need a very special instance of this result, where only one inner variable is allowed to be quantified over the integers. For the sake of a self-contained presentation, we present our own proof of this basic instance.

Theorem 5.3

Let C(yz) be a BCLI in k+1 variables (y,z)Rk×R. There is an algorithm that decides the validity of the quantified statement

yRkzZ:C(y,z). 11

Proof

The main idea is to reformulate the statement as

yRkvZn:D(y,v), 12

for some BCLI D(yv). Once this is achieved, we can reorder the existential quantifiers

vZnyRk:D(y,v),

then eliminate the quantifiers over real variables by Fourier-Motzkin elimination, and obtain a formula

vZn:E(v),

for some BCLI E(v). In this latter formula we then bring E(v) into a disjunctive normal form and convert all inequalities into the form . Decidability of integer linear programming (cf. Borosh & Treybig [14] or Schrijver [33, Ch. 17 & 18]) finally shows decidability of the equivalent original quantified statement (11).

For constructing a BCLI D(yv) such that (11) is equivalent to (12) we proceed as follows: We first write C(yz) as the disjunction C(y,z)=j=1sCj(y,z), where each Cj(y,z) is a conjunction of inequalities. Each inequality involved in C(yz) is either an inequality that depends only on y, or it involves z and can be written as either a lower or an upper bound on z: zl(y) or z>l(y) or zu(y) or z<u(y). We assume that all the inequalities involving z are written in this form.

When we fix yRk the geometric situation is as follows: Each Cj(y,z) defines the set Sj(y):=tR:Cj(y,t), which is an interval (possibly closed, open, or half-open, and possibly equal to the whole R or empty, in degenerate situations). Our formula (11) is now equivalent to

yRk:(j=1sSj(y)Z). 13

It remains to phrase the condition j=1sSj(y)Z as a purely existential integer quantified statement. How do we phrase that a family of intervals covers the integers? If there exists a conjunction Cj(y,z) that does not depend on z, then the validity of Cj(y,z) means that Sj(y)=R, so that Sj(y)Z. If there is no such Cj(y,z), then there must be an interval infinite to the left, and an interval infinite to the right, which together cover all but finitely many points of Z. The remaining points are covered by the remaining intervals.

Assume Sj1(y) is the interval infinite to the left. We can pick the maximal integer value v1Z in this interval, the successor v1+1 will then be covered by some other interval Sj2(y). If the interval Sj2(y) is finite, there is a maximal integer value v2Z in this interval, whose successor v2+1 will be covered by some third interval. Repeating this process, one eventually reaches the last finite interval, with maximal integer value vn, say. Its successor vn+1 will be covered by some interval Sjn+1(y), which is infinite to the right. All this is summarized as the formula D(yv) defined by

Cj1(y,v1)i=2nCji(y,vi-1+1)Cji(y,vi)Cjn+1(y,vn+1),

which is valid for some n<s and some j1,,jn+1{1,,s}. Moreover, Cj1(y,v1) contains only upper bounds on z, Cjn+1(y,vn+1) contains only lower bounds on z, while Cji(y,vi-1+1) and Cji(y,vi), with 1<in, contain both lower and upper bounds on z.

We thus proved that j=1sSj(y)Z is equivalent to the expression vZn:D(y,v), which finishes the proof.

We now obtain our desired application to determining rc(X,XY) for specially structured infinite sets Y. More precisely, we establish computability of rc(X,XY) for the case that the possibly infinitely many lattice points in Y lie on finitely many parallel lines. However, if those lines are allowed not to be parallel, then we do not know whether the corresponding relaxation complexity is computable.

Corollary 5.4

Let kN, let XZd be a finite lattice-convex set, and let YZd\X be of the form Y=Y0(L1Zd)(LmZd), where Y0 is finite and L1,,Lm are lines, each containing a lattice point pi, and all being parallel to some common primitive vector uZd\{0}. Then, there is an algorithm that decides whether there is a system of k linear inequalities f1(x)0,,fk(x)0, that is satisfied for every xX and is not satisfied for every yY.

In particular, for given X and Y as above, there exists an algorithm that determines rc(X,XY).

Proof

The lattice points of Li are parametrized as pi+zu with zZ. A priori, the expression fj(pi+zu) that will show up in our considerations, is non-linear when we view fj as a variable vector (in the vector space of affine functions on Rd) and z as an integer variable. When we write fj(x)=ajx+bj, for some ajRd and bjR, we have

fj(pi+zu)=aj(pi+zu)+bj=(aju)z+ajpi+bj.

At this point, by rescaling fj, that is, rescaling the vector (aj,bj)Rd+1, we can always assume that aju=ϵj, where ϵj{-1,0,1}, is a linear equality in aj. The three choices for each ϵj produce 3k choices for ϵ=(ϵ1,,ϵk). For a given choice, the expression fj(pi+zu)=ϵjz+ajpi+bj is linear in the variables z,aj,bj. Therefore, for a fixed ϵ{-1,0,1}k, we can model the statement that there is a system of k linear inequalities f1(x)0,,fk(x)0, that is satisfied for every xX and is not satisfied for every yY, by the following mixed-integer quantified expression:

aRd×kbRkzZ:Cϵ(a,b,z), 14

where a=(a1,,ak) and b=(b1,,bk). The BCLI Cϵ(a,b,z) appearing in this expression is defined by

Cϵ(a,b,z)=CX(a,b)C0(a,b)i=1mCiϵ(a,b,z),

with the following constituents:

CX(a,b)=xX(f1(x)0)(fk(x)0)

and

C0(a,b)=yY0¬(f1(y)0)(fk(y)0),

model that the linear system is satisfied by all xX, and by none of the yY0, respectively. And, for each i[m], the BCLI

Ciϵ(a,b,z)=¬(f1(pi+zu)0)(fk(pi+zu)0)=¬(ϵ1z+a1pi+b10)(ϵkz+akpi+bk0),

models that the linear system is not satisfied for the lattice point pi+zu on the line Li.

Since the expression (14) has the right form to apply Theorem 5.3, we obtain decidability of the representation of X within XY with k linear inequalities, by considering 3k such expressions, one for each ϵ{-1,0,1}k. By a standard binary search, we can thus compute rc(X,XY).

Computability of the relaxation complexity for d=3

We are now well-prepared to demonstrate computability of the relaxation complexity of three-dimensional lattice-convex sets. We rephrase Theorem 1.3(b) for the reader’s convenience.

Theorem 6.1

For every full-dimensional finite lattice-convex set XZ3, there is a finite algorithm that computes rc(X).

Given a d-dimensional lattice-free lattice polytope P, there exists a lattice-free lattice polyhedron M with PM and such that M is inclusion-maximal among all lattice-free lattice polyhedra (see [7] and [29]). Moreover, the same sources show that there are only finitely many choices for M, up to unimodular equivalence.

In dimension three the possible choices for M have been classified in the series of papers [7] and [5]: Up to unimodular equivalence, these choices are the slab [0,1]×R2, the toblerone M0=conv({0,2e1,2e2})×R, and 12 bounded lattice polytopes M1,,M12 of volume at most 6 and lattice-width at least 2.

Our proof of Theorem 6.1 is based on this classification and on a case distinction on the structural properties of the observers of the given lattice-convex set X. To this end, given an observer pObs(X), we write

Qp=conv(X{p}).

Note that Qp is a full-dimensional lattice polytope and that p is one of its vertices. Further, we write DXZd\{0} for the set of lattice-width directions of XZd, that is, uZd\{0} is contained in DX if and only if w(X)=w(X,u). The set DX is the explicitly computable set

DX=(w(X)·(X-X))Zd\{0},

where Y:=xRd:xy1for allyY denotes the polar of a set YRd. Also, DX has at most 3d-1 elements as shown in [16]. Each such uDX corresponds to a pair of parallel supporting lattice planes Hu+ and Hu- of conv(X) with outer normals u and -u, respectively. We say that uDX has type (ij) if dim(XHu+)=i and dim(XHu-)=j, or vice versa.

With this notation, we can now describe the structure of the proof of Theorem 6.1. We distinguish different types of observers pObs(X). In most of the cases we describe a finite and computable search space for the respective observer p, which then allows us to use Corollary 4.10 and Proposition 4.3 to compute rc(X). Only in the Cases 2.2.1, 2.2.2, and 2.2.3 we need to employ a different method to compute rc(X) either directly or by an alternative algorithm. The cases are as follows:

  • Case 1: Qp is not lattice-free

  • Case 2: Qp is lattice-free
    • Case 2.1: w(Qp)>1
      • Case 2.1.1: up to unimodular equivalence, Qp is contained in M0
      • Case 2.1.2: up to unimodular equivalence, Qp is contained in one of the polytopes M1,,M12
    • Case 2.2: w(Qp)=1
      • Case 2.2.1: there exists a lattice-width direction of X of type (1, 1)
      • Case 2.2.2: there exists a lattice-width direction of X of type (2, 0)
      • Case 2.2.3: there exists a lattice-width direction of X of type (2, 1)
      • Case 2.2.4: every lattice-width direction of X is of type (2, 2)

Let us now get into the details.

Details for Case 1

Theorem 4.5 and its proof show that if Qp is not lattice-free, then pX+cd·conv(X-X), with the constant cd=d(2d+1)(s2d+1-1). Hence, the set

S1=X+c3·conv(X-X)Z3,

with the exact value c3=21·(s7-1)=2.23651195966947·1014, may be taken as a finite search space in this case, which of course is explicitly computable. Clearly, the constant c3 chosen above is tremendously large. This is due to our limited knowledge on the best upper bound on the coefficient of asymmetry in (5). If the conjectured bound in Remark 4.6 were true, we could replace c3 with the constant s32-2=47.

Details for Case 2.1.1

By assumption, we have w(Qp)2 and a unimodular image of Qp is contained in the toblerone M0, which has lattice-width 2. Therefore, w(Qp)=2. In order to find a finite search space that is guaranteed to contain the observer p, we need to identify all unimodular copies of Qp that lie in M0, or equivalently, all unimodular copies of M0 that contain Qp.

Lemma 6.2

For a subset YZ3 of lattice-width w(Y)=2, let

T(Y):=φ(M0):Yφ(M0)andφ:R3R3is a unimodular map

be the family of unimodular copies of the toblerone M0 that contain Y, and assume that T(Y). Then, T(Y) is finite and explicitly computable.

Proof

First of all, the toblerones TT(Y) are in correspondence with pairs (uv) of lattice-width directions of Y such that

  • (i)

    u+v is a lattice-width direction of Y as well, and

  • (ii)

    {u,v} is a basis of the two-dimensional lattice lin({u,v})Z3.

The lattice-width directions of Y are the non-zero vectors in the set Z32(Y-Y) which is finite and computable. This already shows the claimed finiteness of T(Y).

Condition (i) on a pair (uv) can be checked by a membership test of u+v in 2(Y-Y). Condition (ii) can be computationally checked by computing the row-style Hermite normal form of the (3×2)-matrix with columns uv. The set {u,v} is a basis of lin({u,v})Z3 if and only if this Hermite normal form equals the matrix with columns e1,e2 (cf. [33, Ch. 4] for more details).

Let us now fix an arbitrary toblerone T=φ(M0)T(Qp), where φ is a suitable unimodular map with Qpφ(M0). With the notation of the proof of Lemma 6.2, we write w=u+v for a pair (uv) of lattice-width directions of Qp corresponding to T. Moreover, denote the three unbounded facets of T that are orthogonal to uv and w by Fu,Fv and Fw, respectively. Let π be the orthogonal projection onto the hyperplane lin({u,v}). There are two options: Either π(φ-1(X))=conv({0,2e1,2e2}) or exactly one of the vertices 0, 2e1, 2e2 is missing in the projection π(φ-1(X)). Indeed, if two such vertices were missing, then w(Qp)=1, contradicting our assumption.

Write Q=conv(X) as usual. In the first case, all the intersections QFu, QFv, and QFw are lattice polygons of dimension 1 or 2, and the observer p may a priori lie anywhere on the facets Fu, Fv, and Fw. Without loss of generality, assume that pFu, and let E be an edge of QFu with the property that the intersection of conv(E{p}) and QFu is one-dimensional. Since p is an observer of X, it can only lie in the next parallel lattice line to aff(E), as otherwise the lattice triangle conv(E{p}) would have too large an area and contain additional lattice points. This follows, for instance, from Pick’s Theorem that relates the area of a lattice polygon to the number of lattice points that it contains (cf. [17, Sect. 19]).

In the second case, again all the intersections QFu, QFv, and QFw are lattice polygons of dimension 1 or 2, but now the observer p is constrained to lie in an unbounded edge, say FuFv, of the toblerone  T. Again, let E be an edge of QFu or QFv with the property described before. Similarly as in the first case, p must lie in the lattice line aff(E) or in the next parallel lattice line to it, in order not to contradict the property of being an observer.

Summarizing our considerations, in both cases there is only a finite and explicitly computable set ST of lattice points that are candidates for the observer p. Hence a finite and computable search space that is guaranteed to contain the observer p is

S2.1.1=TT(Qp)ST.

Details for Case 2.1.2

Since by assumption a unimodular copy of Qp is contained in one of the lattice polytopes M1,,M12 of volume at most 6, the observer pObs(X) cannot lie too far away from X in the following sense: Write Q=conv(X)=xR3:aixbi,1im, where the aiZ3 are primitive outer normal vectors of the facets of Q, and where biZ for 1im. Since every facet F of Q is a lattice polygon and the pyramid Fp=conv(F{p}) is a lattice 3-polytope of volume at most 6, the point p can lie at most 36 parallel lattice hyperplanes away from aff(F) (note that a lattice 3-simplex has volume at least 1/6). Therefore, the finite search space for p in this case can be taken as

S2.1.2=xR3:aixbi+36,1imZ3.

Details for Case 2.2.1

If there is a lattice-width direction of X of type (1, 1), then conv(X) is a tetrahedron and as such a rational relaxation of X with four facets. Since X is also full-dimensional, Corollary 3.8 yields that rcQ(X)=rc(X)=4.

Details for Case 2.2.2

For sets X with a lattice-width direction of type (2, 0) we need an auxiliary lemma that is valid in every dimension.

Lemma 6.3

Let XZ2 be a finite lattice-convex set and let xZ3 be a point whose last coordinate equals 1. If rc(X)4, then the lattice-convex set X=(X×{0}){x}Z3 satisfies rc(X)=rc(X).

Proof

Let PR2 be a polyhedron with m=rc(X)4 facets such that PZ2=X, and let a1,,am be a set of outer normal vectors of the facets of P. Observe that in view of Lemma 3.3 this relaxation P of X is necessarily bounded. After a perturbation of the facets of P, we may assume that there are a1,a2,a3, say, such that the origin 0 is contained in the relative interior of conv({a1,a2,a3}). Writing F1,F2,F3 for the corresponding facets of P, this yields that the hyperplanes Hi=aff(Fi{x}), 1i3, separate X from all points in zZ3:z3>0\X. This holds since x has height 1 over P.

Without loss of generality, we may assume that the projection of x onto the first two coordinates is contained in P. In this setting we can separate X from the lattice points with negative last coordinate by taking as additional hyperplane normals ai=(ai,0), for every i5, and a4=(a4,-M), for some large enough M such that the corresponding hyperplane is almost horizontal and thus cuts off all lattice points “below” P.

The lattice points outside of X which have vanishing last coordinate are separated from X, because in the process above we just tilted all the facets of P, thus leaving the separation within the plane {xR3:x3=0} unchanged.

We showed so far that rc(X)m=rc(X). If this would be strict however, then we would find a relaxation of X within Z3 with k<m hyperplanes, whose restriction to the plane {xR3:x3=0} would lead to a relaxation of X within Z2 with k hyperplanes, a contradiction to the definition of m.

Now, if XZ3 has a lattice-width direction of type (2, 0), then up to unimodular equivalence, there is some two-dimensional finite lattice-convex set XZ2 and some xZ3 with last coordinate equal to 1, such that X=(X×{0}){x}. In view of Weltge’s [38, Thm. 7.5.7] treatment of planar lattice-convex sets, we know that rc(X)=rcQ(X)3 and that this number is computable. So, if rc(X)4, then Lemma 6.3 implies that rc(X)=rc(X) is computable. If, however, rc(X)=3 and PR2 is a relaxation of X within Z2 having three facets, then P=conv(P×{0}){x} is a relaxation of X within Z3 having four facets. Since X is full-dimensional, Corollary 3.8 guarantees that every of its relaxations need to have at least four facets and thus rc(X)=rcQ(X)=4.

Details for Case 2.2.3

Since we dealt with the cases of lattice-width directions of type (1, 1) and (2, 0) before, we assume in the sequel that X does not have any of such lattice-width directions, and hence every uDX is of type either (2, 1) or (2, 2). By assumption we have w(Qp)=1, and thus also w(X)=1. In general for sets XZ3 of lattice-width w(X)=1, the observers are contained in finitely many affine subspaces. Recall that for each uDX, we denote by Hu+ and Hu- the parallel supporting lattice planes of conv(X).

Lemma 6.4

Let XZ3 be a full-dimensional lattice-convex set of lattice-width w(X)=1. Then, there is a finite subset Y0Z3 such that

Obs(X)Y0uDX(Hu+Z3)(Hu-Z3).

More precisely, if every lattice-width direction of X is of type either (2, 1) or (2, 2), then there are lattice lines L1,,Lm such that

Obs(X)Y0(L1Z3)(LmZ3),

and the set Y0 and a lattice point on and the direction of every line Li can be computed explicitly.

Proof

For pObs(X), let Qp=conv(X{p}) be as above. If w(Qp)>1, then by Case 1 and Case 2.1 there are only finitely many choices for the observer p, and we may take the finite set in the claim as

Y0=S1S2.1.1S2.1.2.

If pObs(X) is such that w(Qp)=1, then pHu+Hu- for some uDX, and the claimed inclusion follows.

Now assume that every lattice-width direction of X is of type either (2, 1) or (2, 2). Whenever dim(Hu±X)=2, then by Proposition 4.2 the corresponding plane contains only finitely many observers of X, which are computable and which we may include into the finite set Y0. If dim(Hu+X)=1, say, then every observer of X that is contained in Hu+ lies on one of three consecutive parallel lattice lines. To see this, let L0=aff(Hu+X) and let L+ and L- be the two neighboring lattice lines to L0 in Hu+. Every lattice point on either L+ or L- is in fact an observer, and there are exactly two observers in L0 (the lattice points next to the endpoints of conv(Hu+X)). If pHu+Z3 is not contained in either of the three lines L-,L0,L+, then conv(Hu+X){p} is a lattice triangle in Hu+ that is not unimodularly equivalent to conv({0,e1,e2}). This triangle must therefore have an additional lattice point, so that, in fact, p cannot be an observer of X. Computability of the lines L-,L0, and L+ follows immediately from their definition.

Now, if X has a lattice-width direction of type (2, 1), then we use Proposition 4.3 for the set

Y=Y0(L1Z3)(LmZ3)

from Lemma 6.4, and obtain that rc(X)=rc(X,XY). If the lattice lines L1,,Lm are parallel, we can apply Corollary 5.4 and compute rc(X) by the quantifier elimination procedure outlined in Sect. 5.2. The proof of Lemma 6.4 shows that this holds if all the one-dimensional sets Hu+X, where uDX are the lattice-width directions of X of type (2, 1), are parallel.

If the lattice lines L1,,Lm are not parallel, then X belongs to an explicit parametrized family. For its description, we fix some more notation: For a lattice-width direction uDX of type (2, 1), write F(Xu) for the set of those xX such that ux is maximized on X. Also, we assume in the sequel that u is oriented such that dim(F(X,u))=1, and thus dim(F(X,-u))=2.

Lemma 6.5

Let XZ3 be a finite lattice-convex set admitting linearly independent lattice-width directions u,vDX of type (2, 1) such that F(Xu) and F(Xv) are not parallel. Then, X is unimodularly equivalent to

Xa,b={e1,e2,e1+e2}ke3:k=a,,b,

for some a,bZ with ab and {a,b}{0}.

Proof

First of all, we can apply a unimodular transformation and assume that both u and v are orthogonal to e3. Let π:R3R2 be the projection π(x1,x2,x3)=(x1,x2) that forgets the last coordinate. Since w(X,u)=w(X,v)=1, the projection π(X) is unimodularly equivalent to {0,e1,e2} or {0,e1,e2,e1+e2}. The first case cannot happen, because then both F(Xu) and F(Xv) would be parallel to e3, contradicting our assumption.

So, we may assume that π(X)={0,e1,e2,e1+e2}, and that u=e1 and v=e2. Since F(X,e1) and F(X,e2) are not parallel, one of them, say F(X,e1), is not parallel to e3. Then,

π(F(X,-e1))={0,e2}andπ(F(X,e1))={e1,e1+e2}.

This implies that F(X,e2) is also not parallel to e3. Indeed, if F(X,e2) would be parallel to e3, then π(F(X,e2)) would be a single point, either equal to {e2} or {e1+e2}. In the first case, the set F(X,e2) would be two-dimensional, and in the second case, F(X,e1) would be two-dimensional, a contradiction either way. We conclude that

π(F(X,e1))={e1,e1+e2}andπ(F(X,e2))={e2,e1+e2}.

Furthermore, F(X,e1) and F(X,e2) share exactly one point that gets projected onto e1+e2. Thus, we have that F(X,e1)={p,q} and F(X,e2)={q,r}, where π(p)=e1, π(q)=e1+e2, and π(r)=e2.

Applying a suitable unimodular transformation, we may thus assume that p=e1, q=e1+e2, and r=e2, and thus X=Xa,b, for some a,bZ.

In order to finish up Case 2.2.3 of the proof of Theorem 6.1, we explicitly determine the relaxation complexity of the exceptional examples in Lemma 6.5.

Lemma 6.6

For every a,bZ with ab and {a,b}{0}, the relaxation complexity of the set

Xa,b={e1,e2,e1+e2}ke3:k=a,,b

is given by rc(Xa,b)=rcQ(Xa,b)=4.

Proof

The lower bound rc(Xa,b)4 follows by Corollary 3.8 since Xa,b is clearly full-dimensional. For the upper bound, we construct an explicit relaxation of Xa,b with four facets.

It is enough to consider the following cases:

  1. 0=a<b,

  2. 0<ab,

  3. a<0<b.

Case 0=a<b: The triangle T with vertices

(-1/2,-1/2),(1/4,7/4),(7/4,1/4)

is a relaxation of [0,1]2 with the property that the vertex 0 of [0,1]2 lies in the interior of T, while the other three vertices lie in the boundary of T. It is clear that the tetrahedron P with base T×{0} and apex be3 is a relaxation of Xa,b=X0,b.

Fig. 4.

Fig. 4

The relaxation complexity of X0,3 is four. The figure shows the base and the two horizontal cross-sections of the tetrahedron P that relaxes X0,3

Case 0<ab:

We claim that the following tetrahedron is a relaxation of Xa,b:

P=xR3:-x30,-b(x1+x2)-x3-a,bx1+x3b,bx2+x3b

(cf. Fig. 5 for an illustration). For this to hold, we need to prove that zXa,b if and only if zPZ3. The reader may quickly check that Xa,bP, so we focus on establishing sufficiency and let zPZ3. First of all, combining the first and third defining inequality of P, we get bz1b and thus z11. Adding the second and fourth inequality gives -bz1b-a, and thus z1-b-ab>-1. Since moreover z has only integral entries, we have z1{0,1}, and by symmetry, the same holds for z2.

Fig. 5.

Fig. 5

The relaxation complexity of X1,2 is four. The figure presents the base and the two horizontal cross-sections of the tetrahedron P that relaxes X1,2

This leaves us with four possibilities for z: If z=(0,0,z3)PZ3, then z3a by the second inequality, and z3b by the third one. In particular, zXa,b. If z=(1,0,z3)PZ3, then z30 by the first inequality, and z30 by the third one. Hence, we have z=(1,0,0)Xa,b. A similar argument works for the two remaining cases z=(0,1,z3) and z=(1,1,z3), in which we also get that z3=0. In conclusion, we saw that zXa,b in either case, and thus P is indeed a relaxation of Xa,b having four facets.

Case a<0<b: Consider the segment

S=[-ϵ,1],

where ϵ>0 is sufficiently small. We define the tetrahedron

P=convS×{0}horiz. segm.×{a}heighta{0}×Svert. segm.×{b}heightb

and consider the horizontal cross-section of P at height z by introducing

Pz:=(x,y):(x,y,z)P.

For azb, Pz is the rectangle given by

Pz:=z-ab-a·S×b-zb-a·S.

In particular, the cross-section at height 0 is given by

P0=-ab-a·S×bb-a·S.

We now define the two-dimensional lattice Λ:=Zb1+Zb2 spanned by the vectors

b1:=1b-a·(-a,-bϵ)andb2:=1b-a·(aϵ,b),

which are two opposite vertices of the rectangle P0. It turns out that, for a sufficiently small ϵ>0, the set

Xa,b:=P(Λ×Z)

is given as

Xa,b={b1,b2,b1+b2}ze3:z=a,,b.

Consequently, applying the transformation ϕ:R3R3, given by ϕ(b1,0)=e1, ϕ(b2,0)=e2, and ϕ(e3)=e3, satisfying ϕ(Λ×Z)=Z3, we conclude that

Xa,b=ϕ(P)Z3.

As the relaxation ϕ(P) of Xa,b is a tetrahedron as well, we are done (cf. Fig. 6 for an illustration).

Fig. 6.

Fig. 6

An illustration for the construction in the case a<0<b. The figure depicts the lattice Λ (red dots), its generators b1,b2 (red vectors) and cross-sections of the tetrahedron P

Details for Case 2.2.4

If every lattice-width direction of X is of type (2, 2), then, in the notation from before, all the intersections Hu+X and Hu-X, uDX, are two-dimensional. In view of Proposition 4.2, there are only finitely many observers of X in each of these hyperplanes and they can be computed algorithmically. Using Lemma 6.4, we thus find an explicitly computable finite set YZ3\X such that rc(X)=rc(X,XY) and invoking Corollary 4.10 once more shows computablitity of rc(X).

Acknowledgements

We thank Christoph Hunkenschröder for valuable comments on an earlier draft, Christoph Haase for pointers to the literature on Presburger arithmetic, and Georg Loho for discussions related to the topics of this paper. Moreover, we thank the referees for thorough reading and for suggestions that helped to improve the presentation.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Footnotes

The second author was partially supported by the Swiss National Science Foundation (SNSF) within the project Lattice Algorithms and Integer Programming (Nr. 185030).

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Gennadiy Averkov, Email: averkov@b-tu.de.

Matthias Schymura, Email: schymura@b-tu.de.

References

  • 1.Applegate DL, Bixby RE, Chvátal V, William JA. Computational Study, Cook, The Traveling Salesman Problem. Princeton: Princeton University Press; 2006. [Google Scholar]
  • 2.Averkov G. A proof of Lovász’s theorem on maximal lattice-free sets. Beitr. Algebra Geom. 2013;54(1):105–109. doi: 10.1007/s13366-012-0092-8. [DOI] [Google Scholar]
  • 3.Averkov G, Conforti M, Del Pia A, Di Summa M, Faenza Y. On the convergence of the affine hull of the Chvátal-Gomory closures. SIAM J. Discrete Math. 2013;27(3):1492–1502. doi: 10.1137/120898371. [DOI] [Google Scholar]
  • 4.Averkov G, Krümpelmann J, Nill B. Lattice simplices with a fixed positive number of interior lattice points: a nearly optimal. Bound. Int. Math. Res. Not. 2018;3:2–47. [Google Scholar]
  • 5.Averkov G, Krümpelmann J, Weltge S. Notions of maximality for integral lattice-free polyhedra: the case of dimension three. Math. Oper. Res. 2017;42(4):1035–1062. doi: 10.1287/moor.2016.0836. [DOI] [Google Scholar]
  • 6.Averkov G, Wagner C. Inequalities for the lattice width of lattice-free convex sets in the plane. Beitr. Algebra Geom. 2012;53(1):1–23. doi: 10.1007/s13366-011-0028-8. [DOI] [Google Scholar]
  • 7.Averkov G, Wagner C, Weismantel R. Maximal Lattice-Free Polyhedra: Finiteness and an Explicit Description in Dimension Three. Math. Oper. Res. 2011;36(4):721–742. doi: 10.1287/moor.1110.0510. [DOI] [Google Scholar]
  • 8.Basu A, Conforti M, Cornuéjols G, Zambelli G. Maximal lattice-free convex sets in linear subspaces. Math. Oper. Res. 2010;35(3):704–720. doi: 10.1287/moor.1100.0461. [DOI] [Google Scholar]
  • 9.Basu S, Pollack R, Roy M-F. Algorithms in Real Algebraic Geometry. 2. Berlin: Springer; 2006. [Google Scholar]
  • 10.Blanco, M., Haase, C., Hofmann, J., Santos, F.: The Finiteness Threshold Width of Lattice Polytopes. https://arxiv.org/abs/1607.00798 (2016)
  • 11.Blanco M, Santos F. Lattice 3-polytopes with few lattice points. SIAM J. Discrete Math. 2016;30(2):669–686. doi: 10.1137/15M1014450. [DOI] [Google Scholar]
  • 12.Blichfeldt, H.F.: A new principle in the geometry of numbers, with some applications. Trans. Am. Math. Soc. 15(3), 227–235 (1914)
  • 13.Boigelot B, Jodogne S, Wolper P. An effective decision procedure for linear arithmetic over the integers and reals. ACM Trans. Comput. Logic. 2005;6(3):614–633. doi: 10.1145/1071596.1071601. [DOI] [Google Scholar]
  • 14.Borosh I, Treybig LB. Bounds on positive integral solutions of linear Diophantine equations. Proc. Am. Math. Soc. 1976;55(2):299–304. doi: 10.1090/S0002-9939-1976-0396605-3. [DOI] [Google Scholar]
  • 15.Conforti M, Cornuéjols G, Zambelli G. Extended formulations in combinatorial optimization. 4OR. 2010;8(1):1–48. doi: 10.1007/s10288-010-0122-z. [DOI] [Google Scholar]
  • 16.Draisma J, McAllister TB, Nill B. Lattice-width directions and Minkowski’s 3d-theorem. SIAM J. Discrete Math. 2012;26(3):1104–1107. doi: 10.1137/120877635. [DOI] [Google Scholar]
  • 17.Peter M. Gruber, Convex and Discrete Geometry, Grundlehren der Mathematischen Wissenschaften. Berlin: Springer; 2007. [Google Scholar]
  • 18.Haase C. A survival guide to Presburger arithmetic. ACM SIGLOG News. 2018;5(3):67–82. doi: 10.1145/3242953.3242964. [DOI] [Google Scholar]
  • 19.Hammer, P.L., Ibaraki, T., Peled, U.N., Threshold numbers and threshold completions, Studies on graphs and discrete programming, vol. 11. North-Holland, Amsterdam-New York, pp. 125–145 (1979)
  • 20.Hojny, C.: Strong IP Formulations Need Large Coefficients. http://www.optimization-online.org/DB_HTML/2018/11/6934.html, (2018)
  • 21.Hojny C. Polynomial size IP formulations of knapsack may require exponentially large coefficients. Oper. Res. Lett. 2020;48(5):612–618. doi: 10.1016/j.orl.2020.07.013. [DOI] [Google Scholar]
  • 22.Robert GJ. There cannot be any algorithm for integer programming with quadratic constraints. Oper. Res. 1973;21:221–224. doi: 10.1287/opre.21.1.221. [DOI] [Google Scholar]
  • 23.Robert G. Jeroslow, On defining sets of vertices of the hypercube by linear inequalities. Discrete Math. 1975;11:119–124. doi: 10.1016/0012-365X(75)90003-5. [DOI] [Google Scholar]
  • 24.Kaibel V. Extended formulations in combinatorial optimization. Optima. 2011;85:2–7. [Google Scholar]
  • 25.Kaibel V, Weltge S. Lower bounds on the sizes of integer programs without additional variables. Math. Program. Ser. B. 2015;154(1–2):407–425. doi: 10.1007/s10107-014-0855-0. [DOI] [Google Scholar]
  • 26.Liberti L. Undecidability and hardness in mixed-integer nonlinear programming. RAIRO Oper. Res. 2019;53(1):81–109. doi: 10.1051/ro/2018036. [DOI] [Google Scholar]
  • 27.Lovász, László, Geometry of numbers and integer programming, Mathematical programming (Tokyo, : Math. Appl. (Japanese Ser.), vol. 6. SCIPRESS, Tokyo 1989, 177–201 (1988)
  • 28.Megiddo N. On the complexity of polyhedral separability. Discrete Comput. Geom. 1988;3:325–337. doi: 10.1007/BF02187916. [DOI] [Google Scholar]
  • 29.Nill B, Ziegler GM. Projecting lattice polytopes without interior lattice points. Math. Oper. Res. 2011;36(3):462–467. doi: 10.1287/moor.1110.0503. [DOI] [Google Scholar]
  • 30.Pikhurko O. Lattice points in lattice polytopes. Mathematika. 2001;48(1–2):15–24. doi: 10.1112/S0025579300014339. [DOI] [Google Scholar]
  • 31.Presburger, M.: Über die Vollständigkeit eines gewissen Systems der Arithmetik ganzer Zahlen, in welchem die Addition als einzige Operation hervortritt. Comptes Rendus du I congres de Mathematiciens des Pays Slaves, pp. 92–101 (1929)
  • 32.Rogers CA, Shephard GC. The difference body of a convex body. Arch. Math. (Basel) 1957;8:220–233. doi: 10.1007/BF01899997. [DOI] [Google Scholar]
  • 33.Schrijver A. Theory of Linear and Integer Programming. Chichester: Wiley; 1986. [Google Scholar]
  • 34.Sperner E. Ein Satz über Untermengen einer endlichen Menge. Math. Z. 1928;27(1):544–548. doi: 10.1007/BF01171114. [DOI] [Google Scholar]
  • 35.Tarski A. A Decision Method for Elementary Algebra and Geometry. 2. Berkeley: University of California Press; 1951. [Google Scholar]
  • 36.Taylor AD, Zwicker WS. Simple Games: Desirability Relations, Trading, Pseudoweightings. Princeton: Princeton University Press; 1999. [Google Scholar]
  • 37.Weispfenning, V.: Mixed Real-Integer Linear Quantifier Elimination. In: Proceedings of the 1999 International Symposium on Symbolic and Algebraic Computation, ISSAC ’99, (1999), pp. 129–136
  • 38.Weltge, S.: Sizes of Linear Descriptions in Combinatorial Optimization, Ph.D. thesis, Otto-von-Guericke-Universität Magdeburg, (2015)

Articles from Mathematical Programming are provided here courtesy of Springer

RESOURCES