Skip to main content
Springer logoLink to Springer
. 2020 Jul 15;190(1-2):467–482. doi: 10.1007/s10107-020-01542-w

Strengthening convex relaxations of 0/1-sets using Boolean formulas

Samuel Fiorini 1, Tony Huynh 2, Stefan Weltge 3,
PMCID: PMC8550646  PMID: 34776534

Abstract

In convex integer programming, various procedures have been developed to strengthen convex relaxations of sets of integer points. On the one hand, there exist several general-purpose methods that strengthen relaxations without specific knowledge of the set S of feasible integer points, such as popular linear programming or semi-definite programming hierarchies. On the other hand, various methods have been designed for obtaining strengthened relaxations for very specific sets S that arise in combinatorial optimization. We propose a new efficient method that interpolates between these two approaches. Our procedure strengthens any convex set containing a set S{0,1}n by exploiting certain additional information about S. Namely, the required extra information will be in the form of a Boolean formula ϕ defining the target set S. The new relaxation is obtained by “feeding” the convex set into the formula ϕ. We analyze various aspects regarding the strength of our procedure. As one application, interpreting an iterated application of our procedure as a hierarchy, our findings simplify, improve, and extend previous results by Bienstock and Zuckerberg on covering problems.

Mathematics Subject Classification: 90Cxx, 52Bxx, 68Q06

Introduction

In convex integer programming, there exist various procedures to strengthen convex relaxations of sets of integer points. Formally, given a set SZn of integer points and a convex set QRn with QZn=S, these methods aim to construct a new convex set f(Q) satisfying Sf(Q)Q.

On the one hand, there exist several general-purpose methods that strengthen relaxations without specific knowledge of the set S, in a systematic way. The hierarchies of Sherali and Adams [27], Lovász and Schrijver [21], and Lasserre [20], which are tailored to 0/1-sets S{0,1}n, are methods of this type.

On the other hand, various methods have been designed for obtaining strengthened relaxations for specific sets S. Such methods include, as an example, an impressive collection of families of valid inequalities of the traveling salesperson polytope that strengthen the classical subtour elimination formulation. Similar research has been performed for many other polytopes arising in combinatorial optimization, such as stable set polytopes and knapsack polytopes.

In this work, we propose a new method that interpolates between the two approaches described above. We design a procedure to strengthen any convex set QRn containing a set S{0,1}n by exploiting certain additional information about S. Namely, the required extra information will be in the form of a Boolean formula ϕ defining the target set S. Instead of viewing a Boolean formula as taking 0/1-vectors as input, the improved relaxation is obtained by “feeding” the convex set Q into the formula ϕ, and will be denoted by ϕ(Q).

While the formula ϕ has to be provided as a further input, for certain problems, there is an “obvious” candidate for ϕ. For example, suppose that the set S arises from a 0/1-covering problem. That is, it is given by a matrix A{0,1}m×n such that S={x{0,1}n:Ax1}, where 1 is the all-ones vector. Then S can be equivalently specified by the following Boolean formula in conjunctive normal form

ϕ:=i=1mj:Aij=1xj. 1

In addition, there is a vast literature on representing sets of 0/1-points via Boolean formulas, and we are free to use any of these formulas for our procedure.

An important property of ϕ(Q) is that it can be described by an extended formulation whose size is bounded by the size of the formula ϕ times the size of an extended formulation defining the input relaxation Q. Recall that an extended formulation of size m of a polytope P is determined by matrices TRn×d, ARm×d and vectors tRn, bRm such that P={xRn:yRd:Ayb,x=Ty+t}. Therefore, provided that ϕ has polynomial size, our procedure is efficient in the sense that a small extended formulation for Q can be converted into a small extended formulation for ϕ(Q) in polynomial time.

To illustrate, the well-studied procedure [10] that maps a relaxation Q to its Chvátal-Gomory closure f(Q) is not efficient in the above sense (this is to be expected since determining membership in f(Q) is NP-complete [12]). A striking example is given by choosing Q as the fractional matching polytope [26, Section 30.2]. In this case, the Chvátal-Gomory closure f(Q) is the matching polytope. It was recently shown by Rothvoß that all extended formulations of the matching polytope have exponential size [25].

Another property of our procedure is that it is complete in the sense that iterating it a finite number of times (in fact, at most n times) always yields the convex hull of S. Furthermore, our procedure can be applied to any convex set Q[0,1]n that contains the target set S. In particular, the set Q is even allowed to contain 0/1-points that do not belong to S. As an example, we can always apply our method with Q=[0,1]n, and thus finding an initial relaxation for conv(S) is never an issue. Intuitively, this is possible since the information of which points belong to S is stored in ϕ.

By viewing an iterated application of our procedure as a hierarchy, we obtain a significant simplification of the Bienstock-Zuckerberg hierarchy [7]. This is a powerful hierarchy tailored to 0/1-covering problems. However, one of its drawbacks is that the definition of the hierarchy is quite complicated. Prior to this work, it had been simplified by Mastrolilli [22] using a modification of the Sherali-Adams hierarchy that is based on appropriately defined high-degree polynomials. Subsequent to our work, it has also been simplified by Bienstock and Zuckerberg [9] themselves. Despite the simplicity of our method, we obtain extended formulations whose size is often vastly smaller than those of Bienstock and Zuckerberg [7, 8] and Mastrolilli [22]. We discuss this in more detail in Sect. 6.

Another aspect of our work is that it should serve as a bridge between combinatorial optimization and circuit complexity. As a concrete example, our procedure yields a very simple proof that Rothvoß’ result [25] on the extension complexity of the matching polytope implies a seminal result of Raz and Wigderson [23, Theorem 4.1] on the size of monotone formulas required to describe the matching function. In the other direction, constructions of small formulas describing a set S{0,1}n can now be used to obtain small extended formulations for conv(S). We give a few non-trivial examples of this in Sect. 5, but of course there are many more.

For readers familiar with circuit complexity, we mention that our work is inspired by a relatively unknown connection between Karchmer-Wigderson games [18] and nonnegative factorizations, pioneered by Hrubeš [17]. This connection was recently rediscovered by Göös, Jain and Watson [15] and exploited in [3]. As such, the proofs of the main properties of our hierarchy are also very short. Indeed, the proof of our main theorem can be regarded as a “polyhedral” Karchmer-Wigderson game, but no knowledge of communication complexity is required.

Paper Outline We start by describing our procedure to obtain ϕ(Q) in Sect. 2. In Sect. 3, we introduce notions that allow us to quantify the strength of our relaxations. Our main results regarding properties of the set ϕ(Q) are presented in Sect. 4. In Sect. 5, we discuss several applications of our method in detail. In Sect. 6 we compare our procedure to related work of Bienstock and Zuckerberg [7, 8] and Mastrolilli [22], and state some open problems.

Description of the procedure

In order to present the construction of the procedure, let us fix some notation concerning Boolean formulas. We consider formulas that are built out of input variables x1,,xn, conjunctions , disjunctions , and negations ¬ in the standard way. Here, we define the size1 of a Boolean formula as the total number of occurrences of input variables. We denote by |ϕ| the size of ϕ.

Given a Boolean formula ϕ, we can interpret it as a function from {0,1}n{0,1} and for an input x=(x1,,xn){0,1}n we will denote its output by ϕ(x). We say that the set S={x{0,1}n:ϕ(x)=1} is defined by ϕ. Two formulas are said to be equivalent if they define the same set.

We say that a formula is reduced if negations are only applied to input variables. Note that, by De Morgan’s laws, every Boolean formula can be brought into an equivalent reduced formula of the same size. As an example, the formulas

ϕ1=¬x1¬x2¬x1x3ϕ2=¬x1x2x1x3 2

are equivalent and both have size 4, but only the second is in reduced form.

Below, we will repeatedly use the elementary fact that for every reduced formula ϕ of size |ϕ|, one of the following holds:

  • |ϕ|=1 and either ϕ=xi or ϕ=¬xi for some i[n], or

  • |ϕ|2 and ϕ is either the conjuction or the disjunction of two reduced formulas ϕ1,ϕ2 such that |ϕ|=|ϕ1|+|ϕ2|.

This gives a way to represent any reduced Boolean formula as a rooted tree each of whose inner nodes is labeled with or and each of whose leaves is labeled with a non-negated variable xi or a negated variable ¬xi. Note that there may be many trees that represent the same reduced Boolean formula, but this will not matter. Observe that the size of a formula is the number of leaves in any one of its trees.

We are ready to describe our method to strengthen a convex relaxation of a given set of points in {0,1}n.

Definition 1

Let ϕ be a reduced Boolean formula with input variables x1,,xn and let Q[0,1]n be any convex set.

The set ϕ(Q)Rn is recursively constructed from the formula ϕ as follows.

  • Replace any non-negated input variable xi by the set {xQ:xi=1}.

  • Replace any negated input variable ¬xi by the set {xQ:xi=0}.

  • Replace any conjuction of two sets by their intersection.

  • Replace any disjunction of two sets by the convex hull of their union.

As an example, given any convex set Q[0,1]3 and the formula ϕ2 defined in (2), we have

ϕ2(Q)=conv({xQ:x1=0}{xQ:x2=1})conv({xQ:x1=1}{xQ:x3=1}).

In the remainder of this work, we will analyze several properties of ϕ(Q). One simple observation will be that, if S is defined by ϕ and SQ, then Sϕ(Q)Q. Furthermore, ϕ(Q) is strictly contained in Q unless conv(S)=Q. In order to quantify this improvement over Q, we will introduce useful measures in the next section.

Measuring the strength: pitch and notch

We now introduce two quantities that measure the strength of our procedure. To this end, note that for every linear inequality in variables x1,,xn we can partition [n] into sets I+,I-[n] (with I+I-=[n] and I+I-=) such that the inequality can be written as

iI+cixi+iI-ci(1-xi)δ, 3

where c=(c1,,cn)R0n and δR. Since we will only consider the intersection of [0,1]n with the set of points satisfying such an inequality, we are only interested in inequalities where δ0. In this case, we call (3) an inequality in standard form.

The notch of an inequality in standard form is the smallest number ν such that

jJcjδ 4

holds for every J[n] with |J|ν, while its pitch is the smallest number p such that (4) holds for every Jsupp(c) with |J|p. Note that the pitch of an inequality is at most its notch. For instance, the notch of the inequality x1+xn1 is n-1, while its pitch equals 1. Both quantities appear in the study of Chvátal-Gomory closures of polytopes in [0,1]n.

Intuitively, the notch of an inequality is related to how “deep” it cuts the 0/1-cube. For simplicity, assume that I-=, so that the origin minimizes the left-hand side of (3) over the cube. The notch of (3) is then the smallest number ν such that no 0/1-vector of Hamming weight ν or more is cut by the inequality. A similar intuition applies to the pitch.

We extend the definition of notch from inequalities to sets of 0/1-points as follows. The notch of a non-empty set S{0,1}n, denoted ν(S), is the largest notch of any inequality in standard form that is valid for S. It can be shown that ν(S) is equal to the smallest number k such that every k-dimensional face of [0,1]n contains a point from S. This equivalent definition of notch2 was introduced in [6]. The main result of [6] is that if S has bounded notch and conv(S) has bounded facet coefficients, then every polytope Q[0,1]n whose set of 0/1-points is S has bounded Chvátal-Gomory rank.

The term pitch was used by Bienstock and Zuckerberg [7], who defined it for monotone inequalities in standard form, that is, where I-=. Bounded pitch inequalities are related to the Chvátal-Gomory closure as follows. Consider any constants ε>0 and Z1, and any relaxation Q:={x[0,1]n:Axb} of a set S:=Q{0,1}n, with A, b nonnegative. Bienstock and Zuckerberg [8, Lemma 2.1] proved that adding all valid pitch-p inequalities for p/ln(1+ε)=Θ(/ε) to the system defining Q gives a relaxation R that is a (1+ε)-approximation3 of the -th Chvátal-Gomory closure of Q.

Main results

In this section, we prove several properties of the set ϕ(Q). Let us start with the following simple observation.

Proposition 2

For every reduced Boolean formula ϕ and every convex set Q[0,1]n, the set ϕ(Q) is a convex subset of Q. Moreover, ϕ(Q) contains every point x{0,1}n such that xQ and ϕ(x)=1. In other words, ϕ(Q) contains Qϕ-1(1).

Proof

The fact that ϕ(Q) is a convex set contained in Q is clear, since ϕ(Q) is constructed from faces of Q by taking intersections and convex hulls of unions.

We prove the second part by induction on the size of ϕ. If |ϕ|=1, then ϕ is either ϕ=xi or ϕ=¬xi for some i[n]. So either ϕ(Q)={xQ:xi=1} or ϕ(Q)={xQ:xi=0}, respectively. We see immediately that ϕ(Q) contains Qϕ-1(1).

Now if |ϕ|2, then ϕ is the conjunction or disjunction of two formulas of smaller size, say ϕ1 and ϕ2. In the first case, ϕ=ϕ1ϕ2 and we have ϕ(Q)=ϕ1(Q)ϕ2(Q)(Qϕ1-1(1))(Qϕ2-1(1))=Q(ϕ1-1(1)ϕ2-1(1))=Qϕ-1(1), where the inclusion follows from induction. In the second case, ϕ=ϕ1ϕ2 and ϕ(Q)=conv(ϕ1(Q)ϕ2(Q))(Qϕ1-1(1))(Qϕ2-1(1))=Q(ϕ1-1(1)ϕ2-1(1))=Qϕ-1(1).

Next, we argue that we can use ϕ to transform any extended formulation for Q into one for ϕ(Q). To this end, we make use of the extension complexity of a polytope P, which is defined as the smallest size of any extended formulation for P, and is denoted by xc(P). We need the following standard facts about extension complexity. First, if F is a non-empty face of P, then xc(F)xc(P). Second, for any non-empty polytopes P1,P2Rn one has xc(P1P2)xc(P1)+xc(P2). Third, a slight refinement of Balas’ theorem [2] states that xc(conv(P1P2))max{xc(P1),1}+max{xc(P2),1}, see [30, Prop. 3.1.1].

Proposition 3

Let ϕ be a reduced Boolean formula and let Q[0,1]n be a polytope such that ϕ(Q). Then ϕ(Q) is a polytope with extension complexity xc(ϕ(Q))|ϕ|xc(Q).

Proof

First, note that if xc(Q)=0, then Q is a single point and so is ϕ(Q), which implies xc(ϕ(Q))=0 and hence the claimed inequality holds trivially. Thus, we may assume that xc(Q)1 holds.

We prove the claim by induction over the size of ϕ. If |ϕ|=1, then ϕ=xi or ϕ=¬xi for some i[n]. So either ϕ(Q)={xQ:xi=1} or ϕ(Q)={xQ:xi=0}, respectively. In both cases, ϕ(Q) is a face of Q and hence xc(ϕ(Q))xc(Q).

If |ϕ|2, there exist reduced Boolean formulas ϕ1,ϕ2 (of size smaller than |ϕ|) with |ϕ|=|ϕ1|+|ϕ2| such that ϕ=ϕ1ϕ2 or ϕ=ϕ1ϕ2. First, consider the case ϕ=ϕ1ϕ2, in which we have ϕ(Q)=ϕ1(Q)ϕ2(Q). Since ϕ(Q) is non-empty, the same holds for ϕ1(Q) and ϕ2(Q) and hence, by the induction hypothesis, we have xc(ϕi(Q))|ϕi|xc(Q) for i=1,2. Therefore,

xc(ϕ(Q))xc(ϕ1(Q))+xc(ϕ2(Q))|ϕ1|xc(Q)+|ϕ2|xc(Q)=|ϕ|xc(Q).

It remains to consider the case ϕ=ϕ1ϕ2, in which we have ϕ(Q)=conv(ϕ1(Q)ϕ2(Q)).

Note that the claimed inequality holds if ϕ1(Q)= or ϕ2(Q)=. Thus, we may assume that ϕ1(Q) and ϕ2(Q) are both non-empty. By the induction hypothesis, xc(ϕi(Q))|ϕi|xc(Q) for i=1,2. Therefore,

xc(ϕ(Q))max{xc(ϕ1(Q)),1}+max{xc(ϕ2(Q)),1}max{|ϕ1|xc(Q),1}+max{|ϕ2|xc(Q),1}=|ϕ1|xc(Q)+|ϕ2|xc(Q)=|ϕ|xc(Q).

We remark that the upper bound provided by Proposition 3 is quite generous, and can be improved in some cases. For instance, if we let τ denote the number of maximal rooted subtrees of ϕ whose nodes are either input variables or gates, then we have xc(ϕ(Q))τxc(Q). This is due to the well-known fact that any intersection of faces of Q is a face of Q.

A Boolean formula is monotone if it does not contain negations. We are ready to prove our main theorem in the monotone case.

Theorem 4

Let ϕ be a monotone Boolean formula defining a set S{0,1}n and let Q[0,1]n be any convex set containing S. If Q satisfies all monotone inequalities of pitch at most p that are valid for S, then ϕ(Q) satisfies all monotone inequalities of pitch at most p+1 that are valid for S. Moreover, if Q is a polytope defined by an extended formulation of size σ, then ϕ(Q) is a polytope that can be defined by an extended formulation of size |ϕ|σ, where |ϕ| is the size of the formula.

Proof

The second part of the theorem is implied by Proposition 3. For the first part, consider any monotone pitch-(p+1) inequality in standard form that is valid for S={x{0,1}n:ϕ(x)=1},

iI+cixiδ. 5

By the definition of pitch, we may assume ci>0 for all iI+. We also assume δ>0; otherwise, there is nothing to prove. Let a{0,1}n be the characteristic vector of [n]\I+. Thus, ai=1 if i[n]\I+ and ai=0 if iI+. Notice that a violates (5). This implies ϕ(a)=0.

By contradiction, suppose that (5) is not valid for ϕ(Q). That is, there exists a point in ϕ(Q) that violates (5). Let T be a tree that represents the formula ϕ. Each vV(T) has a corresponding formula, which is the formula computed by the subtree of T rooted at v. For notational convenience, we identity each node of T with its corresponding formula.

Our strategy is to find a root-to-leaf path in T such that for every node ψ on this path,

()ψ(a)=0andthere exists a pointx~=x~(ψ)ψ(Q)that violates(5).

This is satisfied at the root node ϕ.

Now consider any non-leaf node ψ in T that satisfies (). Let ψ1 and ψ2 denote the children of ψ, so that ψ=ψ1ψ2 or ψ=ψ1ψ2. We claim that, in both cases, there exists an index k{1,2} such that ψk satisfies ().

First, in case ψ=ψ1ψ2, we let x~(ψ1)=x~(ψ2):=x~(ψ) and choose k{1,2} such that ψk(a)=0. Such an index is guaranteed to exist since ψ(a)=0. Then ψk satisfies ().

Second, in case ψ=ψ1ψ2, we have ψ1(a)=ψ2(a)=ψ(a)=0. We let x~(ψ1) and x~(ψ2) be any points of ψ1(Q) and ψ2(Q) (respectively) such that the segment [x~(ψ1),x~(ψ2)] contains x~. For at least one k{1,2}, the point x~(ψk) violates (5). Thus ψk satisfies () for that choice of k.

By iterating the argument above, starting at the root node ϕ, we reach a leaf node ψ that satisfies (). Note that ψ=xj for some j, since ϕ is monotone. We have aj=ψ(a)=0, so jI+. Moreover, there exists a point x~=x~(ψ)ψ(Q)={xQ:xj=1} that violates (5).

Now consider the monotone inequality

iI+ijcixiδ-cj. 6

This inequality is valid for S since it is the sum of (5) and cj(1-xj)0, which are both valid. Since cj(1-x~j)=0, (6) is also violated by x~ψ(Q)Q. The key observation is that the pitch of (6) is at most p, which contradicts our assumption that Q satisfies all monotone inequalities of pitch at most p.

In the non-monotone case, we now prove a statement analogous to Theorem 4 where the pitch is replaced by the notch.

Theorem 5

Let ϕ be a reduced Boolean formula defining a set S{0,1}n and let Q[0,1]n be any convex set containing S. If Q satisfies all inequalities of notch at most ν that are valid for S, then ϕ(Q) satisfies all inequalities of notch at most ν+1 that are valid for S. Moreover, if Q is a polytope defined by an extended formulation of size σ, then ϕ(Q) is a polytope that can be defined by an extended formulation of size |ϕ|σ.

Proof

The proof is almost identical to that of Theorem 4. Instead of repeating the whole proof, here we only explain the differences. The starting point is a notch-(ν+1) inequality

iI+cixi+iI-ci(1-xi)δ, 7

where I+[n] and I-[n] satisfy I+I-= and I+I-=[n], δ>0, and ci0 for all i[n]. Contrary to the previous proof, here we allow ci=0. Let a{0,1}n be the characteristic vector of I-. Notice that a violates (7). This implies ϕ(a)=0.

Let T be a tree that represents the formula ϕ. Using the same proof strategy, we find a leaf node ψ=xj or ψ=¬xj of T such that ψ(a)=0, and there exists a point x~=x~(ψ)ψ(Q) that violates (7).

If ψ=xj, then jI+ and we consider the valid inequality

iI+ijcixi+iI-ci(1-xi)+δ(1-xj)δ-cj.

Otherwise, ψ=¬xj and thus jI-. In this case, we consider the valid inequality

iI+cixi+iI-ijci(1-xi)+δxjδ-cj.

Since (7) is a notch-(ν+1) inequality, it is easy to check that the notch of both of the above inequalities is at most ν. However, they are violated by the point x~=x~(ψ)Q. As in the proof of Theorem 4, this gives the desired contradiction.

Setting ϕ1(Q):=ϕ(Q) and ϕ+1(Q):=ϕ(ϕ(Q)) for Z1, and using the trivial fact that the notch of a non-trivial inequality is at most n, we immediately obtain the following corollary.

Corollary 6

Let ϕ be a reduced Boolean formula defining a set S{0,1}n and let Q[0,1]n be any convex set containing S. Then we have ϕn(Q)=conv(S).

Another consequence of Theorem 5 is that integer points not belonging to S are already excluded from ϕ(Q).

Corollary 7

Let ϕ be a reduced Boolean formula defining a set S{0,1}n and let Q[0,1]n be any convex set containing S. Then we have ϕ(Q)Zn=S.

Proof

It suffices to show that no point from {0,1}n\S is contained in ϕ(Q). To this end, fix x¯{0,1}n\S and consider the inequality

i[n]:x¯i=0xi+i[n]:x¯i=1(1-xi)1,

which is violated by x¯, but valid for all other points of {0,1}n. Since the inequality has notch 1, by Theorem 5 it is also valid for ϕ(Q) and hence x¯ is not contained in ϕ(Q).

Applications

In this section, we present several applications of our procedure, in which we repeatedly make use of Theorems 4 and 5.

Monotone formulas for matching

As a first application, we demonstrate how our findings together with Rothvoß’ result [25] on the extension complexity of the matching polytope yield a very simple proof of a seminal result of Raz and Wigderson [23, Theorem 4.1], which states4 that any monotone Boolean formula deciding whether a graph on n nodes contains a perfect matching has size 2Ω(n). Before giving any further detail, we point out that Raz and Wigderson’s result extends to the bipartite case [23, Theorem 4.2], which is not the case of the polyhedral approach described below.

The fact that Rothvoß’ theorem implies Raz and Wigderson’s was first discovered by Göös, Jain and Watson [15]. While their arguments are based on connections between nonnegative ranks of certain slack matrices and Karchmer-Wigderson games, which implicitly play an important role in the proofs of Theorems 4 and 5, our results yield a straightforward proof that does not require any further notions.

To this end, let nZ2 be even and let G=(V,E) denote the complete undirected graph on n nodes. The set S considered by Raz and Wigderson is the set

S:={x{0,1}E:supp(x)Econtains a perfect matching}.

Let ϕ be any monotone Boolean formula in variables xe (eE) that defines S. Next, define the polytope

P:={x[0,1]E:x(δ(U))1for everyUVwith|U|odd}.

It is a basic fact that S is contained in P. Furthermore, observe that every non-trivial inequality in the definition of P has pitch 1. Thus, we have conv(S)ϕ([0,1]E)P. Moreover, if we consider the affine subspace

D:={xRE:x(δ({u})=1for everyuV},

it is well-known that both conv(S)D and PD are equal to the perfect matching polytope of G, and hence we obtain that ϕ([0,1]E)D is also equal to the perfect matching polytope of G. By Rothvoß’ result, this implies xc(ϕ([0,1]E))=2Ω(n). On the other hand, by Proposition 3 we also have xc(ϕ([0,1]E))|ϕ|·xc([0,1]E)=|ϕ|·2|E|n2|ϕ| and hence |ϕ| must be exponential in n.

Covering problems: the binary case

In this section, we consider sets S{0,1}n that arise from 0/1-covering problems, in which there is a matrix A{0,1}m×n such that S={x{0,1}n:Ax1}, where 1 is the all-ones vector. As an example, if A is the node-edge incidence matrix of an undirected graph G, then the points of S correspond to vertex covers in G. This shows that, in general, the convex hull of such sets S may not admit polynomial-size (in n) extended formulations, see for example, [4, 14, 15].

Moreover, general 0/1-hierarchies may have difficulties identifying basic inequalities even in simple instances. For example, in [8] it is shown that if Ax1 consists of the inequalities i[n]\{j}xi1 for each j[n], then it takes at least n-2 rounds of the Lovász-Schrijver or Sherali-Adams hierarchy to satisfy the pitch-2 inequality i[n]xi2.

By developing a hierarchy tailored to 0/1-covering problems, Bienstock and Zuckerberg [7] were able to bypass some of these issues. As their main result, for each kN, they construct a polytope fk(Q) containing S satisfying the following two properties. First, every inequality of pitch at most k that is valid for S is also valid for fk(Q). Second, fk(Q) can be described by an extended formulation of size (m+n)g(k), where g(k)=Ω(k2). However, constructing the polytope fk(Q) is quite technical and involved.

In contrast, our procedure directly implies significantly simpler and smaller extended formulations that satisfy all pitch-k inequalities.

Corollary 8

Let A{0,1}m×n, S={x{0,1}n:Ax1}, and kN. Then there is a polyhedral relaxation P of S such that all points of P satisfy all valid inequalities of pitch at most k, and P can be defined by an extended formulation of size at most 2n·(mn)k.

Proof

Let ϕ:=i=1mj:Aij=1xj. Since [0,1]n has 2n facets and ϕ has size at most mn, we may take P=ϕk([0,1]n) by Theorem 4.

Covering problems: bounded coefficients

Next, we consider a more general form of a covering problem in which S={x{0,1}n:Axb} for some non-negative integer matrix AZ0m×n and bZ1m. We first restrict ourselves to the case that all entries in A and b are bounded by some constant ΔZ2.

Based on their results in [7], Bienstock and Zuckerberg [8] provide an extended formulation of size O(m+nΔ)g(k), where g(k)=Ω(k2). Our method yields a significantly smaller extended formulation, via the following lemma.

Lemma 9

For every AZ0m×n and bZ1m with entries bounded by Δ, the set S={x{0,1}n:Axb} can be defined by a monotone formula ϕ of size at most Δ5.3mnlogO(1)(n). Moreover, this formula can be constructed in randomized polynomial time.

Proof

Fix i[m], let n:=j=1nAij and let ψi be a monotone formula defining the set {y{0,1}n:k=1nykbi}. Next, pick any function h:[n][n] such that |h-1(j)|=Aij for all j[n]. In formula ψi, replace every occurrence of yk by xh(k), for k[n]. We obtain a monotone formula ϕi defining the set {x{0,1}n:j=1nAijxjbi}. By using the construction of Hoory, Magen and Pitassi [16] for the initial formula ψi, the resulting formula ϕi has size

|ϕi|=|ψi|Δ4.3nlogO(1)(n/Δ)Δ5.3nlogO(1)(n),

since nnΔ and biΔ. The result follows by taking ϕ:=i=1mϕi.

Corollary 10

Let AZ0m×n, bZ1m with entries bounded by Δ, S={x{0,1}n:Axb}, and kN. Then there is a polyhedral relaxation P of S such that all points of P satisfy all valid inequalities of pitch at most k, and P can be defined by an extended formulation of size at most (Δ5.3mnlogO(1)(n))k.

Proof

By Theorem 4, we may take P=ϕk([0,1]n), where ϕ is the formula from Lemma 9.

Covering problems: the general case

In some cases, especially when m=O(1), the matrix AZ0m×n and vector bZ0m may have coefficients as large as 2Ω(nlogn). For such general instances, we can improve the bound from Corollary 10.

Corollary 11

Let AZ0m×n, bZ1m, S={x{0,1}n:Axb}, and kN. Then there is a polyhedral relaxation P of S such that all points of P satisfy all valid inequalities of pitch at most k, and P can be defined by an extended formulation of size at most mnO(logn)k.

Proof

Beimel and Weinreb [5] show that, for every a1,,an,δR0, the set {x{0,1}n:j=1najxjδ} can be decided by a monotone formula of size nO(logn). Let ϕ be the conjunction of these formulas for each inequality in Axb. By Theorem 4, we may take P=ϕk([0,1]n).

In comparison, for this general case, Bienstock and Zuckerberg [7] have no nontrivial upper bound.

Constant notch 0/1-sets

In this section, we consider non-empty sets S{0,1}n with constant notch ν(S). These sets have several desirable properties. For example, as noted in [3] (and implicitly in [11]), there is an easy polynomial-time algorithm to optimize a linear function over a constant notch set S, provided that we have a polynomial-time membership oracle for S. On the other hand, sets with constant notch do not necessarily admit small extended formulations. Indeed, counting arguments developed in [1, 24] show that even for a “generic” set S{0,1}n with notch ν(S)=1, conv(S) requires extended formulations of size 2Ω(n).

This raises the question of which constant notch sets do admit compact extended formulations. As an immediate corollary to Theorem 5, we have the following nice partial answer.

Corollary 12

If S{0,1}n has constant notch and S can be described by a formula ϕ of size polynomial in n, then conv(S) can be described by a polynomial-size extended formulation.

Notice that every explicit 0/1-set S of constant notch such that xc(conv(S)) is large would thus provide an explicit Boolean function requiring large depth circuits, and solve one of the hardest open problems in circuit complexity.

Comparison and conclusion

In this paper, we propose a new method for strengthening convex relaxations of 0/1-sets. Our approach currently yields the simplest and smallest linear extended formulations expressing inequalities of constant pitch in the monotone case, and constant notch in the general case.

By viewing an iterated application of our procedure as a hierarchy, we obtain a significant simplification of the Bienstock-Zuckerberg hierarchy [7]. Prior to our work, [7] had been simplified by Mastrolilli [22] using a modification of the Sherali-Adams hierarchy. Subsequent to our work, [7] has also been simplified by Bienstock and Zuckerberg [9] themselves for the case of A{0,1}m×n. The way [9] construct their extended formulation is similar to what we do, except that they replace the canonical monotone formula in (1) by a (logically equivalent) non-monotone formula, which might yield a tighter relaxation in some cases.

Although [22] is an important simplification of [7], our approach is from first principles and assumes no knowledge of polynomial optimization. Moreover, despite the simplicity of our approach, our extended formulations (see Corollaries 8, 10, and 11) are significantly smaller than those provided by [7, 22]. This is possible since we allow any monotone formula, and can thus use any known construction from the literature. In contrast, Bienstock and Zuckerberg [7] implicitly only consider formulas in conjunctive normal form. The number of clauses in every formula in conjunctive normal form is at least the number of minimal covers5, which makes it impossible for them to construct small extended formulations in situations where the number of minimal covers is large.

Furthermore, the way in which we derive our extended formulations is conceptually different than [22]. Mastrolilli [22] first writes down a proof of validity of any bounded-pitch inequality that has a certain “polynomial” form (similar to a sum-of-squares proof, except that no square is necessary). He then uses this proof to recursively define a set of polynomials S=S(A,k), and then constructs an extended formulation generalizing the Sherali-Adams hierarchy from S. At the heart of his approach is a lemma due to Bienstock and Zuckerberg [7, Lemma 4.2].

In our paper, we give a direct way to strengthen any given relaxation by “feeding” it in a Boolean formula ϕ defining the set of feasible 0/1 solutions. That is, we first describe how to construct the extended formulation. Then we prove that each iteration (of the same procedure) “gives at least one extra unit of pitch”. At the heart of our analysis lies a new ingredient (coming from a Karchmer-Widgerson game) replacing the lemma from Bienstock and Zuckerberg. This is the reason why we improve the exponential dependence in k from k2 to k in Corollary 8.

Finally, as far as we can tell, our results from Sects. 5.1 and 5.5 are completely independent from [7, 9, 22]. To conclude, we state a few open questions raised by our work.

  1. Do the new extended formulations lead to any new interesting algorithmic application, in particular for covering problems? This appears to be connected to the following question. How good are the lower bounds on the optimum value obtained after performing a few rounds of the Chvátal-Gomory closure? For some problems, such as the vertex cover problem in graphs or more generally in q-uniform hypergraphs with q=O(1), the bounds turn out to be quite poor in the worst case [4, 28]. The situation is less clear for other problems, such as network design problems. Recent work [13] on the tree augmentation problem uses certain inequalities from the first Chvátal-Gomory closure in an essential way. For the related 2-edge connected spanning subgraph problem, our work implies that one can approximately optimize over the -th Chvátal-Gomory closure in quasi-polynomial time, for every =O(1).

  2. For which classes of polytopes in [0,1]n can one approximate a constant number of rounds of the Chvátal-Gomory closure with compact extended formulations? Mastrolilli [22] show that this is possible for packing problems. However, his approach crucially uses positive semi-definite extended formulations. Packing problems are unlikely to admit compact linear extended formulations, although we do not have a proof of this.

  3. Can one find polynomial-size monotone formulas for any nonnegative weighted threshold function, that is, for every min-knapsack {x{0,1}n:i=1naixiβ}? This would improve on the nO(logn) upper bound by Beimel and Weinreb [5]. Klabjan, Nemhauser and Tovey show that separating pitch-1 inequalities for such sets is NP-hard [19]. However, this does not rule out a polynomial-size extended formulation defining a relaxation that would be stronger than that provided by pitch-1 inequalities.

Acknowledgements

Open Access funding provided by Projekt DEAL. This work was done while Samuel Fiorini was visiting the Simons Institute for the Theory of Computing. It was supported in part by the DIMACS/Simons Collaboration on Bridging Continuous and Discrete Optimization through NSF grant #CCF-1740425, and in part by ERC Consolidator Grant 615640-ForEFront. We also thank all the anonymous referees for their constructive comments on the paper.

Footnotes

1

Some sources also count the number of occurrences of , and ¬, which is not necessary for our purposes.

2

To avoid possible confusion, we warn the reader that in a previous version of [6], this notion is called pitch instead of notch.

3

Here, this means that min{cx:xf(Q)}(1+ε)min{cx:xR} for every nonnegative cost vector c, where f(Q) denotes the -th Chvátal-Gomory closure of Q.

4

The original result of Raz and Wigderson states that the depth of any monotone circuit computing the mentioned function is Ω(n), which is equivalent to the mentioned result, see [29].

5

A point x{0,1}n is called a min-term, and its characteristic vector a minimal cover, if x is a minimal element of S with respect to the component-wise order .

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Averkov, G., Kaibel, V., Weltge, S.: Maximum semidefinite and linear extension complexity of families of polytopes. Math. Program. 167(2), Ser. A, pp. 381–394 (2018). MR 3755737
  • 2.Balas, E.: Disjunctive programming. Ann. Discrete Math. 5, 3–51 (1979). Discrete optimization (Proceedings of the Advanced Research Institute on Discrete Optimization and Systems Applications, Banff, Alta., 1977), II. MR 558566
  • 3.Bazzi, A., Fiorini, S., Huang, S., Svensson, O.: Small extended formulation for knapsack cover inequalities from monotone circuits. In: Proceedings of the 28th Annual ACM-SIAM Symposium on Discrete Algorithms 2017, (2017)
  • 4.Bazzi, A., Fiorini, S., Pokutta, S., Svensson, O.: No small linear program approximates vertex cover within a factor 2–e. In: 2015 IEEE 56th Annual Symposium on Foundations of Computer Science (FOCS), pp. 1123–1142 (2015)
  • 5.Beimel A, Weinreb E. Monotone circuits for monotone weighted threshold functions. Inf. Process. Lett. 2006;97(1):12–18. doi: 10.1016/j.ipl.2005.09.008. [DOI] [Google Scholar]
  • 6.Benchetrit, Y., Fiorini, S., Huynh, T., Weltge, S.: Characterizing Polytopes Contained in the 0/1-Cube with Bounded Chvátal-Gomory Rank. Mathematics of Operations Research (2018)
  • 7.Bienstock D, Zuckerberg M. Subset algebra lift operators for 0–1 integer programming. SIAM J. Optim. 2004;15(1):63–95. doi: 10.1137/S1052623402420346. [DOI] [Google Scholar]
  • 8.Bienstock D, Zuckerberg M. Approximate fixed-rank closures of covering problems. Math. Program. Ser. A. 2006;105:9–27. doi: 10.1007/s10107-005-0598-z. [DOI] [Google Scholar]
  • 9.Bienstock, D., Zuckerberg, M.: Simpler derivation of bounded pitch inequalities for set covering, and minimum knapsack sets, arXiv:1806.07435 (2018)
  • 10.Chvátal V. Edmonds polytopes and a hierarchy of combinatorial problems. Discrete Math. 1973;4:305–337. doi: 10.1016/0012-365X(73)90167-2. [DOI] [Google Scholar]
  • 11.Cornuéjols, G., Lee, D.: On some polytopes contained in the 0,1 hypercube that have a small Chvátal rank. In: International Conference on Integer Programming and Combinatorial Optimization, pp. 300–311. Springer (2016)
  • 12.Eisenbrand F. On the membership problem for the elementary closure of a polyhedron. Combinatorica. 1999;19(2):297–300. doi: 10.1007/s004930050057. [DOI] [Google Scholar]
  • 13.Fiorini, S., Groß, M., Könemann, J., Sanità, L.: Approximating weighted tree augmentation via Chvátal-Gomory cuts. In: Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 817–831. SIAM, Philadelphia, PA (2018). MR 3775841
  • 14.Fiorini, S., Massar, S., Pokutta, S., Tiwary, H. R., de Wolf, R.: Exponential lower bounds for polytopes in combinatorial optimization. J. ACM 62(2), 1–23 (2015)
  • 15.Göös, M., Jain, R., Watson, T.: Extension complexity of independent set polytopes. In: Proc. FOCS 2016, (2016)
  • 16.Hoory, S., Magen, A., Pitassi, T.: Monotone circuits for the majority function. In: Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, 9th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, APPROX 2006 and 10th International Workshop on Randomization and Computation, RANDOM 2006, Barcelona, Spain, August 28-30 2006, Proceedings, pp. 410–425 (2006)
  • 17.Hrubeš Pavel. On the nonnegative rank of distance matrices. Inf. Process. Lett. 2012;112(11):457–461. doi: 10.1016/j.ipl.2012.02.009. [DOI] [Google Scholar]
  • 18.Karchmer M, Wigderson A. Monotone circuits for connectivity require super-logarithmic depth. SIAM J. Discrete Math. 1990;3:255–265. doi: 10.1137/0403021. [DOI] [Google Scholar]
  • 19.Klabjan D, Nemhauser GL, Tovey C. The complexity of cover inequality separation. Oper. Res. Lett. 1998;23:35–40. doi: 10.1016/S0167-6377(98)00025-X. [DOI] [Google Scholar]
  • 20.Lasserre, J. B.: An explicit exact SDP relaxation for nonlinear 0-1 programs. Integer programming and combinatorial optimization (Utrecht, 2001), Lecture Notes in Comput. Sci., vol. 2081, pp. 293–303. Springer, Berlin (2001). MR 2041934
  • 21.Lovász L, Schrijver A. Cones of matrices and set-functions and 0-1 optimization. SIAM J. Optim. 1991;1(2):166–190. doi: 10.1137/0801013. [DOI] [Google Scholar]
  • 22.Mastrolilli, M: High degree sum of squares proofs, Bienstock-Zuckerberg hierarchy and CG cuts. In: Integer Programming and Combinatorial Optimization - 19th International Conference, IPCO 2017, Waterloo, ON, Canada, June 26-28, 2017, Proceedings, pp. 405–416 (2017)
  • 23.Raz R, Wigderson Avi. Monotone circuits for matching require linear depth. J. Assoc. Comput. Mach. 1992;39(3):736–744. doi: 10.1145/146637.146684. [DOI] [Google Scholar]
  • 24.Rothvoß T. Some 0/1 polytopes need exponential size extended formulations. Math. Program. A. 2013;142:255–268. doi: 10.1007/s10107-012-0574-3. [DOI] [Google Scholar]
  • 25.Rothvoß, T.: The matching polytope has exponential extension complexity. J. ACM 64, no. 6, Art. 41, 19 (2017). MR 3713797
  • 26.Schrijver, A.: Combinatorial optimization. Polyhedra and efficiency. Vol. A, Algorithms and Combinatorics, vol. 24, Springer-Verlag, Berlin, 2003, Paths, flows, matchings, Chapters 1–38. MR 1956924 (2004b:90004a)
  • 27.Sherali HD, Adams WP. A hierarchy of relaxations between the continuous and convex hull representations for zero-one programming problems. SIAM J. Discrete Math. 1990;3(3):411–430. doi: 10.1137/0403036. [DOI] [Google Scholar]
  • 28.Singh, M., Talwar, K.: Improving integrality gaps via Chvátal-Gomory rounding. In: Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, 13th International Workshop, APPROX 2010, and 14th International Workshop, RANDOM 2010, Barcelona, Spain, September 1-3, 2010. Proceedings, pp. 366–379 (2010)
  • 29.Wegener I. Relating monotone formula size and monotone depth of Boolean functions. Inf. Process. Lett. 1983;16(1):41–42. doi: 10.1016/0020-0190(83)90011-X. [DOI] [Google Scholar]
  • 30.Weltge, S.: Sizes of linear descriptions in combinatorial optimization. dissertation, Otto-von-Guericke-Universität Magdeburg, (2016)

Articles from Mathematical Programming are provided here courtesy of Springer

RESOURCES