Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2020 Jun 26;12178:218–232. doi: 10.1007/978-3-030-51825-7_16

Towards a Better Understanding of (Partial Weighted) MaxSAT Proof Systems

Javier Larrosa , Emma Rollon ‡,
Editors: Luca Pulina8, Martina Seidl9
PMCID: PMC7326545

Abstract

MaxSAT is a very popular language for discrete optimization with many domains of application. While there has been a lot of progress in MaxSAT solvers during the last decade, the theoretical analysis of MaxSAT inference has not followed the pace. Aiming at compensating that lack of balance, in this paper we do a proof complexity approach to MaxSAT resolution-based proof systems. First, we give some basic definitions on completeness and show that refutational completeness makes compleness redundant, as it happens in SAT. Then we take three inference rules such that adding them sequentially allows us to navigate from the weakest to the strongest resolution-based MaxSAT system available (i.e., from standalone MaxSAT resolution to the recently proposed ResE), each rule making the system stronger. Finally, we show that the strongest system captures the recently proposed concept of Circular Proof while being conceptually simpler, since weights, which are intrinsic in MaxSAT, naturally guarantee the flow condition required for the SAT case.

Keywords: MaxSAT, Rule-based proof systems, Circular proofs

Introduction

Proof Complexity is the field aiming to understand the computational cost required to prove or refute statements. Different proof systems may provide different proofs for the same formula and some proof systems are provably more efficient than others. When that happens, proof complexity cares about which elements of the more powerful proof system really make the difference.

In propositional logic, proof systems that work with CNF formulas have attracted the interest of researchers for several decades. One of the reasons is that CNF is the working language of the extremely successful SAT solvers and the search spaces that they traverse can be understood and analyzed as proofs [5].

(Partial Weighted) MaxSAT is the optimization version of SAT. Although discrete optimization problems can be modeled and solved with SAT solvers, many of these problems are more naturally treated as MaxSAT. For this reason the design of MaxSAT solvers has attracted the interest of researchers in the last decade. Interestingly, while some of the first efficient MaxSAT solvers were strongly influenced by MaxSAT inference [9], this influence has diminished along time. The currently most efficient algorithms solve MaxSAT by sophisticated sequences of calls to SAT solvers [1, 4, 11].

We think it is important to understand this scientific trend with a more formal approach and such understanding must go through an analysis of the possibilities and limitations of MaxSAT proof systems (how MaxSAT inference compares with obtaining the same result with a sequence of SAT inferences?). The purpose of this paper is to start contributing in that direction by improving the understanding of MaxSAT proof systems. With that aim we extend some classic proof complexity concepts (i.e, entailment, completeness, etc) to MaxSAT and analyze three proof systems of increasing complexity: from stand-alone MaxSAT resolution (Res) [9] to the recently proposed resolution with extension (ResE) [10]. For the sake of clarity, we break the extension rule of ResE into two atomic rules: split and virtual; and analyze their incremental power. Our results show that each add-on makes a provable stronger system. More precisely, we have observed that: Res is sound and refutationally complete. Adding the split rule (ResS) we get completeness and (unlike what happens in SAT) some exponential speed-up in certain refutations. Further adding the virtual rule (ResSV), which allows to keep negative weights during proofs, we get further speed-up by capturing the concept of circular proofs [3]. We also give the interesting and somehow unexpected result that in some cases rephrasing a MaxSAT refutation as a MaxSAT entailment may transform the problem from exponentially hard to polynomial when using ResSV.

The structure of the paper is as follows. In Sects. 2 and 3 we provide preliminaries on SAT and MaxSAT, respectively. In Sect. 4 we define some variations of the Pigeon Hole Problem that we need for the proofs of the theorems. In Sect. 5 we provide basic definition and properties on MaxSAT proof systems and introduce and analyze the different systems addressed in the paper. In Sect. 6 we show how the strongest proof system ResSV captures the notion of Circular Proof. Finally, in Sect. 7, we give some conclusions.

SAT Preliminaries

A boolean variable x takes values on the set Inline graphic. A literal is a variable x (positive literal) or its negation Inline graphic (negative literal). A clause is a disjunction of literals. A clause C is satisfied by a truth assignment X if X contains at least one of the literals in C. The empty clause is denoted Inline graphic and cannot be satisfied. The negation of a clause Inline graphic is satisfied if all its literals are falsified and this can be trivially expressed in CNF as the set of unit clause Inline graphic.

A CNF formula Inline graphic is a set of clauses (understood as a conjunction). A truth assignment satisfies a formula if it satisfies all its clauses. If such an assignment exists, we say that the assignment is a model and the formula is satisfiable, noted Inline graphic. Determining whether a formula is satisfiable constitutes the well-known SAT Problem.

We say that formula Inline graphic entails formula Inline graphic, noted Inline graphic, if every model of Inline graphic is also a model of Inline graphic. Two formulas Inline graphic and Inline graphic are equivalent, noted Inline graphic, if they entail each other.

An inference rule is given by a set of antecedent clauses and a set of consequent clauses. In SAT, the intended meaning of an inference rule is that if some clauses of a formula match the antecedents, the consequents can be added. The rule is sound if every truth assignment that satisfies the antecedents also satisfies the consequents. The process of applying an inference rule to a formula Inline graphic is noted Inline graphic.

Consider the following two rules [3, 12],

graphic file with name 495779_1_En_16_Equ16_HTML.gif

where A and B are arbitrary (possibly empty) disjunctions of literals and x is an arbitrary variable. In propositional logic it is customary to define rules with just one consequent because one rule with s consequents can be obtained from s one-consequent rules. As we will see, this is not the case in MaxSAT. For this reason, here we prefer to introduce the two-consequents split rule instead of the equivalent weakening rule [3] to keep the parallelism with MaxSAT more evident.

A proof system Inline graphic is a set of inference rules. A proof of length e under a proof system Inline graphic is a finite sequence Inline graphic where Inline graphic is the original formula and each Inline graphic is obtained by applying an inference rule from Inline graphic. We will use Inline graphic to denote an arbitrary number of inference steps. A short proof is a proof whose length can be bounded by a polynomial on Inline graphic. A refutation is a proof such that Inline graphic. Refutations are important because they prove unsatisfiability.

A proof system is sound if all its rules are sound. All the SAT rules and proof systems considered in this paper are sound. A proof system is complete if for every Inline graphic such that Inline graphic, there is a proof Inline graphic with Inline graphic. Although completeness is a natural and elegant property, it has limited practical interest. For that reason a weaker version of completeness has been defined. A proof system is refutationally complete if for every unsatisfiable formula Inline graphic there is a refutation starting in Inline graphic (i.e, completeness is required only for refutations). It is usually believed that refutational completeness is enough for practical purposes. The reason is that Inline graphic if and only if Inline graphic is unsatisfiable, so any implicationally complete proof system can prove the entailment by deriving a refutation from a CNF formula equivalent to Inline graphic.

It is well-known that the proof system made exclusively of resolution is refutationally complete and adding the split rule makes the system complete. The following property says that adding the split rule does not give any advantage to resolution in terms of refutational power.

Property 1

[(see Lemma 7 in [2]]. A proof system with resolution and split as inference rules cannot make shorter refutations than a proof system with only resolution.

MaxSAT Preliminaries

A weight w is a positive number or Inline graphic (i.e, Inline graphic). We extend sum and substraction to weights defining Inline graphic and Inline graphic for all w. Note that Inline graphic is only defined when Inline graphic.

A weighted clause is a pair (Cw) where C is a clause and w is a weight associated to its falsification. If Inline graphic we say that the clause is hard, else it is soft. A weighted MaxSAT CNF formula is a set of weighted clauses Inline graphic. If all the clauses are hard, we say that the formula is hard. We say that Inline graphic if for all Inline graphic there is a Inline graphic with Inline graphic.

Given a formula Inline graphic, we define the cost of a truth assignment X, noted Inline graphic, as the sum of weights over the clauses that are falsified by X. The MaxSAT problem is to find the minimum cost over the set of all truth assignments,

graphic file with name M50.gif

This definition of MaxSAT including weights and hard clauses is sometimes referred to as Partial Weighted MaxSAT [11]. Note that any clause (Cw) can be broken into two clauses (Cu), (Cv) as long as Inline graphic. In the following we will assume that clauses are separated and merged as needed.

We say that formula Inline graphic entails formula Inline graphic, noted Inline graphic, if Inline graphic is a lower bound of Inline graphic. That is, if for all X, Inline graphic. We say that two formulas Inline graphic and Inline graphic are equivalent, noted Inline graphic, if they entail each other. That is, if forall X, Inline graphic.

In the following Sections we will find useful to deal with negated clauses. Hence, the corresponding definitions and useful properties. Let A and B be arbitrary disjunctions of literals. Let Inline graphic mean that falsifying Inline graphic incurs a cost of w. Although Inline graphic is not a clause, the following property shows that it can be efficiently transformed into a CNF equivalent,

Property 2

Inline graphic.

Observe that if we restrict the MaxSAT language to hard formulas we have standard SAT CNF formulas where Inline graphic corresponds to false and 0 corresponds to true. Note that all the previous definitions naturally instantiate to their SAT analogous.

Pigeon Hole Problem and Variations

We define the well-known Pigeon Hole Problem Inline graphic and three MaxSAT versions Inline graphic, Inline graphic and Inline graphic, that we will be using in the proof of our results.

In the Pigeon Hole Problem Inline graphic the goal is to assign Inline graphic pigeons to m holes without any pair of pigeons sharing their hole. In the usual SAT encoding there is a boolean variable Inline graphic (with Inline graphic and Inline graphic) which is true if pigeon i is in hole j. There are two groups of clauses. For each pigeon i, we have the clause,

graphic file with name M76.gif

indicating that pigeon i must be assigned to a hole. For each hole j we have the set of clauses,

graphic file with name M77.gif

indicating that hole j is occupied by at most one pigeon. Let Inline graphic be the union of all these sets of clauses Inline graphic. It is obvious that Inline graphic is an unsatisfiable CNF formula. In MaxSAT notation the pigeon hole problem is,

graphic file with name M81.gif

and clearly Inline graphic.

In the soft Pigeon Hole Problem Inline graphic the goal is to find the assignment that falsifies the minimum number of clauses. In MaxSAT language it is encoded as,

graphic file with name M84.gif

and it is obvious that Inline graphic.

The Inline graphic problem is like the soft pigeon hole problem but augmented with one more clause Inline graphic where m is the number of holes. Note that Inline graphic.

Finally, the Inline graphic problem is like the soft pigeon hole problem but augmented with a set of unit clauses Inline graphic. Note that Inline graphic.

MaxSAT Proof Systems

A MaxSAT inference rule is given by a set of antecedent clauses and a set of consequent clause. In MaxSAT, the application of an inference rule is to replace the antecedents by the consequents. The process of applying an inference rule to a formula Inline graphic is also noted Inline graphic. The rule is sound if it preserves the equivalence of the formula.

As in the SAT case, given a proof system Inline graphic (namely, a set of rules) a proof of length e is a sequence Inline graphic where Inline graphic is the original formula and each Inline graphic is obtained by applying an inference rule from Inline graphic. If Inline graphic, we say that the proof is a proof of Inline graphic from Inline graphic.

A proof system is sound if all its rules are sound. In this paper all MaxSAT rules and proof systems are sound. A proof system is complete if for every Inline graphic, Inline graphic such that Inline graphic, there is a proof of Inline graphic from Inline graphic. A refutation of Inline graphic is a proof of Inline graphic from Inline graphic with Inline graphic. A proof system is refutationally complete if it can derive a refutation of every formula Inline graphic.

Next we show that, similarly to what happens in SAT, refutationally completeness is sufficient for practical purposes. The reason is that it can also be used to proof or disproof general entailment, making completeness somehow redundant. We need first to define the maximum soft cost of a formula as Inline graphic and the negation of a MaxSAT formula as the negation of all its clauses Inline graphic. The following property tells the effect of negating a formula without hard clauses,

Property 3

If Inline graphic is a CNF MaxSAT formula without hard clauses, then

graphic file with name M115.gif

Proof

Let X be a truth assignment, Inline graphic be the set of clauses satisfied by X and Inline graphic be the set of clauses falsified by X. It is clear that Inline graphic while Inline graphic. Since Inline graphic and Inline graphic, then Inline graphic. Therefore, Inline graphic and, as a consequence, Inline graphic.

We can now show that an entailment Inline graphic can be rephrased as a MaxSAT problem,

Theorem 1

Let Inline graphic and Inline graphic be two MaxSAT formulas, possibly with hard clauses. Then,

graphic file with name M128.gif

where Inline graphic is a softened version of Inline graphic in which infinity weights are replaced by Inline graphic.

Proof

Let us prove the if direction. Inline graphic means that Inline graphic. Also, by construction Inline graphic. Therefore, Inline graphic. Because Inline graphic does not contain hard clauses, Inline graphic, which means that, Inline graphic Adding Inline graphic to both sides of the disequality we get, Inline graphic. By Property 3, we have, Inline graphic which clearly means that, Inline graphic.

Let us proof the else if direction. Inline graphic implies that Inline graphic. Moreover, since Inline graphic does not have hard clauses, from Property 3 we know that, Inline graphic so we have, Inline graphic and we need to have, Inline graphic. We reason on cases for truth assignment X:

  1. If Inline graphic, by definition of Inline graphic, Inline graphic. Therefore, Inline graphic, which proofs this case.

  2. If Inline graphic, by definition of Inline graphic, Inline graphic. We show that in this case, Inline graphic.

    • if Inline graphic then Inline graphic. We show that Inline graphic implies that Inline graphic. We proceed by contradiction. Let us suppose that Inline graphic and Inline graphic. The latter means that X satisfies all hard clauses. As a consequence, Inline graphic, which contradicts the hypothesis.
    • if Inline graphic, then there are no X such that Inline graphic. By definition of Inline graphic, forall Inline graphic, Inline graphic. Therefore, either X satisfies all hard clauses in Inline graphic and then Inline graphic or X falsifies at least one hard clause in Inline graphic and then Inline graphic.

which proofs the theorem.

The application of the previous theorem to single clause entailment yields the following corollary.

Corollary 1

Let Inline graphic be a formula and (Cw) be a weighted clause. Then,

graphic file with name M174.gif

where Inline graphic

A useful application of this corollary will be shown in Sect. 5.3.

In the rest of the section we introduce and analyze the incremental impact of the three inference rules.

Resolution

The MaxSAT resolution rule [8] is,

graphic file with name 495779_1_En_16_Equ17_HTML.gif

where A and B are arbitrary (possibly empty) disjunctions of literals and Inline graphic. When A (resp. B) is empty, Inline graphic (resp. Inline graphic) is constant true, so Inline graphic (resp. Inline graphic) is tautological. Note that MaxSAT resolution, when applied to two hard clauses, corresponds to SAT resolution.

It is known that the proof system Res made exclusively of the resolution rule is refutationally complete,

Theorem 2

[6, 9]. Res is refutationally complete.

However, as we show next, it is not complete.

Theorem 3

Res is not complete.

Proof

Consider formula Inline graphic. It is clear that Inline graphic which cannot be derived with Res.

It is well-known that Res cannot compute short refutations for PHP [12] or SPHP [6]. However, it can efficiently refute Inline graphic. We write it as a property and sketch the proof (which is a direct adaptation of what was proved in [7] and [10]) because it will be instrumental in the proof of several results in the rest of this section,

Property 4

There is a short Res refutation of Inline graphic.

Proof

The proof is based on the fact that for each one of the Inline graphic pigeons there is a short refutation

graphic file with name M186.gif

and for each one of the m holes there is a short refutation

graphic file with name M187.gif

Because each derivation is independent of the other we can concatenate them into,

graphic file with name M188.gif

which is a refutation of Inline graphic.

Split

The split rule,

graphic file with name 495779_1_En_16_Equ18_HTML.gif

is the natural extension of its SAT counterpart. Consider the proof system ResS, made of resolution and split. We show that, as it happens in the SAT case, the split rule brings completeness,

Theorem 4

ResS is complete.

Proof

The proof is based on the following facts:

  1. For every formula Inline graphic there is a proof Inline graphic where Inline graphic is made exclusively of splits in which all the clauses of Inline graphic contain all the variables in the formula and there are no repeated clauses. Each clause Inline graphic can be expanded to a new variable not in C using the split rule. This process can be repeated until all clauses in the current formula contain all the variables in the formula. Note that all clauses Inline graphic, Inline graphic can be merged and, as a result, Inline graphic does not contain repeated clauses.

  2. If Inline graphic then Inline graphic. Let Inline graphic be the proof from Inline graphic to Inline graphic. Then, Inline graphic is done resolving the pairs of clauses in Inline graphic that were splitted in the Inline graphic step.

  3. If Inline graphic then there exists a unique clause Inline graphic which is falsified by X

By fact (1), Inline graphic and Inline graphic. Because of soundness, Inline graphic, Inline graphic and Inline graphic. Since Inline graphic, Inline graphic. Therefore, Inline graphic which, by fact (3), means that for each X there exists a unique Inline graphic and Inline graphic which is falsified by X. Separating all Inline graphic into Inline graphic, Inline graphic we have Inline graphic. Therefore, Inline graphic. By fact (2), Inline graphic.

However, unlike what happens in the SAT case (see Property 1), ResS is stronger than Res,

Theorem 5

ResS is stronger than Res.

Proof

On the one hand, it is clear that ResS can simulate any proof of Res since it is a superset of Res. On the other hand, unlike Res, ResS can produce short refutations for Inline graphic, as shown below.

First, let us proof that Res cannot produce short refutations for Inline graphic. Since the resolution rule does not apply to the empty clause Inline graphic, if Res could refute Inline graphic in polynomial time it would also refute Inline graphic in polynomial time, which is impossible [6].

ResS can produce short refutations for Inline graphic because it can transform Inline graphic into Inline graphic and then apply Property 4. The transformation is done by a sequence of splits,

graphic file with name 495779_1_En_16_Equ19_HTML.gif

that move one unit of weight from the empty clause to every variable in the formula and its negation.

Virtual

In a recent paper [10] we proposed a proof system in which clauses with negative weights can appear during the proof. This is equivalent to adding to ResS the virtual rule,

graphic file with name 495779_1_En_16_Equ20_HTML.gif

which allows to introduce a fresh clause (Aw) into the formula. To preserve soundness (i.e, cancel out the effect of the addition) it also adds Inline graphic.

Let ResSV be the proof system made of resolution, split and virtual (note that resolution and split are only defined for antecedents with positive weights). It has been shown that if Inline graphic is a ResSV proof and Inline graphic does not contain any negative weight, then for every Inline graphic we have that Inline graphic.

The following theorem shows that the virtual rule adds further strength to the proof system,

Theorem 6

ResSV is stronger than ResS.

Proof

On the one hand, it is clear that ResSV can simulate any proof of ResS since it is a superset of ResS. On the other hand, ResSV can produce a short refutation of SPHP and ResS cannot.

The short refutation of ResSV, as shown in [10], is obtained by first virtually transforming SPHP into Inline graphic. Then, it uses Property 4 to derive Inline graphic. Finally, it splits one unit of the empty clause cost to each pair Inline graphic to cancel out negative weights. At the end of the process all clauses have positive weight while still having Inline graphic.

It is clear that ResS cannot polynomially refute Inline graphic because otherwise a SAT proof system with resolution and split rules would produce shorter refutations than a SAT proof system with only resolution, which contradicts Property 1.

We will finish this section showing that Theorem 1 has an unexpected application in the context of ResSV. Consider the problem of proving Inline graphic. This can be done with a refutation of PHP. Namely Inline graphic or using Corollary 1, which tells that Inline graphic if and only if Inline graphic. The following two theorems shows that ResSV cannot do efficiently the first approach, but can do efficiently the second.

Theorem 7

There is no short ResSV refutation of PHP.

Proof

Virtual rule cannot introduce hard clauses and resolution and split rules only produce a hard consequence if they have hard antecedents. As a consequence, Inline graphic can only be obtained by resolving or splitting hard clauses in PHP. If ResSV produce a short refutation for PHP, ResS and, as a consequence Res, also produce the same short refutation for PHP, which contradicts Property 1.

Theorem 8

There is a polynomial ResSV proof of Inline graphic from PHP.

Proof

We only need to apply the virtual rule, graphic file with name 495779_1_En_16_Figa_HTML.jpg and then split, graphic file with name 495779_1_En_16_Figb_HTML.jpg for each ij. The resulting problem is similar to Inline graphic but with hard clauses. At this point and adapting the proof of 4 we can derive Inline graphic cancel out the negative weight while still retaining Inline graphic.

MaxSAT Circular Proofs

In this section we study the relation between ResSV and the recently proposed concept of circular proofs [3]. Circular proofs allows the addition of an arbitrary set of clauses to the original formula. It can be seen that conclusions are sound as long as the added clauses are re-derived as many times as they are used. In the original paper this condition is characterized as the existence of a flow in a graphical representation of the proof. Here we show that the ResSV proof system naturally captures the same idea and extends it from SAT to MaxSAT with an arguably simpler notation. In particular, the virtual rule guarantees the existence of the flow.

SAT Circular Proofs

We start reviewing the SAT case, as defined in [3]. Given a CNF formula Inline graphic a circular pre-proof of Inline graphic from Inline graphic is a sequence,

graphic file with name M254.gif

such that Inline graphic, Inline graphic is an arbitrary set of clauses and each Inline graphic (Inline graphic) is obtained from previous clauses by applying an inference rule in the proof system. Note that the same clause can be both derived and used several times during the proof.

A circular pre-proof Inline graphic can be associated with a directed bi-partite graph Inline graphic such that there is one node in J for each element of the sequence (called clause nodes) and one node in I for each inference step (called inference nodes). There is an arc from Inline graphic to Inline graphic if u is an antecedent clause in the inference step of v. There is an arc from Inline graphic to Inline graphic if v is a consequent clause in the inference step of u. The graph is compacted by merging nodes whose associated clause is identical to one in Inline graphic. Note that before the compactation the graph is acyclic, but the compactation may introduce cycles. The set of in-neighbors and out-neighbors of node Inline graphic are denoted Inline graphic and Inline graphic, respectively.

A flow assignment for a circular pre-proof is an assignment Inline graphic of positive reals to inference nodes. The balance of node Inline graphic is the inflow minus the outflow,

graphic file with name M271.gif

Definition 1

A SAT circular proof of clause A from CNF formula Inline graphic is a pre-proof Inline graphic whose proof-graph Inline graphic admits a flow in which all clauses not in Inline graphic have non-negative balance and Inline graphic has a strictly positive balance.

Theorem 9 (soundness)

Assuming a sound SAT proof system, if there is a SAT circular proof of A from formula Inline graphic then Inline graphic.

Property 5

Using the proof system with the following two rules,

graphic file with name 495779_1_En_16_Equ21_HTML.gif

there is a short circular refutation of Inline graphic.

ResSV and MaxSAT Circular Proofs

Now we show that the MaxSAT ResSV proof system is a true extension of circular proofs from SAT to MaxSAT. The following two theorems show that, when restricted to hard formulas, ResSV and SAT circular proofs can simulate each other. Recall that specializing Corollary 1 to hard formulas, Inline graphic and Inline graphic is equivalent. Therefore, one can show Inline graphic with a proof Inline graphic.

Theorem 10

Let Inline graphic be a SAT circular proof of clause A from formula Inline graphic using the proof system symmetric resolution and split. There is a MaxSAT ResSV proof of Inline graphic from Inline graphic. The length of the proof is Inline graphic.

Proof

Let Inline graphic be the proof graph and and Inline graphic the flow of Inline graphic. By definition of SAT circular proof, Inline graphic and Inline graphic.

The ResSV proof starts with Inline graphic and consists in 3 phases. In the first phase, the virtual rule is applied for each Inline graphic not in Inline graphic, introducing Inline graphic with Inline graphic. In the second phase, there is an inference step for each Inline graphic. If u is a SAT split, the inference step is a MaxSAT split generating two clauses with weight f(u). If u is a SAT symmetric resolution, the inference step is a MaxSAT resolution generating one clause with weight f(u). Note that this phase never creates new clauses because all of them have been virtually added at the first phase. It only moves weights around the existing ones. Note as well that we guarantee by construction that at each step of the proof the antecedents are available no matter in which order the proof is done because the first phase has given enough weight to each added clause to guarantee it and original clauses are hard, so their weight never decreases. At the end of the second phase we have Inline graphic with Inline graphic with b(C) being the balance of C. Therefore (Ab(A)) is in Inline graphic. The third phase is a final sequence of q steps in which Inline graphic is derived from Inline graphic which completes the proof. Note that the size of the proof is Inline graphic.

Theorem 11

Consider a hard formula Inline graphic and a ResSV proof Inline graphic with e inference steps. There is a SAT circular proof Inline graphic of A from Inline graphic with proof system symmetric resolution and split. Besides, Inline graphic.

Proof

We need to build a graph Inline graphic with Inline graphic and Inline graphic, and a flow Inline graphic that satisfies the balance conditions and with which A has strictly positive balance.

Because the virtual rule does not have antecedents all its applications can be done at the beginning of the proof and all the cancellation of all the virtual clauses can be done at the end. Therefore, we can omit all those inference steps and assume without loss of generality that the proof is a ResS (that is, without virtual),

graphic file with name M315.gif

where Inline graphic is the set of virtually added clauses. Note as well that any application of MaxSAT resolution between Inline graphic and Inline graphic can be simulated by a short sequence of splits to both clauses until their scope is the same and then one resolution step between Inline graphic and Inline graphic. So, again without loss of generality we can assume that the proof only contains splits and symmetric resolutions.

Our proof contains three phases. First, we are going to build an acyclic graph Inline graphic which is an unfolded version of Inline graphic and a flow function Inline graphic that may have Inline graphic flows. Second we will compute Inline graphic traversing the graph Inline graphic bottom-up and replacing any infinite flow in Inline graphic by a finite one that still guarantees the flow condition. In the third and final phase, we will compact the graph which will constitute the circular proof.

Phase 1:

We build Inline graphic by following the proof step by step. Let Inline graphic be the graph associated to proof step i. We define the front of Inline graphic as the set of clause nodes in Inline graphic with strictly positive balance. By construction of Inline graphic we will guarantee a connection between the current formula Inline graphic and the front of the current graph Inline graphic

graphic file with name M335.gif

where we define Inline graphic.

Inline graphic contains one clause node for each clause in Inline graphic, Inline graphic and Inline graphic, respectively. For each clause node there is one dummy inference node pointing to it. The flow Inline graphic of the inference node is the weight of the clause it points to. This set of dummy inference nodes will be removed at step three. Then we proceed through the proof. At inference step i, we add a new inference node i to I. Its in-neighbors will be nodes from the front (that must exist because of the invariant) and its out-neighbors will be newly added clause nodes. Its flow Inline graphic is the weight moved by the inference rule (which may be infinite). If the inference rule is split we add two clause nodes, one for each consequent and add the corresponding arcs. If the inference rule is a resolution we add one clause node for its consequent and add the corresponding arcs. Note that, the out-neighbors of node i have a positive balance and in-neighbors of i have their out-flow decreased, but cannot turn negative. Finally, we merge any pair of nodes in the front of Inline graphic whose associated clause is the same (which preserves the property of balances being non-negative). Graph Inline graphic is obtained after processing the last inference step. Note that the invariant guarantees that Inline graphic is in Inline graphic and its balance is 1.

Phase 2:

Now we traverse the inference nodes of Inline graphic in the reverse order of how they were added transforming infinite flows into finite. When considering node i, because of the traversing order, we know that every Inline graphic has finite out-going flow. We compute the flow value f(i) as follows: if Inline graphic is finite, then Inline graphic, else f(i) is the minimum value that guarantees that the balance of every Inline graphic is non-negative.

Phase 3:

We obtain G by doing some final arrangements to Inline graphic. First, we remove dummy inference nodes pointing to clauses in Inline graphic, Inline graphic and Inline graphic added in Phase 1. As a result, the balance of these nodes is negative. In particular, the balance of nodes representing Inline graphic and Inline graphic is its negative weight.

Since Inline graphic, we know that all nodes representing Inline graphic are included in the front of Inline graphic with balance greater than or equal to its weight. We compact these nodes with the ones in Inline graphic and, as a result, its balance is positive.

Finally, we add some split nodes with flow 1 from node Inline graphic (recall that Inline graphic) in order to generate A and Inline graphic, and we compact the latter ones with the ones in Inline graphic. As a result, the balance of A is 1 and the balance of Inline graphic nodes is positive.

Conclusions

This paper constitutes a first attempt towards MaxSAT resolution-based proof complexity analysis. We have provided some basic definitions and results emphasizing the similarities and differences with respect to SAT. In particular, we have shown that MaxSAT entailment can be rephrased as a MaxSAT refutation problem and, as a consequence, refutation completeness is sufficient for practical purposes. Interestingly, when such rephrasing is applied to hard formulas it transforms a SAT query into a MaxSAT one, and such transformation turns out to be relevant in our analysis of SAT circular proofs.

We have also provided three basic inference MaxSAT rules used in resolution-based proof systems (e.g. resolution, split and virtual) and have analysed their incremental effect in terms of refutation power. Finally, we have related ResSV, the strongest of the proof systems considered, with the recently proposed concept of circular proofs. We have shown that ResSV generalizes SAT circular proofs as defined in [3].

An additional contribution of the paper is to put together under a formal framework and common notation some ideas spread around in different recent papers such as [3, 7, 10].

Footnotes

Projects TIN2015-69175-C4-3-R and RTI2018-094403-B-C33, funded by: FEDER/Ministerio de Ciencia e Innovación − Agencia Estatal de Investigación, Spain.

Contributor Information

Luca Pulina, Email: lpulina@uniss.it.

Martina Seidl, Email: martina.seidl@jku.at.

Javier Larrosa, Email: larrosa@cs.upc.edu, http://www.cs.upc.edu.

Emma Rollon, Email: erollon@cs.upc.edu.

References

  • 1.Ansótegui C, Bonet ML, Levy J. SAT-based maxsat algorithms. Artif. Intell. 2013;196:77–105. doi: 10.1016/j.artint.2013.01.002. [DOI] [Google Scholar]
  • 2.Atserias A. On sufficient conditions for unsatisfiability of random formulas. J. ACM. 2004;51(2):281–311. doi: 10.1145/972639.972645. [DOI] [Google Scholar]
  • 3.Atserias A, Lauria M. Circular (yet sound) proofs. In: Janota M, Lynce I, editors. Theory and Applications of Satisfiability Testing – SAT 2019; Cham: Springer; 2019. pp. 1–18. [Google Scholar]
  • 4.Bacchus, F., Hyttinen, A., Järvisalo, M., Saikko, P.: Reduced cost fixing for maximum satisfiability. In: Lang, J. (ed.) Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI 2018), 13–19 July (2018), Stockholm, Sweden, pp. 5209–5213 (2018). ijcai.org. 10.24963/ijcai.2018/723
  • 5.Biere A, Heule M, van Maaren H, Walsh T, editors. Handbook of Satisfiability, Frontiers in Artificial Intelligence and Applications. Amsterdam: IOS Press; 2009. [Google Scholar]
  • 6.Bonet ML, Levy J, Manyà F. Resolution for Max-Sat. Artif. Intell. 2007;171(8–9):606–618. doi: 10.1016/j.artint.2007.03.001. [DOI] [Google Scholar]
  • 7.Ignatiev A, Morgado A, Marques-Silva J. On tackling the limits of resolution in SAT solving. In: Gaspers S, Walsh T, editors. Theory and Applications of Satisfiability Testing – SAT 2017; Cham: Springer; 2017. pp. 164–183. [Google Scholar]
  • 8.Larrosa, J., Heras, F.: Resolution in Max-Sat and its relation to local consistency in weighted CSPs. In: Kaelbling, L.P., Saffiotti, A. (eds.) IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, 30 July–5 August 2005, pp. 193–198. Professional Book Center (2005). http://ijcai.org/Proceedings/05/Papers/0360.pdf
  • 9.Larrosa J, Heras F, de Givry S. A logical approach to efficient Max-SAT solving. Artif. Intell. 2008;172(2–3):204–233. doi: 10.1016/j.artint.2007.05.006. [DOI] [Google Scholar]
  • 10.Larrosa, J., Rollon, E.: Augmenting the power of (partial) maxsat resolution with extension. In: Proceedings of the Thirty-third AAAI Conference on Artificial Intelligence (AAAI 2020), New York, New York, USA, 7–12 February 2020
  • 11.Morgado A, Heras F, Liffiton MH, Planes J, Marques-Silva J. Iterative and core-guided MaxSAT solving: a survey and assessment. Constraints. 2013;18(4):478–534. doi: 10.1007/s10601-013-9146-2. [DOI] [Google Scholar]
  • 12.Robinson JA. A machine-oriented logic based on the resolution principle. J. ACM. 1965;12(1):23–41. doi: 10.1145/321250.321253. [DOI] [Google Scholar]

Articles from Theory and Applications of Satisfiability Testing – SAT 2020 are provided here courtesy of Nature Publishing Group

RESOURCES