Strong Extension-Free Proof Systems

Marijn J H Heule; Benjamin Kiesl; Armin Biere

doi:10.1007/s10817-019-09516-0

. 2019 Feb 22;64(3):533–554. doi: 10.1007/s10817-019-09516-0

Strong Extension-Free Proof Systems

Marijn J H Heule ¹, Benjamin Kiesl ^2,³, Armin Biere ^4,^✉

PMCID: PMC7089731 PMID: 32226181

Abstract

We introduce proof systems for propositional logic that admit short proofs of hard formulas as well as the succinct expression of most techniques used by modern SAT solvers. Our proof systems allow the derivation of clauses that are not necessarily implied, but which are redundant in the sense that their addition preserves satisfiability. To guarantee that these added clauses are redundant, we consider various efficiently decidable redundancy criteria which we obtain by first characterizing clause redundancy in terms of a semantic implication relationship and then restricting this relationship so that it becomes decidable in polynomial time. As the restricted implication relation is based on unit propagation—a core technique of SAT solvers—it allows efficient proof checking too. The resulting proof systems are surprisingly strong, even without the introduction of new variables—a key feature of short proofs presented in the proof-complexity literature. We demonstrate the strength of our proof systems on the famous pigeon hole formulas by providing short clausal proofs without new variables.

Keywords: SAT, Propositional proof systems, Proof complexity, Proof checking, Pigeon hole problem, Extended resolution, Clause redundancy

Introduction

Satisfiability (SAT) solvers are used to determine the correctness of hardware and software systems [4, 16]. It is therefore crucial that these solvers justify their claims by providing proofs that can be independently verified. This holds also for various other applications that use SAT solvers. Just recently, long-standing mathematical problems were solved using SAT, including the Erdős Discrepancy Problem [21], the Pythagorean Triples Problem [13], and the computation of the fifth Schur number [10]. Especially in such cases, proofs are at the center of attention and without them the result of a solver is almost worthless. What the mathematical problems and the industrial applications have in common is that proofs are often of considerable size—about 200 terabytes in the case of the Pythagorean Triples Problem and even two petabytes for the fifth Schur number. As the size of proofs is influenced by the strength of their underlying proof system, the search for shorter proofs goes hand in hand with the search for stronger proof systems.

In this article, we introduce highly expressive clausal proof systems that can capture most of the techniques used by modern SAT solvers. Informally, a clausal proof system allows the addition of redundant clauses to a formula in conjunctive normal form (CNF). Here, a clause is considered redundant if its addition preserves satisfiability. If the repeated addition of clauses allows us to eventually add the empty clause—which is, by definition, unsatisfiable—then the unsatisfiability of the original formula has been established.

Since the redundancy of clauses is not efficiently decidable in general, practical proof systems only allow the addition of a clause if it fulfills some efficiently decidable criterion that ensures redundancy. For instance, the popular $DRAT$ proof system [30], which is the de-facto standard in practical SAT solving, only allows the addition of so-called resolution asymmetric tautologies [18]. Given a formula and a clause, it can be decided in polynomial time whether the clause is a resolution asymmetric tautology with respect to the formula and therefore the soundness of $DRAT$ proofs can be checked efficiently.

We present various new redundancy criteria by introducing a characterization of clause redundancy based on a semantic implication relationship between formulas. By replacing the implication relation in this characterization with restricted notions of implication that are computable in polynomial time, we then obtain powerful redundancy criteria that are still efficiently decidable. These redundancy criteria not only generalize earlier ones such as resolution asymmetric tautologies [18] or set-blocked clauses [19], but they are also closely related to other concepts from the literature, including autarkies [24], safe assignments [29], variable instantiation [1], and symmetry breaking [6].

Proof systems based on our new redundancy criteria turn out to be highly expressive, even without allowing the introduction of new variables. This is in contrast to resolution, which is considered relatively weak as long as the introduction of new variables via definitions—as in the stronger proof system of extended resolution [9, 26]—is not allowed. The introduction of new variables, however, has a major drawback—the search space of variables and clauses we could possibly add to a proof is clearly exponential. Finding useful clauses with new variables is therefore hard in practice and resulted only in limited success in the past [2, 23].

We illustrate the strength of our strongest proof system by providing short clausal proofs for the famous pigeon hole formulas without introducing new variables. The size of the proofs is linear in the size of the formulas and the clauses added in the proof contain at most two literals. In these proofs, we add redundant clauses that are similar in nature to symmetry-breaking predicates [6, 7]. To verify the correctness of proofs in our new system, we implemented a proof checker. The checker is built on top of DRAT-trim [30], the checker used to validate the unsatisfiability results of the recent SAT competitions [3]. We compare our proofs with existing proofs of the pigeon hole formulas in other proof systems and show that our new proofs are much smaller and cheaper to validate.

This invited article is an extended version of our CADE’17 best paper [14]. Apart from several small improvements throughout the article, we extended the conference version by adding Sect. 7, which describes further interesting properties of the redundant clauses introduced in this article. We also included a new discussion of open problems in Sect. 9.

Preliminaries

We consider propositional formulas in conjunctive normal form (CNF), which are defined as follows. A literal is either a variable x (a positive literal) or the negation $\bar{x}$ of a variable x (a negative literal). The complement $\bar{l}$ of a literal l is defined as $\bar{l} = \bar{x}$ if $l = x$ and $\bar{l} = x$ if $l = \bar{x}$ . For a literal l, we denote the variable of l by $v a r (l)$ . A clause is a disjunction of literals. If not stated otherwise, we assume that clauses do not contain complementary literals, i.e., a literal and its complement. A formula is a conjunction of clauses. Clauses can be viewed as sets of literals and formulas as sets of clauses. For a set L of literals and a formula F, we define $F_{L} = {C \in F ∣ C \cap L \neq \emptyset}$ . We sometimes write $F_{l}$ to denote $F_{{l}}$ .

An assignment is a function from a set of variables to the truth values 1 (true) and 0 (false). An assignment is total with respect to a formula if it assigns a truth value to all variables occurring in the formula, otherwise it is partial. We often denote assignments by the sequences of literals they satisfy. For instance, $x \bar{y}$ denotes the assignment that makes x true and y false. We denote the domain of an assignment $α$ by $v a r (α)$ . A literal l is satisfied by an assignment $α$ if l is positive and $α (v a r (l)) = 1$ or if it is negative and $α (v a r (l)) = 0$ . A literal is falsified by an assignment if its complement is satisfied by the assignment. A clause is satisfied by an assignment $α$ if it contains a literal that is satisfied by $α$ . Finally, a formula is satisfied by an assignment $α$ if all its clauses are satisfied by $α$ . A formula is satisfiable if there exists an assignment that satisfies it. Two formulas are logically equivalent if they are satisfied by the same total assignments; they are satisfiability equivalent if they are either both satisfiable or both unsatisfiable.

We denote the empty clause by $⊥$ and the satisfied clause by $⊤$ . Given an assignment $α$ and a clause C, we define $C | α = ⊤$ if $α$ satisfies C, otherwise $C | α$ denotes the result of removing from C all the literals falsified by $α$ . Moreover, for a formula F, we define $F | α = {C | α ∣ C \in F and C | α \neq ⊤}$ . We say that a clause Cblocks an assignment $α$ if $C = {x ∣ α (x) = 0} \cup {\bar{x} ∣ α (x) = 1}$ . A unit clause is a clause that contains only one literal. The result of applying the unit-clause rule to a formula F is the formula $F | α$ with $α$ being an assignment that satisfies a unit clause in F. The iterated application of the unit-clause rule to a formula, until no unit clauses are left, is called unit propagation. If unit propagation on a formula F yields the empty clause $⊥$ , we say that it derived a conflict on F. For example, unit propagation derives a conflict on $F = (\bar{x} \lor y) \land (\bar{y}) \land (x)$ since $F | x = (y) \land (\bar{y})$ and $F | x y = ⊥$ .

By $F ⊨ F^{'}$ , we denote that F implies $F^{'}$ , i.e., every assignment that satisfies F and assigns all variables in $v a r (F^{'})$ also satisfies $F^{'}$ . Furthermore, by $F ⊢_{^{_{1}}} F^{'}$ we denote that for every clause $(l_{1} \lor \dots \lor l_{k}) \in F^{'}$ , unit propagation derives a conflict on $F \land ({\bar{l}}_{1}) \land \dots \land ({\bar{l}}_{k})$ . If $F ⊢_{^{_{1}}} F^{'}$ , we say that Fimplies $F^{'}$ via unit propagation. As an example, $(x) \land (y) ⊢_{^{_{1}}} (x \lor z) \land (y)$ , since unit propagation derives a conflict on both $(x) \land (y) \land (\bar{x}) \land (z)$ and $(x) \land (y) \land (\bar{y})$ . Similarly, $F ⊢_{^{_{0}}} F^{'}$ denotes that every clause in $F^{'}$ is subsumed by (i.e., is a superset of) a clause in F. Observe that $F \supseteq F^{'}$ implies $F ⊢_{^{_{0}}} F^{'}$ , $F ⊢_{^{_{0}}} F^{'}$ implies $F ⊢_{^{_{1}}} F^{'}$ , and $F ⊢_{^{_{1}}} F^{'}$ implies $F ⊨ F^{'}$ .

Clause Redundancy and Clausal Proofs

In the following, we introduce a formal notion of clause redundancy and demonstrate how it provides the basis for clausal proof systems. We start by introducing clause redundancy [19]:

Definition 1

A clause C is redundant with respect to a formula F if F and $F \land C$ are satisfiability equivalent.

For instance, the clause $C = x \lor y$ is redundant with respect to the formula $F = (\bar{x} \lor \bar{y})$ since F and $F \land C$ are satisfiability equivalent (although they are not logically equivalent). This redundancy notion allows us to add redundant clauses to a formula without affecting its satisfiability and so it provides the basis for so-called clausal proof systems.

In general, given a formula $F = {C_{1}, \dots, C_{m}}$ , a clausal derivation of a clause $C_{n}$ from F is a sequence $(C_{m + 1}, ω_{m + 1}), \dots, (C_{n}, ω_{n})$ of pairs where $C_{i}$ is a clause and $ω_{i}$ , called the witness, is a string (for all $i > m$ ). Such a sequence gives rise to formulas $F_{m}, F_{m + 1}, \dots, F_{n}$ , where $F_{i} = {C_{1}, \dots, C_{i}}$ . We call $F_{i}$ the accumulated formula corresponding to the ith proof step. A clausal derivation is correct if every clause $C_{i}$ ( $i > m$ ) is redundant with respect to the formula $F_{i - 1}$ and if this redundancy can be checked in polynomial time (with respect to the size of the proof) using the witness $ω_{i}$ . A clausal derivation is a (refutation) proof of a formula F if it derives the empty clause, i.e., if $C_{n} = ⊥$ . Clearly, since every clause-addition step preserves satisfiability, and since the empty clause is unsatisfiable, a refutation proof of F certifies the unsatisfiability of F.

By specifying in detail what kind of redundant clauses—and corresponding witnesses—can be added to a clausal derivation, we obtain concrete proof systems. This is usually done by defining an efficiently checkable syntactic criterion that guarantees that clauses fulfilling this criterion are redundant. A popular example for a clausal proof system is $DRAT$ [30], the de-facto standard for unsatisfiability proofs in practical SAT solving. $DRAT$ allows the addition of a clause if it is a so-called resolution asymmetric tautology [18] ( $RAT$ , defined in the next section). As it can be efficiently checked whether a clause is a $RAT$ with respect to a formula, and since $RAT$ s cover many types of redundant clauses, the $DRAT$ proof system is very powerful.

The strength of a clausal proof system depends on the generality of the underlying redundancy criterion. We say that a redundancy criterion $R_{1}$ is more general than a redundancy criterion $R_{2}$ if, whenever $R_{2}$ identifies a clause C as redundant with respect to a formula F, then $R_{1}$ also identifies C as redundant with respect to F. For instance, whenever a clause is subsumed in some formula, it is a $RAT$ with respect to that formula. Therefore, the $RAT$ redundancy criterion is more general than the subsumption criterion. In the next section, we develop redundancy criteria that are even more general than $RAT$ , thus giving rise to proof systems that are stronger than $DRAT$ .

Clause Redundancy via Implication

In the following, we introduce a characterization of clause redundancy that reduces the question whether a clause is redundant with respect to a certain formula to a simple question of implication. The advantage of this is that we can replace the logical implication relation by polynomially decidable implication relations to derive powerful redundancy criteria that are still efficiently checkable. These redundancy criteria can then be used to obtain highly expressive clausal proof systems.

Our characterization is based on the observation that a clause in a CNF formula can be seen as a constraint that blocks those assignments that falsify the clause. Therefore, a clause can be safely added to a formula if it does not constrain the formula too much. What we mean by this is that after adding the clause, there should still exist other assignments (i.e., assignments not blocked by the clause) under which the formula is at least as satisfiable as under the assignments blocked by the clause. Consider the following example:

Example 1

Let $F = (x \lor y) \land (x \lor z) \land (\bar{x} \lor y \lor z)$ and consider the (unit) clause $C = x$ which blocks all assignments that falsify x. The addition of C to F does not affect satisfiability: Let $α = \bar{x}$ and $ω = x$ . Then, $F | α = (y) \land (z)$ while $F | ω = (y \lor z)$ . Clearly, every satisfying assignment of $F | α$ is also a satisfying assignment of $F | ω$ , i.e., $F | α ⊨ F | ω$ . Thus, F is at least as satisfiable under $ω$ as it is under $α$ . Moreover, $ω$ satisfies C. The addition of C does therefore not affect the satisfiability of F. $□$

This motivates our new characterization of clause redundancy presented next. The characterization requires the existence of an assignment that satisfies the clause and so it is only applicable to non-empty clauses. Note that for a given clause C, “the assignment $α$ blocked by C”, as defined above in Sect. 2, is in general a partial assignment and thus C actually rules out all assignments that extend $α$ :

Theorem 1

Let F be a formula, C a non-empty clause, and $α$ the assignment blocked by C. Then, C is redundant with respect to F if and only if there exists an assignment $ω$ such that $ω$ satisfies C and $F | α ⊨ F | ω$ .

Proof

For the “only if” direction, assume that F and $F \land C$ are satisfiability equivalent. If $F | α$ is unsatisfiable, then $F | α ⊨ F | ω$ for every $ω$ , hence the statement trivially holds. Assume now that $F | α$ is satisfiable, implying that F is satisfiable. Then, since F and $F \land C$ are satisfiability equivalent, there exists an assignment $ω$ that satisfies both F and C. Thus, since $ω$ satisfies F, it holds that $F | ω = \emptyset$ and so $F | α ⊨ F | ω$ .

For the “if” direction, assume that there exists an assignment $ω$ such that $ω$ satisfies C and $F | α ⊨ F | ω$ . Now, let $γ$ be a (total) assignment that satisfies F and falsifies C. We show how $γ$ can be turned into a satisfying assignment $γ^{'}$ of $F \land C$ . As $γ$ falsifies C, it coincides with $α$ on $v a r (α)$ . Therefore, since $γ$ satisfies F, it must satisfy $F | α$ and since $F | α ⊨ F | ω$ it must also satisfy $F | ω$ . Now, consider the following assignment:

\begin{matrix} γ^{'} (x) = \{\begin{matrix} ω (x) & if x \in v a r (ω), \\ γ (x) & otherwise. \end{matrix}) \end{matrix}

Clearly, since $ω$ satisfies C, $γ^{'}$ also satisfies C. Moreover, as $γ$ satisfies $F | ω$ and $v a r (F | ω) \subseteq v a r (γ) \ v a r (ω)$ , $γ^{'}$ satisfies F. Hence, $γ^{'}$ satisfies $F \land C$ . $□$

This alternative characterization of redundancy allows us to replace the logical implication relation by restricted implication relations that are polynomially decidable. For instance, we can replace the condition $F | α ⊨ F | ω$ by the restricted condition $F | α ⊢_{^{_{1}}} F | ω$ (likewise, we could also use relations such as “ $⊢_{^{_{0}}}$ ” or “ $\supseteq$ ” instead of “ $⊢_{^{_{1}}}$ ”). Now, if we are given a clause C—which implicitly gives us the blocked assignment $α$ —and a witnessing assignment $ω$ , then we can check in polynomial time whether $F | α ⊢_{^{_{1}}} F | ω$ , which is a sufficient condition for the redundancy of C with respect to F. We can therefore use this implication-based redundancy notion to define proof systems. The witnessing assignments can then be used as witnesses in the proof.

In the following, we use the propagation-implication relation “ $⊢_{^{_{1}}}$ ” to define the redundancy criteria of graphic file with name 10817_2019_9516_Figa_HTML.jpg Basically, the three notions differ in the way we allow the witnessing assignment $ω$ to differ from the assignment $α$ blocked by a clause. The more freedom we give to $ω$ , the more general the redundancy notion we obtain. We show that $LPR$ clauses—the least general of the three—coincide with $RAT$ . For the more general $SPR$ clauses, we show that they generalize set-blocked clauses ( $SBC$ ) [19], which is not the case for $LPR$ clauses. Finally, $PR$ clauses are the most general ones. They give rise to an extremely powerful proof system. The new landscape of redundancy notions we thereby obtain is illustrated in Fig. 1. In the figure, $RUP$ stands for the redundancy notion based on reverse unit propagation [8, 28], $S$ stands for subsumed clauses, $RS$ for clauses with subsumed resolvents [18], and $BC$ for blocked clauses [17, 22].

Fig. 1 — Landscape of redundancy notions of non-empty clauses. $R$ denotes all redundant clauses and $IMP$ stands for implied clauses. A path from X to Y indicates that X is more general than Y. The asterisk ( $^{*}$ ) denotes that the exact characterization implies the shown one, e.g., for every set-blocked clause, the property $F | α \supseteq F | α_{L}$ holds, but not vice versa

As we will see, when defining proof systems based on $LPR$ (e.g., the $DRAT$ system) or $SPR$ clauses, we do not need to explicitly add the redundancy witnesses (i.e., the witnessing assignments $ω$ ) to a proof. Thus, $LPR$ and $SPR$ proofs can just be seen as a sequence of clauses. In particular, a proof system based on $SPR$ clauses can have the same syntax as $DRAT$ proofs, which makes it “downwards compatible”. This is in contrast to proof systems based on $PR$ clauses, where in general witnessing assignments have to be added to a proof. Otherwise redundancy of a clause can not be checked in polynomial time.

We start by introducing $LPR$ clauses. In the following, given a (partial) assignment $α$ and a set L of literals, we denote by $α_{L}$ the assignment obtained from $α$ by making all literals in L true. If L contains only a single literal, we sometimes write $α_{l}$ to denote $α_{{l}}$ . In the conference paper [14], we used a slightly different definition, saying that $α_{L}$ is obtained from $α$ by flipping the truth values of all literals in L. Since we only defined $α_{L}$ for assignments $α$ that falsify all the literals in L, nothing changes. We do, however, believe that the new notion is more intuitive.

Definition 2

Let F be a formula, C a clause, and $α$ the assignment blocked by C. Then, C is literal-propagation redundant ( $LPR$ ) with respect to F if there exists a literal $l \in C$ such that $F | α ⊢_{^{_{1}}} F | α_{l}$ .

Example 2

Let $F = (x \lor y) \land (x \lor \bar{y} \lor z) \land (\bar{x} \lor z)$ and let C be the unit clause x. Then, $α = \bar{x}$ is the assignment blocked by C, and $α_{x} = x$ . Now, consider $F | α = (y) \land (\bar{y} \lor z)$ and $F | α_{x} = (z)$ . Clearly, $F | α ⊢_{^{_{1}}} F | α_{x}$ and therefore C is literal-propagation redundant with respect to F. $□$

The $LPR$ definition is quite restrictive since it requires the witnessing assignment $α_{l}$ to disagree with $α$ on exactly one variable. Nevertheless, this already suffices for $LPR$ clauses to coincide with $RAT$ s [18]:

Definition 3

Let F be a formula and C a clause. Then, C is a resolution asymmetric tautology ( $RAT$ ) with respect to F if there exists a literal $l \in C$ such that, for every clause $D \in F_{\bar{l}}$ , $F ⊢_{^{_{1}}} C \cup (D \ {\bar{l}})$ .

Theorem 2

A clause C is $LPR$ with respect to a formula F if and only if it is a $RAT$ with respect to F.

Proof

For the “only if” direction, assume that C is $LPR$ with respect to F, i.e., C contains a literal l such that $F | α ⊢_{^{_{1}}} F | α_{l}$ . Now, let $D \in F_{\bar{l}}$ . We have to show that $F ⊢_{^{_{1}}} C \cup (D \ {\bar{l}})$ . First, note that $F | α$ is exactly the result of propagating the negated literals of C on F, i.e., applying the unit-clause rule with the negated literals of C but not performing further propagations. Moreover, since $α_{l}$ falsifies $\bar{l}$ , it follows that $D | α_{l} \subseteq (D \ {\bar{l}})$ . But then, since $F | α ⊢_{^{_{1}}} D | α_{l}$ , it must hold that $F ⊢_{^{_{1}}} C \cup (D \ {\bar{l}})$ , hence C is a $RAT$ with respect to F.

For the “if” direction, assume that C is a $RAT$ with respect to F, i.e., C contains a literal l such that, for every clause $D \in F_{\bar{l}}$ , $F ⊢_{^{_{1}}} C \cup (D \ {\bar{l}})$ . Now, let $D | α_{l} \in F | α_{l}$ for $D \in F$ . We have to show that $F | α ⊢_{^{_{1}}} D | α_{l}$ . Since $α_{l}$ satisfies l and $α$ falsifies C, D does neither contain l nor any negations of literals in C except for possibly $\bar{l}$ . If D does not contain $\bar{l}$ , then $D | α = D | α_{l}$ is contained in $F | α$ and hence the claim immediately follows.

Assume therefore that $\bar{l} \in D$ . As argued for the other direction, propagating the negated literals of C (and no other literals) on F yields $F | α$ . Therefore, since $F ⊢_{^{_{1}}} C \cup (D \ {\bar{l}})$ and $D \ {\bar{l}}$ does not contain any negations of literals in C (which could otherwise be the reason for a unit propagation conflict that only happens because of C containing a literal whose negation is contained in $D \ {\bar{l}}$ ), it must be the case that $F | α ⊢_{^{_{1}}} D \ {\bar{l}}$ . Now, the only literals of $D \ {\bar{l}}$ that are not contained in $D | α_{l}$ are the ones falsified by $α$ , but those are anyhow not contained in $F | α$ . Hence, $F | α ⊢_{^{_{1}}} D | α_{l}$ and thus C is $LPR$ with respect to F. $□$

By allowing the witnessing assignments to disagree with $α$ on more than only one literal, we obtain the more general notion of set-propagation-redundant clauses, which we introduce next. In the following, for a set L of literals, we define $\bar{L} = {\bar{l} ∣ l \in L}$ .

Definition 4

Let F be a formula, C a clause, and $α$ the assignment blocked by C. Then, C is set-propagation redundant ( $SPR$ ) with respect to F if there exists a non-empty set $L \subseteq C$ of literals such that $F | α ⊢_{^{_{1}}} F | α_{L}$ .

Example 3

Let $F = (x \lor y) \land (x \lor \bar{y} \lor z) \land (\bar{x} \lor z) \land (\bar{x} \lor u) \land (\bar{u} \lor x)$ , $C = x \lor u$ , and $L = {x, u}$ . Then, $α = \bar{x} \bar{u}$ is the assignment blocked by C, and $α_{L} = x u$ . Now, consider $F | α = (y) \land (\bar{y} \lor z)$ and $F | α_{L} = (z)$ . Clearly, $F | α ⊢_{^{_{1}}} F | α_{L}$ and so C is set-propagation redundant with respect to F. Note also that C is not literal-propagation redundant with respect to F.

Since L is a subset of C, we do not need to add it (or the assignment $α_{L}$ ) explicitly to an $SPR$ proof. By requiring that L must consist of the first literals of C when adding C to a proof (viewing a clause as a sequence of literals), we can ensure that the $SPR$ property is efficiently decidable. For instance, when a proof contains the clause $l_{1} \lor \dots \lor l_{k}$ , we first check whether the $SPR$ property holds under the assumption that $L = {l_{1}}$ . If not, we proceed by assuming that $L = {l_{1}, l_{2}}$ , and so on until $L = {l_{1}, \dots, l_{k}}$ . Thereby, only linearly many candidates for L need to be checked. In contrast to $LPR$ clauses and $RAT$ s, the notion of $SPR$ clauses generalizes set-blocked clauses [19]:

Definition 5

A clause C is set-blocked ( $SBC$ ) by a non-empty set $L \subseteq C$ in a formula F if, for every clause $D \in F_{\bar{L}}$ , the clause $(C \ L) \cup \bar{L} \cup D$ contains two complementary literals.

To show that set-propagation-redundant clauses generalize set-blocked clauses, we first characterize them as follows:

Lemma 3

Let F be a clause, C a formula, $L \subseteq C$ a non-empty set of literals, and $α$ the assignment blocked by C. Then, C is set-blocked by L in F if and only if, for every $D \in F$ , $D | α = ⊤$ implies $D | α_{L} = ⊤$ .

Proof

For the “only if” direction, assume that there exists a clause $D \in F$ such that $D | α = ⊤$ but $D | α_{L} \neq ⊤$ . Then, since $α$ and $α_{L}$ disagree only on literals in L, it follows that D contains a literal $l \in \bar{L}$ and thus $D \in F_{\bar{L}}$ . Now, $α_{L}$ falsifies exactly the literals in $(C \ L) \cup \bar{L}$ and since it does not satisfy any of the literals in D, it follows that there exists no literal $l \in D$ such that its complement $\bar{l}$ is contained in $(C \ L) \cup \bar{L}$ . Therefore, C is not $SBC$ by L in F.

For the “if” direction, assume that C is not $SBC$ by L in F, i.e., there exists a clause $D \in F_{\bar{L}}$ such that $(C \ L) \cup \bar{L} \cup D$ does not contain complementary literals. Now, $D | α = ⊤$ since $α$ falsifies L and $D \cap \bar{L} \neq \emptyset$ . Since D contains no literal l such that $\bar{l} \in (C \ L) \cup \bar{L}$ and since $α_{L}$ falsifies exactly the literals in $(C \ L) \cup \bar{L}$ , it follows that $α_{L}$ does not satisfy D, hence $D | α_{L} \neq ⊤$ . $□$

Theorem 4

If a clause C is set-blocked by a set L in a formula F, it is set-propagation redundant with respect to F.

Proof

Assume that C is set-blocked by L in F. We show that $F | α \supseteq F | α_{L}$ , which implies that $F | α ⊢_{^{_{1}}} F | α_{L}$ , and therefore that C is set-propagation redundant with respect to F. Let $D | α_{L} \in F | α_{L}$ . First, note that D cannot be contained in $F_{L}$ , for otherwise $D | α_{L} = ⊤$ and thus $D | α_{L} \notin F | α_{L}$ . Second, observe that D can also not be contained in $F_{\bar{L}}$ , since that would imply that $D | α = ⊤$ and thus, by Lemma 3, $D | α_{L} = ⊤$ . Therefore, $D \notin F_{L} \cup F_{\bar{L}}$ and so $D | α = D | α_{L}$ . But then, $D | α_{L} \in F | α$ . It follows that $F | α \supseteq F | α_{L}$ . $□$

We thus know that set-propagation-redundant clauses generalize both resolution asymmetric tautologies and set-blocked clauses. As there are resolution asymmetric tautologies that are not set-blocked (and vice versa) [19], it follows that set-propagation-redundant clauses are actually a strict generalization of these two kinds of clauses.

Note that $F | α ⊢_{^{_{1}}} F | α_{L}$ is equivalent to $F | α ⊢_{^{_{1}}} F_{\bar{L}} | α_{L}$ . To see this, observe that if a clause $D | α_{L} \in F | α_{L}$ contains no literals from $\bar{L}$ , then $α_{L}$ does not assign any of its literals, in which case $D | α_{L}$ is also contained in $F | α$ . We therefore do not need to check for every $D | α_{L} \in F | α_{L}$ whether $F | α ⊢_{^{_{1}}} D | α$ .

By giving practically full freedom to the witnessing assignments, i.e., by only requiring them to satisfy C, we finally arrive at propagation-redundant clauses, the most general of the three redundancy notions:

Definition 6

Let F be a formula, C a clause, and $α$ the assignment blocked by C. Then, C is propagation redundant ( $PR$ ) with respect to F if there exists an assignment $ω$ such that $ω$ satisfies C and $F | α ⊢_{^{_{1}}} F | ω$ .

Example 4

Let $F = (x \lor y) \land (\bar{x} \lor y) \land (\bar{x} \lor z)$ , $C = x$ , and let $ω = x z$ be the witnessing assignment. Then, $α = \bar{x}$ is the assignment blocked by C. Now, consider $F | α = (y)$ and $F | ω = (y)$ . Clearly, unit propagation with the negated literal $\bar{y}$ of the unit clause $y \in F | ω$ derives a conflict on $F | α$ . Therefore, $F | α ⊢_{^{_{1}}} F | ω$ and so C is $PR$ with respect to F. Note that C is not set-propagation redundant because for $L = {x}$ , we have $α_{L} = x$ and so $F | α_{L}$ contains the two unit clauses y and z, but it does not hold that $F | α ⊢_{^{_{1}}} z$ . The fact that $ω$ satisfies z is crucial for ensuring propagation redundancy. $□$

Since the witnessing assignments $ω$ are allowed to assign variables that are not contained in C, we need—at least in general—to add them to a proof to guarantee that redundancy can be efficiently checked.

We can now explicitly define the $PR$ proof system as an instance of a clausal proof system as defined on page 4:

Definition 7

Given a formula $F = {C_{1}, \dots, C_{m}}$ , a $PR$ derivation of a clause $C_{n}$ from F is a sequence $(C_{m + 1}, ω_{m + 1}), \dots, (C_{n}, ω_{n})$ where for every pair $(C_{i}, ω_{i})$ , one of the following holds: (1) $ω_{i}$ is an assignment that satisfies $C_{i}$ and $F_{i - 1} | α_{i} ⊢_{^{_{1}}} F_{i - 1} | ω_{i}$ with $α_{i}$ being the assignment blocked by $C_{i}$ , or (2) $C_{n} = ⊥$ and $F_{n - 1} ⊢_{^{_{1}}} ⊥$ . A $PR$ derivation of $⊥$ from F is a $PR$ proof of F.

The $LPR$ proof system and the $SPR$ proof system are defined accordingly. Note that in the definition above we treat the empty clause separately because only non-empty clauses can be propagation redundant. If we allow the mentioned proof systems to delete arbitrary clauses, we obtain the proof systems $D LPR$ , $D SPR$ , and $D PR$ . We will not consider deletion in the rest of the article.

Short Proofs of the Pigeon Hole Principle

In a landmark article, Haken [9] showed that pigeon hole formulas cannot be refuted by resolution proofs that are of polynomial size with respect to the size of the formulas. In contrast, Cook [5] proved that there are actually polynomial-size refutations of the pigeon hole formulas in the stronger proof system of extended resolution. What distinguishes extended resolution from general resolution is that it allows the introduction of new variables via definitions. Cook showed how the introduction of such definitions helps to reduce a pigeon hole formula of size n to a pigeon hole formula of size $n - 1$ over new variables. The problem with the introduction of new variables, however, is that the search space of possible variables—and therefore clauses—that could be added to a proof is exponential.

In the following, we illustrate how the $PR$ proof system admits short proofs of pigeon hole formulas without the need for introducing new variables. This shows that the $PR$ system is strictly stronger than the resolution calculus, even when we forbid the introduction of new variables. A pigeon hole formula $P H P_{n}$ intuitively encodes that $n + 1$ pigeons have to be assigned to n holes such that no hole contains more than one pigeon.1 In the encoding, a variable $x_{p, h}$ intuitively denotes that pigeon p is assigned to hole h:

\begin{matrix} P H P_{n} : = \underset{1 \leq p \leq n + 1}{⋀} (x_{p, 1} \lor \dots \lor x_{p, n}) \land \underset{1 \leq p < q \leq n + 1}{⋀} \underset{1 \leq h \leq n}{⋀} ({\bar{x}}_{p, h} \lor {\bar{x}}_{q, h}) \end{matrix}

Clearly, pigeon hole formulas are unsatisfiable. The main idea behind our approach is similar to that of Cook, namely to reduce a pigeon hole formula $P H P_{n}$ to the smaller $P H P_{n - 1}$ . The difference is that in our case $P H P_{n - 1}$ is still defined on the same variables as $P H P_{n}$ . Therefore, reducing $P H P_{n}$ to $P H P_{n - 1}$ boils down to deriving the clauses $x_{p, 1} \lor \dots \lor x_{p, n - 1}$ for $1 \leq p \leq n$ .

Following Haken [9], we use array notation for clauses: Every clause is represented by an array of $n + 1$ columns and n rows. An array contains a “ Inline graphic ” (“”) in the pth column and hth row if and only if the variable $x_{p, h}$ occurs positively (negatively, respectively) in the corresponding clause. Representing $P H P_{n}$ in array notation, we have for every clause $x_{p, 1} \lor \dots \lor x_{p, n}$ , an array in which the pth column is filled with “ Inline graphic ”. Moreover, for every clause ${\bar{x}}_{p, h} \lor {\bar{x}}_{q, h}$ , we have an array that contains two “” in row h—one in column p and the other in column q. For instance, $P H P_{3}$ is given in array notation as follows: graphic file with name 10817_2019_9516_Figb_HTML.jpg We illustrate the general idea for reducing a pigeon hole formula $P H P_{n}$ to the smaller $P H P_{n - 1}$ on the concrete formula $P H P_{3}$ . It should, however, become clear from our explanation that the procedure works for every $n > 1$ . If we want to reduce $P H P_{3}$ to $P H P_{2}$ , we have to derive the following three clauses: graphic file with name 10817_2019_9516_Figc_HTML.jpg We can do so by removing the “ Inline graphic ” from the last row of every column full of “”, except for the last column, which can be ignored as it is not contained in $P H P_{2}$ . The key observation is that a “” in the last row of the pth column can be removed with the help of so-called “diagonal clauses” of the form ${\bar{x}}_{p, n} \lor {\bar{x}}_{n + 1, h}$ ( $1 \leq h \leq n - 1$ ). We are allowed to add these diagonal clauses since they are, as we will show, propagation redundant with respect to $P H P_{n}$ . The arrays below represent the diagonal clauses to remove the “ Inline graphic ” from the last row of the first (left), second (middle), and third column (right): graphic file with name 10817_2019_9516_Figd_HTML.jpg We next show how exactly these diagonal clauses allow us to remove the bottom “” from a column full of “”, or, in other words, how they help us to remove the literal $x_{p, n}$ from a clause $x_{p, 1} \lor \dots \lor x_{p, n}$ ( $1 \leq p \leq n$ ). Consider, for instance, the clause $x_{2, 1} \lor x_{2, 2} \lor x_{2, 3}$ in $P H P_{3}$ . Our aim is to remove the literal $x_{2, 3}$ from this clause. Before we explain the procedure, we like to remark that proof systems based on propagation redundancy can easily simulate resolution: Since every resolvent of clauses in a formula F is implied by F, the assignment $α$ blocked by the resolvent must falsify F and thus $F | α ⊢_{^{_{1}}} ⊥$ . We explain our procedure textually before we illustrate it in array notation:

First, we add the diagonal clauses $D_{1} = {\bar{x}}_{2, 3} \lor {\bar{x}}_{4, 1}$ and $D_{2} = {\bar{x}}_{2, 3} \lor {\bar{x}}_{4, 2}$ to $P H P_{3}$ . Now, we can derive the unit clause ${\bar{x}}_{2, 3}$ by resolving the two diagonal clauses $D_{1}$ and $D_{2}$ with the original pigeon hole clauses $P_{1} = {\bar{x}}_{2, 3} \lor {\bar{x}}_{4, 3}$ and $P_{2} = x_{4, 1} \lor x_{4, 2} \lor x_{4, 3}$ as follows: We obtain ${\bar{x}}_{2, 3} \lor x_{4, 2} \lor x_{4, 3}$ by resolving $D_{1}$ with $P_{2}$ . Then, we resolve this clause with $D_{2}$ to obtain ${\bar{x}}_{2, 3} \lor x_{4, 3}$ , which we resolve with $P_{1}$ to obtain ${\bar{x}}_{2, 3}$ . Note that our proof system actually allows us to add ${\bar{x}}_{2, 3}$ immediately without carrying out all the resolution steps explicitly. Finally, we resolve ${\bar{x}}_{2, 3}$ with $x_{2, 1} \lor x_{2, 2} \lor x_{2, 3}$ to obtain the desired clause $x_{2, 1} \lor x_{2, 2}$ .

We next illustrate this procedure in array notation. We start by visualizing the clauses $D_{1}$ , $D_{2}$ , $P_{1}$ , and $P_{2}$ that can be resolved to yield the clause ${\bar{x}}_{2, 3}$ . The clauses are given in array notation as follows: graphic file with name 10817_2019_9516_Fige_HTML.jpg

We can then resolve ${\bar{x}}_{2, 3}$ with $x_{2, 1} \lor x_{2, 2} \lor x_{2, 3}$ to obtain $x_{2, 1} \lor x_{2, 2}$ : graphic file with name 10817_2019_9516_Figf_HTML.jpg

This should illustrate how a clause of the form $x_{p, 1} \lor \dots \lor x_{p, n}$ ( $1 \leq p \leq n$ ) can be reduced to a clause $x_{p, 1} \lor \dots \lor x_{p, n - 1}$ . By repeating this procedure for every column p with $1 \leq p \leq n$ , we can thus reduce a pigeon hole formula $P H P_{n}$ to a pigeon hole formula $P H P_{n - 1}$ without introducing new variables. Note that the last step, in which we resolve the derived unit clause ${\bar{x}}_{2, 3}$ with the clause $x_{2, 1} \lor x_{2, 2} \lor x_{2, 3}$ , is actually not necessary for a valid $PR$ proof of a pigeon hole formula, but we added it to simplify the presentation.

It remains to show that the diagonal clauses are indeed propagation redundant with respect to the pigeon hole formula. To do so, we show that for every assignment $α = x_{p, n} x_{n + 1, h}$ that is blocked by a diagonal clause ${\bar{x}}_{p, n} \lor {\bar{x}}_{n + 1, h}$ , it holds that for the assignment $ω = {\bar{x}}_{p, n} {\bar{x}}_{n + 1, h} x_{p, h} x_{n + 1, n}$ , $P H P_{n} | α = P H P_{n} | ω$ , implying that $P H P_{n} | α ⊢_{^{_{1}}} P H P_{n} | ω$ . We also argue why other diagonal and unit clauses can be ignored when checking whether a new diagonal clause is propagation redundant.

We again illustrate the idea on $P H P_{3}$ . We now use array notation also for assignments, i.e., a “ Inline graphic ” (“”) in column p and row h denotes that the assignment makes variable $x_{p, h}$ true (false, respectively). Consider, for instance, the diagonal clause $D_{2} = {\bar{x}}_{2, 3} \lor {\bar{x}}_{4, 2}$ that blocks $α = x_{2, 3} x_{4, 2}$ . The corresponding witnessing assignment $ω = {\bar{x}}_{2, 3} {\bar{x}}_{4, 2} x_{2, 2} x_{4, 3}$ can be seen as a “rectangle” with two “ Inline graphic ” in the corners of one diagonal and two “” in the other corners: graphic file with name 10817_2019_9516_Figg_HTML.jpg To see that $P H P_{3} | α$ and $P H P_{3} | ω$ coincide on clauses $x_{p, 1} \lor \dots \lor x_{p, n}$ , consider that whenever $α$ and $ω$ assign a variable of such a clause, they both satisfy the clause (since they both have a “ Inline graphic ” in every column in which they assign a variable) and so they both remove it from $P H P_{3}$ . For instance, in the following example, both $α$ and $ω$ satisfy $x_{2, 1} \lor x_{2, 2} \lor x_{2, 3}$ while both do not assign a variable of the clause $x_{3, 1} \lor x_{3, 2} \lor x_{3, 3}$ : graphic file with name 10817_2019_9516_Figh_HTML.jpg To see that $P H P_{3} | α$ and $P H P_{3} | ω$ coincide on clauses of the form ${\bar{x}}_{p, h} \lor {\bar{x}}_{q, h}$ , consider the following: If $α$ falsifies a literal of ${\bar{x}}_{p, h} \lor {\bar{x}}_{q, h}$ , then the resulting clause is a unit clause for which one of the two literals is not assigned by $α$ (since $α$ does not assign two variables in the same row). Now, one can show that the same unit clause is also contained in $P H P_{3} | ω$ , where it is obtained from another clause: Consider, for example, again the assignment $α = x_{2, 3} x_{4, 2}$ and the corresponding witnessing assignment $ω = {\bar{x}}_{2, 3} {\bar{x}}_{4, 2} x_{2, 2} x_{4, 3}$ from above. The assignment $α$ turns the clause $C = {\bar{x}}_{3, 2} \lor {\bar{x}}_{4, 2}$ into the unit $C | α = {\bar{x}}_{3, 2}$ . The same clause is contained in $P H P_{3} | ω$ , as it is obtained from $C^{'} = {\bar{x}}_{2, 2} \lor {\bar{x}}_{3, 2}$ since $C^{'} | ω = C | α = {\bar{x}}_{3, 2}$ : graphic file with name 10817_2019_9516_Figi_HTML.jpg Note that diagonal clauses and unit clauses that have been derived earlier can be ignored when checking whether the current one is propagation redundant. For instance, assume we are currently reducing $P H P_{n}$ to $P H P_{n - 1}$ . Then, the assignments $α$ and $ω$ under consideration only assign variables in $P H P_{n}$ . In contrast, the unit and diagonal clauses used for reducing $P H P_{n + 1}$ to $P H P_{n}$ (or earlier ones) are only defined on variables outside of $P H P_{n}$ . They are therefore contained in both $P H P_{n} | α$ and $P H P_{n} | ω$ . We can also ignore earlier unit and diagonal clauses over variables in $P H P_{n}$ , i.e., clauses used for reducing an earlier column or other diagonal clauses for the current column: If $α$ assigns one of their variables, then $ω$ satisfies them and so they are not in $P H P_{n} | ω$ .

Finally, we want to mention that short $SPR$ proofs (without new variables) of the pigeon hole formulas can be constructed by first adding $SPR$ clauses of the form ${\bar{x}}_{p, n} \lor {\bar{x}}_{n + 1, h} \lor x_{p, h} \lor x_{n + 1, n}$ and then turning them into diagonal clauses using resolution. We left these proofs out since they are twice as large as the $PR$ proofs and their explanation is less intuitive. A recent result shows that the conversion of a $PR$ proof into a $DRAT$ proof requires only one auxiliary variable [11]. We can thus construct short $DRAT$ proofs without new variables from short $PR$ proofs without new variables. To do so, we first eliminate a single variable from the original formula and then reuse that variable in the conversion algorithm. This is only possible because $DRAT$ allows clause deletion. We consider it unlikely that there exist short proofs for the pigeon hole formulas in the $RAT$ / $LPR$ proof system, where no deletions are allowed.

Evaluation

We implemented a $PR$ proof checker on top of DRAT-trim [30]. The tool, formulas, and proofs are available at https://www.cs.utexas.edu/~marijn/pr. Figure 2 shows the pseudo code of the checking algorithm. The first “if” statement is not necessary but significantly improves the efficiency of the algorithm. The worst-case complexity of the algorithm is $O (n^{3})$ , where n is the size of the final formula. The reason for this is that there are $n - m$ iterations of the outer for-loop and for each of these iterations, the inner for-loop is performed $| F_{i} |$ times, i.e., once for every clause in $F_{i}$ . Given that $F_{i}$ contains i clauses, we know that the size of F is bounded by n. It follows that the inner for-loop is performed mn times. Now, there is a unit propagation test in the inner if-statement: If k is the maximal clause size and n is an upper bound for the size of the formula, then the complexity of unit propagation is known to be at most kn. Hence, the overall worst-case complexity of the algorithm is bounded by $m k n^{2} = O (n^{3})$ .

Fig. 2 — Pseudo code of the $PR$ -proof checking algorithm

This complexity is the same as for $RAT$ -proof checking. In fact, the pseudo-code for $RAT$ -proof checking and $PR$ -proof checking is the same apart from the first if-statement, which is always true in the worst case, both for $RAT$ and $PR$ . Although the theoretical worst-case complexity makes proof checking seem very expensive, it can be done quite efficiently in practice: For the $RAT$ proofs produced by solvers in the SAT competitions, we observed that the runtime of proof checking is close to linear with respect to the sizes of the proofs.

Moreover, we want to highlight that verifying the $PR$ property of a clause is relatively easy as long as a witnessing assignment is given. For an arbitrary clause without a witnessing assignment, however, it is an $NP$ -complete problem to decide if the clause is $PR$ [15]. We therefore believe that in general, the verification of $PR$ proofs is simpler than the actual solving/proving.

The format of $PR$ proofs is an extension of $DRAT$ proofs: the first numbers of line i denote the literals in $C_{i}$ . Positive numbers refer to positive literals, and negative numbers refer to negative literals. In case a witness $ω_{i}$ is provided, the first literal in the clause is repeated to denote the start of the witness. Recall that the witness always has to satisfy the clause. It is therefore guaranteed that the witness and the clause have at least one literal in common. Our format requires that such a literal occurs at the first position of the clause and of the witness. Finally, 0 marks the end of a line. Figure 3 shows the formula and the $PR$ proof of our running example $P H P_{3}$ .

Fig. 3 — Left, ten clauses of $P H P_{3}$ using the notation as elsewhere in this article and next to it the equivalent representation of these clauses in the DIMACS format used by SAT solvers. Right, the full $PR$ refutation consisting of clause-witness pairs. A repetition of the first literal indicates the start of the optional witness

Table 1 compares our $PR$ proofs with existing $DRAT$ proofs of the pigeon hole formulas and of formulas from another challenging benchmark suite of the SAT competition that allows two pigeons per hole. For the latter suite, $PR$ proofs can be constructed in a similar way as those of the classical pigeon hole formulas. Notice that the $PR$ proofs do not introduce new variables and that they contain fewer clauses than their corresponding formulas. The $DRAT$ proof of $P H P_{n}$ contains a copy of the formula $P H P_{k}$ for each $k < n$ . Checking $PR$ proofs is also more efficient, as they are more compact.

Table 1.

The sizes (in terms of the number of variables and clauses) of pigeon hole formulas (top) and two-pigeons-per-hole formulas (bottom) as well as the sizes and validation times (in seconds) for their $PR$ proofs (as described in Sect. 5) and their $DRAT$ proofs (based on symmetry breaking [12])

Formula	Input		$PR$ proofs			$DRAT$ proofs
	#var	#cls	#var	#cls	Time	#var	#cls	Time
hole10.cnf	110	561	110	385	0.17	440	3685	0.22
hole11.cnf	132	738	132	506	0.18	572	5236	0.23
hole12.cnf	156	949	156	650	0.19	728	7228	0.27
hole13.cnf	182	1197	182	819	0.21	910	9737	0.34
hole20.cnf	420	4221	420	2870	0.40	3080	49,420	2.90
hole30.cnf	930	13,981	930	9455	2.57	9920	234,205	61.83
hole40.cnf	1640	32,841	1640	22,140	13.54	22,960	715,040	623.29
hole50.cnf	2550	63,801	2550	42,925	71.72	44,200	1,708,925	3158.17
tph8.cnf	136	5457	136	680	0.32	3520	834,963	5.47
tph12.cnf	300	27,625	300	2300	1.81	11,376	28,183,301	1396.92
tph16.cnf	528	87,329	528	5456	11.16	Not available, too large
tph20.cnf	820	213,241	820	10,660	61.69	Not available, too large

Open in a new tab

Properties of Propagation Redundancy

In the following, we discuss some properties of propagation redundancy and its restricted variants of literal-propagation redundancy and set-propagation redundancy. We first prove that if a clause C is either $LPR$ , $SPR$ , or $PR$ with respect to a formula F, then every superclause of C (i.e., every clause D such that $C \subseteq D$ ) is also $LPR$ , $SPR$ , or $PR$ (respectively) with respect to F. After this, we show how propagation-redundant clauses can be shortened based on unit propagation. Finally, we present an observation that clarifies the relationship between propagation-redundant clauses and their corresponding witnessing assignments.

Our strategy for showing that the superclauses of $LPR$ , $SPR$ , and $PR$ clauses are also $LPR$ , $SPR$ , and $PR$ is as follows: Assume, for instance, that C is a $PR$ clause with respect to F and let $α$ be the assignment blocked by C. We then know that there exists an assignment $ω$ such that $F | α ⊢_{^{_{1}}} F | ω$ . To show that every superclause $C^{'}$ of C is also propagation redundant with respect to F, we extend $α$ so that it becomes the assignment $α^{'}$ blocked by $C^{'}$ . After this, we extend $ω$ to an assignment $ω^{'}$ such that $F | α^{'} ⊢_{^{_{1}}} F | ω^{'}$ . The following example shows that we cannot simply extend $α$ without extending $ω$ :

Example 5

Let $F = (x \lor y) \land (\bar{x} \lor y)$ and let $α = x$ and $ω = \bar{x}$ . Then, $F | α$ and $F | ω$ both contain only the unit clause y and so it holds that $F | α ⊢_{^{_{1}}} F | ω$ . If we only extend $α$ to $α y$ , then $F | α y$ contains no clauses and thus $F | α y ⊬_{^{_{1}}} F | ω$ . However, if we also extend $ω$ to $ω y$ , then we again have $F | α y ⊢_{^{_{1}}} F | ω y$ . $□$

In the following, we present several statements that help us extend $ω$ in the right way. We start with a simple observation about the relation between unit propagation and variable assignments. If a formula is unsatisfiable, then the formula remains unsatisfiable after we assign some of its variables. A similar property holds when unsatisfiability can be shown by unit propagation: If unit propagation derives a conflict on a formula, then we can assign truth values to arbitrary variables of the formula and unit propagation will still derive a conflict. This property will be useful below when we prove other properties about propagation redundancy:

Proposition 5

If unit propagation derives a conflict on a formula F, then it derives a conflict on $F | x$ for every literal x.

Proof

Assume unit propagation derives a conflict on F. Then, there must exist a sequence $C_{1}, \dots, C_{k}$ of clauses from F such that $C_{1}$ is the unit clause $a_{1}$ , $C_{2} | a_{1}$ is the unit clause $a_{2}$ , and so on until $C_{k - 1} | a_{1} \dots a_{n - 2} = a_{k - 1}$ , and finally $C_{k} | a_{1} \dots a_{k - 1} = ⊥$ . Now, if the variable $v a r (x)$ does not occur in any of the clauses $C_{1}, \dots, C_{k}$ , then $C_{i} | x = C_{i}$ for each $i \in 1, \dots, n$ and thus unit propagation derives a conflict on $F | x$ . Assume now that $v a r (x)$ occurs in $C_{1}, \dots, C_{k}$ and let $C_{i}$ be the clause with the smallest i such that $v a r (x) \in C_{i}$ . Then, $C_{1}, \dots, C_{i - 1} \in F | x$ and so unit propagation derives the unit clauses $a_{1}, \dots, a_{i - 1}$ on $F | x$ . We proceed by a case distinction.

$x \in C_{i}$ : In this case, $C_{i} \notin F | x$ , but we know that $C_{i} | a_{1} \dots a_{i - 1} = a_{i} = x$ since $v a r (x)$ cannot occur in $a_{1}, \dots, a_{i - 1}$ . But then the assignment $a_{1} \dots a_{i - 1} a_{i}$ , derived by unit propagation on F, is the assignment $a_{1} \dots a_{i - 1} x$ , derived by unit propagation on $F | x$ . Hence, unit propagation on $F | x$ derives a conflict using the clauses $C_{1} | x, \dots, C_{i - 1} | x, C_{i + 1} | x, \dots C_{k} | x$ .

$\bar{x} \in C_{i}$ : In that case, $C_{i} | a_{1} \dots a_{i - 1}$ must be the unit clause $\bar{x}$ . It follows that $C_{i} \subseteq {\bar{a}}_{1} \lor \dots \lor {\bar{a}}_{i - 1} \lor \bar{x}$ and thus $C_{i} | x \subseteq {\bar{a}}_{1} \lor \dots \lor {\bar{a}}_{i - 1}$ . Hence, unit propagation on $F | x$ derives a conflict with the clauses $C_{1}, C_{2}, \dots, C_{i} | x$ since it derives all the unit clauses $a_{1}, \dots, a_{i - 1}$ . $□$

In Example 5, we presented a formula F with two assignments $α$ and $ω$ such that $F | α ⊢_{^{_{1}}} F | ω$ . After extending $α$ to $α x$ , however, we could observe that $F | α x ⊬_{^{_{1}}} F | ω$ . The problem in the example is that in contrast to $F | α$ , the formula $F | α x$ does not contain the unit clause x anymore while $F | ω$ still contains x as a clause. Hence, $F | α x$ does not imply $F | ω$ . However, as the next statement tells us, it is guaranteed that $F | α x$ implies all those clauses of $F | ω$ that contain neither x nor $\bar{x}$ .

In the rest of this section, given a clause $C = (c_{1} \lor \dots \lor c_{n})$ , we write $\neg C$ for the conjunction ${\bar{c}}_{1} \land \dots \land {\bar{c}}_{n}$ of unit clauses. In the following statement, the requirement that $α$ must not falsify x makes sure that $α x$ is well-defined:

Lemma 6

Let F be formula, let $α, ω$ be assignments such that $F | α ⊢_{^{_{1}}} F | ω$ , and let x be a literal that is not falsified by $α$ . Then, $F | α x ⊢_{^{_{1}}} D | ω$ for every clause $D | ω \in F | ω$ such that $v a r (x) \notin v a r (D | ω)$ .

Proof

Assume that $F | α ⊢_{^{_{1}}} F | ω$ and let $D | ω \in F | ω$ be a clause such that $v a r (x) \notin D | ω$ . Since $F | α ⊢_{^{_{1}}} D | ω$ , we know that unit propagation derives a conflict on $F | α \land \neg (D | ω)$ , with $\neg (D | ω)$ being the conjunction of the negated literals of $D | ω$ . Since $v a r (x) \notin v a r (D | ω)$ , it follows that $D | ω = D | ω x$ . Thus, $(F | α \land \neg (D | ω)) | x = F | α x \land \neg (D | ω)$ . But then, since unit propagation derives a conflict on $F | α \land \neg (D | ω)$ , we know, by Proposition 5, that unit propagation derives a conflict on $F | α x \land \neg (D | ω)$ . It follows that $F | α x ⊢_{^{_{1}}} D | ω$ . $□$

Using Lemma 6, we can show that every literal that is neither falsified by $α$ nor by $ω$ can just be appended to both $α$ and $ω$ :

Lemma 7

Let F be formula, $α$ and $ω$ assignments, and x a literal that is neither falsified by $α$ nor by $ω$ . Then, $F | α ⊢_{^{_{1}}} F | ω$ implies $F | α x ⊢_{^{_{1}}} F | ω x$ .

Proof

Suppose $F | α ⊢_{^{_{1}}} F | ω$ and let $D | ω x \in F | ω x$ for $D \in F$ . We show that $F | α x ⊢_{^{_{1}}} D | ω x$ . If $\bar{x} \in D | ω$ , then $D | ω = D | ω x \lor \bar{x}$ . Hence, unit propagation derives a conflict on $F | α \land \neg (D | ω x) \land x$ , with $\neg (D | ω x)$ being the conjunction of the negated literals of $D | ω x$ . But then unit propagation derives a conflict on $F | α x \land \neg (D | ω x)$ and thus $F | α x ⊢_{^{_{1}}} D | ω x$ . If $\bar{x} \notin D$ , then $D | ω x = D | ω$ . Now, we know that $F | α ⊢_{^{_{1}}} D | ω$ and hence $F | α ⊢_{^{_{1}}} D | ω x$ . Thus, by Lemma 6, it follows that $F | α x ⊢_{^{_{1}}} D | ω x$ . $□$

Assume that a clause C is $LPR$ with respect to a formula F. Let D be a superclause of C, $α$ be the assignment blocked by C, and $α x_{1} \dots x_{k}$ the assignment blocked by D. Then, we know that there exists a literal $l \in C$ such that $F | α ⊢_{^{_{1}}} F | α_{l}$ . But then Lemma 7 tells us that $F | α x_{1} \dots x_{k} ⊢_{^{_{1}}} F | α_{l} x_{1} \dots x_{k}$ and thus D is $LPR$ with respect to F. The same argument applies to $SPR$ clauses (but not to $PR$ clauses in general) and thus we get:

Theorem 8

If a clause C is $LPR$ ( $SPR$ ) with respect to F, then every superclause of C is $LPR$ ( $SPR$ , respectively) with respect to F.

To show that the corresponding statement also holds for $PR$ clauses that are not $SPR$ clauses, we need to show some additional properties of $PR$ clauses. The next statement, which is a simple consequence of Lemma 6, tells us that the extension of $α$ to $α x$ is harmless if $ω$ already falsifies x:

Lemma 9

Let F be formula, let $α$ and $ω$ be assignments, and let x be a literal that is not falsified by $α$ . Then, $F | α ⊢_{^{_{1}}} F | ω \bar{x}$ implies $F | α x ⊢_{^{_{1}}} F | ω \bar{x}$ .

Proof

Assume that $v a r (x)$ occurs in a clause $D \in F$ . If $x \in D$ , then $D | ω \bar{x}$ does not contain x. If $\bar{x} \in D$ , then D is satisfied by $ω \bar{x}$ and thus $D | ω \bar{x} \notin F | ω \bar{x}$ . Thus, by Lemma 6, $F | α x ⊢_{^{_{1}}} F | ω \bar{x}$ . $□$

Putting everything together, we can now show that we can always extend $α$ if we just extend $ω$ accordingly:

Lemma 10

Let F be a formula, let $α$ and $ω$ be two assignments such that $F | α ⊢_{^{_{1}}} F | ω$ , and let $α^{'}$ be an assignment such that $α \subseteq α^{'}$ . Then, there exists an assignment $ω^{'}$ such that $ω \subseteq ω^{'}$ and $F | α^{'} ⊢_{^{_{1}}} F | ω^{'}$ .

Proof

Suppose $α^{'}$ is an assignment such that $α \subseteq α^{'}$ . Then, $α^{'}$ is of the form $α x_{1} \dots x_{k}$ where $n \in 0, \dots, n$ . Now, starting with $x_{1}$ , we stepwise extend $α$ with $x_{1}, \dots, x_{k}$ to finally obtain $α x_{1} \dots x_{k}$ . We just have to extend $ω$ to an assignment $ω_{k}$ accordingly to ensure that $F | α x_{1} \dots x_{k} ⊢_{^{_{1}}} F | ω_{k}$ . We start with $ω_{0} = ω$ and proceed as follows for $i \in 1, \dots, n$ : If $v a r (x_{i}) \in v a r (ω)$ , define $ω_{i} = ω_{i - 1}$ . In contrast, if $v a r (x_{i}) \notin v a r (ω)$ , define $ω_{i} = ω_{i - 1} x_{i}$ .

By a simple induction on i we can now show $F | α x_{1} \dots x_{i} ⊢_{^{_{1}}} F | ω_{i}$ for every $i \in 0, \dots, k$ : The base case, $F | α ⊢_{^{_{1}}} F | ω$ , holds by assumption. The induction hypothesis states that $F | α x_{1} \dots x_{i - 1} ⊢_{^{_{1}}} F | ω_{i - 1}$ . Now, for the induction step, if $ω (x_{i}) = 0$ , then $ω_{i}$ is of the form $ω_{i - 1} {\bar{x}}_{i}$ . In this case, by Lemma 9, $F | α x_{1} \dots x_{i} ⊢_{^{_{1}}} F | ω_{i - 1} {\bar{x}}_{i}$ . If $v a r (x_{i}) \notin v a r (ω)$ or $ω (x_{i}) = 1$ , then $ω_{i}$ is of the form $ω_{i - 1} x_{i}$ . In that case, by Lemma 7, $F | α x_{1} \dots x_{i} ⊢_{^{_{1}}} F | ω_{i - 1} x_{i}$ . $□$

As immediate consequence we obtain one of our main statements:

Theorem 11

If a clause C is $PR$ with respect to F, then every superclause of C is $PR$ with respect to F.

Next, we show that we can remove certain literals from propagation-redundant clauses without violating their property of being propagation redundant. The idea is as follows: Let C be a clause that is propagation redundant with respect to a formula F and let $α x_{1} \dots x_{k}$ be the assignment blocked by C. Now, if unit propagation on $F | α$ derives the unit clauses $x_{1}, \dots, x_{k}$ , then the clause that blocks only $α$ —and not the whole assignment $α x_{1} \dots x_{k}$ —is propagation redundant with respect to F too. To show this, we first introduce the notion of propagation extensions. Note that in the following definition, by a consistent set of unit clauses, we mean a set of unit clauses that does not contain two complementary unit clauses x and $\bar{x}$ :

Definition 8

Let F be a formula, let $α$ be an assignment, and let ${x_{1}, \dots, x_{k}}$ be a consistent set of unit clauses derived by unit propagation on $F | α$ . Then, $α x_{1} \dots x_{k}$ is a propagation extension of $α$ on F.

Example 6

Let $F = (x \lor y) \land (\bar{y} \lor z)$ and let $α = \bar{x}$ . Unit propagation on $F | α$ derives the unit clauses y and z. Hence, the assignments $α y$ , $α z$ , and $α y z$ are propagation extensions of $α$ on F. Now consider the formula $F \land \bar{z}$ . Then, unit propagation on $(F \land \bar{z}) | α$ derives the unit clauses y, z, and $\bar{z}$ . Thus, the propagation extensions of $α$ on $F \land (\bar{z})$ are the assignments $α \bar{z}$ and $α y \bar{z}$ as well as all the propagation extensions of $α$ on F. $□$

If $F | α^{+} ⊢_{^{_{1}}} F | ω$ for some propagation extension $α^{+}$ of an assignment $α$ , we can simply shorten $α^{+}$ to $α$ without modifying $ω$ and it will still hold that $F | α ⊢_{^{_{1}}} F | ω$ :

Lemma 12

Let F be a formula, $α$ an assignment, and $α^{+}$ a propagation extension of $α$ on F. Then, $F | α ⊢_{^{_{1}}} F | ω$ if and only if $F | α^{+} ⊢_{^{_{1}}} F | ω$ .

Proof

The “only if” direction is an immediate consequence of Lemma 10. For the “if” direction, assume that $F | α^{+} ⊢_{^{_{1}}} F | ω$ and let $D | ω \in F | ω$ . We know that unit propagation derives a conflict on $F | α^{+} \land \neg (D | ω)$ where $\neg (D | ω)$ is the conjunction of the negated literals of $D | ω$ . Since unit propagation derives $F | α^{+}$ from $F | α$ , it follows that unit propagation derives a conflict on $F | α \land \neg (D | ω)$ . Hence, $F | α ⊢_{^{_{1}}} D | ω$ and thus $F | α ⊢_{^{_{1}}} F | ω$ . $□$

Using Lemma 12, we can now show that the removal of propagated literals from $PR$ clauses is harmless:

Theorem 13

Let C be a clause that is $PR$ with respect to a formula F and let $α^{+}$ be the assignment blocked by C. If $α^{+}$ is a propagation extension of an assignment $α$ on F, then the clause that blocks $α$ is $PR$ with respect to F.

Proof

Assume that $α^{+}$ is a propagation extension of an assignment $α$ on F and let $C^{-}$ be the clause that blocks $α$ . Then, $α^{+}$ is of the form $α x_{1} \dots x_{k}$ where $x_{1}, \dots x_{k}$ are all the literals derived by unit propagation on $F | α$ . Since C is propagation redundant with respect to F, we know that there exists some assignment $ω$ such that $ω$ satisfies C and $F | α x_{1} \dots x_{k} ⊢_{^{_{1}}} F | ω$ . Hence, by Lemma 12, $F | α ⊢_{^{_{1}}} F | ω$ . Now, if $ω$ satisfies $C^{-}$ , then $C^{-}$ is propagation redundant with respect to F.

Assume thus that $ω$ does not satisfy $C^{-}$ . Then, since $ω$ satisfies C, it must falsify a literal in $x_{1}, \dots, x_{k}$ . Let $x_{i}$ be the first literal (i.e., the one with the smallest index) of $x_{1}, \dots, x_{k}$ that is falsified by $ω$ . Then, there exists a clause $D \in F$ such that $D | α x_{1} \dots x_{i - 1}$ is the unit clause $x_{i}$ and thus $ω$ falsifies D. Hence, $F | α ⊢_{^{_{1}}} ⊥$ . But then, $C^{-}$ is trivially $PR$ with respect to F since it holds for every assignment $τ$ (and in particular for every $τ$ that satisfies C) that $F | α ⊢_{^{_{1}}} F | τ$ . $□$

We can prove a corresponding result about $LPR$ clauses: If we remove propagated literals from an $LPR$ clause, then the resulting clause is also an $LPR$ clause. As we will see later, this is not the case for $SPR$ clauses. We start with two lemmas:

Lemma 14

Let F be a formula, $α$ and $ω$ assignments, and x a literal such that $F | α ⊢_{^{_{1}}} x$ . Then, $F | α ⊢_{^{_{1}}} F | ω x$ implies $F | α ⊢_{^{_{1}}} F | ω$ .

Proof

Suppose to the contrary that there exists a clause $D | ω \in F | ω$ such that $F | α ⊬_{^{_{1}}} D | ω$ . Then, D must contain x, for otherwise $ω x$ would not satisfy D, which would in turn imply $F | α ⊬_{^{_{1}}} F | ω x$ . Therefore, the clause $\neg (D | ω)$ , being the conjunction of the negated literals of $D | ω$ , must contain $\bar{x}$ . But then, since unit propagation on $F | α$ derives x, it follows that unit propagation derives a conflict on $F | α \land \neg (D | ω)$ . Hence, $F | α ⊢_{^{_{1}}} D | ω$ and thus $F | α ⊢_{^{_{1}}} F | ω$ . $□$

Lemma 15

Let F be a formula, $α$ an assignment, $α^{+}$ a propagation extension of $α$ on F, and l a literal. Then, $F | α^{+} ⊢_{^{_{1}}} F | α_{l}^{+}$ implies $F | α ⊢_{^{_{1}}} F | α_{l}$ .

Proof

Assume that $α^{+}$ is a propagation extension of $α$ . Then, $α^{+}$ is of the form $α τ$ where $τ$ make the literals true that have been derived by unit propagation on $F | α$ . To show that $F | α ⊢_{^{_{1}}} F | α_{l}$ , we distinguish two cases:

$v a r (l) \in v a r (α)$ : In this case, $F | α τ ⊢_{^{_{1}}} F | α_{l} τ$ and thus $F | α ⊢_{^{_{1}}} F | α_{l} τ$ . Now, since $F | α ⊢_{^{_{1}}} x$ for every literal x satisfied by $τ$ , we use Lemma 14 to repeatedly remove from $α_{l} τ$ all assignments made by $τ$ to obtain $F | α ⊢_{^{_{1}}} F | α_{l}$ .

$v a r (l) \in v a r (τ)$ : In that case, $F | α τ ⊢_{^{_{1}}} F | α τ_{l}$ . Since unit propagation on $F | α$ derives $\bar{l}$ , there exists a clause $D \in F$ such that $α τ_{l}$ —which satisfies l—falsifies D. Hence, $D | α τ_{l} = ⊥$ , and since $F | α τ ⊢_{^{_{1}}} D | α τ_{l}$ , it follows that unit propagation derives a conflict on $F | α τ$ . But then unit propagation must derive a conflict on $F | α$ and thus $F | α$ implies every clause via unit propagation. We thus conclude that $F | α ⊢_{^{_{1}}} F | α_{l}$ . $□$

The following Theorem is now an immediate consequence of Lemma 15:

Theorem 16

Let C be a clause that is $LPR$ with respect to a formula F and let $α^{+}$ be the assignment blocked by C. If $α^{+}$ is a propagation extension of an assignment $α$ on F, then the clause that blocks $α$ is $LPR$ with respect to F.

The corresponding property does not hold for $SPR$ clauses, as illustrated by the following example:

Example 7

Let $F = (\bar{x} \lor y) \land (x \lor \bar{y})$ , $C = x \lor y$ , and $L = {x, y}$ . Then, $α^{+} = \bar{x} \bar{y}$ , which is the assignment blocked by C, is a propagation extension of the assignment $α = \bar{x}$ . Moreover, since $F | α_{L}^{+}$ contains no clauses, we have $F | α^{+} ⊢_{^{_{1}}} F | α_{L}^{+}$ and thus C is set-propagation redundant with respect to F. However, $F | α ⊬_{^{_{1}}} F | α_{x}$ and thus the subclause x of C is not set-propagation redundant with respect to F. $□$

We conclude this section with an observation about the witnessing assignments of propagation-redundant clauses. In the case of propagation redundancy, the domain of the witnessing assignment $ω$ is not constrained to a particular set of variables. Thus, if we are given a clause C and we want to find a corresponding assignment $ω$ that witnesses the propagation redundancy of C, we would have to consider assignments over all possible sets of variables. It turns out, however, that there is always a witnessing assignment that assigns all variables occurring in C, and possibly more. Thus, if $α$ is the assignment blocked by C, we only need to consider assignments $ω$ such that $v a r (α) \subseteq v a r (ω)$ . The reason for this is that we can extend every witness $ω$ to the variables of $v a r (α)$ :

Proposition 17

Let F be a formula, $α$ and $ω$ assignments, and x a literal such that $v a r (x) \in v a r (α) \ v a r (ω)$ . Then, $F | α ⊢_{^{_{1}}} F | ω$ implies $F | α ⊢_{^{_{1}}} F | ω x$ .

Proof

Let $D | ω x \in F | ω x$ . We show that $F | α ⊢_{^{_{1}}} D | ω x$ . Clearly, x is not contained in D for otherwise $D | ω x = ⊤$ . Therefore, the only possible difference between $D | ω$ and $D | ω x$ is that $\bar{x}$ is contained in $D | ω$ but not in $D | ω x$ . Now, since $v a r (x) \in v a r (α)$ , we know that $v a r (x) \notin F | α$ . But then, $F | α ⊢_{^{_{1}}} D | ω x$ if and only if $F | α ⊢_{^{_{1}}} D | ω$ . It thus follows that $F | α ⊢_{^{_{1}}} F | ω x$ . $□$

Related Work

Here, we discuss how the concepts in this article are related to variable instantiation [1], autarkies [24], safe assignments [29], and symmetry breaking [6]. If $F | \bar{x} ⊨ F | x$ holds for some literal x, then variable instantiation, as described by Andersson et al. [1], allows to make the literal x true in the formula F. Analogously, our redundancy notion identifies the clause x as redundant.

As presented by Monien and Speckenmeyer [24], an assignment $ω$ is an autarky for a formula F if it satisfies all clauses of F that contain a literal to which $ω$ assigns a truth value. If an assignment $ω$ is an autarky for a formula F, then F is satisfiability equivalent to $F | ω$ . Similarly, propagation redundancy $PR$ allows us to add all the unit clauses satisfied by an autarky, with the autarky serving as a witness:2 Let $ω$ be an autarky for some formula F, $C = x$ for a literal x satisfied by $ω$ , and $α = \bar{x}$ the assignment blocked by C. Notice that $F | α \supseteq F | ω$ and thus C is $PR$ with respect to F.

According to Weaver and Franco [29], an assignment $ω$ is considered safe if, for every assignment $α$ with $v a r (α) = v a r (ω)$ , it holds that $F | α ⊨ F | ω$ . If an assignment $ω$ is safe, then $F | ω$ is satisfiability equivalent to F. In a similar fashion, our approach allows us to block all the above-mentioned assignments $α \neq ω$ . Through this, we obtain a formula that is logically equivalent to $F | ω$ . Note that safe assignments generalize autarkies and variable instantiation. Moreover, while safe assignments only allow the application of an assignment $ω$ to a formula F if $F | α ⊨ F | ω$ holds for all assignments $α \neq ω$ , our approach enables us to block an assignment $α$ as soon as $F | α ⊨ F | ω$ .

Finally, symmetry breaking [6] can be expressed in the $DRAT$ proof system [12] but existing methods introduce many new variables and duplicate the input formula multiple times. It might be possible to express symmetry breaking without new variables in the $PR$ proof system. For one important symmetry, row-interchangeability [7], the symmetry breaking using $PR$ without new variables appears similar to the method we presented for the pigeon hole formulas.

Open Problems and Future Work

In a more recent paper, we showed that there exists a polynomial-time procedure that translates $PR$ proofs to $DRAT$ proofs by introducing one new variable [11]. Moreover, we proved that extended resolution polynomially simulates the $DRAT$ proof system [20]. The combination of these two results demonstrates that extended resolution polynomially simulates the $PR$ proof system and therefore also its restricted variants. An open question is how the $PR$ proof system without new variables relates to other strong proof systems for propositional logic that do not introduce new variables, such as Frege systems. Other open questions are related to the space and width bounds of the smallest $PR$ proofs, again without new variables, for well-known other hard problems such as Tseitin formulas [26, 27] or pebbling games [25].

On the practical side, we want to pursue some ideas to improve SAT solving by learning short $PR$ clauses. Our first approach, called satisfaction-driven clause learning, generalizes the well-known conflict-driven clause learning paradigm by checking whether certain assignments—encountered during solving—can be pruned from the search space by adding $PR$ clauses [15]. Our current implementation can find short proofs of pigeon hole formulas, although the solver can only find a subset of all possible $PR$ clauses. Moreover, we are still searching for efficient heuristics that help solvers with finding short $PR$ clauses in general formulas. Another problem we are currently exploring is the minimization of conflict clauses by checking if a subset of a conflict clause is propagation redundant with respect to the formula under consideration. Finally, we want to implement a formally-verified proof checker for $PR$ proofs.

Conclusion

We presented a clean and simple characterization of clause redundancy that is based on an implication relation between a formula and itself under different partial assignments. Replacing the implication relation by efficiently decidable notions of implication, e.g., the superset relation or implication via unit propagation, gives then rise to various polynomially-checkable redundancy criteria. One variant yields a proof system that turns out to coincide with $RAT$ , which together with deletion is the de-facto standard in SAT solving. We conjecture the proof systems based on the other two variants to be stronger if the introduction of new variables is not allowed. We showed that these more general proof systems admit short clausal proofs without new variables for the famous pigeon hole formulas. Experiments show that our proofs are much smaller than existing clausal proofs and that they are also much faster to check. Our new proof systems concisely simulate many other concepts from the literature such as autarkies, variable instantiation, safe assignments, and certain kinds of symmetry reasoning.

Acknowledgements

Open access funding provided by Austrian Science Fund (FWF).

Footnotes

We changed the definition of $P H P_{n}$ from putting n pigeons into $n - 1$ holes to putting $n + 1$ pigeons into n holes in order to fix a discrepancy with the formulas in the evaluation.

In the conference version [14] of this article, we wrongly stated that we can add all falsified unit clauses instead of the satisfied ones.

Disclaimer: The authors strongly disagree with Springer’s typesetting practices. A version with the intended content and typesetting is available on the authors’ websites.

This work has been supported by the National Science Foundation under Grant CCF-1526760 and by the Austrian Science Fund (FWF) under Project W1255-N23.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Andersson, G., Bjesse, P., Cook, B., Hanna, Z.: A proof engine approach to solving combinational design automation problems. In: Proceedings of the 39th Annual Design Automation Conference (DAC 2002), pp. 725–730. ACM (2002)
2.Audemard, G., Katsirelos, G., Simon, L.: A restriction of extended resolution for clause learning sat solvers. In: Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI 2010). AAAI Press (2010)
3.Balyo, T., Heule, M.J.H., Järvisalo, M.: SAT competition 2016: recent developments. To appear in: Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI 2017). AAAI Press (2017)
4.Clarke EM, Biere A, Raimi R, Zhu Y. Bounded model checking using satisfiability solving. Form. Methods Syst. Des. 2001;19(1):7–34. doi: 10.1023/A:1011276507260. [DOI] [Google Scholar]
5.Cook SA. A short proof of the pigeon hole principle using extended resolution. SIGACT News. 1976;8(4):28–32. doi: 10.1145/1008335.1008338. [DOI] [Google Scholar]
6.Crawford, J., Ginsberg, M., Luks, E., Roy, A.: Symmetry-breaking predicates for search problems. In: Proceedings of the 5th International Conference on Principles of Knowledge Representation and Reasoning (KR 1996), pp. 148–159. Morgan Kaufmann (1996)
7.Devriendt Jo, Bogaerts Bart, Bruynooghe Maurice, Denecker Marc. Theory and Applications of Satisfiability Testing – SAT 2016. Cham: Springer International Publishing; 2016. Improved Static Symmetry Breaking for SAT; pp. 104–122. [Google Scholar]
8.Goldberg, E.I., Novikov, Y.: Verification of proofs of unsatisfiability for CNF formulas. In: Proceedings of the Conference on Design, Automation and Test in Europe (DATE 2003), pp. 10,886–10,891. IEEE Computer Society (2003)
9.Haken A. The intractability of resolution. Theor. Comput. Sci. 1985;39:297–308. doi: 10.1016/0304-3975(85)90144-6. [DOI] [Google Scholar]
10.Heule, M.J.H.: Schur number five. In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI 2018), pp. 6598–6606. AAAI Press (2018)
11.Heule Marijn J. H., Biere Armin. Tools and Algorithms for the Construction and Analysis of Systems. Cham: Springer International Publishing; 2018. What a Difference a Variable Makes; pp. 75–92. [Google Scholar]
12.Heule Marijn J. H., Hunt Warren A., Wetzler Nathan. Automated Deduction - CADE-25. Cham: Springer International Publishing; 2015. Expressing Symmetry Breaking in DRAT Proofs; pp. 591–606. [Google Scholar]
13.Heule Marijn J. H., Kullmann Oliver, Marek Victor W. Theory and Applications of Satisfiability Testing – SAT 2016. Cham: Springer International Publishing; 2016. Solving and Verifying the Boolean Pythagorean Triples Problem via Cube-and-Conquer; pp. 228–245. [Google Scholar]
14.Heule Marijn J. H., Kiesl Benjamin, Biere Armin. Automated Deduction – CADE 26. Cham: Springer International Publishing; 2017. Short Proofs Without New Variables; pp. 130–147. [Google Scholar]
15.Heule Marijn J. H., Kiesl Benjamin, Seidl Martina, Biere Armin. Hardware and Software: Verification and Testing. Cham: Springer International Publishing; 2017. PRuning Through Satisfaction; pp. 179–194. [Google Scholar]
16.Ivančić F, Yang Z, Ganai MK, Gupta A, Ashar P. Efficient SAT-based bounded model checking for software verification. Theor. Comput. Sci. 2008;404(3):256–274. doi: 10.1016/j.tcs.2008.03.013. [DOI] [Google Scholar]
17.Järvisalo M, Biere A, Heule MJH. Simulating circuit-level simplifications on CNF. J. Autom. Reason. 2012;49(4):583–619. doi: 10.1007/s10817-011-9239-9. [DOI] [Google Scholar]
18.Järvisalo Matti, Heule Marijn J. H., Biere Armin. Automated Reasoning. Berlin, Heidelberg: Springer Berlin Heidelberg; 2012. Inprocessing Rules; pp. 355–370. [Google Scholar]
19.Kiesl Benjamin, Seidl Martina, Tompits Hans, Biere Armin. Automated Reasoning. Cham: Springer International Publishing; 2016. Super-Blocked Clauses; pp. 45–61. [Google Scholar]
20.Kiesl Benjamin, Rebola-Pardo Adrián, Heule Marijn J. H. Automated Reasoning. Cham: Springer International Publishing; 2018. Extended Resolution Simulates DRAT; pp. 516–531. [Google Scholar]
21.Konev B, Lisitsa A. Computer-aided proof of Erdős discrepancy properties. Artif. Intell. 2015;224(C):103–118. doi: 10.1016/j.artint.2015.03.004. [DOI] [Google Scholar]
22.Kullmann O. On a generalization of extended resolution. Discrete Appl. Math. 1999;96–97:149–176. doi: 10.1016/S0166-218X(99)00037-2. [DOI] [Google Scholar]
23.Manthey Norbert, Heule Marijn J. H., Biere Armin. Hardware and Software: Verification and Testing. Berlin, Heidelberg: Springer Berlin Heidelberg; 2013. Automated Reencoding of Boolean Formulas; pp. 102–117. [Google Scholar]
24.Monien B, Speckenmeyer E. Solving satisfiability in less than $2^{n}$ steps. Discrete Appl. Math. 1985;10(3):287–295. doi: 10.1016/0166-218X(85)90050-2. [DOI] [Google Scholar]
25.Nordström J. A simplified way of proving trade-off results for resolution. Inf. Process. Lett. 2009;109(18):1030–1035. doi: 10.1016/j.ipl.2009.06.006. [DOI] [Google Scholar]
26.Tseitin GS. On the complexity of derivation in propositional calculus. Stud. Math. Math. Log. 1968;2:115–125. [Google Scholar]
27.Urquhart A. The complexity of propositional proofs. Bull. Symb. Log. 1995;1(4):425–467. doi: 10.2178/bsl/1203350879. [DOI] [Google Scholar]
28.Van Gelder A. Producing and verifying extremely large propositional refutations. Ann. Math. Artif. Intell. 2012;65(4):329–372. doi: 10.1007/s10472-012-9322-x. [DOI] [Google Scholar]
29.Weaver S, Franco JV, Schlipf JS. Extending existential quantification in conjunctions of BDDs. JSAT. 2006;1(2):89–110. [Google Scholar]
30.Wetzler Nathan, Heule Marijn J. H., Hunt Warren A. Lecture Notes in Computer Science. Cham: Springer International Publishing; 2014. DRAT-trim: Efficient Checking and Trimming Using Expressive Clausal Proofs; pp. 422–429. [Google Scholar]

[CR1] 1.Andersson, G., Bjesse, P., Cook, B., Hanna, Z.: A proof engine approach to solving combinational design automation problems. In: Proceedings of the 39th Annual Design Automation Conference (DAC 2002), pp. 725–730. ACM (2002)

[CR2] 2.Audemard, G., Katsirelos, G., Simon, L.: A restriction of extended resolution for clause learning sat solvers. In: Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI 2010). AAAI Press (2010)

[CR3] 3.Balyo, T., Heule, M.J.H., Järvisalo, M.: SAT competition 2016: recent developments. To appear in: Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI 2017). AAAI Press (2017)

[CR4] 4.Clarke EM, Biere A, Raimi R, Zhu Y. Bounded model checking using satisfiability solving. Form. Methods Syst. Des. 2001;19(1):7–34. doi: 10.1023/A:1011276507260. [DOI] [Google Scholar]

[CR5] 5.Cook SA. A short proof of the pigeon hole principle using extended resolution. SIGACT News. 1976;8(4):28–32. doi: 10.1145/1008335.1008338. [DOI] [Google Scholar]

[CR6] 6.Crawford, J., Ginsberg, M., Luks, E., Roy, A.: Symmetry-breaking predicates for search problems. In: Proceedings of the 5th International Conference on Principles of Knowledge Representation and Reasoning (KR 1996), pp. 148–159. Morgan Kaufmann (1996)

[CR7] 7.Devriendt Jo, Bogaerts Bart, Bruynooghe Maurice, Denecker Marc. Theory and Applications of Satisfiability Testing – SAT 2016. Cham: Springer International Publishing; 2016. Improved Static Symmetry Breaking for SAT; pp. 104–122. [Google Scholar]

[CR8] 8.Goldberg, E.I., Novikov, Y.: Verification of proofs of unsatisfiability for CNF formulas. In: Proceedings of the Conference on Design, Automation and Test in Europe (DATE 2003), pp. 10,886–10,891. IEEE Computer Society (2003)

[CR9] 9.Haken A. The intractability of resolution. Theor. Comput. Sci. 1985;39:297–308. doi: 10.1016/0304-3975(85)90144-6. [DOI] [Google Scholar]

[CR10] 10.Heule, M.J.H.: Schur number five. In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI 2018), pp. 6598–6606. AAAI Press (2018)

[CR11] 11.Heule Marijn J. H., Biere Armin. Tools and Algorithms for the Construction and Analysis of Systems. Cham: Springer International Publishing; 2018. What a Difference a Variable Makes; pp. 75–92. [Google Scholar]

[CR12] 12.Heule Marijn J. H., Hunt Warren A., Wetzler Nathan. Automated Deduction - CADE-25. Cham: Springer International Publishing; 2015. Expressing Symmetry Breaking in DRAT Proofs; pp. 591–606. [Google Scholar]

[CR13] 13.Heule Marijn J. H., Kullmann Oliver, Marek Victor W. Theory and Applications of Satisfiability Testing – SAT 2016. Cham: Springer International Publishing; 2016. Solving and Verifying the Boolean Pythagorean Triples Problem via Cube-and-Conquer; pp. 228–245. [Google Scholar]

[CR14] 14.Heule Marijn J. H., Kiesl Benjamin, Biere Armin. Automated Deduction – CADE 26. Cham: Springer International Publishing; 2017. Short Proofs Without New Variables; pp. 130–147. [Google Scholar]

[CR15] 15.Heule Marijn J. H., Kiesl Benjamin, Seidl Martina, Biere Armin. Hardware and Software: Verification and Testing. Cham: Springer International Publishing; 2017. PRuning Through Satisfaction; pp. 179–194. [Google Scholar]

[CR16] 16.Ivančić F, Yang Z, Ganai MK, Gupta A, Ashar P. Efficient SAT-based bounded model checking for software verification. Theor. Comput. Sci. 2008;404(3):256–274. doi: 10.1016/j.tcs.2008.03.013. [DOI] [Google Scholar]

[CR17] 17.Järvisalo M, Biere A, Heule MJH. Simulating circuit-level simplifications on CNF. J. Autom. Reason. 2012;49(4):583–619. doi: 10.1007/s10817-011-9239-9. [DOI] [Google Scholar]

[CR18] 18.Järvisalo Matti, Heule Marijn J. H., Biere Armin. Automated Reasoning. Berlin, Heidelberg: Springer Berlin Heidelberg; 2012. Inprocessing Rules; pp. 355–370. [Google Scholar]

[CR19] 19.Kiesl Benjamin, Seidl Martina, Tompits Hans, Biere Armin. Automated Reasoning. Cham: Springer International Publishing; 2016. Super-Blocked Clauses; pp. 45–61. [Google Scholar]

[CR20] 20.Kiesl Benjamin, Rebola-Pardo Adrián, Heule Marijn J. H. Automated Reasoning. Cham: Springer International Publishing; 2018. Extended Resolution Simulates DRAT; pp. 516–531. [Google Scholar]

[CR21] 21.Konev B, Lisitsa A. Computer-aided proof of Erdős discrepancy properties. Artif. Intell. 2015;224(C):103–118. doi: 10.1016/j.artint.2015.03.004. [DOI] [Google Scholar]

[CR22] 22.Kullmann O. On a generalization of extended resolution. Discrete Appl. Math. 1999;96–97:149–176. doi: 10.1016/S0166-218X(99)00037-2. [DOI] [Google Scholar]

[CR23] 23.Manthey Norbert, Heule Marijn J. H., Biere Armin. Hardware and Software: Verification and Testing. Berlin, Heidelberg: Springer Berlin Heidelberg; 2013. Automated Reencoding of Boolean Formulas; pp. 102–117. [Google Scholar]

[CR24] 24.Monien B, Speckenmeyer E. Solving satisfiability in less than $2^{n}$ steps. Discrete Appl. Math. 1985;10(3):287–295. doi: 10.1016/0166-218X(85)90050-2. [DOI] [Google Scholar]

[CR25] 25.Nordström J. A simplified way of proving trade-off results for resolution. Inf. Process. Lett. 2009;109(18):1030–1035. doi: 10.1016/j.ipl.2009.06.006. [DOI] [Google Scholar]

[CR26] 26.Tseitin GS. On the complexity of derivation in propositional calculus. Stud. Math. Math. Log. 1968;2:115–125. [Google Scholar]

[CR27] 27.Urquhart A. The complexity of propositional proofs. Bull. Symb. Log. 1995;1(4):425–467. doi: 10.2178/bsl/1203350879. [DOI] [Google Scholar]

[CR28] 28.Van Gelder A. Producing and verifying extremely large propositional refutations. Ann. Math. Artif. Intell. 2012;65(4):329–372. doi: 10.1007/s10472-012-9322-x. [DOI] [Google Scholar]

[CR29] 29.Weaver S, Franco JV, Schlipf JS. Extending existential quantification in conjunctions of BDDs. JSAT. 2006;1(2):89–110. [Google Scholar]

[CR30] 30.Wetzler Nathan, Heule Marijn J. H., Hunt Warren A. Lecture Notes in Computer Science. Cham: Springer International Publishing; 2014. DRAT-trim: Efficient Checking and Trimming Using Expressive Clausal Proofs; pp. 422–429. [Google Scholar]

PERMALINK

Strong Extension-Free Proof Systems

Marijn J H Heule

Benjamin Kiesl

Armin Biere

Abstract

Introduction

Preliminaries

Clause Redundancy and Clausal Proofs

Definition 1

Clause Redundancy via Implication

Example 1

Theorem 1

Proof

Fig. 1.

Definition 2

Example 2

Definition 3

Theorem 2

Proof

Definition 4

Example 3

Definition 5

Lemma 3

Proof

Theorem 4

Proof

Definition 6

Example 4

Definition 7

Short Proofs of the Pigeon Hole Principle

Evaluation

Fig. 2.

Fig. 3.

Table 1.

Properties of Propagation Redundancy

Example 5

Proposition 5

Proof

Lemma 6

Proof

Lemma 7

Proof

Theorem 8

Lemma 9

Proof

Lemma 10

Proof

Theorem 11

Definition 8

Example 6

Lemma 12

Proof

Theorem 13

Proof

Lemma 14

Proof

Lemma 15

Proof

Theorem 16

Example 7

Proposition 17

Proof

Related Work

Open Problems and Future Work

Conclusion

Acknowledgements

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases