Uniquely Satisfiable d-Regular (k,s)-SAT Instances

Zufeng Fu; Daoyun Xu

doi:10.3390/e22050569

. 2020 May 19;22(5):569. doi: 10.3390/e22050569

Uniquely Satisfiable d-Regular (k,s)-SAT Instances

Zufeng Fu ^1,², Daoyun Xu ^1,^*

PMCID: PMC7517085 PMID: 33286341

Abstract

Unique k-SAT is the promised version of k-SAT where the given formula has 0 or 1 solution and is proved to be as difficult as the general k-SAT. For any $k \geq 3$ , $s \geq f (k, d)$ and $(s + d) / 2 > k - 1$ , a parsimonious reduction from k-CNF to d-regular (k,s)-CNF is given. Here regular (k,s)-CNF is a subclass of CNF, where each clause of the formula has exactly k distinct variables, and each variable occurs in exactly s clauses. A d-regular (k,s)-CNF formula is a regular (k,s)-CNF formula, in which the absolute value of the difference between positive and negative occurrences of every variable is at most a nonnegative integer d. We prove that for all $k \geq 3$ , $f (k, d) \leq u (k, d) + 1$ and $f (k, d + 1) \leq u (k, d)$ . The critical function $f (k, d)$ is the maximal value of s, such that every d-regular (k,s)-CNF formula is satisfiable. In this study, $u (k, d)$ denotes the minimal value of s such that there exists a uniquely satisfiable d-regular (k,s)-CNF formula. We further show that for $s \geq f (k, d) + 1$ and $(s + d) / 2 > k - 1$ , there exists a uniquely satisfiable d-regular $(k, s + 1)$ -CNF formula. Moreover, for $k \geq 7$ , we have that $u (k, d) \leq f (k, d) + 1$ .

Keywords: d-regular (k,s)-CNF; SAT-problem; uniquely satisfiable

1. Introduction

Satisfiability Problem (SAT) is a central problem in theoretical computer science of deciding whether a given Conjunction Normal Formula (CNF) is satisfiable. The k-SAT is a satisfiability problem where every clause has exactly k distinct variables, and was proved to be a NP-complete problem for $k \geq 3$ in [1]. That is, SAT problem should be a computationally hard problem. However, modern SAT solvers are able to efficiently solve some formulas with millions of variables, such as MiniSat [2], Glucose [3], Maple [4]. The conflict-driven clause learning technique is an important algorithm to improve the efficiency of these SAT solver. Yet, how these solvers can be so successful has remained elusive. In order to analyze and improve SAT solvers, some random SAT models were propose.

A natural measure of the solution space is the number of solutions. Unique k-SAT denotes the promise search problem of k-SAT where the number of solutions is either 0 or 1. The harder instances should have fewer solutions. But Calabro and Paturi in [5] proved that the exponential complexity of deciding whether a k-CNF formula has a solution is the same as that of deciding whether it has exactly one solution, both when it is promised and when it is not promised that the input formula has a solution. Thus, the research of uniquely satisfiable SAT instances is a very significant work.

The ( $k, s$ )-SAT denotes the family of satisfiability problems restricted to CNF formulas with exactly k distinct variables per clause and at most s occurrences of each variable. Regular ( $k, s$ )-SAT is a class of special ( $k, s$ )-SAT which each variable occurs in exactly s clauses. By some polynomial time reductions, it is discovered that some SAT problems with regular structures are NP-complete, such as (3,4)-SAT problem in [6] and regular (3,4)-SAT problem in [7]. Experimental results and theoretical analysis on a random k-SAT problem showed that the constrained density $α$ of a CNF formula is an important parameter affecting the formula satisfiability and the solving difficulty in [8,9,10,11]. There is a phase transition point $α (k)$ on a random k-SAT problem such that

(i)
all random k-CNF instances with $α < α (k)$ are satisfiable with high probability;
(ii)
all random k-CNF instances with $α > α (k)$ are unsatisfiable with high probability.

But every regular ( $k, s$ )-CNF formula has a fixed constrained density $α$ (the clause-to-variable ratio), such as regular (3,4)-CNF formula corresponding to 4/3. The constrained density of the regular (3,4)-CNF is much smaller than the SAT-UNSAT phase transition point of the random 3-SAT problem $α (3) \approx 4.267$ in [12]. This shows that a random regular (3,4)-CNF formula is satisfiable with high probability, but the regular (3,4)-SAT problem is NP-complete. Obviously, it is not enough to describe structural features of the CNF formula merely by the constrained density $α$ .

In [13,14], M. Wahlström presented a definition of ( $a, b$ )-variable to classify all variables in a CNF formula, and designed two algorithms for solving a CNF formula with at most d occurrences per variable. Here, an ( $a, b$ )-variable is a variable which occurs positively in a clauses and negatively in b clauses. In [15], Johannsen, Razgon and Wahlström presented an algorithm for solving a CNF formula in which the number of occurrences of each literal is at most d. Their results demonstrated that the CNF formulas with some restrictions on the number of occurrences (positive or negative) of each variable have its own characteristics.

In order to further study SAT problems with regular structures, we introduced d-regular ( $k, s$ )-CNF formula in [16,17]. The regular ( $k, s$ )-CNF formula requires that each clause contains exactly k variables and each variable occurs in exactly s clauses. The d-regular ( $k, s$ )-CNF formula also requires that the absolute value of the difference between positive and negative occurrences of each variable is no more than a nonnegative integer d. In this paper, we investigate the existence condition of uniquely satisfiable d-regular ( $k, s$ )-SAT Instances, and present a method to construct a uniquely satisfiable d-regular ( $k, s$ )-formula. We also give a parsimonious reduction from k-CNF to d-regular ( $k, s$ )-CNF, and further explain the constrained density is not enough to describe the structural features of a CNF formula.

2. Related Works

Unique SAT is the promised version of the SAT, where a given CNF formula has 0 or 1 solution. Valiant and Vazirani in [18] gave a randomized polynomial time reduction from SAT to Unique SAT, and showed that deciding whether a CNF formula has zero or one solution is essentially as difficult as SAT in general. Calabro et al. in [19] proved that Unique k-SAT is no easier than k-SAT, not just for polynomial time algorithms but also super-polynomial time algorithms. They in [5] pointed out it does not matter whether there has a promise that a formula has a solution. Matthews in [20] studied the complexity of UNIQUE-( $k, s$ )-SAT and proved that $f (k) \leq u (k) \leq f (k) + 2$ for $k \geq 3$ , where $u (k)$ is the minimal value of s so that uniquely satisfiable ( $k, s$ )-CNF formulas exist and $f (k)$ represents the maximal value of s such that all ( $k, s$ )-CNF formulas are satisfiable. The exact values of $f (k)$ are only known for $k = 3$ and $k = 4$ , because $f (3) = 3$ , $f (4) = 4$ were shown in [21]. In [22,23,24,25], it showed that the upper and lower bounds for $k = 5, 6, \dots, 9$ , $f (k)$ are described as follows

5 \leq f (5) \leq 7, 7 \leq f (6) \leq 11, 13 \leq f (7) \leq 17, 24 \leq f (8) \leq 29, 41 \leq f (9) \leq 51 .

Encoding into a CNF formula is a common way to solve a practical problem. These CNF formulas often have some special structures and properties. It is important to design some random SAT models that are similar to reality. Markström in [26] proposed a constructor method of SAT instance based on Eulerian graphs, and discussed how a solver can try to avoid at least some of the pitfalls presented by these instances. Giraldez-Cru and Levy in [27] proposed a new model of generation of random SAT instances with community structure, and showed that modern solvers do actually exploit this community structure. In [28], they presented a random SAT instances generator based on the notion of locality, and showed that CDCL SAT solvers take advantage of both popularity and similarity. In [29,30], it showed that SAT instances with less solutions tend to be harder for stochastic local search methods. In [31], Žnidarič gave an experimental evaluation of uniquely satisfiable 3-SAT instances obtained by simply filtering randomly generated formulas.

In this paper, we investigate a uniquely satisfiable d-regular ( $k, s$ )-SAT Instances, and show that $f (k, d) \leq u (k, d) + 1$ , $f (k, d + 1) \leq u (k, d)$ for $k \geq 3$ , and $u (k, d) \leq f (k, d) + 1$ for $k \geq 7$ . Here $u (k, d)$ denotes the minimal value of s such that uniquely satisfiable d-regular ( $k, s$ )-CNF formulas exist, and $f (k, d)$ denotes the maximal value of s such that all d-regular ( $k, s$ )-CNF formulas are satisfiable. We demonstrate that for $s \geq f (k, d) + 1$ and $(s + d) / 2 > k - 1$ , there is a uniquely satisfiable d-regular ( $k, s$ )-CNF formula. We also reveal that for $k \geq 7$ , if a d-regular ( $k, s$ )-CNF formula is unsatisfiable, then $(s + d) / 2 > k - 1$ . Finally, for $k \geq 3$ , $s \geq f (k) + 1$ and $(s + d) / 2 > k - 1$ , we give a parsimonious reduction from a k-CNF formula to a d-regular ( $k, s$ )-CNF formula. Constructing uniquely satisfiable d-regular ( $k, s + 1$ )-CNF formulas from an unsatisfiable d-regular ( $k, s$ )-CNF formula is a key component of our reduction.

3. Notations

A literal is a boolean variable x or a negated boolean variable $\neg x$ . x is called a positive literal, and $\neg x$ is called a negative literal. A clause C is a disjunction of literals, $C = L_{1} \lor L_{2} \lor \dots \lor L_{k}$ or $C = {L_{1}, L_{2}, \dots, L_{k}}$ . A formula F in the conjunctive normal formula is a conjunction of clauses, $F = C_{1} \land C_{2} \land \dots \land C_{m}$ or $F = {C_{1}, C_{2}, \dots, C_{m}}$ . $v a r (F)$ denotes the set of boolean variables occurring in a formula F, and $# v a r (F)$ refers to the number of variables occurring in F. $# c l (F)$ denotes the number of clauses of F, and $p o s (F, x)$ ( $n e g (F, x)$ ) refers to the number of positive (negative) occurrences of a variable x in F. $p o s (F)$ ( $n e g (F)$ ) denotes the number of positive (negative) literals in F, and $p o s (F, X)$ ( $n e g (F, X)$ ) refers to the number of positive (negative) occurrences of all variables of the variable set X in F.

A truth assignment $τ$ is a function which assigns to each boolean variable v a unique value $τ (v) = {0, 1}$ . A CNF formula F is satisfiable, if a truth assignment $τ$ with $τ (F) = 1$ exists. Such a truth assignment is called a satisfying assignment. We divide boolean variables in these formulas into forced variables or unforced variables. If every satisfying assignment of a formula sets a variable to the same value, we call it a forced variable. Otherwise, the variable is regarded as an unforced variable.

If the formulas $Φ$ and $Ψ$ are either satisfiable at the same time or not, they are called SAT- $e q u i v a l e n t s$ . This implies that, $Φ$ is satisfiable if and only if $Ψ$ is satisfiable. A formula $F^{'}$ is called the disjoint copy of a CNF formula F, if $F^{'}$ is a copy of F and their variable sets are disjoint. A uniquely satisfiable d-regular ( $k, s$ )-CNF formula is a d-regular ( $k, s$ )-CNF with only one solution. A CNF formula F is a minimal unsatisfiable formula (MU), if F is unsatisfiable and $F - {C}$ is satisfiable for any clause $C \in F$ . For a given unsatisfiable formula F, a minimal unsatisfiable formula can be obtained by removing some clauses from F.

Definition 1.

For each $k \geq 3$ , $f (k)$ is defined as the maximal value of s such that all ( $k, s$ )-CNF formulas are satisfiable, $f (k, d)$ is defined as the maximal value of s such that all d-regular ( $k, s$ )-CNF formulas are satisfiable, $u (k)$ is defined as the minimal value of s such that uniquely satisfiable ( $k, s$ )-CNF formulas exist, and $u (k, d)$ is defined as the minimal value of s such that uniquely satisfiable d-regular ( $k, s$ )-CNF formulas exist.

Definition 2.

A k-CNF formula F is called a k-forced-once d-regular ( $k, s$ )-CNF formula if

(i)
there exist k variables $x_{1}, x_{2}, \dots, x_{k}$ that only occur once;

(ii)
except for the k variables, every variable occurs in exactly s clauses, and the absolute value of the difference between positive and negative occurrences of every variable is no more than the nonnegative integer d.

(iii)
F is satisfiable and for any truth assignment τ satisfying F, it holds that
$τ (x_{1}) = τ (x_{2}) = \dots = τ (x_{k}) = t r u e .$

We can represent a CNF formula as a matrix. Each variable $x_{i}$ corresponds to a row of the matrix and each clause $C_{j}$ corresponds to a column of the matrix. For each variable $x_{i}$ , if its positive (resp., negative) literal is in the clause $C_{j}$ , then $a_{i, j} = +$ (resp., $a_{i, j} = -$ ); otherwise, 0.

Let F is a CNF formula with 15 variables $x_{1}, x_{2}, \dots, x_{15}$ and 25 clauses $C_{1}, C_{2}, \dots, C_{25}$ . The representation matrix of the formula F is

\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \\ x_{4} \\ x_{5} \\ x_{6} \\ x_{7} \\ x_{8} \\ x_{9} \\ x_{10} \\ x_{11} \\ x_{12} \\ x_{13} \\ x_{14} \\ x_{15} \end{matrix} (\begin{matrix} + & - & + & + & - & - \\ + & - & - & - & + & + \\ + & - & - & - & + & + \\ + & - & + & - & + & - \\ + & - & - & - & + & + \\ + & - & + & + & - & - \\ + & - & - & - & + & + \\ + & - & - & - & + & + \\ + & - & + & - & + & - \\ + & - & - & - & + & + \\ - & - & - & + & + & + \\ + & + & + & - & - & - \\ + \\ + \\ + \end{matrix}) .

Clearly, F is a 3-forced 0-regular (3,6)-CNF formula. Each of the three variables $x_{13}, x_{14}, x_{15}$ occurs in exactly one clause in F and is forced to be $t r u e$ .

Definition 3.

In the context of SAT, a reduction M is identified to be parsimonious if x and $M (x)$ have the same number of satisfying assignments for any one formula x.

Lemma 1

([32]). Let ( $k, s$ )-CNF be a class of satisfiable formulas, then all ( $k + r, s + r [s / k]$ )-CNF formulas are satisfiable for any nonnegative integer r ( $[x]$ denotes the integral part of x).

Lemma 2

([17]). If the representation matrix of a formula F is

$\begin{matrix} x_{1} \\ x_{2} \\ ⋮ \\ ⋮ \\ x_{n - 1} \\ x_{n} \end{matrix} (\begin{matrix} + & - \\ - & + \\ - \\ ⋱ \\ + \\ - & + \end{matrix}),$

then the formula is satisfiable and every satisfying assignment forces all variables to a same value.

4. Uniquely Satisfiable d-Regular ( $k, s$ )-CNF Formula

The d-regular ( $k, s$ )-CNF formula has stronger regular constraints than the regular ( $k, s$ )-CNF formula. It limits the absolute value of the difference between positive and negative occurrences of each variable. The uniquely satisfiable d-regular ( $k, s$ )-CNF formula refers to a d-regular ( $k, s$ )-CNF formula with only one solution. We investigate the existence conditions of the uniquely satisfiable d-regular ( $k, s$ )-CNF formula.

Theorem 1.

For all $k \geq 3$ , $f (k, d) \leq u (k, d) + 1$ and $f (k, d + 1) \leq u (k, d)$ .

Proof.

Because $f (k, d)$ denotes the maximal value of s such that all d-regular ( $k, s$ )-CNF formulas are satisfiable, we usually construct an unsatisfiable d-regular ( $k, s$ )-CNF formula to find the upper bound of $f (k, d)$ .

Let $s = u (k, d)$ . Because $u (k, d)$ denotes the minimal value of s such that uniquely satisfiable d-regular ( $k, s$ )-CNF formulas exist, there must be a uniquely satisfiable d-regular ( $k, s$ )-CNF formula F. Obviously, by adding a clause to F which is violated by the unique satisfying assignment, the formula F can become an unsatisfiable formula. Suppose the formula F has n variables. We give two methods to construct unsatisfiable instances.

Method 1: We introduce $⌈ n / k ⌉ k - n$ new variables and add $⌈ n / k ⌉ (s + 2) - n s / k$ new clauses to F, which contains at least one clause violated by the unique satisfying assignment. Let each original variable occurs twice in the new clauses (one negative occurrence and another positive occurrence), and each new variable occurs $s + 2$ times in the new clauses (the number of positive and negative occurrences of every new variable is nearly equal). That is, each variable occurs $s + 2$ times in F and the absolute value of the difference between positive and negative occurrences of each variable is no more than d. Therefor, F is turned into an unsatisfiable d-regular ( $k, s + 2$ )-CNF formula. It can be seen that $f (k, d) \leq s + 1 = u (k, d) + 1$ .

Method 2: We introduce $⌈ n / k ⌉ k - n$ new variables and add $⌈ n / k ⌉ (s + 1) - n s / k$ new clauses to F, which contains at least one clause violated by the unique satisfying assignment. Let each original variable occur once in the new clauses, and each new variable occurs $s + 1$ times in the new clauses (the number of positive and negative occurrences of every new variable is nearly equal). That is, each variable occurs $s + 1$ times in F and the absolute value of the difference between positive and negative occurrences of each variable is no more than $d + 1$ . Therefor, F is turned into an unsatisfiable ( $d + 1$ )-regular (k, $s + 1$ )-CNF formula. It can be seen that $f (k, d + 1) \leq s = u (k, d)$ . □

Lemma 3.

If $k \geq 3$ and s are two nonnegative integers such that an unsatisfiable d-regular ( $k, s$ )-CNF formula exists, there exists a k-forced-once d-regular ( $k, s$ )-CNF formula.

Proof.

Let $Φ$ be an unsatisfiable d-regular ( $k, s$ )-CNF formula. Obviously, the number of positive occurrences and negative occurrences of every variable in $Φ$ are all no more than $(s + d) / 2$ . By removing some clauses of $Φ$ , a minimal unsatisfiable ( $k, s$ )-CNF formula $Φ_{1}$ can be obtained. It is easy to get that, the number of positive occurrences and negative occurrences of every variable in $Φ_{1}$ are all no more than $(s + d) / 2$ .

Let $Φ = Φ_{1} \land Φ_{2}$ , where $Φ_{1}$ is the unsatisfiable ( $k, s$ )-CNF formula obtained by removing some clauses of $Φ$ , and $Φ_{2}$ is a conjunction of the removed clauses. Suppose $Φ_{2}$ contains $m \geq 0$ clauses and $m k$ literals. Let $C_{1}$ be the clause set of $Φ_{1}$ and $C_{2}$ be the clause set of $Φ_{2}$ . A variable y of $v a r (Φ_{1})$ and a clause c containing $\neg y$ are randomly selected. Define $C = (C 1 ∖ {c}) \cup {\tilde{c}}$ , with $\tilde{c} = (c ∖ {\neg y}) \cup {x}$ , where x is a new extra variable that does not occur in $Φ$ . Define $Φ_{1}^{'} = \land_{c \in C} c$ . Clearly, the variable x is forced to be $t r u e$ .

Let $Φ_{1 i}$ be disjoint copies of the formula $Φ_{1}^{'}$ with the variable $x, y$ of $Φ_{1}^{'}$ being renamed as $x_{i}, y_{i}$ in $Φ_{1 i}$ , and $Φ_{2 i}$ be disjoint copies of the formula $Φ_{2}$ , for $1 \leq i \leq k$ . In addition, we ensure that every variable occurring both in $Φ_{1}^{'}$ and $Φ_{2}$ is renamed as a same new variable in $Φ_{1 i}$ and $Φ_{2 i}$ , respectively, for $1 \leq i \leq k$ .

Introduce a new boolean variable set $Z = z_{1}, z_{2}, \dots, z_{t k}$ which does not occur in $Φ$ , $t > 2 m k / s$ . The k-CNF formula $Φ_{3}$ is constructed using $\neg y_{i}$ , the literals of $Φ_{2 i}$ and the variables of Z, for $1 \leq i \leq k$ . And it shall meet the following limits.

(i)
Every variable of Z occurs positively in $⌈ s / 2 ⌉$ clauses and negatively in $⌊ s / 2 ⌋$ clauses;

(ii)
All literals of $Φ_{2 i}$ and $\neg y_{i}$ occur exactly once in $Φ_{3}$ , $1 \leq i \leq k$ ;

(iii)
Every clause of $Φ_{3}$ must have at least one positive occurrence of any one of Z.

Define $Φ^{'} = Φ_{3} \land Φ_{11} \land Φ_{12} \land \dots \land Φ_{1 k}$ .

Obviously, condition (i) and (ii) of Definition 2 hold in $Φ^{'}$ (note $s > 3$ from the unsatisfiability of $Φ$ ). $Φ_{1 i}$ is satisfiable and forces the variable $x_{i}$ to be $t r u e$ . Because every variable of Z does not occur in $Φ_{1}^{'}$ , $Φ_{3}$ is satisfiable (let the value of every variable of Z be $t r u e$ ) without affecting $Φ_{11}, Φ_{12}, \dots, Φ_{1 k}$ . So it can be concluded that $Φ^{'}$ is satisfiable and forces $x_{1}, x_{2}, \dots, x_{k}$ to be $t r u e$ . $Φ_{1}^{'}$ , $Φ_{2}$ and $\neg y$ only contain x and all literals of $Φ$ . Except for x, every variable of $Φ_{1}^{'}$ , $Φ_{2}$ and $\neg y$ occur in s clause, and meet the d-regularity ( $Φ$ is a d-regular ( $k, s$ )-CNF formula). Hence, $Φ_{1 i}^{'}$ , $Φ_{2 i}$ and $\neg y_{i}$ meet these requirements (by the definition of disjoint copy). Every variable of Z occurs positively in $⌈ s / 2 ⌉$ clauses and negatively in $⌊ s / 2 ⌋$ clauses. Thus, $x_{1}, x_{2}, \dots, x_{k}$ occur only once in $Φ^{'}$ . Except for the k variables, every variable occurs in exactly s clauses, and the absolute value of the difference between positive and negative occurrences of every variable is no more than d. Therefore, we claim that $Φ^{'}$ is a k-forced-once d-regular ( $k, s$ )-CNF formula.

Next, we will assess the feasibility of the construction of $Φ^{'}$ . If an unsatisfiable d-regular ( $k, s$ )-CNF formula $Φ$ exists, $Φ_{1}^{'}$ should be easily constructed. The number of literals of $Φ_{3}$ is $m k^{2} + t k s + k$ , and the number of positive occurrences of the variables of Z in $Φ_{3}$ is $k t ⌈ s / 2 ⌉$ . The number of clauses of $Φ_{3}$ is $m k + t s + 1$ . For $t > 2 m k / s$ , we obtain $m k < t s / 2$ , $m k + t s < 3 t s / 2$ . For $k \geq 3$ , we obtain $m k + t s + 1 \leq k t s / 2 \leq k t ⌈ s / 2 ⌉$ . As a result, the number of positive occurrences of Z in $Φ_{3}$ is greater than that of clauses of $Φ_{3}$ . The construction of $Φ_{3}$ is almost random (First let each clause get a positive literal of Z, then randomly arrange other literals). Therefore, $Φ_{3}$ can be constructed in polynomial time. □

Lemma 4.

For $k \geq 3$ , $(s + d) / 2 > k - 1$ and $m \geq 1$ , we can transform a k-forced-once d-regular ( $k, s$ )-CNF formula with n unforced variables into a $(m + 1) k$ -forced-once d-regular ( $k, s$ )-CNF formula with n unforced variables.

Proof.

Let $Φ$ be a k-forced-once d-regular ( $k, s$ )-CNF formula with n unforced variables, and $x_{1}, x_{2}, \dots, x_{k}$ denote k forced variables that only occur once. That is, $x_{1}, x_{2}, \dots, x_{k}$ are forced to be $t r u e$ . Let

$\begin{matrix} H_{0} & = \land_{i = 1}^{k} (\neg x_{1} \lor \neg x_{2} \lor \dots \lor \neg x_{k - 1} \lor y_{1, i}), \\ H_{j} & = \land_{i = 1}^{k} (\neg y_{j, 1} \lor \neg y_{j, 2} \lor \dots \lor \neg y_{j, k - 1} \lor y_{j + 1, i}), j = 1, 2, \dots, m k - 1, \end{matrix}$

where every $y_{j, i}$ is a fresh variable.

We construct a k-CNF formula $Ψ$ with the variable set $X = {x_{i}}$ and the variable set $Y = {y_{j, i}}$ , for $i = 1, 2, \dots, k - 1$ , $j = 1, 2, \dots, m k - 1$ , which meets the following restrictions.

(i)
every variable of X and Y occurs in exactly $s - k - 1$ clauses of $Ψ$ ,
$if s \leq 2 k, p o s (Ψ, x_{i}) = s - k - 1, p o s (Ψ, y_{j, i}) = s - k - 1;$

$if s > 2 k, p o s (Ψ, x_{i}) = ⌈ s / 2 ⌉ - 1, p o s (Ψ, y_{j, i}) = ⌈ s / 2 ⌉ - 1 .$

(ii)
Every clause of $Ψ$ must have at least one positive occurrence of any one of these variables.

Define $Φ^{'} = Φ \land H_{0} \land H_{1} \land \dots \land H_{m k - 1} \land Ψ$ .

Obviously, $x_{i}$ and $y_{j, i}$ are forced to be $t r u e$ for $i = 1, 2, \dots, k, j = 1, 2, \dots, m k$ (this ensures that $Ψ$ is satisfiable). In these forced variables, $x_{k}, y_{1, k}, y_{2, k}, \dots, y_{m k - 1, k}$ and $y_{m k, 1}, y_{m k, 2}, \dots, y_{m k, k}$ occur exactly in one clause of $Φ^{'}$ . Except for the $(m + 1) k$ variables, every variable occurs in exactly s clauses, and the absolute value of the difference between positive and negative occurrences of every variable is at most d. The number of unforced variables in $Φ^{'}$ is still n. So $Φ^{'}$ is a $(m + 1) k$ -forced-once d-regular ( $k, s$ )-CNF formula with n unforced variables.

Next, we will prove that the construction of $Ψ$ is feasible. We focus on the satisfiability of the condition ii.

The variables of $Ψ$ consists of two parts: X and Y. The variable set Y has $(m k - 1) (k - 1)$ variables. The variable set X has $k - 1$ variables. Every variable of X and Y occurs in exactly $s - k - 1$ clauses of $Ψ$ . Obviously, the number of literals of $Ψ$ is $(m k - 1) (k - 1) (s - k - 1) + (k - 1) (s - k - 1)$ . The number of clauses of $Ψ$ is

$\begin{matrix} # c l (Ψ) & = \frac{(m k - 1) (k - 1) (s - k - 1) + (k - 1) (s - k - 1)}{k} \\ = m (k - 1) (s - k - 1) . \end{matrix}$

When $s \leq 2 k$ , all literals in $Ψ$ are positive literal and must satisfy the condition iii. When $s > 2 k$ , the number of positive occurrences of the variables in $Ψ$ is

$\begin{matrix} p o s (Ψ) & = (m k - 1) (k - 1) (⌈s / 2⌉ - 1) + (k - 1) (⌈s / 2⌉ - 1) \\ = m k (k - 1) (⌈s / 2⌉ - 1) . \end{matrix}$

For $k \geq 3$ ,

$\begin{matrix} m k (k - 1) (⌈s / 2⌉ - 1) & = m (k - 1) (k ⌈s / 2⌉ - k) \geq m (k - 1) (3 ⌈s / 2⌉ - k) \\ \geq m (k - 1) (3 s / 2 - k) > m (k - 1) (s - k) \\ > m (k - 1) (s - k - 1) . \end{matrix}$

So $p o s (Ψ) > # c l (Ψ)$ . That indicates that the number of positive literals is more than that of clauses. That is, we can arrange a positive literal for every clause of $Ψ$ , then randomly arrange other literals. Hence, $Ψ$ can be constructed. □

Theorem 2.

For $k \geq 3$ and $s \geq f (k, d) + 1$ and $(s + d) / 2 > k - 1$ , there exists a uniquely satisfiable d-regular ( $k, s$ )-CNF formula.

Proof.

We will show a way to construct a uniquely satisfiable d-regular ( $k, s$ )-CNF formula.

By Lemma 3 and Lemma 4, for $k \geq 3$ , $s \geq f (k, d) + 1$ , $(s + d) / 2 > k - 1$ and $m \geq 1$ , we can construct a $(m + 1) k$ -forced-once d-regular ( $k, s$ )-CNF formula $Ψ$ . It is assumed that the $(m + 1) k$ forced variables which occur only once are $X = {x_{1}, x_{2}, \dots, x_{(m + 1) k}}$ . Without loss of generality, we assume that forcing n of unforced variables to be $t r u e$ can turn $Ψ$ into a uniquely satisfiable formula. Let $Y = {y_{1}, y_{2}, \dots, y_{n}}$ denote the n unforced variables. Let $t = ⌈n / (k - 1)⌉$ . Constructing a uniquely satisfiable d-regular ( $k, s$ )-CNF formula is based on four stages, which are described as follows.

Step 1 Divide the variables $y_{1}, y_{2}, \dots, y_{n}$ arbitrarily into t variable sets $Y_{1}, Y_{2}, \dots, Y_{t}$ of size $k - 1$ . Some variables of $Ψ$ forced to be $t r u e$ are added, so that every variable set contains exactly $k - 1$ variables (a variable forced to be $f a l s e$ can be transformed to a variable forced to be $t r u e$ by flipping all occurrences of the variable). The variables $x_{1}, x_{2}, \dots, x_{(m + 1) k}$ are arbitrarily divided into $4 t + 1$ variable sets $X_{1}, X_{2}, \dots, X_{4 t + 1}$ . Moreover, it should be guaranteed that any one of $X_{1}, X_{2}, \dots, X_{3 t}$ has $k - 2$ variables, any one of $X_{3 t + 1}, \dots, X_{4 t}$ has $k - 1$ variables and $X_{4 t + 1}$ includes the rest. When m is appropriately chosen, the partition is feasible. Now assume $X_{4 t + 1}$ contains r variables.

Step 2 For each $1 \leq i \leq t$ , we will construct a formula $H_{i}$ using the variable sets $Y_{i}, X_{i}, X_{t + i}, X_{2 t + i}$ and $X_{3 t + i}$ .

For simplicity, let $Y_{i} = {y_{1}, \dots, y_{k - 1}}$ , $X_{i} = {x_{1}, \dots, x_{k - 2}}$ , $X_{t + i} = {x_{t + 1}, \dots, x_{t + k - 2}}$ , $X_{2 t + i} = {x_{2 t + 1}, \dots, x_{2 t + k - 2}}$ and $X_{3 t + i} = {x_{3 t + 1}, \dots, x_{3 t + k - 1}}$ . For each $1 \leq i \leq t$ , we introduce a new boolean variable set $Z_{i} = {z_{1, 0}, z_{1, 1}, z_{2, 0}, z_{2, 1}, \dots, z_{k - 1, 0}, z_{k - 1, 1}}$ which does not occur in $Ψ$ and perform the following steps to construct $H_{i}$ .

(i)
Let $z_{j, 0}$ replace any one of positive occurrences of $y_{j}$ , and $\neg z_{j, 1}$ replace any one of negative occurrences of $y_{j}$ in $Ψ$ , for $j = 1, 2, \dots, k - 1$ . If $y_{j}$ does not occur as a positive literal, then we let $z_{j, 0}$ replace one of other negative occurrences of $y_{j}$ in $Ψ$ and flip all occurrences of $z_{j, 0}$ in the following formulas $H_{i 1}, H_{i 2}, H_{i 3}, H_{i 4}$ . If $y_{j}$ does not occur as a negative literal, then we perform similar operations.

(ii)
Let
$\begin{matrix} H_{i 1} & = \land_{j = 1}^{k - 1} (y_{j} \lor \neg z_{j, 0} \lor \neg x_{1} \lor \dots \lor \neg x_{k - 2}), \\ H_{i 2} & = \land_{j = 1}^{k - 1} (z_{j, 0} \lor \neg z_{j, 1} \lor \neg x_{t + 1} \lor \dots \lor \neg x_{t + k - 2}), \\ H_{i 3} & = \land_{j = 1}^{k - 1} (z_{j, 1} \lor \neg y_{j} \lor \neg x_{2 t + 1} \lor \dots \lor \neg x_{2 t + k - 2}), \\ H_{i 4} & = \land_{j = 1}^{k - 1} (z_{j, 1} \lor \neg x_{3 t + 1} \lor \dots \lor \neg x_{3 t + k - 1}) . \end{matrix}$

Define $H_{i} = H_{i 1} \land H_{i 2} \land H_{i 3} \land H_{i 4}$ . The new formula with all substitutions performed on $Ψ$ is denoted as $Ψ_{1}$ .

Step 3 We will make up the gap of the number of occurrences of every variable. Using the variables in sets X and $Z = {Z_{i}, i = 1, 2, \dots, t}$ , we construct a formula $Ψ_{2}$ that satisfies the following conditions.

(i)
For $i = 1, \dots, t$ , each $z_{j, 0}, j = 1, 2, \dots, k - 1$ in the variable set $Z_{i}$ occurs in exactly $s - 3$ clause of $Ψ_{2}$ and $p o s (Ψ_{2}, z_{j, 0}) + 1 - n e g (Ψ_{2}, z_{j, 0}) = m i n (d, 1)$ .

(ii)
For $i = 1, \dots, t$ , each $z_{j, 1}, j = 1, 2, \dots, k - 1$ in the variable set $Z_{i}$ occurs in exactly $s - 4$ clauses of $Ψ_{2}$ and $p o s (Ψ_{2}, z_{j, 1}) - n e g (Ψ_{2}, z_{j, 1}) = m i n (d, 1)$ .

(iii)
Each variable x in $X_{1}, X_{2}, \dots, X_{4 t}$ occurs in exactly $s - k$ clauses of $Ψ_{2}$ ,
$p o s (Ψ_{2}, x) = s - k for s < 2 k or p o s (Ψ_{2}, x) = ⌈s / 2⌉ - 1 for s \geq 2 k .$

(iv)
Each variable x in $X_{4 t + 1}$ occurs in exactly $s - 1$ clauses of $Ψ_{2}$ and
$p o s (Ψ_{2}, x) + 1 - n e g (Ψ_{2}, x) = m i n (d, 1) .$

(v)
Every clause of $Ψ_{2}$ must have at least one positive occurrence of any one of the variables.

Step 4 Let $Φ = Ψ_{1} \land (\land_{i = 1}^{t} H_{i}) \land Ψ_{2}$ .

Clearly, $Φ$ is a d-regular ( $k, s$ )-CNF formula. All variables in the set X are forced to be $t r u e$ . Hence, $z_{j, 1}, j = 1, 2, \dots, k - 1$ is forced to be $t r u e$ by $H_{i 4}$ . By Lemma 2, $z_{j, 0}, z_{j, 1}$ and $y_{j}$ are forced to be the same value. Given that, every variable in Y and Z is forced to be $t r u e$ , too. Because all variables in X and Z are forced to be $t r u e$ , $Ψ_{2}$ is apparently satisfiable. Thus, it can be concluded that $Φ$ has only forced variables and the unique solution. That is, $Φ$ is a uniquely satisfiable d-regular ( $k, s$ )-CNF formula.

Next, we will discuss the feasibility of constructing $Φ$ . We focus on the formula $Ψ_{2}$ . For $Ψ_{2}$ , the number of positive literals should be more than that of clauses.

The variable set Z generates $t (k - 1) (s - 3) + t (k - 1) (s - 4)$ literals in $Ψ_{2}$ . The variable set X generate $3 t (k - 2) (s - k) + t (k - 1) (s - k) + r (s - 1)$ literals in $Ψ_{2}$ . The number of clauses of $Ψ_{2}$ is

$# c l (Ψ_{2}) = \frac{t (k - 1) (2 s - 7) + 3 t (k - 2) (s - k) + t (k - 1) (s - k) + r (s - 1)}{k} .$

Every variable of Z generates $⌈s / 2⌉ - 2$ positive literals in $Ψ_{2}$ , and very variable of $X_{4 t + 1}$ generates $⌈s / 2⌉ - 1$ positive literals in $Ψ_{2}$ . About the number of positive literals of $Ψ_{2}$ , there are two situations. When $s < 2 k$ , the number of positive literals of $Ψ_{2}$ is

$p o s (Ψ_{2}) = t (k - 1) (2 ⌈s / 2⌉ - 4) + 3 t (k - 2) (s - k) + t (k - 1) (s - k - 1) + r (⌈s / 2⌉ - 1) .$

When $s \geq 2 k$ , the number of positive literals of $Ψ_{2}$ is

$p o s (Ψ_{2}) = t (k - 1) (2 ⌈s / 2⌉ - 4) + 3 t (k - 2) (⌈s / 2⌉ - 1) + t (k - 1) (⌈s / 2⌉ - 1) + r (⌈s / 2⌉ - 1) .$

Since $k \geq 3$ and $s > k$ , we get $p o s (Ψ_{2}) > # c l (Ψ_{2})$ . To construct $Ψ_{2}$ , We first arrange a positive literal for every clause, then randomly arrange other literals. That is, $Ψ_{2}$ can be constructed in polynomial time.

$Ψ_{1}$ and $H_{i}$ can obviously be constructed in polynomial time. Therefore, we can construct a uniquely satisfiable d-regular ( $k, s$ )-CNF formula $Φ$ in polynomial time. □

In the previous proof, we construct a uniquely satisfiable d-regular ( $k, s$ )-CNF formula $Φ$ by using a ( $m + 1$ )k-forced-once d-regular ( $k, s$ )-CNF formula $Ψ$ . m determines the number of forced variables of $Ψ$ that only occurs once. If let m be 1 more than our demand, then $Φ$ can preserve k forced variables that only occurs once. Therefore, we get the following lemma.

Lemma 5.

For $k \geq 3$ , $s \geq f (k, d) + 1$ and $(s + d) / 2 > k - 1$ , there exists a k-forced-once d-regular ( $k, s$ )-CNF formula Ψ where every variable is forced.

Lemma 6.

For $k \geq 7$ , if a d-regular ( $k, s$ )-CNF formula is unsatisfiable then $(s + d) / 2 > k - 1$ .

Proof.

By $13 \leq f (7) \leq 17$ in [22], if a (7,s)-CNF formula is unsatisfiable, then $s \geq 14$ . That is, for any integer $d \geq 0$ , if a d-regular (7,s)-CNF formula is unsatisfiable, then $s \geq 14$ . It implies that for $k = 7$ , if a d-regular ( $k, s$ )-CNF formula is unsatisfiable, we can obtain that $(s + d) / 2 \geq 14 / 2 > k - 1$ .

By $24 \leq f (8) \leq 29$ in [22], all (8,24)-CNF formulas are satisfiable. That is, for any integer $d \geq 0$ , if a d-regular (8,s)-CNF formula is unsatisfiable, then $s > 24$ . As for $k = 8$ , if a d-regular ( $k, s$ )-CNF formula is unsatisfiable, we get $(s + d) / 2 > k - 1$ again.

Using Lemma 1, all ( $8 + r, 24 + 3 \times r$ )-CNF formulas are satisfiable for any nonnegative integer r. That is to say, if a ( $8 + r, s$ )-CNF formula is unsatisfiable, then $s > 24 + 3 \times r$ for any nonnegative integer r. For $(24 + 3 \times r) / 2 > 8 + r - 1$ , we obtain that for $k \geq 7$ if a ( $k, s$ )-CNF formula is unsatisfiable, then $(s + d) / 2 > k - 1$ . □

Theorem 3.

For all $k \geq 7$ and $s \geq f (k, d) + 1$ , there exist uniquely satisfiable d-regular ( $k, s$ )-CNF formulas.

Proof.

By the definition of $f (k, d)$ , if $k \geq 7$ and $s \geq f (k, d) + 1$ , there exists an unsatisfiable d-regular ( $k, s$ )-CNF formula. Using Lemma 6, we get $(s + d) / 2 > k - 1$ . By Theorem 2, we obtain that there exist uniquely satisfiable d-regular ( $k, s$ )-CNF formulas. □

By Theorem 3, for $k \geq 7$ , we get $u (k, d) \leq f (k, d) + 1$ . Matthews in [20] showed that $f (k) \leq u (k) \leq f (k) + 2$ . Using Theorem 2, it is easy to achieve $f (k) \leq u (k) \leq f (k) + 1$ .

Theorem 4.

For all $k \geq 3$ , $f (k) \leq u (k) \leq f (k) + 1$ .

Proof.

Let d be a infinite integer. That is, any one of ( $k, s$ )-CNF formulas is a d-regular ( $k, s$ )-CNF formula and any one of d-regular ( $k, s$ )-CNF formulas is a ( $k, s$ )-CNF formula. It holds that $f (k) = f (k, d)$ and $(s + d) / 2 > k - 1$ . Using Theorem 2, for a infinite integer d, $k \geq 3$ and $s \geq f (k) + 1$ , there exists a uniquely satisfiable d-regular ( $k, s$ )-CNF formula. Obviously, a uniquely satisfiable d-regular ( $k, s$ )-CNF formula must be a uniquely satisfiable ( $k, s$ )-CNF formula. In other words, for $k \geq 3$ and $s \geq f (k) + 1$ , there exists a uniquely satisfiable ( $k, s$ )-CNF formula. By $f (k) \leq u (k) \leq f (k) + 2$ in [20], we obtain that $k \geq 3$ , $f (k) \leq u (k) \leq f (k) + 1$ . □

Corollary 1.

For $k \geq 7$ and $s \geq f (k, d) + 1$ , there exists a k-forced-once d-regular ( $k, s$ )-CNF formula Ψ that has exactly one satisfying assignment.

Proof.

The statement follows directly from Lemmas 5 and 6. □

5. A Parsimonious Polynomial Time Reduction

In [20], Matthews presented a parsimonious reduction from SAT to ( $k, s$ )-SAT for any $k \geq 3$ and $s \geq f (k) + 2$ . We will transform parsimoniously a k-CNF formula into a d-regular ( $k, s$ )-CNF formula.

Theorem 5.

For any constants $k \geq 3$ , $s \geq f (k) + 1$ and $(s + d) / 2 > k - 1$ , there exists a parsimonious polynomial time reduction from k-CNF to d-regular ( $k, s$ )-CNF.

Proof.

Let $Ψ$ be an arbitrarily k-CNF formula. It is supposed that $Ψ$ contains m clauses. Obviously, $Ψ$ contains $m k$ literals $L_{1, 1}, L_{1, 2}, \dots, L_{m, k}$ . We will construct a d-regular ( $k, s$ )-CNF formula $Ψ^{'}$ that is SAT-equivalent with the formula $Ψ$ , and they have the same number of solutions. Based on Lemma 5, we first construct a k-forced-once d-regular ( $k, s$ )-CNF formula $Φ$ where every variable is forced. It is assumed that k forced variables that occur only once are $x_{1}, x_{2}, \dots, x_{k}$ .

The reduction method has five steps, which are described as follows.

Step 1 We introduce a new boolean variable set $Z = {z_{i, j} : 1 \leq i \leq m, 1 \leq j \leq k}$ to replace $m k$ literals in $Ψ$ in order to construct a new formula $Ψ_{1}$ .

$Ψ_{1} = \underset{1 \leq i \leq m}{\land} \underset{1 \leq j \leq k}{\lor} {L^{'}}_{i, j}, {L^{'}}_{i, j} = \{\begin{matrix} z_{i, j}, i f L_{i, j} = v \\ \neg z_{i, j}, i f L_{i, j} = \neg v \end{matrix}, v \in var (Ψ) .$

Here, $L_{i, j}$ is the jth literal of the ith clause of $Ψ$ .

Step 2 Let $Φ_{i}, 1 \leq i \leq m (k - 1)$ be disjoint copies of the formula $Φ$ with the variables $x_{j}, 1 \leq j \leq k$ of $Φ$ being renamed as $x_{i, j}$ in $Φ_{i}$ . All of $x_{i, j}$ are renumbered and formed a variable set $X = {x_{i}, 1 \leq i \leq m k (k - 1)}$ . Let $Ψ_{2} = \land_{1 \leq i \leq m (k - 1)} Φ_{i}$ .

Step 3 Let $Ψ_{3} = \land_{1 \leq i \leq m, 1 \leq j \leq k} d_{i, j}$ , and $d_{i, j} = z_{i, j} \lor \neg {z^{'}}_{i, j} \lor_{l = 1}^{k - 2} \neg x_{((i - 1) m - j - 1) (k - 2) + l}$ . Here $z_{i, j}, z_{i, j}^{'} \in Z$ and if $z_{i, j}$ replaces a variable v in $Ψ$ , then $z_{i, j}^{'}$ will point to the next variable in Z that replaces v (if $z_{i, j}$ is the last variable in Z that replaces v, then $z_{i, j}^{'}$ will point to the first variable in Z that replaces v). The variables in Z are sorted by their subscripts.

Step 4 We construct a k-CNF formula $Ψ_{4}$ with two variable sets X and Z, satisfying the following conditions.

(i)
Every variable $z_{i, j}$ of the variable set Z occurs in exactly $s - 3$ clauses of $Ψ_{4}$ , and if $z_{i, j}$ occurs negatively in $Ψ_{1}$ ,
$p o s (Ψ_{4}, z_{i, j}) - n e g (Ψ_{4}, z_{i, j}) = m i n (d, 1) .$

Otherwise
$n e g (Ψ_{4}, z_{i, j}) - p o s (Ψ_{4}, z_{i, j}) = m i n (d, 1) .$

(ii)
For $1 \leq i \leq m k (k - 2)$ , every variable $x_{i}$ of X occurs in exactly $s - 2$ clauses of the formula $Ψ_{4}$ , and
$p o s (Ψ_{4}, x_{i}) - n e g (Ψ_{4}, x_{i}) = m i n (d, 1) .$

(iii)
For $m k (k - 2) + 1 \leq i \leq m k (k - 1)$ , every variable $x_{i}$ of the variable set X occurs in exactly $s - 1$ clauses of the formula $Ψ_{4}$ , and
$p o s (Ψ_{4}, x_{i}) + 1 - n e g (Ψ_{4}, x_{i}) = m i n (d, 1) .$

(iv)
Every clause of $Ψ_{4}$ must have at least one positive occurrence of any one of the variable set X.

Step 5 We construct the formula $Ψ^{'} = {Ψ_{1}, Ψ_{2}, Ψ_{3}, Ψ_{4}}$ .

Obviously, every variable of $Ψ^{'}$ occurs in exactly s clauses, and the absolute value of the difference between positive and negative occurrences of every variable of $Ψ^{'}$ is at most d. Therefore, $Ψ^{'}$ is a d-regular ( $k, s$ )-CNF formula. Next, we will evaluate the feasibility of $Ψ^{'}$ , SAT-equivalent with $Ψ^{'}$ and $Ψ$ , the parsimony of the reduction.

First, we focus on the feasibility of $Ψ^{'}$ . The formulas $Ψ_{1}, Ψ_{2}, Ψ_{3}$ apparently can be constructed in polynomial time. With respect to the formula $Ψ_{4}$ , we need to consider the condition iv. That is to say, the number of positive occurrences of X in $Ψ_{4}$ should be more than the number of clauses of $Ψ_{4}$ .

The variables of $Ψ_{4}$ consists of two parts: X and Z. The variable set X generates $m k (k - 2) (s - 3) + m k (s - 1)$ literals in $Ψ_{4}$ . The variable set Z generates $m k (s - 3)$ literals in $Ψ_{4}$ . The number of clauses of $Ψ_{4}$

$\begin{matrix} # c l (Ψ) & = \frac{m k (s - 3) + m k (k - 2) (s - 2) + m k (s - 1)}{k} \\ = m (s - 3) + m (k - 2) (s - 2) + m (s - 1) \\ = m (k - 2) (s - 2) + m (2 s - 4) . \end{matrix}$

For $1 \leq i \leq m k (k - 2)$ , $p o s (Ψ_{4}, x_{i}) - n e g (Ψ_{4}, x_{i}) = m i n (d, 1)$ and $p o s (Ψ_{4}, x_{i}) + n e g (Ψ_{4}, x_{i}) = s - 2$ . So,

$p o s (Ψ_{4}, x_{i}) = ⌈ (s - 2) / 2 ⌉ = ⌈ s / 2 ⌉ - 1 .$

For $m k (k - 2) + 1 \leq i \leq m k (k - 1)$ , $p o s (Ψ_{4}, x_{i}) + 1 - n e g (Ψ_{4}, x_{i}) = m i n (d, 1)$ and $p o s (Ψ_{4}, x_{i}) + n e g (Ψ_{4}, x_{i}) = s - 1$ . So, $p o s (Ψ_{4}, x_{i}) + 1 = ⌈ s / 2 ⌉$ , $p o s (Ψ_{4}, x_{i}) = ⌈ s / 2 ⌉ - 1$ . The number of positive occurrences of X in $Ψ_{4}$

$p o s (Ψ_{4}, X) = m k (k - 2) (⌈s / 2⌉ - 1) + m k (⌈s / 2⌉ - 1) .$

For $k \geq 3$ and $s > k$ , we get

$\begin{matrix} p o s (Ψ_{4}, X) & > 3 m (k - 2) (⌈s / 2⌉ - 1) + 3 m (⌈s / 2⌉ - 1) \\ > m (k - 2) (s - 2) + m (k - 2) (⌈s / 2⌉ - 1) + 3 m (⌈s / 2⌉ - 1) \\ > m (k - 2) (s - 2) + 4 m (⌈s / 2⌉ - 1) \\ > m (k - 2) (s - 2) + m (2 s - 4) = # c l (Ψ_{4}) . \end{matrix}$

Obviously, the number of positive literals of X is more than the number of clauses in $Ψ_{4}$ . To construct $Ψ_{4}$ , We first arrange a positive literal for every clause, then randomly arrange other literals. That is, the formula $Ψ_{4}$ can be constructed in polynomial time.

Second, we will prove that the formula $Ψ$ is satisfiable if and only if the formula $Ψ^{'}$ is satisfiable.

It is assumed that $Ψ$ is satisfied by a truth assignment $τ$ on $v a r (Ψ)$ and $Φ_{i}$ is satisfied by a truth assignment $τ_{i}$ on $v a r (Φ_{i})$ for $1 \leq i \leq m (k - 1)$ . Because $Φ_{i}$ forces the variable $x_{i, j}$ to be $t r u e$ , $τ_{i} (x_{i, j})$ must be $t r u e$ . A truth assignment $τ^{'}$ is defined by

$τ^{'} (v) = \{\begin{matrix} τ (z), & i f v \in v a r (Ψ_{1}) a n d a v a r i a b l e z o f v a r Ψ i s r e p l a c e d w i t h v \\ τ_{i} (v), & i f v \in v a r (Φ_{i}) \end{matrix} .$

Obvious, the truth assignment $τ^{'}$ can satisfy these formulas $Ψ_{1}, Ψ_{2}, Ψ_{3}$ . Every clause of $Ψ_{4}$ must have at least one positive occurrence of any one of X. As a result, $τ^{'}$ also can satisfy the formula $Ψ_{4}$ . The formula $Ψ^{'}$ is a conjunction of $Ψ_{1}, Ψ_{2}, Ψ_{3}, Ψ_{4}$ . Thus, $τ^{'}$ can satisfy the formula $Ψ^{'}$ certainly.

It is assumed that $Ψ^{'}$ is satisfied by a truth assignment $τ$ over $v a r (Ψ^{'})$ . Obviously, the truth assignment $τ$ can satisfy these formulas $Ψ_{1}, Ψ_{2}, Ψ_{3}, Ψ_{4}$ . For $Ψ_{2} = \land_{1 \leq i \leq m (k - 1)} Φ_{i}$ , the truth assignment $τ$ can satisfy these formulas $Φ_{i}, 1 \leq i \leq m (k - 1)$ . Because $Φ_{i}$ forces the variable $x_{i, j}$ to be $t r u e$ ,

$τ (\neg x_{i, j}) = f a l s e, 1 \leq i \leq m k, 1 \leq j \leq k - 2 .$ (1)

We substitute Equation (1) into $Ψ_{3}$ , and simplify $Ψ_{3}$ . The simplified $Ψ_{3}$ contains some similar structure that are mentioned in Lemma 2. According to Lemma 2, if $z_{i}$ and $z_{j}$ replace the same variable of $Ψ$ , $τ (z_{i}) = τ (z_{j})$ . Therefore, we define a truth assignment $τ^{'}$ on $v a r (Ψ)$ by

$τ^{'} (v) = τ (z), if a variable v of Ψ is replaced with a variable z in Ψ_{1} .$

Obviously, the truth assignment $τ^{'}$ can satisfy the formula $Ψ$ , and the formula $Ψ$ is satisfiable.

Therefore, $Ψ^{'}$ is SAT-equivalent with $Ψ$ .

Finally, we will explain why the polynomial-time reduction is parsimonious. If $Ψ^{'}$ is satisfiable, all variables in X are forced to be $t r u e$ . Due to the formula $Ψ_{3}$ , all variables of Z that replaced the same variable of $Ψ$ are forced to be the same value in every satisfying assignment. Thus, the number of satisfying assignments cannot be changed by introducing new variable set Z. Due to only one solution of $Φ$ , $Ψ_{2}$ must not influence the number of satisfying assignments. Therefore, $Ψ$ has as many satisfying assignments as the formula $Ψ^{'}$ . □

6. Conclusions

For $k \geq 3$ , k-SAT problem is a NP-complete problem. From Theorem 5, it demonstrates that there exists a polynomial time reduction from k-SAT to d-regular ( $k, s$ )-SAT for any constants $k \geq 3$ , $s \geq f (k) + 1$ and $(s + d) / 2 > k - 1$ . That is to say, d-regular ( $k, s$ )-SAT problem is NP-complete in this case. For example, the 2-regular (3,4)-SAT problem is NP-complete. In other words, there exists a parsimonious polynomial time reduction from 3-CNF to 2-regular (3,4)-CNF. Although the parsimonious reduction does not increase the number of solutions, it adds numerous new variables to the original formula. That is, The new formula has bigger solution space than the original formula. It seems that the parsimonious reduction diluted these solutions and make the SAT problem harder to solve. This explains why a random regular (3,4)-SAT instance is satisfiable with high probability and can be easily solved, but the 2-regular (3,4)-SAT problem is NP-complete.

From Lemma 6, this suggests that for $k \geq 7$ and $(s + d) / 2 < k$ , all d-regular ( $k, s$ )-CNF formulas are satisfiable. Consider a regular ( $k, s$ )-CNF formula F in which the positive and negative occurrences number of every variable do not exceed $k - 1$ . Obviously, $s \leq 2 k - 2$ and the formula F is a ( $2 k - s - 2$ )-regular ( $k, s$ )-CNF formula. For $(s + d) / 2 = (s + 2 k - s - 2) / 2 = k - 1 < k$ , we obtain that the formula F must be satisfiable for $k \geq 7$ . That is, all regular ( $k, s$ )-CNF formulas in which the positive and negative occurrences number of every variable are less than k, can be satisfiable for $k \geq 7$ . However, it is unknown whether this phenomenon exists for $k < 7$ .

We present the construction method of a uniquely satisfiable d-regular ( $k, s$ )-formula. Uniquely satisfiable d-regular ( $k, s$ )-SAT instances have their own characteristics. How to use uniquely satisfiable SAT instances to evaluate, analyze and improve some SAT solvers will be considered in the future.

Author Contributions

Formal analysis, Z.F.; Investigation, Z.F. and D.X.; Methodology, Z.F. and D.X.; Writing—Original Draft, Z.F.; Writing—Review & Editing, Z.F. and D.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China under grant numbers No.61762019,61862051.

Conflicts of Interest

The authors declare no conflict of interest.

References

1.Cook S.A. The complexity of theorem-proving procedures; Proceedings of the Third Annual ACM Symposium on Theory of Computing; Shaker Heights, OH, USA. 3–5 May 1971; pp. 151–158. [DOI] [Google Scholar]
2.Eén N., Sorenssön N. Theory and Applications of Satisfiability Testing. Springer; Berlin/Heidelberg, Germany: 2003. An Extensible SAT-solver. [DOI] [Google Scholar]
3.Audemard G., Simon L. GLUCOSE2.1: Aggressive-but Reactive-Clause Database Management, Dynamic Restarts; Proceedings of the International Workshop of Pragmatics of SAT; Trento, Italy. 16 June 2012. [Google Scholar]
4.Luo M., Minli M., Xiao F., Manyá F., Zhipeng L. An Effective Learnt Clause Minimization Approach for CDCL SAT Solvers; Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence; Melbourne, Australia. 19–25 August 2017; [DOI] [Google Scholar]
5.Calabro C., Paturi R. Computer Science Symposium in Russia. Springer; Berlin/Heidelberg, Germany: 2009. k-SAT Is No Harder Than Decision-Unique-k-SAT. [DOI] [Google Scholar]
6.Tovey C.A. A simplified NP-complete satisfiability problem. Discret. Appl. Math. 1984;8:85–89. doi: 10.1016/0166-218X(84)90081-7. [DOI] [Google Scholar]
7.Daoyun X., Xiaofeng W. A Regular NP-Complete Problem and Its Inapproximability. J. Front. Comput. Sci. Technol. 2013;7:691–697. doi: 10.3778/j.issn.1673-9418.1305025. [DOI] [Google Scholar]
8.Crawford J.M., Auton L.D. Experimental Results on the Crossover Point in Satisfiability Problems. Artif. Intell. 1996;81:31–57. doi: 10.1016/0004-3702(95)00046-1. [DOI] [Google Scholar]
9.Kirkpatrick S., Selman B. Critical behavior in the satisfiability of random boolean expressions. Science. 1994;264:1297–1301. doi: 10.1126/science.264.5163.1297. [DOI] [PubMed] [Google Scholar]
10.Jincheng Z., Daoyun X., Youjun L. Satisfiability Threshold of the Regular Random (k,r)-SAT Problem. J. Softw. 2016;27:2985–2993. doi: 10.13328/j.cnki.jos.005129. [DOI] [Google Scholar]
11.Jincheng Z., Daoyun X., Youjun L. Satisfiability threshold of regular (k,r)-SAT problem via 1RSB theory. J. Huazhong Univ. Sci. Technol. 2017;45:7–13. doi: 10.13245/j.hust.171202. [DOI] [Google Scholar]
12.Mézard M., Parisi G., Zecchina R. Analytic and algorithmic solution of random satisfiability problems. Science. 2002;297:812–815. doi: 10.1126/science.1073287. [DOI] [PubMed] [Google Scholar]
13.Wahlström M. Theory and Applications of Satisfiability Testing (SAT-2005) Springer; Berlin/Heidelberg, Germany: 2005. Faster exact solving of SAT formulae with a low number of occurrences per variable. [DOI] [Google Scholar]
14.Wahlström M. European Conference on Algorithms. Springer; Berlin/Heidelberg, Germany: 2005. An algorithm for the SAT problem for formulae of linear length. [DOI] [Google Scholar]
15.Johannsen D., Razgon I., Wahlström M. Theory and Applications of Satisfiability Testing. Springer; Berlin/Heidelberg, Germany: 2009. Solving SAT for CNF Formulas with a One-Sided Restriction on Variable Occurrences. [DOI] [Google Scholar]
16.Fu Z., Xu D. The NP-completeness of d-regular (k,s)-SAT problem. J. Softw. 2020;31:1113–1123. doi: 10.13328/j.cnki.jos.005896. [DOI] [Google Scholar]
17.Fu Z., Xu D. (1,0)-Super Solutions of (k,s)-CNF Formula. Entropy. 2020;22:253. doi: 10.3390/e22020253. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Valiant L., Vazirani V. NP is as easy as detecting unique solutions. Theor. Comput. Sci. 1986;47:85–93. doi: 10.1016/0304-3975(86)90135-0. [DOI] [Google Scholar]
19.Calabro C., Impagliazzo R., Kabanets V., Paturi R. The complexity of unique k-SAT: An isolation lemma for k-CNFs. Comput. Syst. Sci. 2008;74:386–393. doi: 10.1016/j.jcss.2007.06.015. [DOI] [Google Scholar]
20.Matthews W., Paturi R. Theory and Applications of Satisfiability Testing. Springer; Berlin/Heidelberg, Germany: 2010. Uniquely Satisfiable k-SAT Instances with Almost Minimal Occurrences of Each Variable. [DOI] [Google Scholar]
21.Kratochvíl J., Savický P., Tuza Z. One more occurrence of variables makes satisfiability jump from trivial to NP-complete. Acta Inform. 1993;22:203–210. doi: 10.1137/0222015. [DOI] [Google Scholar]
22.Hoory S., Szeider S. Computing unsatisfiable k-SAT instances with few occurrences per variable. Theor. Comput. Sci. 2004;337:347–359. doi: 10.1016/j.tcs.2005.02.004. [DOI] [Google Scholar]
23.Hoory S., Szeider S. Families of unsatisfiable k-CNF formulas with few occurrences per variable. SIAM J. Discret. Math. 2006;20:523–528. doi: 10.1137/S0895480104445745. [DOI] [Google Scholar]
24.Savický P., Sgall J. DNF tautologies with a limited number of occurrences of every variable. Theor. Comput. Sci. 2007;238:495–498. doi: 10.1016/S0304-3975(00)00036-0. [DOI] [Google Scholar]
25.Gebauer H., Szabo T., Tardos G. The Local Lemma is asymptotically tight for SAT. ACM. 2016;63:664–674. doi: 10.1145/2975386. [DOI] [Google Scholar]
26.Markström K. Locality and Hard SAT-Instances. J. Satisf. Boolean Modeling Comput. 2006;2:221–227. doi: 10.3233/SAT190024. [DOI] [Google Scholar]
27.Giráldez-cru J., Levy J. Generating SAT instances with community structure. Artif. Intell. 2016;238:119–134. doi: 10.1016/j.artint.2016.06.001. [DOI] [Google Scholar]
28.Giráldez-cru J., Levy J. Locality in Random SAT Instances; Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence; Melbourne, Australia. 19–25 August 2017; [DOI] [Google Scholar]
29.Clark D., Frank J., Gent I., MacIntyre E., Tomov N., Walsh T. The Principles and Practices of Contraint Programming (CP96) Springer; Berlin/Heidelberg, Germany: 1996. Local search and the number of solutions. [DOI] [Google Scholar]
30.Singer J., Gent I.P., Smaill A. Backbone fragility and the local search cost peak. J. Artif. Intell. Res. 2000;12:235–270. doi: 10.1613/jair.711. [DOI] [Google Scholar]
31.Znidaric M. Single-solution Random 3-SAT Instances. arXiv. 2005cs/0504101 [Google Scholar]
32.Dubois O. On the r,s-SAT satisfiability problem and a conjecture of Tovey. Discret. Appl. Math. 1990;26:51–60. doi: 10.1016/0166-218X(90)90020-D. [DOI] [Google Scholar]

[B1-entropy-22-00569] 1.Cook S.A. The complexity of theorem-proving procedures; Proceedings of the Third Annual ACM Symposium on Theory of Computing; Shaker Heights, OH, USA. 3–5 May 1971; pp. 151–158. [DOI] [Google Scholar]

[B2-entropy-22-00569] 2.Eén N., Sorenssön N. Theory and Applications of Satisfiability Testing. Springer; Berlin/Heidelberg, Germany: 2003. An Extensible SAT-solver. [DOI] [Google Scholar]

[B3-entropy-22-00569] 3.Audemard G., Simon L. GLUCOSE2.1: Aggressive-but Reactive-Clause Database Management, Dynamic Restarts; Proceedings of the International Workshop of Pragmatics of SAT; Trento, Italy. 16 June 2012. [Google Scholar]

[B4-entropy-22-00569] 4.Luo M., Minli M., Xiao F., Manyá F., Zhipeng L. An Effective Learnt Clause Minimization Approach for CDCL SAT Solvers; Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence; Melbourne, Australia. 19–25 August 2017; [DOI] [Google Scholar]

[B5-entropy-22-00569] 5.Calabro C., Paturi R. Computer Science Symposium in Russia. Springer; Berlin/Heidelberg, Germany: 2009. k-SAT Is No Harder Than Decision-Unique-k-SAT. [DOI] [Google Scholar]

[B6-entropy-22-00569] 6.Tovey C.A. A simplified NP-complete satisfiability problem. Discret. Appl. Math. 1984;8:85–89. doi: 10.1016/0166-218X(84)90081-7. [DOI] [Google Scholar]

[B7-entropy-22-00569] 7.Daoyun X., Xiaofeng W. A Regular NP-Complete Problem and Its Inapproximability. J. Front. Comput. Sci. Technol. 2013;7:691–697. doi: 10.3778/j.issn.1673-9418.1305025. [DOI] [Google Scholar]

[B8-entropy-22-00569] 8.Crawford J.M., Auton L.D. Experimental Results on the Crossover Point in Satisfiability Problems. Artif. Intell. 1996;81:31–57. doi: 10.1016/0004-3702(95)00046-1. [DOI] [Google Scholar]

[B9-entropy-22-00569] 9.Kirkpatrick S., Selman B. Critical behavior in the satisfiability of random boolean expressions. Science. 1994;264:1297–1301. doi: 10.1126/science.264.5163.1297. [DOI] [PubMed] [Google Scholar]

[B10-entropy-22-00569] 10.Jincheng Z., Daoyun X., Youjun L. Satisfiability Threshold of the Regular Random (k,r)-SAT Problem. J. Softw. 2016;27:2985–2993. doi: 10.13328/j.cnki.jos.005129. [DOI] [Google Scholar]

[B11-entropy-22-00569] 11.Jincheng Z., Daoyun X., Youjun L. Satisfiability threshold of regular (k,r)-SAT problem via 1RSB theory. J. Huazhong Univ. Sci. Technol. 2017;45:7–13. doi: 10.13245/j.hust.171202. [DOI] [Google Scholar]

[B12-entropy-22-00569] 12.Mézard M., Parisi G., Zecchina R. Analytic and algorithmic solution of random satisfiability problems. Science. 2002;297:812–815. doi: 10.1126/science.1073287. [DOI] [PubMed] [Google Scholar]

[B13-entropy-22-00569] 13.Wahlström M. Theory and Applications of Satisfiability Testing (SAT-2005) Springer; Berlin/Heidelberg, Germany: 2005. Faster exact solving of SAT formulae with a low number of occurrences per variable. [DOI] [Google Scholar]

[B14-entropy-22-00569] 14.Wahlström M. European Conference on Algorithms. Springer; Berlin/Heidelberg, Germany: 2005. An algorithm for the SAT problem for formulae of linear length. [DOI] [Google Scholar]

[B15-entropy-22-00569] 15.Johannsen D., Razgon I., Wahlström M. Theory and Applications of Satisfiability Testing. Springer; Berlin/Heidelberg, Germany: 2009. Solving SAT for CNF Formulas with a One-Sided Restriction on Variable Occurrences. [DOI] [Google Scholar]

[B16-entropy-22-00569] 16.Fu Z., Xu D. The NP-completeness of d-regular (k,s)-SAT problem. J. Softw. 2020;31:1113–1123. doi: 10.13328/j.cnki.jos.005896. [DOI] [Google Scholar]

[B17-entropy-22-00569] 17.Fu Z., Xu D. (1,0)-Super Solutions of (k,s)-CNF Formula. Entropy. 2020;22:253. doi: 10.3390/e22020253. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18-entropy-22-00569] 18.Valiant L., Vazirani V. NP is as easy as detecting unique solutions. Theor. Comput. Sci. 1986;47:85–93. doi: 10.1016/0304-3975(86)90135-0. [DOI] [Google Scholar]

[B19-entropy-22-00569] 19.Calabro C., Impagliazzo R., Kabanets V., Paturi R. The complexity of unique k-SAT: An isolation lemma for k-CNFs. Comput. Syst. Sci. 2008;74:386–393. doi: 10.1016/j.jcss.2007.06.015. [DOI] [Google Scholar]

[B20-entropy-22-00569] 20.Matthews W., Paturi R. Theory and Applications of Satisfiability Testing. Springer; Berlin/Heidelberg, Germany: 2010. Uniquely Satisfiable k-SAT Instances with Almost Minimal Occurrences of Each Variable. [DOI] [Google Scholar]

[B21-entropy-22-00569] 21.Kratochvíl J., Savický P., Tuza Z. One more occurrence of variables makes satisfiability jump from trivial to NP-complete. Acta Inform. 1993;22:203–210. doi: 10.1137/0222015. [DOI] [Google Scholar]

[B22-entropy-22-00569] 22.Hoory S., Szeider S. Computing unsatisfiable k-SAT instances with few occurrences per variable. Theor. Comput. Sci. 2004;337:347–359. doi: 10.1016/j.tcs.2005.02.004. [DOI] [Google Scholar]

[B23-entropy-22-00569] 23.Hoory S., Szeider S. Families of unsatisfiable k-CNF formulas with few occurrences per variable. SIAM J. Discret. Math. 2006;20:523–528. doi: 10.1137/S0895480104445745. [DOI] [Google Scholar]

[B24-entropy-22-00569] 24.Savický P., Sgall J. DNF tautologies with a limited number of occurrences of every variable. Theor. Comput. Sci. 2007;238:495–498. doi: 10.1016/S0304-3975(00)00036-0. [DOI] [Google Scholar]

[B25-entropy-22-00569] 25.Gebauer H., Szabo T., Tardos G. The Local Lemma is asymptotically tight for SAT. ACM. 2016;63:664–674. doi: 10.1145/2975386. [DOI] [Google Scholar]

[B26-entropy-22-00569] 26.Markström K. Locality and Hard SAT-Instances. J. Satisf. Boolean Modeling Comput. 2006;2:221–227. doi: 10.3233/SAT190024. [DOI] [Google Scholar]

[B27-entropy-22-00569] 27.Giráldez-cru J., Levy J. Generating SAT instances with community structure. Artif. Intell. 2016;238:119–134. doi: 10.1016/j.artint.2016.06.001. [DOI] [Google Scholar]

[B28-entropy-22-00569] 28.Giráldez-cru J., Levy J. Locality in Random SAT Instances; Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence; Melbourne, Australia. 19–25 August 2017; [DOI] [Google Scholar]

[B29-entropy-22-00569] 29.Clark D., Frank J., Gent I., MacIntyre E., Tomov N., Walsh T. The Principles and Practices of Contraint Programming (CP96) Springer; Berlin/Heidelberg, Germany: 1996. Local search and the number of solutions. [DOI] [Google Scholar]

[B30-entropy-22-00569] 30.Singer J., Gent I.P., Smaill A. Backbone fragility and the local search cost peak. J. Artif. Intell. Res. 2000;12:235–270. doi: 10.1613/jair.711. [DOI] [Google Scholar]

[B31-entropy-22-00569] 31.Znidaric M. Single-solution Random 3-SAT Instances. arXiv. 2005cs/0504101 [Google Scholar]

[B32-entropy-22-00569] 32.Dubois O. On the r,s-SAT satisfiability problem and a conjecture of Tovey. Discret. Appl. Math. 1990;26:51–60. doi: 10.1016/0166-218X(90)90020-D. [DOI] [Google Scholar]

PERMALINK

Uniquely Satisfiable d-Regular (k,s)-SAT Instances

Zufeng Fu

Daoyun Xu

Abstract

1. Introduction

2. Related Works

3. Notations

Definition 1.

Definition 2.

Definition 3.

Lemma 1

Lemma 2

4. Uniquely Satisfiable d-Regular (k,s)-CNF Formula

Theorem 1.

Proof.

Lemma 3.

Proof.

Lemma 4.

Proof.

Theorem 2.

Proof.

Lemma 5.

Lemma 6.

Proof.

Theorem 3.

Proof.

Theorem 4.

Proof.

Corollary 1.

Proof.

5. A Parsimonious Polynomial Time Reduction

Theorem 5.

Proof.

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

4. Uniquely Satisfiable d-Regular ( $k, s$ )-CNF Formula