research-article

Open access

Separations in Proof Complexity and TFNP

Authors:

Mika Göös,

Alexandros Hollender,

Ran TaoAuthors Info & Claims

Journal of the ACM, Volume 71, Issue 4

Article No.: 26, Pages 1 - 45

https://doi.org/10.1145/3663758

Published: 01 August 2024 Publication History

PDF eReader

Abstract

It is well-known that Resolution proofs can be efficiently simulated by Sherali–Adams (SA) proofs. We show, however, that any such simulation needs to exploit huge coefficients: Resolution cannot be efficiently simulated by SA when the coefficients are written in unary. We also show that Reversible Resolution (a variant of MaxSAT Resolution) cannot be efficiently simulated by Nullstellensatz (NS).

These results have consequences for total NP search problems. First, we characterise the classes PPADS, PPAD, SOPL by unary-SA, unary-NS, and Reversible Resolution, respectively. Second, we show that, relative to an oracle, ${\text{ PLS}} \not\subseteq {\text{ PPP}}$ , ${\text{ SOPL}} \not\subseteq {\text{ PPA}}$ , and ${\text{ EOPL}} \not\subseteq {\text{ UEOPL}}$ . In particular, together with prior work, this gives a complete picture of the black-box relationships between all classical TFNP classes introduced in the 1990s.

1 Separations in Proof Complexity

The main results of this work are two separations between standard propositional proof systems, as summarised in Figure 1. Moreover, these results can be further interpreted as black-box separations in the theory of total NP search problems (TFNP), as we explain later in Section 2. This connection between TFNP and proof complexity, which has proved fruitful in past works and which we further explore here, also yields a new type of result in proof complexity, which we call intersection theorems; see Section 2.4.

1.1 Resolution versus Sherali–Adams

Our first separation is between the most basic and well-studied proof system Resolution (see the textbooks [48, 51] for an introduction) and the semi-algebraic proof system Sherali–Adams [23, 68] (see the monograph [37] for an introduction). Let us briefly recall these systems. Each system aims to refute a given conjunctive normal form (CNF) contradiction (unsatisfiable CNF formula) $F:= C_1\wedge \cdots \wedge C_m$ over the n Boolean variables $x=(x_1,\ldots , x_n)$ .

Resolution (Res). A Resolution refutation of F starts with the set of clauses of F and repeatedly applies the resolution rule $C\vee x_i, D\vee \bar{x}_i \vdash C\vee D$ . That is, if we have already deduced premise clauses $C\vee x_i$ and $D\vee \bar{x}_i$ for some i, then we can further deduce the clause $C\vee D$ . Once this rule has been applied enough times to produce the empty clause $\bot$ , the refutation is complete. The size of the refutation is the number of deduction steps, and its width is the maximum width $|C|$ (number of literals) of any clause C appearing in the refutation.

Sherali–Adams (SA). Sherali–Adams refutes unsatisfiable sets of polynomial equations $\lbrace a_i(x)=0: i\in [m]\rbrace$ with real coefficients, $a_i\in \mathbb {R}[x]$ . A CNF contradiction F can be translated into this language by encoding each clause, say, $C:= (x_1\vee \overline{x}_2 \vee x_3)$ , as the equation $(1-x_1)x_2(1-x_3)=0$ , and by enforcing each variable $x_i$ to take Boolean values by the equation $x_i^2-x_i=0$ . An SA refutation of $\lbrace a_i(x)=0\rbrace$ is a polynomial identity of the form¹

\begin{equation} \sum _{i\in [m]} p_i(x)\cdot a_i(x) ~=~ 1 + J(x), \end{equation}

(1)

where $p_i\in \mathbb {R}[x]$ are polynomials and J is a conical junta: a nonnegative linear combination of terms, that is, $J(x)=\sum _j \alpha _j \cdot t_j(x)$ where $\alpha _j\in \mathbb {R}_{\ge 0}$ are nonnegative coefficients and each $t_j$ is a conjunction of literals; for example, $t_j(x)=x_1\overline{x}_2x_3 = x_1(1-x_2)x_3$ . The size of the refutation is the combined total number of monomials in $p_i$ , $a_i$ , and $t_j$ (viewed as a polynomial) and its degree is the maximum of $\deg (p_i)+\deg (a_i)$ and of $\deg (t_j)$ over all $i,j$ .

It is a basic fact that SA is strictly more powerful than Resolution. First, Resolution is p-simulated by SA, that is, with only polynomial overhead in proof width/degree and size. Indeed, if F can be refuted by width-w Resolution, then F can be refuted by SA in degree $w+1$ [24]. Moreover, if one allows twin variables in SA, the simulation can also be made efficient relative to size [3]. Second, SA is not p-simulated by Resolution: there are n-variate CNF contradictions F (e.g., graph pigeonhole principles) that can be refuted by SA in constant degree but such that any Resolution refutation of F requires width $\Omega (n)$ and size $\exp (\Omega (n))$ [2].

Our first result highlights a previously overlooked inefficiency in the way that SA simulates Resolution. We show that any low-degree simulation needs to exploit huge coefficients.

Theorem 1.

There are n-variate CNF formulas F that can be refuted by constant-width Resolution, but such that any SA refutation of F in degree $n^{o(1)}$ requires coefficients of magnitude $\exp (n^{\Omega (1)})$ .

Theorem 1 is qualitatively tight in that the singly exponential lower bound $\exp (n^{\Omega (1)})$ cannot be improved much. Namely, if a CNF formula can be refuted by a degree-d SA proof, then there also exist a degree-d SA proof with integer coefficients of magnitude $\exp (n^{O(d)})$ (see Appendix A). To our knowledge, Theorem 1 is the first exponential coefficient lower bound for a constant-width CNF formula in any semi-algebraic proof system. (Examples of systems of polynomial equations—not coming from CNFs—requiring even doubly exponential coefficients were known previously [41, 61, 65].)

We also note that the conclusion of Theorem 1 can be slightly strenthened using standard lifting/xorification techniques [7, Section 4] to show that any SA refutation of F must either use exponentially many monomials or exponentially large coefficients. This trade-off has consequences for the unary Sherali–Adams (uSA) system where we restrict the coefficients to be integers written in unary and where their magnitude counts toward proof size (more precisely, the size of a uSA proof is the sum of the magnitudes of all coefficients appearing in the proof). Thus, we conclude that Resolution is not p-simulated by uSA. In particular, this answers a question raised in a concurrent work by Bonacina and Bonet [8]. For comparison, proving a similar lower bound for the Cutting Planes (CP) system (separating CP from unary-CP) is a long-standing open problem.

Fig. 1.

1.2 Reversible Resolution versus Nullstellensatz

Our second separation is between the standard algebraic proof system Nullstellensatz [5] and a subsystem of Resolution that we call Reversible Resolution. The latter is closely related to fragments of Resolution that have been introduced to model the reasoning used by MaxSAT solvers (which find an assignment that satisfies as many clauses as possible). Prior work has defined several distinct such MaxSAT Resolution systems [10, 34, 54]. Our variant is yet slightly different (see Section 6.3 for a comparison to prior systems). Ultimately, our definition is motivated by results that will be discussed in Section 2: Reversible Resolution captures an important TFNP class, and, moreover, it equals the “intersection” of Resolution and uSA.

Reversible Resolution (RevRes). In this restricted fragment of Resolution, we only allow the symmetric resolution rule $C\vee x_i, C\vee \bar{x}_i \vdash C$ and its inverse $C \vdash C\vee x_i, C\vee \bar{x}_i$ . Moreover, we stipulate that an application of either rule consumes its premises in the following sense. The refutation begins with a multiset of clauses of F—we may choose the multiplicity of each clause freely at start—and a single application of a deduction rule removes a single occurrence of each premise clause from the multiset and then adds the concluded clauses back to the multiset. Once we produce at least one empty clause, the refutation is complete. The size and width of the refutation are defined as before.

Nullstellensatz ( $\mathbb {F}$ -NS).Let $\mathbb {F}$ be a field. An $\mathbb {F}$ -Nullstellensatz refutation of a set of polynomial equations $\lbrace a_i(x)=0:i\in [m]\rbrace$ over $\mathbb {F}$ is given by a set of polynomials $\lbrace p_i(x)\rbrace \subseteq \mathbb {F}[x]$ such that

\begin{equation} \sum _{i\in [m]} p_i(x)\cdot a_i(x) ~=~ 1. \end{equation}

(2)

The size of the refutation is the combined total number of monomials in $p_i$ and $a_i$ and its degree is the maximum of $\deg (p_i)+\deg (a_i)$ over all i.

Reversible Resolution is p-simulated by uSA. Indeed, the usual simulations of Resolution by SA [3, 24] have the neat property that if they are applied to a RevRes proof instead, the resulting coefficients become bounded by the size of the RevRes proof (see also [34] for a simulation in a closely related MaxSAT system). This also means that RevRes is strictly less powerful than Resolution, as per our first separation result.

It is a classic result that Resolution is not p-simulated by $\mathbb {F}$ -NS over any field $\mathbb {F}$ . This is witnessed by CNF formulas expressing the sink-of-dag (SoD) principle [13, 20] or the pebbling principle [11, 29]. Our second result strengthens these classical separations showing that RevRes cannot be simulated by low-degree $\mathbb {F}$ -NS.

Theorem 2.

There are n-variate CNF formulas F that can be refuted by constant-width polynomial-size RevRes, but such that any $\mathbb {F}$ -NS refutation (over any $\mathbb {F}$ ) of F requires degree $n^{\Omega (1)}$ .

Again, we note that standard lifting techniques can be used to strengthen the degree lower bound in Theorem 2 to an exponential size lower bound. We conclude that RevRes is not p-simulated by $\mathbb {F}$ -NS. In particular, this strengthens a previous result by Filmus et. al. [34] who showed that RevRes (actually, their closely related MaxSAT system) is not p-simulated by tree-like Resolution.

1.3 Techniques

Our separation between Resolution and uSA (Section 5) builds on the separation between RevRes and $\mathbb {F}$ -NS (Section 4). We prove the latter separation for $\mathbb {F}=\mathbb {R}$ in a particularly robust form, namely, we show that it holds even if we allow some small amount of “error” in the NS proof. We introduce what we call $\boldsymbol \epsilon$ -approximate Nullstellensatz ( $\epsilon$ -NS) refutations, where we relax the polynomial identity Equation (2) over $\mathbb {F}=\mathbb {R}$ to hold only approximately:

\begin{equation} \sum _{i\in [m]} p_i(x)\cdot a_i(x) ~=~ 1\pm \epsilon , \quad \qquad \forall x\in \lbrace 0,1\rbrace ^n. \end{equation}

(3)

In the above expression and for the remainder of the article, “ $= 1\pm \epsilon$ ” stands for “ $\in [1 - \epsilon , 1 + \epsilon ]$ ”, meaning that the left-hand side (LHS) is a polynomial that takes values in $[1 - \epsilon , 1 + \epsilon ]$ when evaluated on Boolean inputs. For example, an SA refutation where $J(x) \le \epsilon$ for all Boolean inputs x is also an $\epsilon$ -NS refutation (since $J(x) \ge 0$ trivially holds).

We show that there is no low-degree approximate NS proof for the formulas that encode the so called sink-of-potential-line (SoPL) principle. These formulas are easy for RevRes, and in fact, we later show they are complete for RevRes (see Theorem 3). Naturally, our lower-bound proof borrows techniques from polynomial approximation theory. We give a randomised decision-to-search reduction, showing how a low-degree $\epsilon$ -NS refutation of SoPL would imply a low-degree approximating polynomial for the Or function. It is well-known, however, that the n-bit Or requires large approximate polynomial degree, namely, $\Omega (\sqrt {n})$ . This proof idea is inspired by previous works [40, 44, 45, 66] that followed a similar strategy in the context of communication complexity: they studied randomised reductions from set-disjointness (communication analogue of Or) to various communication search problems. Finally, we also give a separate (non-robust) proof that SoPL is hard for $\mathbb {F}$ -NS over any field $\mathbb {F}$ using the intersection theorem (Theorem 6).

Our lower bound for $\epsilon$ -NS, say with $\epsilon := 1/2$ , now helps us prove Theorem 1. We consider an SA refutation Equation (1) of the SoD principle (which is a stronger principle than SoPL). The non-existence of a low-degree $\epsilon$ -NS refutation for SoPL immediately implies that in any SA refutation of SoD, the conical junta J has to assume a value at least $\epsilon$ on some input: The right-hand side (RHS) equals $1+J(x)\ge 1+\epsilon = 1.5$ for some x. Our idea is to now iterate the $\epsilon$ -NS lower-bound argument by combining several SoPL instances inside SoD with the aim of finding large values on the RHS. After i iterations, we show the RHS equals $1+J(x_i)\ge 1.5^{\Omega (i)}$ for some carefully constructed input $x_i$ that embeds i copies of SoPL. Setting $i=\mbox{poly}(n)$ concludes the proof.

2 Separations in TFNP

A major motivation for our proof complexity separations in Section 1 is that they have consequences in terms of black-box separations between subclasses of TFNP. Together with prior work, our new separations resolve all the black-box relationships between classes depicted in Figure 2. To explain this connection in detail, we start with a short introduction to TFNP.

Fig. 2.

2.1 Introduction to TFNP

The class TFNP consists of all total NP search problems, that is, search problems where a solution is guaranteed to exist, and where it can be efficiently checked whether a given candidate solution is feasible. Some very important problems lie in TFNP, for example, Factoring (given a number, compute a prime factor) or Nash (given a bimatrix game, compute a Nash equilibrium).

A crucial observation is that no TFNP problem can be NP-hard, unless ${\text{ NP}} = {\text{ coNP}}$ [57]. Furthermore, it is believed that TFNP is unlikely to have complete problems [64]. As a result, to understand the complexity of important TFNP problems, researchers have defined syntactic subclasses of TFNP, such as PLS [47], PPAD, PPADS, PPA, and PPP [62]. These subclasses are defined using canonical complete problems that correspond to very simple existence principles.

PLS:	Every directed acyclic graph has a sink.
PPAD:	Every directed graph with an unbalanced node (outdegree $\ne$ indegree) must have another unbalanced node.
PPADS:	Every directed graph with a positively unbalanced node (outdegree $\gt$ indegree) must have a negatively unbalanced node (outdegree $\lt$ indegree).
PPA:	Every undirected graph with an odd-degree node must have another odd-degree node.
PPP:	Every function mapping $[n+1]$ to $[n]$ must have a collision. (Pigeonhole Principle)

These existence principles naturally give rise to corresponding total search problems. For example, for PPAD that would be: given a directed graph and an unbalanced node in that graph, find another unbalanced node. These problems are defined so that the search space (the set of nodes) has size exponential in the size of the input. Otherwise, it would be trivial to find a solution in polynomial time. In more detail, this is achieved by having the input of the problem consist of a Boolean circuit that can be used to compute the neighbours of any given node.

The theory of TFNP classes has been successful in capturing the complexity of many important natural problems. Indeed, in a celebrated result [17, 26], it was shown that Nash is complete for PPAD. Following this breakthrough, other problems from game theory [18, 30, 58] and economics [16, 19, 21] were also proved PPAD-complete. Similarly, PLS has been found to capture the complexity of various interesting problems, mainly ones where a local optimum of some sort is sought [31, 52, 53, 67]. Finally, various problems in fair division are PPA-complete [35], while some problems related to cryptography have been shown PPP-complete [69].

New classes and collapses.More recently, newer classes CLS [27], EOPL [33, 42], SOPL [39] were defined, motivated chiefly by problems that were unlikely to be complete for any of the classical classes discussed above. Indeed, it was noted that many interesting problems lie in both PLS and PPAD, but are unlikely to be complete for PLS $\cap$ PPAD, a seemingly completely artificial class. To remedy this situation, CLS, and later EOPL, were defined as more natural subclasses of PLS $\cap$ PPAD. However, in a surprising turn of events, it was discovered that ${\text{ CLS}} = {\text{ PLS}} \cap {\text{ PPAD}}$ [32] and also that ${\text{ EOPL}} = {\text{ PLS}} \cap {\text{ PPAD}}$ and ${\text{ SOPL}} = {\text{ PLS}} \cap {\text{ PPADS}}$ [38]. In other words, the new classes can be completely defined in terms of the classical ones.

To rule out further surprising collapses in the future, it would thus makes sense, whenever one defines a new subclass, to also provide some kind of evidence that the new class is indeed new, and does not collapse to existing classes. Clearly, any unconditional separation is completely out of reach, since it would immediately imply that ${\text{ P}} \ne {\text{ NP}}$ . However, it turns out that one can indeed prove separations relative to oracles by proving unconditional separations between black-box versions of the classes.

The black-box model. Recall that TFNP subclasses are defined in terms of very simple existence principles that are turned into (white-box) total search problems by having the input be implicitly described by a Boolean circuit. Another—sometimes more natural—choice is to have the input be described by a black box, instead of a white box. For example, in the case of PPAD, instead of being given the description of a circuit that can be used to compute neighbours, we can consider the model where we can query an oracle (black-box) to ask for the neighbours of a node.

More formally, a total query search problem is a sequence of relations $R_n \subseteq \lbrace 0,1\rbrace ^n \times O_n$ , one for each size $n \in \mathbb {N}$ , such that for all inputs $x \in \lbrace 0,1\rbrace ^n$ there is an output $o \in O_n$ such that $(x,o) \in R_n$ . Here, $O_n$ is a finite set of outputs, and we say that o is a solution to instance x, when $(x,o) \in R_n$ . We think of an instance $x \in \lbrace 0,1\rbrace ^n$ as a very long bitstring that can only be accessed through queries to individual bits. In this context, an efficient algorithm is a deterministic algorithm that, for any $x \in \lbrace 0,1\rbrace ^n$ , finds a solution o to x by performing a small number of queries to x, namely, at most $\mbox{poly}(\log n)$ queries. Thus, efficient algorithms correspond to decision trees (with leaves labelled by elements of $O_n$ ) of depth at most $\mbox{poly}(\log n)$ . Note that this model is non-uniform: the problem admits an efficient algorithm, if for each $n \in \mathbb {N}$ , there exists a shallow decision tree solving $R_n$ .

The notion of total search problems as defined above does not quite correspond to TFNP yet, because it is missing the requirement for efficient verification of solutions. We enforce this in the following natural way. A total search problem ${\rm\small R} = (R_n)_n$ is in ${\text{ TFNP}} ^{dt}$ , if for each $o \in O_n$ there is a decision tree $T_o$ with depth $\mbox{poly}(\log n)$ such that for every $x \in \lbrace 0,1\rbrace ^n$ , $T_o(x) = 1$ if and only if $(x,o) \in R_n$ . We define the class ${\text{ PPAD}} ^{dt}$ as the set of all ${\text{ TFNP}} ^{dt}$ problems that have an efficient decision-tree reduction to (the query version of) the canonical complete problem for PPAD. We denote by ${\text{ PPAD}} ^{dt}(R_n)$ the decision tree complexity of a reduction from $R_n$ to the canonical ${\text{ PPAD}} ^{dt}$ -complete problem (see Section 3 for a precise definition). Thus, problem ${\rm\small R}=(R_n)_n$ lies in ${\text{ PPAD}} ^{dt}$ if and only if ${\text{ PPAD}} ^{dt}(R_n) = \mbox{poly}(\log n)$ . The decision-tree analogues of the other classes are defined in the same way.

Black-box separations. In the black-box model, it is now possible to prove unconditional separations, e.g., that ${\text{ PPAD}} ^{dt} \not\subseteq {\text{ PLS}} ^{dt}$ by showing that there is no shallow decision-tree reduction from some problem in ${\text{ PPAD}} ^{dt}$ to a complete problem for ${\text{ PLS}} ^{dt}$ . Importantly, a black-box separation also provides some evidence that the separation might hold in the white-box setting too, in the following sense: any black-box separation implies a corresponding separation in the white-box model relative to some oracle [4]. Moreover, all existing containment results (including the recent collapses [32, 38]) also hold in the black-box setting. Thus, a black-box separation is quite significant, since it rules out any collapse using existing techniques.

Previously, Beame et al. [4] proved all possible separations between the classes ${\text{ PPA}} ^{dt}$ , ${\text{ PPAD}} ^{dt}$ , ${\text{ PPADS}} ^{dt}$ , and ${\text{ PPP}} ^{dt}$ . Subsequently, Morioka [59] extended these results by proving that ${\text{ PPAD}} ^{dt}$ is not reducible to ${\text{ PLS}} ^{dt}$ . This implies that none of ${\text{ PPA}} ^{dt}$ , ${\text{ PPAD}} ^{dt}$ , ${\text{ PPADS}} ^{dt}$ , and ${\text{ PPP}} ^{dt}$ are contained in ${\text{ PLS}} ^{dt}$ . Buresh-Oppenheim and Morioka [12] further proved that ${\text{ PLS}} ^{dt}$ is not contained in ${\text{ PPA}} ^{dt}$ . It has so far remained open whether ${\text{ PLS}} ^{dt} \subseteq {\text{ PPADS}} ^{dt}$ or ${\text{ PLS}} ^{dt} \subseteq {\text{ PPP}} ^{dt}$ .

Connection to proof complexity. Propositional proof complexity is a major tool for proving black-box separations. There is a natural correspondence between total query search problems and CNF contradictions. In one direction, a CNF contradiction $F := C_1 \wedge \cdots \wedge C_m$ over the variables $x=(x_1, \ldots , x_n)$ naturally gives rise to a corresponding total search problem $S(F)$ : given an assignment $x\in \lbrace 0,1\rbrace ^n$ , find an unsatisfied clause of F. Formally, we define $S(F) \subseteq \lbrace 0,1\rbrace ^n \times [m]$ by $(x,i) \in S(F)$ if and only if $C_i(x) = 0$ . Thus, a sequence of unsatisfiable CNF formulas ${\rm\small F} = (F_n)$ , where $F_n$ has n variables, defines the total search problem $S({\rm\small F}) = (S(F_n))$ . Note that $S({\rm\small F}) \in {\text{ TFNP}} ^{dt}$ if $F_n$ has width $\mbox{poly}(\log n)$ .

In the other direction, a problem ${\rm\small R} = (R_n)$ in ${\text{ TFNP}} ^{dt}$ can be written equivalently as $S({\rm\small F})$ for some sequence of CNF contradictions ${\rm\small F}=(F_n)$ . Specifically, for $R_n \subseteq \lbrace 0,1\rbrace ^n \times O_n$ , we define the formula $F_n:= \bigwedge _{o \in O_n} \lnot T_o(x)$ , where we note that $T_o(x)$ can naturally be written as a DNF formula of width at most $\mbox{poly}(\log n)$ (with one term per accepting leaf of $T_o$ ), and thus $\lnot T_o(x)$ can be written as a CNF formula of the same width.

2.2 New Characterisations

The above connection to proof complexity opens up the possibility to characterise search problem classes by propositional proof systems, in the following sense: the problem $(S(F_n))_n$ lies in class X if and only if the CNF formulas $(F_n)_n$ have small refutations in proof system Y. To make this more precise, for any proof system ${\rm\small P}$ and a CNF formula F, we define

\begin{equation*} {\rm\small P}(F) ~:=~ \min _{\text{${\rm\small P}$-proof $\Pi $ of $F$}} \big [\log \mbox{size}(\Pi) + \deg (\Pi)\big ]. \end{equation*}

Here, $\deg (\Pi)$ should be understood as width when $\text{P}$ is Resolution (or RevRes) and as depth when $\text{P}$ is tree-like Resolution. Prior work has established the following characterisations.

—

${\text{ FP}} ^{dt}(S(F)) = \Theta (\mbox{TreeRes}(F))$ [56].

—

${\text{ PLS}} ^{dt}(S(F)) = \Theta (\mbox{Res}(F))$ [15].

—

${\text{ PPA}} ^{dt}(S(F)) = \Theta (\mathbb {F}_2\mbox{-NS}(F))$ [39].

—

${\text{ PPA}} _p^{dt}(S(F)) = \Theta (\mathbb {F}_p\mbox{-NS}(F))$ for every prime p [49].

We contribute the following new characterisations. For one of them, we need to introduce one more proof system, Reversible Resolution with Terminals (RevResT), defined in Section 6.3.

Theorem 3.

For any unsatisfiable CNF formula F, we have:

—

${\text{ PPAD}} ^{dt}(S(F)) = \Theta (\mbox{uNS}(F))$ .

—

${\text{ PPADS}} ^{dt}(S(F)) = \Theta (\mbox{uSA}(F))$ .

—

${\text{ SOPL}} ^{dt}(S(F)) = \Theta (\mbox{RevRes}(F))$ .

—

${\text{ EOPL}} ^{dt}(S(F)) = \Theta (\mbox{RevResT}(F))$ .

Together with our proof complexity separations from Section 1, we immediately obtain the following black-box separations (which yield white-box oracle separations as discussed above).

Corollary 1.

${\text{ PLS}} ^{dt} \not\subseteq {\text{ PPADS}} ^{dt}$ .

Corollary 2.

${\text{ SOPL}} ^{dt} \not\subseteq {\text{ PPA}} ^{dt}$ .

Additional characterizations, as well as separation results, were obtained in the subsequent works [43, 55].

2.3 Two Further Separations

We show two more black-box separations involving classes ${\text{ PPP}} ^{dt}$ and ${\text{ UEOPL}} ^{dt}$ , which currently lack elegant proof system characterisations. The first separation strengthens Corollary 1.

Theorem 4.

${\text{ PLS}} ^{dt} \not\subseteq {\text{ PPP}} ^{dt}$ .

Theorem 5.

${\text{ EOPL}} ^{dt} \not\subseteq {\text{ UEOPL}} ^{dt}$ .

(An early preprint of this work did not include the above theorems. In an independent work, Bonacina and Thapen [9] also proved Theorem 4, deriving it from Corollary 1 using essentially the same proof as we do.)

Theorem 4 settles the last open oracle separation question between the five original TFNP classes introduced in References [47, 62]. This question was re-asked recently by Daskalakis in his Nevanlinna Prize lecture [25, Open Question 6]. Previously, Buresh-Oppenheim and Morioka [12] showed a partial result in the direction of Theorem 4, namely, that there is no reduction from ${\text{ PLS}} ^{dt}$ to ${\text{ PPP}} ^{dt}$ that preserves the number of solutions in each instance. Finally, Theorem 5 answers a question of Reference [33] who introduced the class UEOPL. They conjectured that ${\text{ EOPL}} \not\subseteq {\text{ UEOPL}}$ and asked whether this could be shown relative to an oracle.

2.4 Intersection Theorems in Proof Complexity

Our new characterisations can be combined with the collapses ${\text{ SOPL}} = {\text{ PLS}} \cap {\text{ PPADS}}$ and ${\text{ EOPL}} = {\text{ PLS}} \cap {\text{ PPAD}}$ [38] (which hold in the black-box model) to produce completely new types of results in propositional proof complexity that we call intersection theorems.

Stated plainly, the first of these results says that a CNF formula F admits an efficient (small degree and size) Reversible Resolution refutation if and only if if it admits an efficient Resolution refutation and an efficient unary Sherali–Adams refutation. In other words, Reversible Resolution is the “intersection” of Resolution and unary Sherali–Adams. We can similarly show that Reversible Resolution with Terminals is the “intersection” of Resolution and unary Nullstellensatz.

Theorem 6.

For any unsatisfiable CNF formula F, we have:

—

$\mbox{RevRes}(F) = \Theta (\mbox{Res}(F) + \mbox{uSA}(F))$ .

—

$\mbox{RevResT}(F) = \Theta (\mbox{Res}(F) + \mbox{uNS}(F))$ .

To our knowledge, these are the first theorems of their type, that is, showing that efficient proofs exist in one system ${\rm\small P}_0$ if and only if efficient proofs exist in two other systems ${\rm\small P}_1$ and ${\rm\small P}_2$ . This is all the more striking given that all three of these proof systems are quite natural, being motivated from Boolean logic and SAT-solving ( $\mbox{Res}$ ), linear programming ( $\mbox{uSA}$ ), and MaxSAT solving ( $\mbox{RevRes}$ ). Moreover, the proof of this theorem (Section 7) crucially uses both perspectives of proof systems and total search problems. Starting with propositional proofs in Resolution and unary Sherali–Adams, we convert them to efficient formulations of $S(F)$ in ${\text{ PLS}} ^{dt}$ and ${\text{ PPADS}} ^{dt}$ , respectively. We then apply the collapse theorem to argue there is an efficient formulation of $S(F)$ in ${\text{ SOPL}} ^{dt}$ , which we can finally convert back to a $\mbox{RevRes}$ proof. We see no apparent way to prove this theorem directly using classic proof complexity techniques.

2.5 Open Problems

In our opinion, exploring the interplay between TFNP and propositional proof complexity holds untapped potential. The results in this work arose from our core belief that a natural concept introduced in one theory should have a natural counterpart in another theory. This philosophy suggests many further directions for research and serves as a guiding principle for formulating new beautiful connections between the two theories. For example:

(1)

Can Theorem 1 be strengthened to show that the Sum-of-Squares system needs huge coefficients to simulate Resolution in low degree?

(2)

Can we characterise the class PPP by a proof system?

(3)

Does unary-NS p-simulate $\mathbb {Z}$ -NS for refuting CNF formulas?

(4)

Can we prove other intersection theorems in propositional proof complexity?

(5)

Do Sum-of-Squares and Polynomial Calculus characterise some TFNP classes?

(6)

Are there communication complexity analogues of our results? The recent column [28] surveys the connections between total search problems and characterisations of various circuit models in the language of communication complexity (via Karchmer–Wigderson games).

We note here that Buss, Fleming, and Impagliazzo [14] have recently provided an answer to question (5) by giving a TFNP characterization of Polynomial Calculus. In fact, they show a more general connection: every well-behaved proof system that can prove its own soundness is characterized by a TFNP problem, and vice-versa. This also answers question (2), although ideally we would like to characterize PPP by a more natural proof system than the one obtained through this generic connection.

3 Definitions

In this section, we give formal definitions of the total search problems that we consider in this work. We emphasise that unlike the standard uniform setting of TFNP, we will be interested in non-uniform variants of TFNP classes defined by decision trees.

3.1 Decision Tree TFNP

Definition 1.

A total (query) search problem is a sequence of relations ${\rm\small R} = \lbrace R_n \subseteq \lbrace 0,1\rbrace ^{n} \times O_n\rbrace$ , where $O_n$ are finite sets, such that for all $x \in \lbrace 0,1\rbrace ^n$ there is an $o \in O_n$ such that $(x, o) \in R_n$ . A total search problem ${\rm\small R}$ is in ${\text{ TFNP}} ^{dt}$ if for each $o \in O_n$ there is a decision tree $T_o$ with depth $\mbox{poly}(\log n)$ such that for every $x \in \lbrace 0,1\rbrace ^n$ , $T_o(x) = 1$ iff $(x, o) \in {\rm\small R}$ .

While total search problems are formally defined as sequences ${\rm\small R} = (R_n)$ , it will often make sense to speak of an individual search problem $R_n$ in the sequence. We will therefore slightly abuse notation and also call $R_n$ a total search problem. It will also be convenient to encode total search problems with inputs and outputs chosen from domains other than $\lbrace 0,1\rbrace ^n$ . One common example will be total search problems where the inputs are chosen from $[n]^n$ . We can simulate this simply by encoding all elements of the non-Boolean domain in binary in the usual way. In all examples in this article, performing this encoding will change the complexities of the involved problems by no more than a $O(\log n)$ factor. We also allow the nth problem $R_n$ in a sequence to have $\mbox{poly}(n)$ input bits (instead of n) for notational convenience.

The canonical examples of total search problems in ${\text{ TFNP}} ^{dt}$ are the search problems associated with an unsatisfiable CNF formula F.

Definition 2.

For any unsatisfiable CNF formula $F := C_1 \wedge \cdots \wedge C_m$ over n variables, define $S(F) \subseteq \lbrace 0,1\rbrace ^n \times [m]$ by $(x, i) \in S(F)$ if and only if $C_i(x) = 0$ .

Therefore, given any sequence of unsatisfiable CNF formulas ${\rm\small F} = \lbrace F_1, F_2, \ldots \rbrace$ , we get a total search problem $S({\rm\small F}) = \lbrace S(F_1), S(F_2), \dots \rbrace$ in the natural way. Observe that $S({\rm\small F}) \in {\text{ TFNP}} ^{dt}$ if each unsatisfiable CNF formula has width $\mbox{poly}(\log n)$ . Conversely, these examples are also complete, in the sense that any search problem in ${\text{ TFNP}} ^{dt}$ can be re-encoded as unsatisfiable CNF formulas.

Definition 3.

For any total search problem $R\subseteq \lbrace 0,1\rbrace ^n\times O$ with solution verifiers $T_o$ , $o\in O$ , its encoding as an unsatisfiable CNF formula is given by $F:= \bigwedge _{o \in O} \lnot T_o(x)$ , where we think of $\lnot T_o(x)$ written as a CNF formula (of width determined by the decision tree depth of $T_o$ ).

Fig. 3.

3.2 Search Problem Zoo

We now define several search problems that will be of interest to us. See also Figure 3 for helpful illustrations of some of them. We start with the problems that are complete for the classical classes introduced in References [47, 62].

PPP:

Pigeon ( ${\boldsymbol{\rm{P}{\rm\small{IEGON}}}_{n}}$ ). This problem features n pigeons, denoted by $[n]$ , and as input we are given, for each pigeon $u\in [n]$ a hole $s_u \in [n-1]$ . The goal is to output

(1)

$u,v \in [n]$ , if $u \ne v$ and $s_u = s_v$ . (pigeon collision)

PPADS:

${\boldsymbol{\rm{S}{\rm\small{INK-OF-}}\rm{L}{\rm\small{INE}}(Sol)}_{n}}$ . This problem is defined on a set of n nodes, denoted by $[n]$ , where the node 1 is “distinguished.” For input, we are given a successor $s_u \in [n]$ for each node $u \in [n]$ and a predecessor $p_u \in [n]$ for each node $u \ne 1$ . Given this list of successor/predecessor pointers, we create a directed graph G, where we add an edge $(u, v)$ if and only if $s_u = v$ and $p_v = u$ . We say u is a proper sink if it has in-degree 1 and out-degree 0, and it is a proper source if it has in-degree 0 and out-degree 1. The goal of the search problem is to output any of the following

(1)

1, if 1 is not a proper source node in G, or (no distinguished source)

(2)

$i \ne 1$ , if i is a proper sink node in G. (proper sink)

PPAD:

${\boldsymbol{\rm{E}{\rm\small{ND-OF-}}\rm{L}{\rm\small{INE}}(Eol)}_{n}}$ . Same as SoL, except we add the following feasible solution.

(1)

$i \ne 1$ , if i is a proper source node in G. (proper source)

PLS:

${\boldsymbol{\rm{S}{\rm\small{INK-OF-}}\rm{D}{\rm\small{AG}}(SoD)}_{n}}$ . This problem is defined on the $[n] \times [n]$ grid, where the node $(1,1)$ is “distinguished.” As input, for each grid node $u=(i,j) \in [n] \times [n]$ , we are given a successor $s_u \in [n] \cup \lbrace {\textsf {null}}\rbrace$ , interpreted as naming a node $(i+1,s_u)$ on the next row. We say a node u is active if $s_u \ne {\textsf {null}}$ , otherwise it is inactive. A node u is a proper sink if u is inactive but some active node has u as a successor. The goal of the search problem is to output any of the following

(1)

$(1,1)$ , if $(1,1)$ is inactive, (inactive distinguished source)

(2)

$(n, j)$ , if $(n,j)$ is active, (active sink)

(3)

$(i, j)$ for $i \le n-1$ , if $(i,j)$ is active and its successor is a proper sink. (proper sink)

For SoD, it is helpful to think of the successors $s_u$ as describing a fan-out 1 dag on an $n \times n$ grid of nodes such that all edges are between adjacent rows. Active nodes are those nodes that have some edge leaving them. If we require that $(1, 1)$ is active and all nodes on row n are inactive, then the goal is to find a proper sink, that is, an active node with an inactive successor node.

We next define complete problems for the more modern classes introduced in References [33, 39, 42]. They are variations of the SoD problem where all nodes in the grid have predecessor pointers, and we only add an edge if the successor and predecessor pointers agree. In particular, this implies that every node has fan-out and fan-in 1.

SOPL:

${\boldsymbol{\rm{S}{\rm\small{INK-OF-}}\rm{P}{\rm\small{OTENTIAL-}}{\rm{L}{\rm\small{INE}}}(SoPL)}_{n}}$ . As input, we are given a successor $s_{u} \in [n] \cup \lbrace {\textsf {null}}\rbrace$ for each $u \in [n] \times [n]$ and a predecessor $p_u \in [n] \cup \lbrace {\textsf {null}}\rbrace$ for each $u \in \lbrace 2, \dots , n\rbrace \times [n]$ . A node $(i,j) \in [n-1] \times [n]$ is active if $s_{(i,j)} = k \ne {\textsf {null}}$ and $p_{(i+1,k)} = j$ , otherwise it is inactive; a node $(i,j) \in \lbrace n\rbrace \times [n]$ is active if $s_{(i,j)} \ne {\textsf {null}}$ and inactive otherwise. A node u is a proper sink if u is inactive but some active node has u as a successor. The goal is to output any of the following:

(1)

$(1,1)$ , if $(1,1)$ is inactive, (inactive distinguished source)

(2)

$(n, j)$ , if $(n, j)$ is active, (active sink)

(3)

$(i, j)$ , if $(i,j)$ is a proper sink. (proper sink)

EOPL:

${\boldsymbol{\rm{E}{\rm\small{ND-OF-}}\rm{P}{\rm\small{OTENTIAL-}}{\rm{L}{\rm\small{INE}}}(EoPL)}_{n}}$ . Add the following feasible solutions to SoPL. A node $(i,j)$ is a proper source if $(i,j)$ is active and, either, $i = 1$ or $1 \lt i \lt n$ and there is no active node with $(i, j)$ as a successor.

(1)

$(i,j)$ , if $(i, j)\ne (1,1)$ and $(i,j)$ is a proper source. (proper source)

UEOPL:

${\boldsymbol{\rm{U}{\rm\small{NIQUE-}}EoPL\ (UEoPL)}_{n}}$ . Add the following feasible solution to EoPL.

(1)

$(i, j)$ and $(i, j^{\prime })$ , if $j \ne j^{\prime }$ and both nodes are active. (two parallel lines)

3.3 Reductions and Formulations

Given any problem defined above, we can consider complexity classes of total search problems obtained by taking reductions to these problems. In this work, we are particularly interested in the case where the reduction is defined by a low-depth decision tree.

Definition 4.

Let $R \subseteq \lbrace 0,1\rbrace ^n \times O$ and $S \subseteq \lbrace 0,1\rbrace ^m \times O^{\prime }$ be total search problems. An S-formulation of R is a decision-tree reduction $(f_i, g_o)_{i \in [m], o \in O^{\prime }}$ from R to S. Formally, for each $i \in [m]$ and $o \in O^{\prime }$ there are functions $f_i:\lbrace 0,1\rbrace ^n \rightarrow \lbrace 0,1\rbrace$ and $g_o:\lbrace 0,1\rbrace ^n \rightarrow O$ such that

\begin{equation*} (x, g_o(x)) \in R \Leftarrow (f(x), o) \in S, \end{equation*}

where $f(x) \in \lbrace 0,1\rbrace ^m$ is the string whose ith bit is $f_i(x)$ . The depth of the reduction is

\begin{equation*} d ~:=~ \max \big (\lbrace D(f_i) : i \in [m]\rbrace \cup \lbrace D(g_o) : o \in O^{\prime }\rbrace \big), \end{equation*}

where $D(h)$ denotes the decision-tree depth of h. The size of the reduction is m, the number of input bits to S. The complexity of the reduction is $\log m + d$ . We write $S^{dt}(R)$ to denote the minimum complexity of an S-formulation of R.

We extend these notations to sequences in the natural way. If R is a single search problem and ${\rm\small S} = (S_m)$ is a sequence of search problems, then we denote by ${\rm\small S}^{dt}(R)$ the minimum of $S^{dt}_m(R)$ over all m. If ${\rm\small R} = (R_n)$ is also a sequence, then we denote by ${\rm\small S}^{dt}({\rm\small R})$ the function $n\mapsto {\rm\small S}^{dt}(R_n)$ .

Using the previous definition, we can now define complexity classes of total search problems via reductions. For total search problems ${\rm\small R} = (R_n), {\rm\small S} = (S_n)$ , we write

\begin{equation*} {\rm\small S}^{dt} ~:=~ \lbrace {\rm\small R} : {\rm\small S}^{dt}({\rm\small R}) = \mbox{poly}(\log n)\rbrace . \end{equation*}

We can now define the decision-tree variants of the standard classes: ${\text{ PPP}} ^{dt} = {\text{ Pigeon}} ^{dt}$ , ${\text{ PPADS}} ^{dt} = {\text{ SoL}} ^{dt}$ , and so on, according to the problems defined in Section 3.2.

4 Reversible Resolution versus Nullstellensatz

In this section, we prove Theorem 2, restated below.

Theorem 2.

We prove Theorem 2 in two ways. First, in Sections 4.1 to 4.3, we give a particularly robust proof in the special case $\mathbb {F}=\mathbb {R}$ , which will be useful in Section 5, when we prove our other separation result. Second, in Section 4.4, we give a (non-robust) proof for all $\mathbb {F}$ using the intersection theorem. In both proofs, we consider the SoPL principle and show that it does not admit a low-degree NS proof, and that it can be refuted in low-width small-size RevRes.

4.1 Approximate Nullstellensatz

We define a generalisation of $\mathbb {R}$ -NS that we call $\epsilon$ -approximate Nullstellensatz ( $\epsilon$ -NS) where $\epsilon \in (0,1)$ is an error parameter. An $\epsilon$ -NS refutation of a set of real polynomial equations $\lbrace a_i(x)=0:i\in [m]\rbrace$ is a set of polynomials $\lbrace p_i(x)\rbrace$ such that

\begin{equation} \sum _{i\in [m]} p_i(x)\cdot a_i(x) ~=~ 1\pm \epsilon , \quad \qquad \forall x\in \lbrace 0,1\rbrace ^n, \end{equation}

(4)

where we recall that “ $= 1\pm \epsilon$ ” stands for “ $\in [1 - \epsilon , 1 + \epsilon ]$ ,” meaning that the LHS is a polynomial that takes values in $[1 - \epsilon , 1 + \epsilon ]$ when evaluated on Boolean inputs. The $\epsilon$ -NS system is not a standard proof system in the sense of Cook and Reckhow [22]. In particular, it is not hard to show (using the PCP theorem) that testing the condition in Equation (4) is in fact coNP-complete. Another feature of the new system is that the error parameter can be efficiently reduced using standard error reduction techniques for polynomial approximation. For example, if we compose any $\epsilon$ -NS proof $\sum _ip_ia_i=1\pm \epsilon$ with the univariate polynomial $q(z):= z(2-z)$ , then we obtain an $\epsilon ^2$ -NS proof $q(\sum _ip_ia_i)=1\pm \epsilon ^2$ .

4.2 Lower Bound for ε-NS

Recall that the input to ${\text{ SoPL}} _n$ consists of successor pointers $s_u\in [n]\cup \lbrace {\textsf {null}}\rbrace$ and predecessor pointers $p_u\in [n]\cup \lbrace {\textsf {null}}\rbrace$ for each grid node $u\in [n]\times [n]$ . For the purposes of NS, we encode this input in binary by a string $y\in \lbrace 0,1\rbrace ^{n^{\prime }}$ over $n^{\prime } =O(n^2\log n)$ variables. Moreover, we can think of ${\text{ SoPL}} _n$ as an unsatisfiable set of polynomial equations $\lbrace a_i(y)=0\rbrace$ each of degree $O(\log n)$ . These equations can be obtained by taking the unsatisfiable CNF encoding of ${\text{ SoPL}} _n$ (Definition 7) and encoding each clause as the corresponding polynomial equation in the usual way.

Our goal is to prove the following lemma.

Lemma 1.

Every $\frac{1}{2}$ -NS refutation of ${\text{ SoPL}} _n$ requires degree $n^{\Omega (1)}$ .

It suffices to prove the lemma for error $\epsilon := 0.01$ , because of efficient error reduction. Fix any $\epsilon$ -NS refutation $\sum _i p_i(y) a_i(y) = 1 \pm \epsilon$ of degree k for ${\text{ SoPL}} _n$ . Our goal is to show a lower bound on k. We will give a randomised decision-to-search reduction, in the style of References [40, 44, 45, 66], showing that a low-degree $\epsilon$ -NS refutation would imply a low-degree approximating polynomial for the $(n-1)$ -bit Or function. The following well-known fact then concludes the proof.

Fact 1([60]).

Suppose that p is an n-variate real polynomial such that $p(x)={\text{ Or}} _n(x)\pm 1/3$ for all $x\in \lbrace 0,1\rbrace ^n$ . Then $\deg (p)\ge \Omega (\sqrt {n})$ .

Definition of reduction.. We define a depth-d deterministic reduction as a pair $(f,u)$ such that

(1)

$f:\lbrace 0,1\rbrace ^{n-1}\rightarrow \lbrace 0,1\rbrace ^{n^{\prime }}$ is a function that maps an input x of ${\text{ Or}} _{n-1}$ to an input $y=f(x)$ of ${\text{ SoPL}} _n$ . Moreover, each output bit $f_i(x)\in \lbrace 0,1\rbrace$ is a depth-d decision tree function of x.

(2)

For any input x, the only solutions of $y=f(x)$ are active sinks on the last row $\lbrace n\rbrace \times [n]$ . We write $\mbox{Sol}(y)\subseteq \lbrace n\rbrace \times [n]$ for the set of solutions in y. Moreover, $u\in \mbox{Sol}(y)$ is a solution called the planted solution. (Note that u does not depend on x.)

(3)

If ${\text{ Or}} (x)=0$ , then $y=f(x)$ contains a unique solution, namely, $\mbox{Sol}(y)=\lbrace u\rbrace$ .

(4)

If ${\text{ Or}} (x)=1$ , then $y=f(x)$ contains at least two solutions, $|\mbox{Sol}(y)|\ge 2$ .

We then define a depth-d randomised reduction $\mathcal {R}$ as a probability distribution over depth-d deterministic reductions $(\boldsymbol {u})\sim \mathcal {R}$ . For every x, we write $\mathcal {R}_x$ for the distribution of $(x),\boldsymbol {u}) = (\boldsymbol {y},\boldsymbol {u})$ . We say that a pair $(\boldsymbol {y},\boldsymbol {u})$ is ideal if it satisfies the following.

Ideal $(\boldsymbol {y},\boldsymbol {u})$ : Let y be any outcome of $\boldsymbol {y}$ and consider $\boldsymbol {u}$ conditioned on $\boldsymbol {y}=y$ , namely, $\boldsymbol {u}^{\prime }:=(\boldsymbol {u}\mid \boldsymbol {y}=y)$ . Then $\boldsymbol {u}^{\prime }$ is uniformly distributed over $\mbox{Sol}(y)$ ; in short, $\boldsymbol {u}^{\prime }\sim \mbox{Sol}(y)$ .

We say $\mathcal {R}$ is ideal if $\mathcal {R}_x$ is ideal for every x.

Ideal reduction $\Rightarrow$ Approximation to Or. Next, we show that if we had an ideal reduction, then we could construct an approximating polynomial for Or. We write $i_u$ for the unique i such that the polynomial equation $a_i(y)=0$ encodes the ${\text{ SoPL}} _n$ constraint that u is not an active sink. Namely, this corresponds to the equation $s_u = 0$ , where the bit $s_u \in \lbrace 0,1\rbrace$ of the input y encodes whether or not u is active (see Definition 7). If we think of $u\in \lbrace n\rbrace \times [n]$ as encoded by an $O(\log n)$ -bit string, then we can define an $[n^{\prime }+O(\log n)]$ -variate polynomial

\begin{equation} \textstyle q(y,u) ~:=~ p_{i_u}(y) a_{i_u}(y) ~=~ \sum _i{1}[i=i_u] p_i(y)a_i(y). \end{equation}

(5)

Here, for every i, the indicator function ${1}[i=i_{u}]\in \lbrace 0,1\rbrace$ is computed by an $O(\log n)$ -degree polynomial. This means q has degree $\deg (q)\le O(k\log n)$ . If $(\boldsymbol {y},\boldsymbol {u})$ is ideal, then

\begin{align} {\mathbb {E}}[q(\boldsymbol {y},\boldsymbol {u})] &~=~ {\mathbb {E}}_{y\sim \boldsymbol {y}}\big [{\mathbb {E}}_{u^{\prime }\sim (\boldsymbol {u}\mid \boldsymbol {y}=y)}[p_{i_{u^{\prime }}}(y) a_{i_{u^{\prime }}}(y)]\big ] \nonumber \nonumber\\ &~=~ {\mathbb {E}}_{y\sim \boldsymbol {y}}\big [{\mathbb {E}}_{u^{\prime }\sim \mbox{Sol}(y)}[p_{i_{u^{\prime }}}(y) a_{i_{u^{\prime }}}(y)]\big ] \nonumber \nonumber\\ &~=~ {\mathbb {E}}_{y\sim \boldsymbol {y}}\Big [|\mbox{Sol}(y)|^{-1}\sum \limits _{u^{\prime }\in \mbox{Sol}(y)}p_{i_{u^{\prime }}}(y) a_{i_{u^{\prime }}}(y)\Big ] \nonumber \nonumber\\ &~=~ {\mathbb {E}}_{y\sim \boldsymbol {y}}\Big [|\mbox{Sol}(y)|^{-1}\sum _i p_i(y) a_i(y)\Big ] \nonumber \nonumber\\ &~=~ {\mathbb {E}}_{y\sim \boldsymbol {y}}\big [|\mbox{Sol}(y)|^{-1}\big ]\cdot (1\pm \epsilon) \nonumber \nonumber\\ &~=~ (1\pm \epsilon)\cdot {\mathbb {E}}\big [|\mbox{Sol}(\boldsymbol {y})|^{-1}\big ], \end{align}

(6)

where we used the fact that $\sum _{u^{\prime }\in \mbox{Sol}(y)}p_{i_{u^{\prime }}}(y) a_{i_{u^{\prime }}}(y) = \sum _i p_i(y) a_i(y)$ , because $a_i(y) = 0$ for all $i \notin \lbrace i_{u^{\prime }}: u^{\prime } \in \mbox{Sol}(y)\rbrace$ , given that y satisfies all the ${\text{ SoPL}} _n$ constraints, except the equations requiring that $u^{\prime }$ not be an active sink, for $u^{\prime } \in \mbox{Sol}(y)$ .

Suppose for a moment that we had an ideal depth-d randomised reduction $\mathcal {R}$ . Then, we could construct the polynomial

\begin{equation*} \textstyle r(x) ~:=~ {\mathbb {E}}_{\mathcal {R}_x}[q(\boldsymbol {y},\boldsymbol {u})] ~=~ \sum _{f,u}\Pr _{\mathcal {R}}[(\boldsymbol {u})=(f,u)]\cdot q(f(x),u). \end{equation*}

We have $\deg (r)\le O(dk\log n)$ . Moreover, if ${\text{ Or}} (x)=0$ , then $r(x)=1\pm \epsilon$ ; and if ${\text{ Or}} (x)=1$ , then $r(x)\in [0,(1+\epsilon)/2]$ , since ${\mathbb {E}}[|\mbox{Sol}(\boldsymbol {y})|^{-1}] \in [0,1/2]$ . Thus, for $\epsilon =0.01$ , if we consider $t(x) := 1-r^2(x)$ , then we get that t approximates Or to within error $1/3$ . Using Fact 1, we deduce that $k\ge \Omega (\sqrt {n}/(d\log n))$ .

In summary, all that remains is to find an ideal reduction of shallow depth. Unfortunately, we do not know how to design an ideal reduction for SoPL. We instead give a reduction that is locally indistinguishable from an ideal one, which will suffice for us.

Fig. 4.

A locally ideal reduction. Consider the following depth-1 randomised reduction $\mathcal {R}$ ; see Figure 4.

(1)

Let $y=y(x)$ be the input to ${\text{ SoPL}} _n$ that has a directed path running down the first column of nodes, starting at distinguished node $(1,1)$ and terminating at the active sink $u:= (n,1)$ (say u is made active by being assigned 1 as successor). Moreover, we activate a path in y down column $i\ge 2$ iff $x_{i-1}=1$ . Note that y is a depth-1 decision tree function of x, and u does not depend on x at all.

(2)

Let $\boldsymbol {y}=\boldsymbol {y}(x)$ be obtained from y so that, for each row except the first, $i\in [n]\setminus \lbrace 1\rbrace$ , randomly permute the nodes $\lbrace i\rbrace \times [n]$ on that row (updating the successor/predecessor pointers). Let $\boldsymbol {u}$ be the sink node that u is mapped to.

(3)

Output $(\boldsymbol {u})$ where $x):= \boldsymbol {y}(x)$ .

It is easy to check that $\mathcal {R}$ satisfies items (1)–(4) for every outcome of randomness. In particular, we have $|\mbox{Sol}(\boldsymbol {y})|=1+|x|$ . Unfortunately, $\mathcal {R}$ is not ideal: $\boldsymbol {u}$ is always the active sink at the end of the path starting at the distinguished node. What we would really like instead is that $\mathcal {R}_x$ was distributed as the ideal pair $(\boldsymbol {y},\boldsymbol {u})\sim \mathcal {I}_x$ defined by the following procedure: Sample $(\boldsymbol {y},\boldsymbol {u}^{\prime })\sim \mathcal {R}_x$ ; define $\boldsymbol {u}$ such that for every outcome y, $(\boldsymbol {u}\mid \boldsymbol {y}=y)\sim \mbox{Sol}(y)$ ; and output $(\boldsymbol {y},\boldsymbol {u})$ .

Define two functions $\lbrace 0,1\rbrace ^{n-1}\rightarrow \mathbb {R}$ by

\begin{align} r(x) ~:=~&\textstyle {\mathbb {E}}_{\mathcal {R}_x}[q(\boldsymbol {y},\boldsymbol {u})], \end{align}

(7)

\begin{align} r^{\prime }(x) ~:=~&\textstyle {\mathbb {E}}_{\mathcal {I}_x}[q(\boldsymbol {y},\boldsymbol {u})]. \end{align}

(8)

We know that r has low degree as a polynomial, $\deg (r)\le O(k\log n)$ , and $r^{\prime }$ has the ideal output behaviour, $r^{\prime }(x)= (1\pm \epsilon)\cdot {\mathbb {E}}\big [|\mbox{Sol}(x))|^{-1}\big ]$ by Equation (6). The following claim shows that, in fact, $r=r^{\prime }$ , and hence we can get the best of both worlds. By the discussion above, we are then able to construct an $O(k\log n)$ -degree approximating polynomial for Or, which concludes the proof of Lemma 1.

Claim 1.

We have $r(x)=r^{\prime }(x)$ for all $x\in \lbrace 0,1\rbrace ^{n-1}$ .

Proof.

By linearity of expectation, it suffices to show ${\mathbb {E}}_{\mathcal {R}_x}[m(\boldsymbol {y},\boldsymbol {u})]={\mathbb {E}}_{\mathcal {I}_x}[m(\boldsymbol {y},\boldsymbol {u})]$ for any monomial m of q and every x. Fix a monomial m. We claim that $\mathcal {R}_x$ and $\mathcal {I}_x$ have the same marginal distribution over the variables read by m, which would prove the claim. We may assume that $\deg (m)\le O(k\log n)\le o(n)$ , because otherwise Lemma 1 is proved. Hence, there exist two consecutive rows $i,i+1\in [n/3,2n/3]$ such that m does not read any variables associated with either row. Starting with a sample $(\boldsymbol {y},\boldsymbol {u})\sim \mathcal {R}_x$ , we can generate a sample from $\mathcal {I}_x$ as follows: Consider active nodes $A\subseteq \lbrace i\rbrace \times [n]$ and $B\subseteq \lbrace i+1\rbrace \times [n]$ on rows i and $i+1$ in $\boldsymbol {y}$ and the $|A|=|B|=1+|x|$ many directed edges joining them (defined by successor pointers for row i and predecessor pointers for row $i+1$ ). Reroute these edges by choosing a random bijection $A\rightarrow B$ , and denote the resulting input by $\boldsymbol {y}^{\prime }$ . Then $(\boldsymbol {y}^{\prime },\boldsymbol {u})\sim \mathcal {I}_x$ . This proves our claim about the marginals, since our modification to the input $\boldsymbol {y}$ was done outside the variables read by m. □

4.3 Upper Bound for RevRes

Our characterisation of ${\text{ SOPL}} ^{dt}$ by RevRes in Section 6.3 involves proving that ${\text{ SoPL}} _n$ (understood as an $O(\log n)$ -width CNF contradiction) admits an $O(\log n)$ -width polynomial-size RevRes refutation (Theorem 10). If we want to further optimise this down to a constant-width polynomial-size RevRes refutation, as claimed by Theorem 2, then we can consider instead a sparse constant-width variant of ${\text{ SoPL}} _n$ . Indeed, the following sparsifying construction is standard, and so we only sketch it.

We start by defining a bounded-degree dag G that models the connectivity structure of the $[n]\times [n]$ grid with successor/predecessor pointers. The nodes of G include all the grid nodes $[n]\times [n]$ . Moreover, for each $u\in [n-1]\times [n]$ , we include in G a successor tree $S_u$ that is a full binary tree with n leaves, and has edges directed from the root toward the leaves. Similarly, for each $u\in ([n]\setminus \lbrace 1\rbrace)\times [n]$ , we include in G a predecessor tree $P_u$ whose edges are directed from leaves toward the root. We identify the root nodes of $S_u$ and $P_u$ with u. Moreover, for grid nodes $(i,j)$ and $(i+1,k)$ appearing on consecutive rows, we identify the kth leaf of $S_{(i,j)}$ and the jth leaf of $P_{(i+1,k)}$ . This completes the description of G. Note that the in/out-degree of every node is at most 2.

We can now define a search problem ${\text{ SoPL}} _G$ relative to G. As input, each node u in G gets a successor $s_u$ and a predecessor $p_u$ picked from $\lbrace 0,1\rbrace \cup \lbrace {\textsf {null}}\rbrace$ . For example, $s_u=0$ ( $s_u=1$ ) means that u’s successor is the left (right) child of u in G. The constraints of ${\text{ SoPL}} _G$ can now be written in constant width. The RevRes upper bound in Theorem 10 can be adapted to yield a constant-width polynomial-size refutation of ${\text{ SoPL}} _G$ . Moreover, the original grid version ${\text{ SoPL}} _n$ can be reduced to the graph version ${\text{ SoPL}} _G$ using an $O(\log n)$ -depth decision tree reduction; see, for example, Reference [36, Section 4.2] for details (but for SoD instead of SoPL). The existence of this reduction implies that ${\text{ SoPL}} _G$ needs large $\epsilon$ -NS degree, because we showed that ${\text{ SoPL}} _n$ does.

This concludes the proof of Theorem 2 in case $\mathbb {F}=\mathbb {R}$ .

4.4 Lower Bound for $\mathbb {F}$ -NS

We now prove the lower bound in Theorem 2 for any field $\mathbb {F}$ .

Lemma 2.

$\mathbb {F}\mbox{-NS}({\text{ SoPL}} _n)\ge n^{\Omega (1)}$ .

Proof.

Prior work has shown that ${\text{ SoD}} _n$ (understood as an $O(\log n)$ -CNF) requires $n^{\Omega (1)}$ -degree $\mathbb {F}$ -NS refutations [11, 13, 29], and similarly that ${\text{ SoL}} _n$ (understood as an $O(\log n)$ -CNF) requires $n^{\Omega (1)}$ -degree $\mathbb {F}$ -NS refutations [4, 6]. Define the CNF formula

\begin{equation*} F_n ~:=~ {\text{ SoD}} _n\wedge {\text{ SoL}} _n, \end{equation*}

where ${\text{ SoD}} _n$ and ${\text{ SoL}} _n$ are defined on disjoint sets of variables. The following claim (proved below) states that $F_n$ requires $n^{\Omega (1)}$ -degree $\mathbb {F}$ -NS refutations, or, in other words, $\mathbb {F}\mbox{-NS}(F_n) \ge n^{\Omega (1)}$ .

Claim 2.

Let F and G be two CNF contradictions over disjoint sets of variables. If F and G require $\mathbb {F}$ -NS refutations of degree $\ge d$ , then $F\wedge G$ requires $\mathbb {F}$ -NS refutations of degree $\ge d$ . □

By the definition of $F_n$ , we have

\begin{align*} \Theta (\mbox{Res}(F_n))~=~{\text{ PLS}} ^{dt}(S(F_n)) ~=~& {\text{ SoD}} ^{dt}(S(F_n)) ~\le ~ O(\log n),\\ \Theta (\mbox{uSA}(F_n))~=~{\text{ PPADS}} ^{dt}(S(F_n)) ~=~& {\text{ SoL}} ^{dt}(S(F_n)) ~\le ~ O(\log n). \end{align*}

By the intersection theorem (Theorem 6) corresponding to ${\text{ SOPL}} = {\text{ PLS}} \cap {\text{ PPADS}}$ , we conclude that $S(F_n)$ has an efficient SoPL-formulation:

\begin{equation*} \Theta (\mbox{RevRes}(F_n)) ~=~ {\text{ SOPL}} ^{dt}(S(F_n)) ~=~ {\text{ SoPL}} ^{dt}(S(F_n)) ~\le ~ O(\log n). \end{equation*}

If we had $\mathbb {F}\mbox{-NS}({\text{ SoPL}} _n)\le n^{o(1)}$ , then because $S(F_n)$ reduces to ${\text{ SoPL}} _{n^{O(1)}}$ via an $O(\log n)$ -depth decision tree reduction, we would have $\mathbb {F}\mbox{-NS}(F_n)\le n^{o(1)}$ , which is a contradiction.

Proof of Claim2

The least degree of an $\mathbb {F}$ -NS refutation of a set of polynomial equations $\mathcal {F}:= \lbrace a_i(x)=0\rbrace$ can be characterised by the maximum d such that $\mathcal {F}$ admits a d-design [13, Section 2], that is, an $\mathbb {F}$ -linear map $\varphi :\mathbb {F}[x]\rightarrow \mathbb {F}$ satisfying (i) $\varphi (1)=1$ , and (ii) $\varphi (q(x)\cdot a_i(x))=0$ for all $a_i$ and $q\in \mathbb {F}[x]$ such that $\deg (q)+\deg (a_i)\lt d$ . Let $\varphi$ and $\varphi ^{\prime }$ be d-designs for F and G (encoded as sets of polynomial equations) over variables x and y, respectively. For each monomial $m(x)m^{\prime }(y)$ in variables $x,y$ , we define $\Phi (m(x)m^{\prime }(y)):=\varphi (m(x))\cdot \varphi (m^{\prime }(y))$ . We can extend this definition linearly into a map $\Phi :\mathbb {F}[x,y]\rightarrow \mathbb {F}$ . We claim that $\Phi$ is a d-design for $F\wedge G$ . Indeed, for (i), we have $\Phi (1)=\varphi (1)\varphi ^{\prime }(1)=1$ . For (ii), it suffices to check the condition for each monomial $q(x,y)=m(x)m^{\prime }(y)$ and an axiom $a_i(x)$ of F (the case of G is analogous) with $\deg (q)+\deg (a_i)\lt d$ . We have $\Phi (m(x)m^{\prime }(y)\cdot a_i(x))=\varphi (m(x)\cdot a_i(x))\varphi ^{\prime }(m^{\prime }(y))=0\cdot \varphi ^{\prime }(m^{\prime }(y))=0$ . □

5 Resolution versus Sherali–Adams

In this section, we prove Theorem 1, restated below.

Theorem 1.

We consider the SoD principle. We first show that it requires large coefficients to refute in low-degree SA, and then we recall why it has low-width Resolution refutations.

5.1 Lower Bound for SA

We consider the ${\text{ SoD}} _{n^2}$ search problem on the grid $[n^2]\times [n^2]$ . We think of this large grid as being further subdivided into $n^2$ many subgrids, each of size $n\times n$ . The $(i,j)$ -subgrid consists of nodes

\begin{equation*} ((i-1)n,(j-1)n)+[n]\times [n] ~:=~\big \lbrace ((i-1)n+i^{\prime },(j-1)n+j^{\prime }):(i^{\prime },j^{\prime })\in [n]\times [n]\big \rbrace . \end{equation*}

Recall that the input to this search problem consists of a successor $s_u\in [n^2]\cup \lbrace {\textsf {null}}\rbrace$ for each grid node u. For the purposes of SA, we encode this input by a string $x\in \lbrace 0,1\rbrace ^{n^{\prime }}$ over $n^{\prime } =O(n^4\log n)$ variables. Moreover, we can think of ${\text{ SoD}} _{n^2}$ as a set of unsatisfiable polynomial equations $\lbrace a_i(x)=0\rbrace$ each of degree $O(\log n)$ . Our goal is to prove the following lemma.

Lemma 3.

Any degree- $n^{o(1)}$ SA proof of ${\text{ SoD}} _{n^2}$ requires coefficients of magnitude $\exp (\Omega (n))$ .

Suppose we are given a degree- $n^{o(1)}$ SA refutation of ${\text{ SoD}} _{n^2}$ over the reals,

\begin{equation} \sum _{i\in [m]} p_i(x)a_i(x) ~=~ 1 + J(x). \end{equation}

(9)

Our idea is to apply Lemma 1 iteratively in stages to find a sequence of inputs $x_1, \ldots , x_n$ with a RHS value $1+J(x_i)\ge 2^{\Omega (i)}$ . Hence, Lemma 3 follows at stage $i=n$ , since there are at most $\exp (n^{o(1)})$ many monomials, and so one of them must have a coefficient of exponential magnitude.

We start by preprocessing the SA refutation Equation (9) for technical convenience. We may assume wlog that each term t appearing in $J=\sum _t \alpha _t t$ satisfies the following.

(1)

t is node-aligned: if t reads some variable associated with a node u, then it reads all the $O(\log n)$ variables associated with u. To ensure this, we may replace a term t with an equivalent sum of two terms, $t=tx_i+t\bar{x}_i$ , which reads one more variable. Adding more literals to terms like this will only increase the degree of the proof by an $O(\log n)$ factor.

(2)

t is curious: if t reads a node u that lies on the last row of a subgrid, that is, $u\in \lbrace in\rbrace \times [n^2]$ for some $i\in [n]$ , then t also reads the successor $s_u$ of u (if any) on the next row. Similarly as above, this can be ensured by at most doubling the degree of the proof.

(3)

t is non-witnessing: it does not witness a solution to the search problem. Formally, t witnesses a violation $a_i\ne 0$ if for all x, $t(x)=1\Rightarrow a_i(x)\ne 0$ (or contrapositively, $a_i(x)=0\Rightarrow t(x)=0$ ). To ensure this, if t is witnessing, we can factor² $t=p^{\prime }_i a_i$ and move t to the LHS of the proof.

First stage.. Let $y_1$ be an input to ${\text{ SoPL}} _n$ defined on nodes $[n]\times [n]$ . We can embed $y_1$ inside an input to ${\text{ SoD}} _{n^2}$ as follows. We write $({\textsf {null}}^*\leftarrow y_1)$ for the input to ${\text{ SoD}} _{n^2}$ , where we start with an assignment of ${\textsf {null}}$ to all nodes $[n^2]\times [n^2]$ (denoted ${\textsf {null}}^*$ ), and then overwrite the top-left $(1,1)$ -subgrid with the successor pointers in $y_1$ (aligning the distinguished nodes of ${\text{ SoD}} _{n^2}$ and ${\text{ SoPL}} _n$ ). In this reduction, we can forget the predecessor pointers, as they are not part of the input to SoD. Now every solution of $({\textsf {null}}^*\leftarrow y_1)$ for ${\text{ SoD}} _{n^2}$ corresponds naturally to a solution of $y_1$ for ${\text{ SoPL}} _n$ . (A minor detail is that the active sinks in $y_1$ correspond to proper sinks in $({\textsf {null}}^*\leftarrow y_1)$ .) Using this reduction, we can view our SA refutation of ${\text{ SoD}} _{n^2}$ also as a refutation of ${\text{ SoPL}} _n$ .

We claim that there is some input $y_1$ to ${\text{ SoPL}} _n$ such that for $x^{\prime }_1:=({\textsf {null}}^*\leftarrow y_1)$ , we have a RHS value $1+J(x^{\prime }_1)\ge 1.5$ . Suppose not: then the RHS is always in $[1,1.5]$ for all $y_1$ , which means we have a low-degree $\frac{1}{2}$ -NS proof of ${\text{ SoPL}} _n$ . But this contradicts Lemma 1.

We have now found an input $x^{\prime }_1=({\textsf {null}}^*\leftarrow y_1)$ with RHS at least 1.5. Before we iterate this argument in the second stage, we have to clean up $x^{\prime }_1$ slightly.

Fig. 5.

First stage: Clean-up.. Recall that the instances considered in the proof of Lemma 1 consist of some number of directed paths that terminate at sinks $\mbox{Sol}(y_1)\subseteq \lbrace n\rbrace \times [n]$ . We will modify $x^{\prime }_1$ by making the nodes $\mbox{Sol}(y_1)$ point to the same top-left corner of a $(2,j)$ -subgrid for some $j\in [n]$ . Indeed, let $\rho _j:\mbox{Sol}(y_1)\rightarrow [n^2]$ be the partial assignment that assigns $(n,(j-1)n)+(1,1)$ (top-left corner of the $(2,j)$ -subgrid) as the successor of all nodes in $\mbox{Sol}(y_1)$ . Let $(x^{\prime }_1\leftarrow \rho _j)$ be the input obtained from $x^{\prime }_1$ by applying $\rho _j$ . (We actually have $x^{\prime }_1=x^{\prime }_1\leftarrow \rho _1$ , as this is how we decided to make every node in $\mbox{Sol}(y_1)$ an active sink in $y_1$ .) By defining $x_1:= (x^{\prime }_1\leftarrow \rho _j)$ for a carefully chosen $j\in [n]$ (see Figure 5(a)), we establish the following properties for the start of the next stage.

(1a)

The only solutions in $x_1$ are proper sinks pointing to the corner of the $(2,j)$ -subgrid.

(2b)

We have $1+J(x_1\leftarrow y_2) \ge 1.4$ for any partial assignment $y_2$ to nodes in the $(2,j)$ -subgrid.

Property (1a) is true by construction, and we prove Property (1b) below.

Claim 3.

There exists a $j\in [n]$ such that Property (1b) holds.

Proof.

Let us first prove that for every term t appearing in $J=\sum _t\alpha _t t$ , we have

\begin{equation} t(x^{\prime }_1) ~=~ t(x^{\prime }_1\leftarrow \rho _j)\qquad \forall j. \end{equation}

(10)

It suffices to show that any term t in J with $t(x^{\prime }_1)=1$ (or $t(x^{\prime }_1\leftarrow \rho _j)=1$ ) does not read any nodes in $\mbox{Sol}(y_1)$ . Assume for contradiction that such a t reads a node $u\in \mbox{Sol}(y_1)$ . Then, because t is curious, it also reads u’s successor node (note that $s_u\ne {\textsf {null}}$ in both $x^{\prime }_1$ and $x^{\prime }_1\leftarrow \rho _j$ ) on the next row. This successor node is set to ${\textsf {null}}$ in $x^{\prime }_1$ (and $(x^{\prime }_1\leftarrow \rho _j)$ ) and hence t witnesses that u is a solution (proper sink). But this contradicts our assumption that t is non-witnessing. This proves Equation (10).

Define $J_j := \sum _{\smash{t\in T_j}}\alpha _t t$ where $T_j$ is the set of terms t in J that do not read any node from the $(2,j)$ -subgrid. Note that each t can read from at most $\deg (t)\le n^{o(1)}$ many different subgrids, and hence if we choose $\boldsymbol {j}\sim [n]$ at random, then $\Pr [t\in T_{\boldsymbol {j}}]\ge 99\%$ . We now have

\begin{equation*} \textstyle {\mathbb {E}}[1+J_{\boldsymbol {j}}(x^{\prime }_1)] ~=~ 1+\sum _t\Pr [t\in T_{\boldsymbol {j}}]\alpha _t t(x^{\prime }_1) ~\ge ~ 99\%\cdot (1+J(x^{\prime }_1)) ~\ge ~ 99\%\cdot 1.5 ~\ge ~ 1.4. \end{equation*}

By averaging, there is some fixed $j\in [n]$ such that $1+J_j(x^{\prime }_1)\ge 1.4$ . Defining $x_1:= (x^{\prime }_1\leftarrow \rho _j)$ for this particular j, we have, for every assignment $y_2$ to the $(2,j)$ -subgrid,

\begin{equation*} 1+J(x_1\leftarrow y_2) ~\ge ~ 1+J_j(x_1\leftarrow y_2) ~=~ 1+J_j(x_1) ~\overset{Equation~(10)}{=}~ 1+J_j(x^{\prime }_1) ~\ge ~ 1.4. \end{equation*}

□

Second stage.. Here, we start with the input $x_1$ satisfying Properties (1a) and (1b) for some $j\in [n]$ . Let $y_2$ be any input to ${\text{ SoPL}} _n$ . We think of $y_2$ (ignoring predecessor pointers) as embedded in the $(2,j)$ -subgrid. Consider the input $(x_1\leftarrow y_2)$ where the distinguished node of $y_2$ is aligned with corner of the $(2,j)$ -subgrid, which is the only sink in $x_1$ by Property (1a). Then every solution of $(x_1\leftarrow y_2)$ for ${\text{ SoD}} _{n^2}$ corresponds to a solution of $y_2$ for ${\text{ SoPL}} _n$ . Hence, we can view our SA refutation of ${\text{ SoD}} _{n^2}$ as a refutation of ${\text{ SoPL}} _n$ (this time in the $(2,j)$ -subgrid). Moreover, we have from Property (1b) that the RHS of the proof evaluates to $1+J(x_1\leftarrow y_2)\ge 1.4$ for all $y_2$ . If we scale our original SA proof by a factor $1/1.4$ , then we get another polynomial identity,

\begin{equation} \frac{1}{1.4}\sum _{i\in [m]} p_i(x)a_i(x) ~=~ \frac{1}{1.4}(1 + J(x)), \end{equation}

(11)

where the RHS evaluates to at least 1 on any input of the form $x=(x_1\leftarrow y_2)$ . Using Lemma 1, we can now conclude that there must exist an input $x_2^{\prime }=(x_1\leftarrow y_2)$ such that $\frac{1}{1.4}(1 + J(x^{\prime }_2))\ge 1.5$ , or equivalently, $1 + J(x_2^{\prime }) \ge 1.5\cdot 1.4$ .

Second stage: Clean-up. Using exactly the same argument as in the first clean-up stage, we conclude that $x^{\prime }_2$ can be cleaned up into $x_2$ such that for some $j\in [n]$ (different j than in first stage):

(2a)

The only solutions in $x_2$ are proper sinks pointing to the corner of the $(3,j)$ -subgrid.

(2b)

We have $1+J(x_2\leftarrow y_3) \ge 1.4^2$ for any partial assignment $y_3$ to nodes in the $(3,j)$ -subgrid.

By continuing this argument in the same fashion, we can eventually, at stage n, find an input $x_n$ with $1+J(x_n) \ge 1.4^n$ (see Figure 5(b)). This concludes the proof of Lemma 3.

5.2 Upper Bound for Resolution

It is well-known that ${\text{ SoD}} _n$ (understood as an $O(\log n)$ -width CNF contradiction) admits an $O(\log n)$ -width Resolution refutation (e.g., Reference [49, Theorem 8.18]). If we want to further optimise this down to a constant-width refutation, as claimed by Theorem 1, then we can consider a sparse variant of ${\text{ SoD}} _n$ similarly as we did in Section 4.3. We omit the details.

6 Proofs of Characterisations

In this section, we prove Theorem 3, restated below.

Theorem 3.

For any unsatisfiable CNF formula F, we have:

—

${\text{ PPAD}} ^{dt}(S(F)) = \Theta (\mbox{uNS}(F))$ .

—

${\text{ PPADS}} ^{dt}(S(F)) = \Theta (\mbox{uSA}(F))$ .

—

${\text{ SOPL}} ^{dt}(S(F)) = \Theta (\mbox{RevRes}(F))$ .

—

${\text{ EOPL}} ^{dt}(S(F)) = \Theta (\mbox{RevResT}(F))$ .

Recall that the notation ${\rm\small A}^{dt}({\rm\small B})$ for total search problems ${\rm\small A}, {\rm\small B}$ is the minimum complexity (namely, $\log \text{size} + \text{depth}$ ) of an ${\rm\small A}$ -formulation of ${\rm\small B}$ . Similarly, for a proof system $\mbox{P}$ and CNF formula F the notation $\mbox{P}(F)$ is the minimum of $\log \text{size}(\Pi) + \deg (\Pi)$ where $\Pi$ is a $\mbox{P}$ -proof of F.

6.1 Unary Nullstellensatz and PPAD

We first argue that unary Nullstellensatz corresponds to the decision tree class ${\text{ PPAD}} ^{dt}$ .

Theorem 7.

Let F be an unsatisfiable CNF formula. Then,

—

If F has a degree-d size-L uNS proof, then $S(F)$ has a depth- $O(d)$ ${\text{ EoL}} _{O(L)}$ -formulation.

—

If $S(F)$ has a depth-d ${\text{ EoL}} _{L}$ -formulation, then F has a degree- $O(d)$ size- $L2^{O(d)}$ uNS proof.

In particular, ${\text{ PPAD}} ^{dt}(S(F)) = \Theta (\mbox{uNS}(F))$ .

Corollary 3.

For any sequence $F_n$ of $\mbox{poly}(\log n)$ -width CNF formulas, $F_n$ has a degree- $\mbox{poly}(\log n)$ , size- $n^{\mbox{poly}(\log n)}$ unary Nullstellensatz proof if and only if $S(F) \in {\text{ PPAD}} ^{dt}$ .

We prove Theorem 7 in the next two lemmas. The proof of this theorem is itself modelled on a similar characterization of PPA-formulations by $\mathbb {F}_2$ -Nullstellensatz, proved by References [4, 39]. It turns out to be easier to show that EoL-formulations imply Nullstellensatz proofs, so we do that first. Furthermore, we will assume that all of our Nullstellensatz proofs are multilinearized: that is, we work modulo the $x_i^2 - x_i = 0$ equations, and so the individual degree of any variable in the proof is at most 1. It is well-known that making this assumption will not change the degree or size of the proof by more than a constant factor [13].

Lemma 4.

Let F be an unsatisfiable CNF formula. If there is a depth-d ${\text{ EoL}} _L$ -formulation of $S(F)$ , then there is a unary Nullstellensatz refutation of F with degree $O(d)$ and size $L2^{O(d)}$ .

Proof.

Suppose $F := C_1 \wedge \cdots \wedge C_m$ is on n variables $x_1, \dots , x_n$ , and let $\overline{C}_i$ be the negation of $C_i$ represented as a polynomial. Assume that there is a depth-d ${\text{ EoL}} _L$ -formulation of $S(F)$ . Let $V := [L]$ be the set of nodes in the EoL formulation and let $v^* = 1$ denote the distinguished source node. Each node $v \in V$ is equipped with successor and predecessor functions $s_v, p_v : \lbrace 0,1\rbrace ^n \rightarrow V$ , respectively, each computed by decision trees of depth at most d, as well as a solution decision tree $g_v : \lbrace 0,1\rbrace ^n \rightarrow [m]$ that outputs a corresponding solution of $S(F)$ . For any input assignment $x \in \lbrace 0,1\rbrace ^n$ let $G_x$ denote the directed graph obtained by evaluating all the successor and predecessor decision trees on input x and adding an edge $(u, v)$ iff $s_u(x) = v$ and $p_v(x) = u$ .

For each $v \in V$ define the function $S_v : \lbrace 0,1\rbrace ^n \rightarrow \lbrace -1, 0, 1\rbrace$ by

\begin{equation*} S_v(x) := {\left\lbrace \begin{array}{ll}-1 & \text{if} v \ne v^* \text{ is a source in } G_x, \\ 1 & \text{if} v \ne v^* \text{ is a proper sink in} G_x \text{ or} v = v^* \text{and} v^* \text{ is not a source,} \\ 0 & \text{otherwise}. \end{array}\right.} \end{equation*}

We compute $S_v$ for each node v by a depth at most $5d$ decision tree as follows. First, we compute $s_v(x) = u$ and $p_v(x) = w$ , and then compute $p_u(x)$ and $s_w(x)$ . From this information, we can determine the output value of $S_v$ , and we have used at most $4d$ queries. If $S_v = 0$ , then the leaf of the decision tree is labelled with 0. Otherwise, if $S_v \ne 0$ , then v is a solution to EoL, and so in this case we will also run the decision tree for $g_v$ and label each leaf with either 1 or $-1$ according to the output value of $S_v$ . Overall this requires at most $5d$ queries.

Now, for any leaf $\ell$ in the decision tree for $S_v$ let $D_\ell$ denote the polynomial representation of the conjunction of literals on the path from the root of the tree to $\ell$ . Observe that we can represent

\begin{equation*} S_v = \sum _{(-1)\text{-leaf} \ell } - D_\ell + \sum _{\text{1-leaf} \ell } D_\ell , \end{equation*}

where the first sum is over leaves of $S_v$ labelled with $-1$ and the second is over leaves of $S_v$ labelled with 1. If $\ell$ is a non-zero leaf, then v is a solution to the EoL instance, so let $C_\ell$ denote the solution of $S(F)$ output by the decision tree $g_v$ at this leaf. Observe that at every non-zero leaf $\ell$ , the clause $C_\ell$ must be falsified by the assignment on the path to $\ell$ , since $C_\ell$ is a solution to $S(F)$ by the correctness of the EoL formulation and by the fact that we ran the $g_v$ decision tree in $S_v$ . This implies that for each non-zero leaf $\ell$ of $S_v$ , we can write $D_\ell = D^{\prime }_\ell \overline{C}_\ell$ , and thus

\begin{equation*} S_v = \sum _{(-1)\text{-leaf} \ell } - D_\ell + \sum _{\text{1-leaf} \ell } D_\ell = \sum _{(-1)\text{-leaf} \ell } - D^{\prime }_\ell \overline{C}_\ell + \sum _{\text{1-leaf} \ell } D^{\prime }_\ell \overline{C}_\ell . \end{equation*}

If we sum up these polynomials for each $v \in V$ and gather terms, then

\begin{equation*} \sum _{v \in V} S_v = \sum _{i=1}^m p_i \overline{C}_i \end{equation*}

for some polynomials $p_i$ . Note that each polynomial has degree at most $5d$ , since they are obtained from the underlying $S_v$ decision trees.

To see that $\sum _{i=1}^m p_i\overline{C}_i$ is a unary Nullstellensatz refutation of F, observe that since each $S_v$ came from an EoL formulation, we have

\begin{equation*} \sum _{i=1}^m p_i(x) \overline{C}_i(x) = \sum _{v \in V} S_v(x) = (\#\text{ sinks in} G_x) - (\#\text{ non-distinguished sources in} G_x) = 1 \end{equation*}

for any input $x \in \lbrace 0,1\rbrace ^n$ . Finally, we observe that all coefficients used in this proof are integers, and the number of distinct monomials produced is at most $|V|2^{O(d)} = L2^{O(d)}$ from expanding the depth-d decision trees as polynomials. □

The more difficult direction is the converse, proved next.

Lemma 5.

Let F be an unsatisfiable CNF formula. If there is a unary Nullstellensatz refutation of F with degree d and size L, then there is a depth- $O(d)$ ${\text{ EoL}} _{O(L)}$ -formulation of $S(F)$ .

Proof.

Let $F = C_1 \wedge \cdots \wedge C_m$ and consider a degree-d, size-L unary Nullstellensatz refutation of F, which we write as

\begin{equation*} \sum _{i=1}^m p_i\overline{C}_i = 1, \end{equation*}

where each $p_i$ is a multilinear polynomial over $x_1, \dots , x_n$ and all coefficients are integers.

To build the EoL formulation, we expand the above proof out into its constituent monomials with multiplicity. That is, for each $i \in [m]$ write the polynomial

\begin{equation*} p_i\overline{C}_i = \sum _{j} c_{i,j}q_{i,j}, \end{equation*}

where $c_{i,j} \in \mathbb {Z}$ and $q_{i,j}$ is a monomial obtained by expanding the polynomial directly and performing all necessary cancellations. Each node in our EoL formulation will represent one of the above monomials $q_{i,j}$ and is considered a “ $+$ ” or a “ $-$ ” node, depending on that monomial’s sign. In total, we create $m + 1$ sets of nodes $V^*, V_1, \dots V_m$ , defined as follows. The set $V^*$ only contains the distinguished source node $v^*$ , which we consider as a “ $-$ ” node. For each $i \in [m]$ the set $V_i$ contains a node for each monomial $q_{i,j}$ from the above expansion with multiplicity. So, in particular, we add $|c_{i,j}|$ copies of the monomial $q_{i,j}$ to $V_i$ for each monomial $q_{i,j}$ in the above expansion. Let V denote the set of all nodes produced by this construction. For every node $v \in V$ , the decision tree for $g_v$ will query no variables and output $C_i$ if $v \in V_i$ and an arbitrary clause if $v = v^*$ ; our construction will explicitly prevent the source node $v^*$ from being a solution.

Now, we must describe the successor and predecessor decision trees $s_v, p_v$ at each node. It will be easier to describe the possible edges in $G_x$ on an input $x \in \lbrace 0,1\rbrace ^n$ ; all of the edges are organized into two different matchings as detailed next. See Figure 6 for a high-level illustration.

Outer Matching.

In this matching, we add edges between nodes in different node groups. All directed edges will be oriented from “ $-$ ” nodes to “ $+$ ” nodes. Since the polynomials form a Nullstellensatz refutation over $\mathbb {Z}$ , we know that each time the monomial q appears with a “ $+$ ” sign, it must also appear with a “ $-$ ” sign, except for the single 1 term. Thus, by treating the distinguished source $v^*$ as “ $-1$ ,” we can create a perfect matching M on the nodes of V where all matched nodes are between a “ $+$ ” and a “ $-$ ” node standing for the same monomial. Since we have gathered terms within the expansions $p_i\overline{C}_i$ , all occurrences of monomials q within a set $V_i$ have the same sign, and thus all the edges in this matching will be between nodes in different sets. Formally, in the EoL formulation, for each edge $e = (u,v)$ in M corresponding to a monomial q, we add a directed edge from the “ $-$ ” to the “ $+$ ” node if and only if $q(x) = 1$ . This condition can be determined by $s_u$ and $p_v$ by querying the variables occurring in q.

Inner Matching.

In this matching, we add directed edges from “ $+$ ” nodes to “ $-$ ” nodes within the same node group. Consider any set $V_i$ . Formally, at each node occurring in the group $V_i$ , we query all variables of the corresponding clause $C_i$ in both the successor and predecessor functions for that node. For any $x \in \lbrace 0,1\rbrace ^n$ , if $C_i(x) = 1$ then $\overline{C}_i = 0$ and thus $p_i(x) \overline{C}_i(x) = 0$ . This means that under the partial restriction $\rho$ consistent with x at the variables of $C_i$ , all monomials remaining in $p_i\overline{C}_i \upharpoonright \rho$ must cancel. We can therefore fix a perfect matching between the negative and positive instances of monomials in $V_i$ under $\rho$ , representing the cancellation of monomials under $\rho$ . Then, each edge of this matching is included in the graph if and only if the monomials corresponding to its endpoints evaluate to 1 at x (note that the two endpoints will both evaluate to the same value, since they are matched under $\rho$ ). However, if $C_i(x) = 0$ , then we will simply not add any edges to the internal matching of $V_i$ .

Let $x \in \lbrace 0,1\rbrace ^n$ be any assignment to the variables of F. The edges of any node $v \in V_i$ associated with a monomial q are determined by querying the variables of $C_i$ and q. This implies that the depth of each decision tree $T_v$ is at most d, and the size is clearly $O(L)$ , since every monomial in the proof is represented as a node.

We now verify correctness of the EoL formulation. Since it is well-defined, on every input x the graph $G_x$ will have a solution. Let v be such a solution (either a sink or proper source node) in $G_x$ . By construction, $v \ne v^*$ , since the node $v^*$ is always a source node. This implies that $v \in V_i$ for some $i \in [m]$ , and so v must be associated with a monomial q. By the construction of the inner and outer matching, v can only be a source or sink node in $V_i$ if the inner matching is empty. But this can only happen if $C_i(x) = 0$ , and thus $C_i$ is a valid solution to $S(F)$ . □

Fig. 6.

6.2 Unary Sherali–Adams and PPADS

We now show that low-degree unary Sherali–Adams proofs characterise ${\text{ PPADS}} ^{dt}$ . The proof of this fact follows the proof from the previous section quite closely, but requires some extra work to handle the extra conical junta terms.

Theorem 8.

Let F be an unsatisfiable CNF formula. Then,

—

If F has a degree-d, size-L unary Sherali–Adams proof, then $S(F)$ has a depth- $O(d)$ ${\text{ SoL}} _{O(L)}$ -formulation.

—

If $S(F)$ has a depth-d ${\text{ SoL}} _{L}$ -formulation, then F has a degree- $O(d)$ , size- $L2^{O(d)}$ unary Sherali–Adams proof.

In particular, ${\text{ PPADS}} ^{dt}(S(F)) = \Theta (\mbox{uSA}(F))$ .

Corollary 4.

For any sequence F of $\mbox{poly}(\log n)$ -width CNF formulas, F has a $\mbox{poly}(\log n)$ -degree, $n^{\mbox{poly}(\log n)}$ -size unary Sherali–Adams proof if and only if $S(F) \in {\text{ PPADS}} ^{dt}$ .

Before we prove the theorem, it will be convenient to have the following simple normal form for Sherali–Adams proofs. Just like in the previous section, we will assume that all Sherali–Adams proofs are multilinearized, and it is known that this assumption does not change the degree or size of the proof by more than a constant factor [37].

Lemma 6.

Let F be an unsatisfiable CNF formula. If $\sum _{i=1}^m p_i \overline{C}_i = 1 + J$ is a unary Sherali–Adams refutation of F with degree d and size L, then there is a degree-d, size-L unary Sherali–Adams refutation of F of the form $\sum _{i=1}^m J_i\overline{C}_i = 1 + J_0$ , where $J_i$ is a conical junta for each $i = 0, 1, \dots , m$ .

Proof.

For each $i \in [m]$ , we can expand $p_i = \sum _{j} c_{i,j} q_{i,j}$ where $c_{i,j}$ are integers and $q_{i,j}$ are monomials. Each monomial $q_{i,j}$ is a conjunction, so the expressions

\begin{equation*} J_i^{-} = \sum _{j: c_{i,j} \lt 0} |c_{i,j}|q_{i,j}, \quad J^+_i = \sum _{j: c_{i,j} \gt 0} c_{i,j}q_{i,j} \end{equation*}

are conical juntas for each $i \in [m]$ . Writing $p_i = J_i^+ - J_i^-$ , substituting into the Sherali–Adams refutation, and rearranging completes the proof. □

We now begin the proof of Theorem 8. As before, we split the proof into two lemmas, one for each direction of the characterisation. The easier direction is again that an SoL-formulation implies a unary Sherali–Adams proof, and it almost exactly follows the proof of Lemma 4.

Lemma 7.

Let F be an unsatisfiable CNF formula. If there is a depth-d ${\text{ SoL}} _{L}$ -formulation of $S(F)$ , then there is a unary Sherali–Adams refutation of F with degree $O(d)$ and size $L2^{O(d)}$ .

Proof.

The proof of this lemma is essentially the same as the proof of Lemma 4, so we will simply sketch it and note what needs to be modified. Suppose $F := C_1 \wedge \cdots \wedge C_m$ and let $\overline{C}_i$ be the negation of $C_i$ represented as a polynomial. We have an SoL-formulation for $S(F)$ , and so we have decision trees computing successors $s_v$ and predecessors $p_v$ for each of the nodes $v \in V$ . As in the proof of Lemma 4, for each $v \in V$ , we define a depth at most $5d$ decision tree $S_v$ , defined by

\begin{equation*} S_v(x) = {\left\lbrace \begin{array}{ll} 1 & \text{if} v \ne v^* \text{ is a source in} G_x, \\ -1 & \text{if either} v \text{ is a proper sink in} G_x \text{or if} v = v^* \text{and} v^* \text{ is not a source,} \\ 0 & \text{otherwise}, \end{array}\right.} \end{equation*}

where we note that we have switched the “ $-1$ ” and the “ $+1$ ” in the definition of $S_v$ when compared to Lemma 4. As before, $S_v(x)$ can be determined by first running the decision trees for $s_v(x) = u$ and $p_v(x) = w$ , then the decision trees for $p_u(x), s_w(x)$ , and finally the decision tree for $g_v(x)$ if the node v is a solution to SoL. From this, we can again represent

\begin{equation*} S_v = \sum _{(-1)\text{-leaf} \ell } - D_\ell + \sum _{\text{1-leaf} \ell } D_\ell , \end{equation*}

where the first sum is over leaves of $S_v$ labelled with $-1$ and the second is over leaves of $S_v$ labelled with 1. However, now a node v is only a solution to SoL if $S_v(x) = -1$ , and so for each $(-1)$ -leaf $\ell$ of $S_v$ , we can write $D_\ell = D^{\prime }_\ell \cdot \overline{C}_\ell$ where $C_\ell$ is the clause of F falsified at that leaf. This allows us to write

\begin{equation*} S_v = \sum _{(-1)\text{-leaf } \ell } - D_\ell + \sum _{\text{1-leaf } \ell } D_\ell = \sum _{(-1)\text{-leaf } \ell } - D^{\prime }_\ell \cdot \overline{C}_v + \sum _{\text{1-leaf } \ell } D_\ell . \end{equation*}

If we sum up these polynomials for each $v \in V$ and gather terms, then we get

\begin{equation*} \sum _{v \in V} S_v = \sum _{i=1}^m -J_i \overline{C}_i + J_0 \end{equation*}

for some degree- $O(d)$ conical juntas $J_0, J_1, \dots , J_m$ . As in the proof of Lemma 4, we have that $\sum _{v} S_v(x) = -1$ and the size and degree calculations are identical. □

The proof of the converse direction is also similar to the proof of Lemma 5, but requires some more substantial modification when compared to the previous proof. The main issue is how to handle the extra conical junta terms $J_0$ in the unary Sherali–Adams refutation. As in the proof of Lemma 5, we will create a graph representing all the monomials in the unary Sherali–Adams proof. However, we will do some extra work to ensure that the nodes corresponding to monomials from the conical junta term $J_0$ will always be source nodes. This ensures that any solutions will occur at nodes corresponding to some falsified clause in the formula.

Lemma 8.

Let F be an unsatisfiable CNF formula. If there is a unary Sherali–Adams refutation of F with degree d and size L, then there is a degree- $O(d)$ ${\text{ SoL}} _{O(L)}$ -formulation of $S(F)$ .

Proof.

Suppose $F := C_1 \wedge \cdots \wedge C_m$ is on n variables and consider a unary Sherali–Adams refutation

\begin{equation*} \sum _{i=1}^m -J_i\overline{C}_i + J_0 = -1 \end{equation*}

of F where each $J_i$ for $i = 0, 1, \ldots , m$ are integral conical juntas. For notational convenience, we will let $\overline{C}_0 := -1$ , and we will expand each conical junta $J_i$ as a non-negative sum of conjunctions. While this notation is somewhat unusual, it allows us to write the refutation in a uniform way as

\begin{equation*} \sum _{i=1}^m -J_i\overline{C}_i + J_0 = \sum _{i=0}^m \sum _{j=1}^{t_i} -\lambda _{i,j} D_{i,j} \overline{C}_i = -1, \end{equation*}

where $t_i$ is a non-negative integer, $\lambda _{i,j}$ is a positive integer, and $D_{i,j}$ is a conjunction for every $i, j$ .

To build the SoL formulation, we expand the above proof out into its constituent monomials with multiplicity. As in the proof of Lemma 5, each node in our SoL formulation will represent a monomial in the proof and is either a “ $+$ ” or a “ $-$ ” node, depending on that monomial’s sign. This time, however, we create a group of nodes $V_{i,j}$ for each $i = 0, 1, \dots , m$ and each $j \in [t_i]$ , as well as a special group $V^*$ . The group $V^*$ only contains the distinguished node $v^*$ , which we now consider as a “ $+$ ” node. However, for each $i, j$ , the group $V_{i,j}$ will correspond to the polynomial $-\lambda _{i,j}D_{i,j}\overline{C}_{i,j}$ . We expand this polynomial into a sum of monomials $-\lambda _{i,j} D_{i,j}\overline{C}_{i,j} = \sum _q c_q q$ for some integers $c_q$ and monomials q, and for each monomial q in this expansion, we create $|c_q|$ nodes in $V_{i,j}$ , each of which are “ $+$ ” nodes if $c_q \gt 0$ and “ $-$ ” nodes otherwise. Let V denote the set of all nodes produced by this construction. For any node $v \in V$ , if $v \in V_{i,j}$ for some $i \gt 0$ then the solution decision tree $g_v$ will query no variables and simply output $C_i$ as the solution to $S(F)$ . Otherwise, $g_v$ will output an arbitrary solution, as in this case by construction of the formulation the node v will never be a solution to SoL.

Now, we must describe the successor and predecessor decision trees at each node. As in the proof of Lemma 5, it will be easier to describe the possible edges in $G_x$ as all of the edges are organized into two different matchings.

Outer Matching.

The definition of the outer matching is the same as in Lemma 5. In this matching, we add edges between nodes in different node groups. All directed edges will be oriented from “ $+$ ” nodes to “ $-$ ” nodes. Since the polynomials form an $\text{SA}$ refutation over $\mathbb {Z}$ , we know that each time the monomial q appears with a “ $+$ ” sign, it must also appear with a “ $-$ ” sign, except for the single $-1$ term. Thus, by considering $v^*$ as “ $+1$ ,” we can create a perfect matching M of the nodes of V where all edges are between a “ $+$ ” and a “ $-$ ” node standing for the same monomial. Since we have gathered terms within the expansions of $-\lambda _{i,j} D_{i,j} \overline{C}_i$ , all occurrences of monomials q within a single group $V_{i,j}$ have the same sign and thus all the matchings are between nodes in different sets. For each edge e in M, we will add a directed edge between the “ $+$ ” and the “ $-$ ” node if and only if $q(x) = 1$ ; this can be determined by querying all variables in q.

Inner Matching.

The inner matching is constructed similarly as in the proof of Lemma 5, but requires some modification. As in that proof, in the inner matching, we add directed edges from “ $-$ ” nodes to “ $+$ ” nodes within the same node group. However, we will now be careful to force any solution (i.e. a sink node) to occur at a “ $-$ ” node in $G_x$ . By our construction, the $V^*$ group has no “ $-$ ” nodes, and all “ $-$ ” nodes in the group $V_{0, j}$ for any $j \in [t_0]$ will have successors, and thus any sink node must be associated with $V_{i,j}$ for some $i \gt 0$ .

Consider any set of the form $V_{i,j}$ , since $V^*$ has a single node corresponding to $+1$ and so no internal edges will be matched. Formally, at each node occurring in the group $V_{i,j}$ , we query all variables of $\overline{C}_i$ and $D_{i,j}$ (note that when $i = 0$ , $\overline{C}_0 = 1$ , and so we only query $D_{i,j}$ variables). For any assignment $x \in \lbrace 0,1\rbrace ^n$ , if $C_i(x) = 1$ then $\overline{C}_i(x) = 0$ and thus $D_{i,j}(x) \overline{C}_i(x) = 0$ . This means that under the partial restriction $\rho$ consistent with x at the variables of $C_i$ , all monomials in $D_{i,j}\overline{C}_i \upharpoonright \rho$ must cancel to 0. We can therefore fix a directed perfect matching between the negative and positive copies of monomials in $V_{i,j}$ , as in the proof of Lemma 5.

However, if $\overline{C}_i(x) \ne 0$ , then $-\lambda _{i,j}D_{i,j}(x)\overline{C}_i(x) = c$ for some integer c. If $i \gt 0$ , then $c \le 0$ , and so in this case, there will be $|c|$ copies of “ $-$ ” monomials in $V_i$ that are not cancelled by $+$ monomials internally. We can then fix a directed partial matching between monomials accordingly, but leaving the $|c|$ “ $-$ ” monomials without successors if required (these will become sink nodes). If $i = 0$ , then $c \ge 0$ , since $\overline{C}_i = -1$ , and so in this case there may be more “ $+$ ” monomials than “ $-$ ” monomials evaluating to 1. We can therefore fix a directed partial matching between monomials, now leaving some “ $+$ ” monomials without predecessors (these will become new source nodes), but all “ $-$ ” monomials will have successors and so they will not become proper sink nodes.

As we have described above, we will need at most d queries in any decision tree in the reduction, and also the number of nodes in the final SoL instance is no more than the size (number of monomials) of the underlying unary Sherali–Adams proof.

We finally verify correctness of the SoL-formulation. This is a well-defined SoL formulation and thus on every input $x \in \lbrace 0,1\rbrace ^n$ the graph $G_x$ will have a solution $v \in V$ . This must be a sink node by the definition of SoL and therefore, by construction, v must be a “ $-$ ” node, since “ $+$ ” nodes always have successors by the construction of the outer matching. As we have described in the definition of the inner matching, any “ $-$ ” node $v \in V_{0, j}$ for any j will have a successor, and thus $v \in V_{i, j}$ for some $i \gt 0$ . But then, by definition of the inner matching, if v is a sink node in $V_{i,j}$ for $i \gt 0$ then $C_i(x) = 0$ and the label of v is $C_i$ , thus the SoL formulation correctly outputs a solution to $S(F)$ . □

6.3 Reversible Resolution, SOPL, and EOPL

In this section, we define the Reversible Resolution systems (RevRes and RevResT), and prove our final characterisations capturing ${\text{ SOPL}} ^{dt}$ and ${\text{ EOPL}} ^{dt}$ .

Theorem 9.

Let F be an unsatisfiable CNF formula. Then,

—

If F has a width-d, size-L Reversible Resolution proof (with Terminals, respectively), then $S(F)$ has a depth- $O(d)$ ${\text{ SoPL}} _{O(L)}$ -formulation (EoPL-formulation, respectively).

—

If $S(F)$ has a depth-d ${\text{ SoPL}} _L$ -formulation ( ${\text{ EoPL}} _L$ -formulation, respectively), then F has a width- $O(d)$ , size- $L^{O(1)}2^{O(d)}$ Reversible Resolution proof (with Terminals, respectively).

In particular, ${\text{ SOPL}} ^{dt}(S(F)) = \Theta (\mbox{RevRes}(F))$ and ${\text{ EOPL}} ^{dt}(S(F)) = \Theta (\mbox{RevResT}(F))$ .

Corollary 5.

For any sequence F of $\mbox{poly}(\log n)$ -width CNF formulas, F has a $\mbox{poly}(\log n)$ -width, $n^{\mbox{poly}(\log n)}$ -size Reversible Resolution proof (with Terminals, respectively) if and only if $S(F) \in {\text{ SOPL}} ^{dt}$ ( $S(F) \in {\text{ EOPL}} ^{dt}$ , respectively).

Reversible Resolution and MaxSAT. We begin by formally defining Reversible Resolution refutations and comparing them to MaxSAT systems from the literature [10, 34, 54].

Definition 5.

Let F be an unsatisfiable CNF formula. If C is a clause, then the reversible weakening rule is the proof rule $C \vdash C \vee x, C \vee \overline{x}$ , and the reversible resolution rule is the proof rule $C \vee x, C \vee \overline{x} \vdash C$ . A reversible resolution refutation (RevRes) of F is a sequence of multisets of clauses $\mathcal {C}_1, \mathcal {C}_2, \ldots , \mathcal {C}_t$ such that the following holds:

(1)

Every clause in $\mathcal {C}_1$ occurs in F, possibly with multiplicity.

(2)

The multiset $\mathcal {C}_t$ contains the empty clause $\bot$ .

(3)

For each $i = 1, 2, \ldots , t-1$ , the multiset $\mathcal {C}_{i+1}$ is obtained from $\mathcal {C}_i$ by selecting clauses in $\mathcal {C}_{i}$ and replacing them with the result of one of the two reversible rules applied to those clauses.

The proof is a reversible resolution refutation with terminals (RevResT) if every clause in $\mathcal {C}_t$ other than $\bot$ is a weakening of a clause from F. The size of the proof is $\sum _{i=1}^t |\mathcal {C}_i|$ —the number of clauses in all configurations. The width of the proof is the maximum width of any clause occurring in any configuration.

The key difference between the reversible resolution rule and the standard resolution rule is that the output of the reversible rule (as a CNF formula) is logically equivalent to the input of the rule. Despite this restriction, it is clear that we can use the reversible rule to simulate tree-like resolution. If we use clauses $C \vee x$ and $D \vee \overline{x}$ to derive $C \vee D$ , then we can derive this in RevRes as follows. First, for each literal in $C \vee x$ , apply the reversible weakening rule repeatedly to derive $C \vee D \vee x$ (along with some extra clauses, which we can ignore). Similarly, derive $D \vee C \vee \overline{x}$ . Then apply the reversible resolution rule to these two clauses to derive $C \vee D$ .

However, Theorem 1 implies that RevRes cannot efficiently simulate Resolution. Intuitively, this is because of Property (3) in the definition of a reversible refutation: We must replace the clauses used in the rule with new clauses. Therefore, we cannot “duplicate” derived clauses for free, which is essential to obtain the full power of Resolution.

Indeed, the RevRes proof system is a slight strengthening of the proof system MaxSAT Resolution with Weakening (also denoted $\mbox{MaxResW}$ ) studied in the literature on MaxSAT solvers [10, 34, 54]. The principal difference between MaxSAT Resolution and standard Resolution is that MaxSAT Resolution seeks to preserve the number of satisfied clauses under any assignment. For completeness, we define the MaxSAT Resolution proof system next.

Definition 6.

Let $A = a_1 \vee \cdots \vee a_s$ and $B = b_1 \vee \cdots \vee b_t$ be clauses over Boolean literals $a_i, b_j$ . The MaxSAT resolution rule is the proof rule that, given $x \vee A$ and $\overline{x} \vee B$ , deduces the following set of clauses:

\begin{align*} &a_1 \vee \cdots \vee a_s \vee b_1 \vee \cdots \vee b_t, \\ &x \vee A \vee \bigvee _{i=1}^j b_i \vee \overline{b}_{j+1} \quad \forall j = 0, 1, \dots , t, \\ &\overline{x} \vee B \vee \bigvee _{i=1}^j a_i \vee \overline{a}_{j+1} \quad \forall j = 0, 1, \dots , s. \end{align*}

A MaxRes refutation of an unsatisfiable CNF F is a sequence of multisets of clauses $\mathcal {C}_1, \dots , \mathcal {C}_t$ where $\mathcal {C}_1$ contains exactly the clauses in F, $\mathcal {C}_t$ contains a copy of the empty clause $\bot$ , and the configuration $\mathcal {C}_i$ for $i \gt 1$ is obtained from $\mathcal {C}_{i-1}$ by applying the MaxSAT resolution rule to some clauses in $\mathcal {C}_{i-1}$ and replacing those clauses with the output of the rule. A MaxResW refutation is a $\mbox{MaxRes}$ refutation that is also allowed to use the weakening rule $C \vdash C \vee x, C \vee \overline{x}$ .

RevRes can simulate MaxResW proofs without much difficulty. The weakening rule in MaxResW is the reversible weakening rule. To simulate the MaxSAT resolution rule, starting from $x \vee A, \overline{x} \vee B$ , apply the reversible weakening rule on $x \vee A$ to weaken it with the variable $b_1$ , obtaining $x \vee A \vee b_1, x \vee A \vee \overline{b}_1$ . Then, weaken $x \vee A \vee b_1$ on the variable $b_2$ to obtain the clauses $x \vee A \vee b_1 \vee b_2, x \vee A \vee b_1 \vee \overline{b}_2$ . Repeating in this fashion on all literals in B, and similarly weakening $\overline{x} \vee B$ , we obtain $x \vee A \vee B$ , $\overline{x} \vee A \vee B$ , and all the extra clauses output by the MaxSAT rule. Finally applying the reversible resolution rule to $x \vee A \vee B$ and $\overline{x} \vee A \vee B$ deduces $A \vee B$ .

The converse direction, however, is not clear and could very well be false. A significant difference between RevRes and MaxResW is the fact that MaxResW proofs must have the initial configuration exactly equal to F, while RevRes can start with any multiset of clauses from F. As discussed above, this is because the goal of MaxRes is to preserve the number of satisfied clauses under any assignment, while RevRes has no such requirements and simply seeks to prove unsatisfiability.

We can formally interpret this as follows. Suppose we are given an unsatisfiable CNF formula $F = C_1 \wedge \cdots \wedge C_m$ , where every clause $C_i$ is equipped with a positive integer weight $w_i$ . Since F is unsatisfiable, the maximum possible weight of satisfied clauses in any assignment to the variables of F is at most $\sum _{i=1}^m w_i - 1$ . Thus, if we could prove that this is true for some choice of weights $w_i \gt 0$ , then we have verified that the formula F is unsatisfiable.

RevRes implements this idea. Given F, we start by choosing positive integer weights $w_i$ for each clause $C_i$ , and make $w_i$ copies of $C_i$ in the initial configuration $\mathcal {C}_1$ . The two proof rules in RevRes preserve the number of satisfied clauses under any assignment, and so it follows that if $\mathcal {C}_1, \dots , \mathcal {C}_t$ is a RevRes refutation of F then, since $\mathcal {C}_t$ contains at least one instance of $\bot$ , it must be that the maximum weight of satisfied clauses under any assignment is at most $\sum _{i=1}^m w_i - 1$ , since $\bot$ is always false. Hence, the formula F must be unsatisfiable. Interpreted in this way, RevRes sits between MaxResW and the weighted MaxSAT resolution systems defined in Reference [54].

Characterisation theorems. Unlike the characterisation theorems for unary Nullstellensatz and unary Sherali–Adams, the easier direction for this characterisation theorem is showing that RevRes proofs imply SoPL-formulations.

Lemma 9.

Let F be an unsatisfiable CNF formula. If there is a RevRes refutation of F with width d and size L, then there is a depth- $(d+1)$ ${\text{ SoPL}} _L$ -formulation of $S(F)$ . Furthermore, if there is a RevResT refutation, then there is a depth- $(d+1)$ ${\text{ EoPL}} _L$ -formulation of $S(F)$ .

Proof.

We focus on the case of RevRes and then describe what needs to be modified in the case of RevResT. Let $F = C_1 \wedge \cdots \wedge C_m$ be an unsatisfiable CNF formula. Let $\mathcal {C}_1, \mathcal {C}_2, \ldots , \mathcal {C}_\ell$ be a RevRes refutation of F of the prescribed size and width and let $t := \max _{i \in [\ell ]} |\mathcal {C}_i|$ . By the size bound, we know that $t, \ell \le L$ .

We create an SoPL-formulation of $S(F)$ on a grid of size $L \times L$ , although we will only use the subgrid of size $\ell \times t$ and hardwire all other nodes to be inactive. This can be done for each node $(i, j)$ outside of the $\ell \times t$ grid by setting the successor for $(i,j)$ to be ${\textsf {null}}$ and the predecessor to be arbitrary. The relationship between the grid of the SoPL-formulation and the RevRes proof is straightforward: the node $(i, j) \in [\ell ] \times [t]$ corresponds to the jth clause in the multiset $\mathcal {C}_{\ell -i+1}$ . Without loss of generality, we assume $\mathcal {C}_l$ is ordered so that the first clause is $\bot$ , and thus the distinguished node $(1,1)$ in the SoPL instance corresponds to $\bot$ .

Let $(i, j) \in [\ell ] \times [t]$ be any node in the grid and let $C_{i,j}$ denote the corresponding clause in the proof. We define the successor function $s_{i,j} : \lbrace 0,1\rbrace ^n \rightarrow [t] \cup \lbrace {\textsf {null}}\rbrace$ , the predecessor function $p_{i,j} : \lbrace 0,1\rbrace ^n \rightarrow [t]$ , and the solution function $g_{i,j} : \lbrace 0,1\rbrace ^n \rightarrow [m]$ . The solution function $g_{i,j}$ queries no variables and outputs $C_{i,j}$ if $C_{i,j} \in F$ , and otherwise outputs an arbitrary solution (in the second case, by construction $(i,j)$ will never be a solution to SoPL). To define $s_{i,j}$ and $p_{i,j}$ , we introduce some notation. If $C \in \mathcal {C}_i$ and $C^{\prime } \in \mathcal {C}_{i+1}$ are clauses in adjacent configurations, then $C^{\prime }$ is derived from C, written $C \vdash C^{\prime }$ , if either $C^{\prime }$ is the output of a reversible proof rule applied to C or if no proof rule was applied to C and $C^{\prime } = C$ is just the same copy of C in the next configuration. For any $x \in \lbrace 0,1\rbrace ^n$ define

\begin{equation*} s_{i,j}(x) := {\left\lbrace \begin{array}{ll}{\textsf {null}}& \text{ if } C_{i,j}(x) = 1, \\ k & \text{ if } i \lt \ell , C_{i,j}(x) = C_{i+1, k}(x) = 0, \text{ and } C_{i+1,k} \vdash C_{i,j}, \\ 1 & \text{ if } i = \ell \text{ and } C_{i,j}(x) = 0, \end{array}\right.} \end{equation*}

and similarly, if $i \gt 1$ , define

\begin{equation*} p_{i,j}(x) := {\left\lbrace \begin{array}{ll}1 & \text{ if } C_{i,j}(x) = 1, \\ k & \text{ if } C_{i,j}(x) = C_{i-1, k}(x) = 0 \text{ and } C_{i,j} \vdash C_{i-1,k}. \end{array}\right.} \end{equation*}

Intuitively, if $C_{i,j}(x) = 0$ , then we will make the successor and predecessors of $C_{i,j}$ point to the unique clauses in the adjacent configurations that are guaranteed to be false. These functions are well-defined, since the reversible rules are of the form $C \vee x_i, C \vee \overline{x}_i \vdash C$ and $C \vdash C \vee x_i, C \vee \overline{x}_i$ . In particular, under any assignment to the variables, the number of false clauses in the input and output of the rules are equal and at most 1, and thus if C is false then there are unique false clauses in the adjacent configurations that are derived from or used to derive C. Finally, we note that the successor and predecessor functions can each be computed by querying all the variables in $C_{i,j}$ and possibly one more variable (the one that was resolved or weakened on), and thus the decision tree depth of both of these functions is at most $d+1$ .

Now, we argue that the SoPL formulation correctly solves $S(F)$ . By the definition of the successor and predecessor functions, if any node $(i,j)$ on layer $i \lt \ell$ is active, then that node has consistent pointers to successor nodes and predecessor nodes on the adjacent layers. This means that the node $(i,j)$ is a solution only if it is an active node on layer $i = \ell$ , but such a node is active only if the corresponding clause $C_{i, j} \in \mathcal {C}_1$ is false. But all such clauses occur in F, and in this case the solution function $g_{i,j}$ outputs $C_{i,j}$ , which is a correct solution to $S(F)$ .

In case we started with a RevResT refutation, we observe that the same argument described above also works for EoPL with one extra observation: any clause in the final configuration $\mathcal {C}_t$ that is falsified under an input x is now a weakening of an input clause of F, and so this is a valid source node solution to the EoPL problem. □

It remains to prove the converse, which is harder. As a warmup, we begin by showing that the encoding of SoPL (EoPL) as an unsatisfiable CNF formula can be efficiently refuted in RevRes (RevResT, respectively). The general case will follow the structure of this proof closely. For the warmup it will be helpful to explicitly write the CNF encoding of SoPL and EoPL (Section 3).

Explicit Encodings for SoPL and EoPL.. As we have discussed in Section 2, any total search problem $R_n \subseteq \lbrace 0,1\rbrace ^n \times O_n$ has a natural encoding as an unsatisfiable CNF formula by $\bigwedge _{o \in O_n} \lnot T_o(x)$ where $T_o(x)$ is the decision tree that checks if $(x, o) \in R_n$ . Since $T_o$ is a low-depth decision tree, we can encode it as a low-width DNF formula, and thus the resulting CNF formula also has low width. In this section, we describe the unsatisfiable CNF formulas corresponding to ${\text{ SoPL}} _n$ and ${\text{ EoPL}} _n$ explicitly.

The successor and predecessor pointers in the ${\text{ SoPL}} _n$ instance will be encoded in binary, so, for the sake of convenience assume $n = 2^{\lambda } - 1$ for some integer $\lambda \ge 1$ and other cases can be handled similarly. For each node $(i, j)$ the successor and predecessor pointers will be encoded by blocks of Boolean variables $s_{i,j} \in \lbrace 0,1\rbrace ^{\lambda }, p_{i,j} \in \lbrace 0,1\rbrace ^\lambda$ encoding the value of the pointer in binary. The pointer ${\textsf {null}}$ will always be encoded by the all-0 string. We will abuse notation and often consider $s_{i,j}$ and $p_{i,j}$ as actual elements of $[n] \cup \lbrace {\textsf {null}}\rbrace$ , rather than as short Boolean strings. So, we may write things like $s_{i,j} = k$ for $k \in [n]$ to mean that the bits of $s_{i,j}$ are equal to the binary encoding of k.

As everything is encoded in binary, it will be helpful to introduce the following notation. In general, for a predicate $P : \lbrace 0,1\rbrace ^n \rightarrow \lbrace 0,1\rbrace$ , we let $[\!\![ P ]\!\!]$ represent the CNF encoding of P over the n underlying Boolean variables. For example, $[\!\![ s_{i,j} = \ell ]\!\!]$ for $\ell \in [n]$ represents the CNF encoding of the predicate “ $s_{i, j} = \ell$ ” over the Boolean variables underlying $s_{i,j}$ . Explicitly, $[\!\![ s_{i,j} = {\textsf {null}}]\!\!] = \bigwedge _{t=1}^{\lambda } \overline{s}_{i,j,t},$ and similarly $[\!\![ s_{i,j} \ne {\textsf {null}}]\!\!]$ can be represented by the clause $\bigvee _{t=1}^{\lambda } s_{i,j,t}$ . We can also form more complicated statements, writing, e.g., $[\!\![ s_{i,j} = k \wedge p_{i+1, k} = j ]\!\!]$ to mean the CNF encoding of “the successor of $(i,j)$ is $(i+1, k)$ and the predecessor of $(i+1,k)$ is $(i,j)$ .”

Definition 7.

Let n be a positive integer, and for simplicity assume $n = 2^\lambda - 1$ for some integer $\lambda \ge 1$ . Consider the following unsatisfiable CNF formula ${\text{ SoPL}} _n$ . For each $(i, j) \in \lbrace 2, \dots , n-1\rbrace \times [n]$ , we have two blocks of $\lambda$ variables $s_{i,j} \in \lbrace 0,1\rbrace ^\lambda , p_{i,j} \in \lbrace 0,1\rbrace ^\lambda$ encoding the successor and predecessor pointers of the node $(i,j)$ in binary, where ${\textsf {null}}$ is encoded by $0^\lambda$ . For each $j \in [n]$ , we additionally have a block of $\lambda$ variables $s_{1,j} \in \lbrace 0,1\rbrace ^\lambda$ encoding the successor of $(1, j)$ , a block of $\lambda$ variables $p_{n, j} \in \lbrace 0,1\rbrace ^\lambda$ encoding the predecessor of $(n, j)$ , and a single variable $s_{n,j} \in \lbrace 0,1\rbrace$ encoding whether or not $(n, j)$ is active.

The clauses of ${\text{ SoPL}} _n$ are the following:

—

For each $j \in [n]$ , $[\!\![ s_{1,1} \ne j \vee p_{2,j} = 1]\!\!]$ and $[\!\![ s_{1,1} \ne 0 ]\!\!]$ , (active distinguished source)

—

For each $j \in [n]$ , $\overline{s}_{n,j}$ for each $j \in [n]$ , (inactive sink)

—

For each $(i, j) \in \lbrace 1, \dots , n-2\rbrace \times [n]$ and each $a, b \in [n]$ , $c \in [n] \cup \lbrace 0\rbrace$ , $a \ne c$ , (no proper sinks)

\begin{equation*} [\!\![ s_{i,j} \ne a \vee p_{i+1, a} \ne j \vee s_{i+1,a} \ne b \vee p_{i+2,b} \ne c ]\!\!] , \end{equation*}

as well as $[\!\![ s_{i,j} \ne a \vee p_{i+1, a} \ne j \vee s_{i+1,a} \ne 0 ]\!\!]$ . Similarly, for each $a, b \in [n]$ ,

\begin{equation*} [\!\![ s_{n-1, a} \ne b \vee p_{n, b} \ne a \vee s_{n,b} = 1 ]\!\!] . \end{equation*}

The ${\text{ EoPL}} _n$ formula is obtained by adding the following extra clauses to ${\text{ SoPL}} _n$ :

—

For each $(i, j) \in \lbrace 2, \dots , n-1\rbrace \times [n]$ and each $a, b \in [n]$ , $c \in [n] \cup \lbrace 0\rbrace$ , $c \ne j$ , (no proper sources)

\begin{equation*} [\!\![ s_{i,j} \ne a \vee p_{i+1,a} \ne j \vee p_{i,j} \ne b \vee s_{i-1,b} \ne c ]\!\!] , \end{equation*}

as well as $[\!\![ s_{i,j} \ne a \vee p_{i+1,a} \ne j \vee p_{i,j} \ne 0 ]\!\!]$ . Similarly, for any $a, b \in [n]$ with $a \ne 1$ ,

\begin{equation*} [\!\![ s_{1, a} \ne b \vee p_{2,b} \ne a ]\!\!] . \end{equation*}

From the above definition, we can see that both ${\text{ SoPL}} _n$ and ${\text{ EoPL}} _n$ are polynomial-size, $O(\log n)$ -width CNF formulas, and they are unsatisfiable, since the families of clauses simply encode the contradictory statements “the ${\text{ SoPL}}/{\text{ EoPL}}$ problem has no solution.”

Proofs of Characterisations.. Now, before proving that we can refute ${\text{ SoPL}} _n$ in RevRes, we first prove a technical lemma that allow us to manipulate binary encodings in RevRes.

Lemma 10.

Let $\lambda \gt 0$ be a positive integer, and let $n = 2^{\lambda } - 1$ . Let C be a width-k clause that does not depend on a block of Boolean variables $z \in \lbrace 0,1\rbrace ^\lambda$ . Using the reversible weakening rule, we can prove, from C, the set of clauses $\lbrace [\!\![ C \vee z \ne i ]\!\!] : i = 0, \dots , n\rbrace$ in width $k + \lambda$ and size $2^{\lambda }$ . Conversely, from the above set of clauses, we can prove C using the reversible resolution rule in the same size and width.

Proof.

Starting from C, apply the reversible weakening rule on the first bit $z_1$ to obtain $C \vee z_1$ and $C \vee \overline{z}_1$ . Weakening each of the results on $z_2$ , $z_3$ , ..., $z_\lambda$ in turn yields exactly the CNF formula described in the lemma, and the second statement follows from the reversibility of RevRes. □

Theorem 10.

For each positive integer n, there is a $O(\log n)$ -width, polynomial-size RevRes refutation (RevResT refutation, respectively) of ${\text{ SoPL}} _n$ ( ${\text{ EoPL}} _n$ , respectively).

It remains to modify the previous proof to accommodate decision-tree reductions to SoPL and EoPL. To do this, we mimic the previous proof, but replace the construction of the sets of clauses in the proof with appropriate queries to the decision trees (which RevRes can simulate) in the reduction.

Before we prove the theorem, we introduce some helpful notation for manipulating decision trees. If T is a decision tree, then $\mathcal {P}(T)$ is the set of root-to-leaf paths in T. If o is an output (i.e. leaf label) of T, then define $\mathcal {P}_o(T)$ to be the set of root-to-leaf paths in T that output o. Given any path $P \in \mathcal {P}(T)$ , let $C_P := \bigvee _{\ell \in P} \lnot \ell$ be the negation of the literals along P; so, $C_P(x) = 1$ iff P is not followed when T is evaluated on x. We also need an appropriate modification of Lemma 10 to arbitrary decision trees, which we prove next.

Lemma 11.

Let C be a width-k clause, and let T be a depth-d decision tree querying a set of variables disjoint from C. Using the reversible weakening rule, we can prove, from C, the set of clauses $\lbrace C \vee C_P | P \in \mathcal {P}(T)\rbrace$ in width $d + k$ and size at most $2^{d}$ . Conversely, from the above set of clauses, we can prove C using the reversible resolution rule in the same size and width.

Proof.

This proof is essentially the same as in Lemma 10. Now, starting from C, apply the reversible weakening rule on the first variable $x_i$ queried in the decision tree T to derive the clauses $\lbrace C \vee x_i, C \vee \overline{x}_i\rbrace$ . From there, we can continue to apply the reversible weakening rule to simulate the queries of the decision tree. For instance, if after the decision tree learns $x_i = 0$ it queries $x_j$ , we apply the reversible weakening rule to $x_i$ to obtain $x_i \vee x_j, x_i \vee \overline{x}_j$ . Continuing in this manner, we can derive all clauses $C \vee C_P$ for $P \in \mathcal {P}(T)$ , and running the proof in reverse yields the lemma. □

Theorem 11.

Let F be an unsatisfiable CNF formula. If there is a depth-d ${\text{ SoPL}} _L$ -formulation ( ${\text{ EoPL}} _L$ -formulation, respectively) of $S(F)$ , then there is a RevRes refutation (with terminals, respectively) of F with width $O(d)$ and size $L^{O(1)}2^{O(d)}$ .

Proof.

We follow the proof of Theorem 10 and focus on the case of SoPL. Assume $F = C_1 \wedge \cdots \wedge C_m$ is defined on n variables $x_1, \dots , x_n$ . In this proof, we think of CNF formulas and sets of clauses interchangeably. In the ${\text{ SoPL}} _L$ -formulation of $S(F)$ , we have functions

\begin{equation*} s_{i,j} : \lbrace 0,1\rbrace ^n \rightarrow [L] \cup \lbrace {\textsf {null}}\rbrace ,\ p_{i,j}: \lbrace 0,1\rbrace ^n \rightarrow [L] \cup \lbrace {\textsf {null}}\rbrace ,\ g_{i,j} : \lbrace 0,1\rbrace ^n \rightarrow [m] \end{equation*}

computing successors, predecessors, and solutions for each internal node, and we identify each function with the depth-d decision tree computing it.

For each $(i, j) \in [L-1] \times [L]$ consider the CNF formula

\begin{equation*} I_{i,j,k}(x) = [\!\![ s_{i,j}(x) \ne k \vee p_{i+1,k}(x) \ne j ]\!\!] . \end{equation*}

In other words, $I_{i,j,k}$ is the analogue of the clause using the same notation from the proof of Theorem 10. We can use the decision trees for $s_{i,j}$ and $p_{i,j}$ to encode $I_{i,j,k}$ as a CNF formula explicitly. To do this, define the decision tree $T_{i,j}$ as follows: take the decision tree $s_{i,j}$ and at each leaf labelled k, simulate the decision tree $p_{i+1,k}$ (skipping queries to variables already made) to obtain an output a, and then output the pair $(k, a)$ . With this decision tree, we can define $I_{i,j,k} = \lbrace C_P | P \in \mathcal {P}_{(k,j)}(T_{i,j})\rbrace$ . As in the proof of Theorem 10, define

\begin{equation*} I_{i,j} := \bigcup _{k=1}^n I_{i,j,k}, \quad \mathcal {I}_i := \bigcup _{j=1}^n I_{i,j}, \end{equation*}

where we recall that we consider CNFs and sets of clauses interchangeably. When $i = L$ , then for any $j \in [L]$ define the decision tree $T_{L,j}$ that simulates the decision tree $s_{L,j}$ and outputs 1 if $(L,j)$ is active and 0 otherwise. With this, we define $I_{L,j} = \lbrace C_P | P \in \mathcal {P}_1(T_{L,j})\rbrace$ , and similarly define $\mathcal {I}_L = \bigcup _{j=1}^L I_{L,j}$ . In this notation, the set of clauses $\mathcal {I}_i$ again encodes “every node on layer i is inactive,” where now the activity of a node is determined by the underlying decision trees in the formulation.

The main step in this theorem is the following claim.

Claim 5.

For any $i \in \lbrace 2,\dots ,L\rbrace$ , there is a size $L^{O(1)}2^{O(d)}$ , $O(d)$ -width RevRes proof of $\mathcal {I}_{i-1}$ from $\mathcal {I}_{i}$ and a collection of weakenings of clauses from F. □

First, we use the claim to finish the proof of the theorem. We begin by deriving from F the clauses $\mathcal {I}_{L}$ (let us briefly postpone this argument), and then apply the claim $L-1$ times to derive $\mathcal {I}_1$ . Let $\mathcal {Q} = \bigcup _{k \ne 0} \mathcal {P}_{(k,1)}(T_{1,1})$ be the set of paths of $T_{1,1}$ that end in a leaf labelled with $(k, 1)$ for some $k \ne 0$ , and let $\mathcal {R} = \mathcal {P}(T_{1,1}) \setminus \mathcal {Q}$ . Observe that $I_{1,1} \subseteq \mathcal {I}_1$ is, by definition, the set of clauses $\lbrace C_P | P \in \mathcal {Q}\rbrace$ .

Consider any path $P \in \mathcal {R}$ , and note that P ends in a leaf labelled with $(k, a)$ where either $k = 0$ or $a \ne 1$ . Each leaf witnesses that the distinguished node $(1,1)$ is inactive, and so we can then simulate the decision tree $g_{1,1}$ and learn a solution of $S(F)$ . Therefore, for every path $P^{\prime } \in \mathcal {P}(g_{1,1})$ the clause $C_P \vee C_{P^{\prime }}$ is either a weakening of a clause in F, or, is trivially true if it contains both a literal and its negation. Therefore, by applying Lemma 11, we can deduce the clause $C_P$ from weakenings of clauses in F in size $2^{O(d)}$ and width $O(d)$ . Applying this argument for every $P \in \mathcal {R}$ allows us to deduce the clauses $\lbrace C_P | P \in \mathcal {R}\rbrace$ . We have now deduced all the clauses $\lbrace C_P | P \in \mathcal {P}(T_{1,1})\rbrace$ , and so applying Lemma 11 to all of these clauses allows us to deduce $\bot$ .

Let us now describe how to derive from F the clauses

\begin{equation*} \mathcal {I}_{L} = \lbrace [\!\![ (L,j) \text{ is inactive} ]\!\!] | j \in [L]\rbrace = \bigcup _{j = 1}^n \lbrace C_P | P \in \mathcal {P}_1(T_{L,j})\rbrace . \end{equation*}

For any $j \in [L]$ consider the following decision tree $T^{\prime }_{L,j}$ : first run the decision tree $T_{L,j}$ that checks if $(L,j)$ is active and then, if $(L,j)$ is active, simulate the decision tree $g_{L,j}$ to find a solution to $S(F)$ . It follows that for any $P \in \mathcal {P}_1(T_{L,j})$ and any $P^{\prime } \in \mathcal {P}(g_{L,j})$ the clause $C_P \vee C_{P^{\prime }}$ is a weakening of a clause of F or is trivially true. We can therefore deduce $C_P$ from weakenings of clauses of F using Lemma 11, and repeating this argument for every $j \in [L]$ and every $P \in \mathcal {P}_j$ , we can derive every clause in $\mathcal {I}_L$ . So, all that remains is to prove the claim.

Proof of Claim.

The proof of this claim is modelled on the proof of the similar claim from the previous theorem. We again do the general case where $i \le L-1$ ; the case where $i = L$ proceeds similarly. Consider the set of clauses $\mathcal {I}_{i}$ and $\mathcal {I}_{i-1}$ . Our first goal is to derive the analogue of the set $\mathcal {A}_i$ in the proof of Claim 4.

Let $j \in [L]$ be arbitrary and consider any clause $C \in I_{i, j}$ . By definition, there is a $k \ne 0$ such that $C = C_P$ for some $P \in \mathcal {P}_{(k,j)}(T_{i, j})$ . Starting from C in the proof apply Lemma 11 to the decision tree $p_{i,j}$ to derive a set of clauses, each of the form $C \vee C_P$ , where $P \in \mathcal {P}(p_{i,j})$ . Then, for every $a \in [L]$ and any $P^{\prime } \in \mathcal {P}_a(p_{i,j})$ , apply Lemma 11 again to $C \vee C_P$ and the decision tree $s_{i-1, a}$ to obtain $C \vee C_P \vee C_{P^{\prime }}$ for every $P^{\prime } \in \mathcal {P}(s_{i-1,a})$ . Performing this procedure for all $C \in \mathcal {I}_i$ yields

\begin{align*} \mathcal {A}_i & := \lbrace [\!\![ s_{i,j} \ne k \vee p_{i+1, k} \ne j \vee p_{i,j} \ne a \vee s_{i-1,a} \ne b ]\!\!] | j,k,a,b \in [n]\rbrace \\ & = \lbrace C \vee C_P \vee C_{P^{\prime }} | a \in [L], P \in \mathcal {P}_a(p_{i,j}), P^{\prime } \in \mathcal {P}(s_{i-1,a})\rbrace . \end{align*}

We partition $\mathcal {A}_i$ into two sets: the clauses in $\mathcal {T}_i$ where $b = j$ , and the clauses in $\mathcal {J}_i = \mathcal {A}_i \setminus \mathcal {T}_i$ .

Now, as in the proof of Claim 4, we use $\mathcal {T}_i$ along with some clauses in F to deduce $\mathcal {I}_{i-1}$ , and we again will exploit the reversibility of RevRes to do so. Namely, starting from $\mathcal {I}_{i-1}$ , we deduce $\mathcal {T}_i \cup \mathcal {F}_i$ , where $\mathcal {F}_i$ is a collection of (weakenings of) clauses from F, and we can then just run the proof in reverse.

Let D be any clause in $\mathcal {I}_{i-1}$ , and note that there is a $j \in [L]$ such that $D = C_P$ for some $P \in \mathcal {P}_{(j,a)}(T_{i-1,a})$ . Starting from D, apply Lemma 11 with the decision tree $T_{i, j}$ to obtain a collection of clauses of the form $D \vee C_{P^{\prime }}$ where $P^{\prime } \in \mathcal {P}(T_{i,j})$ . Let $(k, b)$ be the output of the decision tree $T_{i,j}$ on the path $P^{\prime }$ . If $b = j$ , then the clause $D \vee C_{P^{\prime }}$ belongs to $\mathcal {T}_i$ . Moreover, if we repeat this argument for all $D \in \mathcal {I}_{i-1}$ , then the collection of all such clauses obtained is exactly $\mathcal {T}_i$ . This is because from $\mathcal {I}_{i}$ , the collection of clauses $\mathcal {T}_i$ was obtained by starting from all clauses at leaves of $T_{i, j}$ labelled with $(k, j)$ and then querying $p_{i, j}$ and $s_{i-1, a}$ ; here, we have performed the exact same queries, except we have reversed the order in which we simulated the decision trees $p_{i,j}$ and $s_{i-1,a}$ .

However, if $b \ne j$ , then the literals queried on the paths $P \in \mathcal {P}_{(j,a)}(T_{i-1,a})$ and $P^{\prime } \in \mathcal {P}_{(k,b)}(T_{i,j})$ together witness that the node $(i, j)$ is a proper sink node, and thus is a solution to the SoPL problem. Therefore, at the end of the path $P \cup P^{\prime }$ , we can run the decision tree $g_{i, j}$ to determine a solution to $S(F)$ . This means that if $P^{\prime \prime } \in \mathcal {P}(g_{i,j})$ is any root-to-leaf path in $g_{i,j}$ , then the clause $D \vee C_{P^{\prime }} \vee C_{P^{\prime \prime }} = C_P \vee C_{P^{\prime }} \vee C_{P^{\prime \prime }}$ must be a weakening of a clause in F (or, again, is trivially true). Let $\mathcal {F}_i$ denote the set of all of these weakenings of clauses of F, obtained by running the above procedure for every clause $D \in \mathcal {I}_{i-1}$ . We have therefore shown that from $\mathcal {I}_{i-1}$ , we can derive $\mathcal {T}_i \cup \mathcal {F}_i$ .

To finish the proof of the claim, we start with the clauses in $\mathcal {I}_{i} \cup \mathcal {F}_i$ , deduce $\mathcal {T}_i \cup \mathcal {J}_i$ from $\mathcal {I}_i$ to obtain the clauses $\mathcal {T}_i \cup \mathcal {J}_i \cup \mathcal {F}_i$ , and then deduce $\mathcal {I}_{i-1}$ from $\mathcal {T}_i \cup \mathcal {F}_i$ . This yields the clauses $\mathcal {I}_{i-1} \cup \mathcal {J}_i$ , and all of these steps required size $L^{O(1)}2^{O(d)}$ and width at most $O(d)$ , completing the proof of the claim and the theorem. □

The above proof can be modified to capture EoPL in the same manner as the proof of Theorem 10. In particular, we can argue via the same techniques that the “junk” clauses in $\mathcal {J}_i$ and the clauses in $\mathcal {I}_1 \setminus I_{1,1}$ each encode violations of the “no proper source” constraints of EoPL, and thus can be used to deduce weakenings of clauses in F by querying the appropriate solution decision trees $g_{i,j}$ . We omit the details.

7 Intersection Theorems

We can now finally prove Theorem 6, our intersection theorem for Reversible Resolution. To prove the theorem, we use the collapse theorems ${\text{ SOPL}} = {\text{ PLS}} \cap {\text{ PPADS}}$ and ${\text{ EOPL}} = {\text{ PLS}} \cap {\text{ PPAD}}$ [38]. In particular, examining the proofs of the collapse theorems from Reference [38], we can extract the following black-box analogues.

Theorem 12.

Let $R \subseteq \lbrace 0,1\rbrace ^n \times O$ be a total search problem, and suppose that there is a depth- $d_1$ , ${\text{ SoD}} _{s_1}$ -formulation of R and a depth- $d_2$ , ${\text{ SoL}} _{s_2}$ -formulation of R. Then there is a depth $O(d)$ ${\text{ SoPL}} _{s^3}$ -formulation of R where $d = \max \lbrace d_1, d_2\rbrace$ and $s = \max \lbrace s_1, s_2\rbrace$ .

Theorem 13.

Let $R \subseteq \lbrace 0,1\rbrace ^n \times O$ be a total search problem, and suppose that there is a depth- $d_1$ , ${\text{ SoD}} _{s_1}$ -formulation of R and a depth- $d_2$ , ${\text{ EoL}} _{s_2}$ -formulation of R. Then there is a depth $O(d)$ ${\text{ EoPL}} _{s^3}$ -formulation of R where $d = \max \lbrace d_1, d_2\rbrace$ and $s = \max \lbrace s_1, s_2\rbrace$ .

Theorem 6 is now an immediate corollary of the next theorem.

Theorem 14.

Let F be an unsatisfiable CNF formula. Let $d_1, d_2, s_1, s_2$ be positive integers and let $d = \max \lbrace d_1, d_2\rbrace$ and $s = \max \lbrace s_1, s_2\rbrace$ .

—

If there is a width- $d_1$ , size- $s_1$ Resolution proof, and a degree- $d_2$ , size- $s_2$ unary Sherali–Adams proof of F, then there is width $O(d)$ and size $s^{O(1)}2^{O(d)}$ RevRes proof of F.

—

If there is a width- $d_1$ , size- $s_1$ Resolution proof, and a degree- $d_2$ , size- $s_2$ unary Nullstellensatz proof of F, then there is width $O(d)$ and size $s^{O(1)}2^{O(d)}$ RevResT proof of F.

In particular, $\mbox{RevRes}(F) = \Theta (\mbox{Res}(F) + \mbox{uSA}(F))$ and $\mbox{RevResT}(F) = \Theta (\mbox{Res(F)} + \mbox{uSA}(F))$ .

Proof.

Since RevRes can be efficiently simulated by both Resolution and unary Sherali–Adams, we have $\mbox{Res}(F) = O(\mbox{RevRes}(F))$ and $\mbox{uSA}(F) = O(\mbox{RevRes}(F))$ . For the converse direction, suppose that we have a width- $d_1$ , size- $s_1$ Resolution proof and a degree- $d_2$ , size- $s_2$ unary Sherali–Adams proof. By Reference [49, Theorem 8.18], there is a depth- $O(d_1)$ ${\text{ SoD}} _{O(s_1)}$ -formulation of $S(F)$ and by Theorem 8, there is a depth- $O(d_2)$ ${\text{ EoL}} _{O(s_2)}$ -formulation for $S(F)$ . Applying the above collapse theorem, this implies that there is a depth- $O(d)$ ${\text{ SoPL}} _{s^3}$ -formulation of $S(F)$ , where $d = \max \lbrace d_1, d_2\rbrace$ and $s = \max \lbrace s_1, s_2\rbrace$ . Finally, applying Theorem 9, we obtain a RevRes proof of F with width $O(d)$ and size $s^{O(1)}2^{O(d)}$ . We therefore have

\begin{equation*} \mbox{RevRes}(F) = O(d + \log s) = O(d_1 + d_2 + \log s_1 + \log s_2) = O(\mbox{Res}(F) + \mbox{uSA}(F)). \end{equation*}

A similar proof using Theorem 7 instead yields the characterisation of $\mbox{RevResT}$ . □

8 Two Further Separations

In this section, we prove Theorem 4 and 5, restated below.

Theorem 4.

${\text{ PLS}} ^{dt} \not\subseteq {\text{ PPP}} ^{dt}$ .

Theorem 5.

${\text{ EOPL}} ^{dt} \not\subseteq {\text{ UEOPL}} ^{dt}$ .

The proofs of these theorems rely on a “glueing” technique that was implicitly used in Reference [4] and that we make more explicit in this article. We use the glueing technique as a tool to alleviate the lack of good proof systems characterizing PPP and UEOPL. In particular, The glueing technique reduces the separation in Theorem 4 to the easier separation ${\text{ PLS}} ^{dt}\not\subseteq {\text{ PPADS}} ^{dt}$ , which we already proved in Corollary 1 and Theorem 5 uses the glueing technique together with a query lower bound for EoPL from Reference [42]. This glueing technique was also recently generalized by Jain, Li, Robere and Xun [46] to prove lower bounds for classes above PPP corresponding to the generalized pigeonhole principles.

8.1 Glueability

Let $\text{R} = (R_n)$ , $R_n\subseteq \lbrace 0,1\rbrace ^n\times O_n$ , be a ${\text{ TFNP}} ^{dt}$ problem. We consider partial assignments $x\in {{\lbrace 0,1,*\rbrace }}^n$ that define partial inputs to $R_n$ . An index i with $x_i=*$ is interpreted as a Boolean variable whose value is not yet assigned. The size of a partial assignment is its number of non- $*$ bits. We say that two partial assignments $x, y \in {{\lbrace 0,1,*\rbrace }}^n$ are consistent if x and y agree on their non- $*$ bits. If x and y are consistent, then we can form the partial assignment $x\cup y$ that assigns values to all variables assigned values in x or y. We further say that x is witnessing if there exists some solution $o \in O_n$ such that for any $y \in \lbrace 0,1\rbrace ^n$ consistent with x we have $o \in R_n(y)$ .

Definition 8 (Glueable sets of assignments).

A set of partial assignments $P\subseteq \lbrace 0,1,*\rbrace ^n$ is k-glueable if for each non-witnessing and consistent $p, p^{\prime } \in P$ , their union $p \cup p^{\prime }$ is non-witnessing, and moreover, if we restrict $R_n$ by the assignment $p\cup p^{\prime }$ , then the resulting search problem $(R_n\upharpoonright p\cup p^{\prime })$ has decision tree complexity greater than k.

This and following definitions are mostly motivated by their use in Lemma 12 and Lemma 14. For instance, in Lemma 12, we consider P to be the set of all partial assignments obtained by collecting leaves pointing to a particular hole in the ${\text{ PPP}} ^{dt}$ -reduction. The main idea is that the glueability property of P then allows to disambiguate between pigeons to find which (if any) is mapping to the particular hole.

Definition 9 (Completions).

Let $x \in {{\lbrace 0,1,*\rbrace }}^n$ be a partial assignment and T a decision tree over $\lbrace 0,1\rbrace ^n$ . The completion $C(T, x)$ of x by T is the set obtained by collecting all the partial assignments corresponding to leaves of T that are consistent with x and taking their union with x. That is, $C(T, x) := \lbrace x \cup p: \text{$p$ is a leaf of $T$ consistent with $x$}\rbrace$ .

Definition 10 (Glueable problem).

Let $f:\mathbb {N} \rightarrow \mathbb {N}$ be a function. We say ${\rm\small R}$ is $f(k)$ -glueable if any set $P\subseteq {{\lbrace 0,1,*\rbrace }}^n$ of partial assignments of size at most k, where $k\le \text{poly}(\log n)$ , can be completed by decision trees of depth at most $f(k)$ such that the union of the completions is k-glueable. That is, if there exists for each $x \in P$ , some decision tree $T_x$ such that $\cup _{x\in P} C(T_x, x)$ is k-glueable. We further say that ${\rm\small R}$ is glueable if it is $\text{poly}(k)$ -glueable.

For example, it is implicit in Reference [4, Section 3.1] that the ${\text{ PPA}} ^{dt}$ -complete problem Lonely (given a matching of an odd number of nodes, find an isolated node) is $O(k)$ -glueable. In the case of Lonely, if $x \in {{\lbrace 0,1,*\rbrace }}^n$ asserts that node u points to node v, then $T_x$ queries the pointing node for v so that a solution is immediately witnessed if u is isolated. We will shortly prove that SoD and EoPL are glueable, too. In what follows, we slightly depart from the above notation and also consider pointer-like partial assignments (as opposed to assignments over ${{\lbrace 0,1,*\rbrace }}$ only). Those are treated naturally; for instance, we can assume that reductions are constrained to query either all or no bits corresponding to a pointer.

8.2 ${\text{ PLS}} ^{dt} \nsubseteq {\text{ PPP}} ^{dt}$

We introduce for convenience the ${\text{ Reversible-Pigeon}}$ problem, which is a variant of Pigeon where a reverse pointer is provided for each hole.

$\boldsymbol{{{\rm{R}{\rm\small{EVERSIBLE-}}\rm{P}{\rm\small{IGEON}}}\ (\rm{RP}{\rm\small{IGEON}}}}_n)$ . This problem is the same as Pigeon except that we are also given reverse pointers $p_u \in [n] \cup \lbrace {\textsf {null}}\rbrace$ for each hole $u \in [n-1]$ . The goal is to output any solution of Pigeon or

$u \in [n]$ such that $p_{s_u} \ne u$ . (successor/predecessor mismatch)

This problem is known to be ${\text{ PPADS}} ^{dt}$ -complete (see, e.g., Reference [38, Lemma 1]) so that ${\text{ RPigeon}} ^{dt} = {\text{ PPADS}} ^{dt}$ . The following key lemma is implicit in Reference [4, Section 3.1].

Lemma 12.

If ${\rm\small R} \in {\text{ PPP}} ^{dt}$ and ${\rm\small R}$ is glueable, then ${\rm\small R} \in {\text{ PPADS}} ^{dt}$ .

Proof.

Fix a ${\text{ Pigeon}} _m$ -formulation $(f_i, g_{i,i^{\prime }})_{i,i^{\prime } \in [m]}$ of $R_n$ that witnesses ${\rm\small R} \in {\text{ PPP}} ^{dt}$ and let $(T_i, S_{i,i^{\prime }})$ be decision trees of depth $k = \text{poly}(\log n)$ implementing this reduction. Since ${\rm\small R}$ is glueable, it is possible to complete the root-to-leaf paths of each $T_i$ to get a reduction $(T_i^{\prime }, S_{i,i^{\prime }})$ of depth $d = \text{poly}(\log n)$ for which the set $P = \cup _{i \in [m]} \mbox{leaves}(T^{\prime }_i)$ is k-glueable. (Note that each $S_{i,i^{\prime }}$ remains unchanged and has depth at most k.) We show how to construct decision trees $(H_j)_{j \in [m]}$ of depth $\le d^2$ that compute reverse pointers for each hole of the ${\text{ Pigeon}} _m$ instance. We start with the following claim.

Claim 6.

Suppose $p\in \mbox{leaves}(T^{\prime }_i)$ and $p^{\prime }\in \mbox{leaves}(T^{\prime }_{i^{\prime }})$ are distinct leaves that are both non-witnessing and labelled with the same hole. Then p and $p^{\prime }$ are inconsistent. □

Proof.

If $i=i^{\prime }$ , then the claim is true, since any two distinct leaves of the same tree are inconsistent. Suppose $i\ne i^{\prime }$ and suppose for contradiction that p and $p^{\prime }$ are consistent. Then, $(i, i^{\prime })$ is a valid solution to the ${\text{ Pigeon}} _m$ instance $(T^{\prime }_1(z), T^{\prime }_2(z), \dots , T^{\prime }_m(z))$ for any $z \in \lbrace 0,1\rbrace ^n$ extending $p \cup p^{\prime }$ . By correctness of the reduction, this further implies that $S_{i, i^{\prime }}$ can solve $(R_n\upharpoonright p \cup p^{\prime })$ with at most k queries—but this contradicts the fact that P is k-glueable. □

Let us write $P_j \subseteq P$ for the set of all non-witnessing partial assignment corresponding to leaves labelled with hole j. The predecessor tree $H_j$ computes as follows. Pick an arbitrary leaf $p \in P_j$ and query all the variables contained in p. At every leaf $x \in {{\lbrace 0,1,*\rbrace }}^n$ of the current version of $H_j$ , the next step depends on the set of x-consistent assignments $P_j^x = \lbrace p \in P_j: p \text{ consistent with } x\rbrace$ .

(1)

If $|P_j^x| = 0$ , then output label ${\textsf {null}}$ .

(2)

If $|P_j^x| = 1$ , then output the unique $i \in [m]$ (by Claim6) such that $P^x_j \cap \text{leaves}(T^{\prime }_i) \ne \emptyset$ .

(3)

If $|P_j^x| \ge 2$ , then pick an arbitrary $p \in P_j^x$ and recurse by querying its variables, and so on.

Note that each predecessor tree $H_j$ has depth at most $d^2$ : by pairwise inconsistency of $P_j$ , at most d paths are queried each of depth at most d. To complete the ${\text{ RPigeon}} _m$ -formulation of $R_n$ , it remains to specify decision trees $(S_i)_{i \in [m]}$ that transform ${\text{ RPigeon}} _m$ -solutions of type (2) into $R_n$ -solutions. Indeed, suppose $T^{\prime }_i(z)=j$ but $H_j(z)\ne i$ for some input z to $R_n$ . Then, since $H_j$ decides unambiguously which non-witnessing assignment in $P_j$ is consistent with z (if any), it must be the case that the leaf outputting $T^{\prime }_i(z)=j$ is not in $P_j$ , which means that it is witnessing. Thus, $S_i(z)$ simply runs $T^{\prime }_i(z)$ and an $R_n$ -solution must be witnessed during its execution.

We note that the method used to disambiguate pigeons in Lemma 12 is common. For instance, it is key to prove the folklore certificate-to-query result ${\text{ D}} (f) \le {\text{ C}} ^1(f) \cdot {\text{ C}} ^0(f)$ for Boolean functions f. To show Theorem 4, the last missing piece is to show that SoD is glueable. Indeed, if ${\text{ SoD}} \in {\text{ PPP}} ^{dt}$ , then Lemma 12 would imply that ${\text{ PLS}} ^{dt} \subseteq {\text{ PPADS}} ^{dt}$ , which contradicts Corollary 1. We show that SoD is glueable in Lemma 13 below.

For technical convenience, we consider here a minor variation of how we encode the successor pointers in the input to SoD. We let the input consist of successor pointers $s_u \in [n]$ for each grid node $u \in [n] \times [n]$ as well as an “active” bit $a_u \in \lbrace 0,1\rbrace$ , where $a_u=0$ means that u has a ${\textsf {null}}$ pointer. This is merely a different way to encode ${\textsf {null}}$ successors, and indeed, there is a trivial reduction to and from the original SoD problem. The advantage of this new encoding is that it allows for querying the activity of a node without querying its successor. This simplifies the completion process in the proof below.

Lemma 13.

SoD is glueable.

Proof.

We show that ${\text{ SoD}} _n$ is $O(k)$ -glueable. Fix some partial ${\text{ SoD}} _n$ -assignment $x = (s, a)$ of size $k = \text{poly}(\log n)$ , that is, $s_u \in [n] \cup \lbrace *\rbrace$ and $a_u \in {{\lbrace 0,1,*\rbrace }}$ for each grid node $u \in [n] \times [n]$ . The decision tree T completing x starts by checking whether x queries any active node below row $n-k-1$ . If yes, then T picks any one such active node and follows the successor path until a sink is found, making the completion witnessing. Note that this step incurs at most $O(k)$ queries. Finally, T ensures that any successor query in x is followed by a query to the active bit of the successor. This costs at most $O(k)$ further queries.

Let P be an arbitrary set of partial assignments each of size at most $k = \text{poly}(\log n)$ and let $P^{\prime }$ be its completion with respect to the procedure defined above. We first show that $P^{\prime }$ is k-glueable. Pick any two non-witnessing and consistent $p, p^{\prime } \in P^{\prime }$ and suppose toward contradiction that their union $p \cup p^{\prime }$ is witnessing. If it reveals a SoD solution u of type (1) or type (2), then it must be that one of p and $p^{\prime }$ checks for the active bit of u: a contradiction with the fact that p and $p^{\prime }$ are non-witnessing. However, if $p \cup p^{\prime }$ reveals a solution u of type (3), then it must be that one of p and $p^{\prime }$ checks for the successor $s_u$ of u, but the completion T forces this check to be followed by a query to the active bit of $s_u$ , making one of the initial partial assignments witnessing as well. Hence, $p\cup p^{\prime }$ is non-witnessing.

We finally argue that $(R_n\upharpoonright p\cup p^{\prime })$ has query complexity greater than k by describing an adversary that can fool any further k queries to $p \cup p^{\prime }$ without witnessing a solution. Recall that $p \cup p^{\prime }$ makes no queries to nodes below row $n-k-1$ . The adversary answers queries as follows. If the successor pointer of an active node is queried, then we answer with a pointer to any unqueried node on the next row and make it active (there always exists one as $k \ll n$ ). If a node u is queried that is not the successor of any node, then we make u inactive ( $a_u = 0$ and $s_u$ is arbitrary). This scheme ensures that a solution can only lie on the very last row n, which is not reachable in k queries starting from row $n - k - 1$ . □

8.3 ${\text{ EOPL}} ^{dt} \nsubseteq {\text{ UEOPL}} ^{dt}$

We prove Theorem 5 using a similar plan as in Section 8.2 above. Namely, we first show (Lemma 14) that if we have a problem ${\rm\small R} \in {\text{ UEOPL}} ^{dt}$ that is glueable, then in fact ${\rm\small R} \in {\text{ FP}} ^{dt}$ , that is, $R_n$ admits a shallow decision tree solving it. Second, we show (Lemma 15) that ${\text{ EoPL}} _n$ is glueable. The combination of these two lemmas implies that if ${\text{ EOPL}} ^{dt}\subseteq {\text{ UEOPL}} ^{dt}$ , then ${\text{ EOPL}} ^{dt}={\text{ FP}} ^{dt}$ . But it is known from prior work [42] (building on References [1, 70]) that ${\text{ EOPL}} ^{dt}\ne {\text{ FP}} ^{dt}$ . This proves Theorem 5.

It remains to prove Lemma 14 15.

Lemma 14.

If ${\rm\small R} \in {\text{ UEOPL}} ^{dt}$ and ${\rm\small R}$ is glueable, then ${\rm\small R} \in {\text{ FP}} ^{dt}$ .

Proof.

Fix an ${\text{ UEoPL}} _m$ -formulation $(f_u, g_{u,u^{\prime }})_{u,u^{\prime } \in [m] \times [m]}$ of $R_n$ that witnesses ${\rm\small R} \in {\text{ UEOPL}} ^{dt}$ and let $(T_u, S_{u,u^{\prime }})$ be decision trees of depth $k = \text{poly}(\log n)$ implementing this reduction. Note that the leaves of each $T_u$ are labelled by a successor and predecessor pointers in $[m]$ . At the cost of doubling the depth of each $T_u$ , we may assume that each leaf is additionally labelled with an “activity” bit, which can be computed by appending to each leaf labelled with successor v the decision tree $T_v$ . Since ${\rm\small R}$ is glueable, it is possible to further complete the leaves of each $T_u$ to get a reduction $(T_u^{\prime }, S_{u,u^{\prime }})$ of depth $d = \text{poly}(\log n)$ for which the set of leaves $P = \cup _{u \in [m] \times [m]} \mbox{leaves}(T^{\prime }_u)$ is k-glueable and each leaf label carries the aforementioned activity bit. Let us say that a node u is good for input z if the leaf reached by $T_u^{\prime }(z)$ is non-witnessing and u is active.

Claim 7.

For every input z, there is at most one good node on each row. □

Proof.

Fix a row $j \in [m]$ and suppose for the sake of contradiction that the jth row contains two good nodes $u^{\prime }$ and u on some input z. Let $p\in \mbox{leaves}(T^{\prime }_u)$ and $p^{\prime }\in \mbox{leaves}(T^{\prime }_{u^{\prime }})$ be the leaves reached on input z. Then p and $p^{\prime }$ are a pair of non-witnessing and consistent assignments. Thus, $(u,u^{\prime })$ is a solution to ${\text{ UEoPL}} _m$ on any input that extends $p\cup p^{\prime }$ . Hence, the depth-k decision tree $S_{u,u^{\prime }}$ solves the search problem $(R_n\upharpoonright p\cup p^{\prime })$ . But this contradicts the fact that P is k-glueable. □

Using this claim similarly as in the proof of Lemma 12, we can construct, for each row $j \in [m]$ , a decision tree $A_j$ of depth $\le d^2$ that computes the column-index of a good node on row j or outputs ${\textsf {null}}$ if the row contains no good node. The main argument is again the disambiguation trick.

We can now design an efficient decision tree for $R_n$ : At the cost of running $O(\log m) = \text{poly}(\log n)$ of the $A_j$ trees, perform a binary search over the m rows to find a good node u on row j such that the next row $j+1$ contains no good nodes. This means that either (i) the successor of u is inactive, in which case we have found a solution to ${\text{ UEoPL}} _m$ , and we can use the S-trees to find a solution to $R_n$ , or (ii) the successor $u^{\prime }$ of u is active and $T^{\prime }_{u^{\prime }}(z)$ is witnessing, which solves $R_n$ .

We next show that an ${\text{ EOPL}} ^{dt}$ -complete problem is glueable. Instead of working with ${\text{ EoPL}} _n$ , it is convenient again to vary the input encoding. We define ${\text{ EoPL}} ^*$ as a version of EoPL where in addition to successor/predecessor pointers, we are also given an “activity” bit.

${\boldsymbol{\rm{E}{\rm\small{ND-OF-}}\rm{P}{\rm\small{OTENTIAL-}}{\rm{L}{\rm\small{INE}}}^*(EoPL}^*_{n})}$ . In addition to predecessor/successor pointers, each $u\in [n]\times [n]$ has an activity indicator bit $a_u \in \lbrace 0,1\rbrace$ . We add the following solutions to EoPL:

u, if u’s activity does not match $a_u$ , (active bit mismatch)

u, if $a_v=0$ and ( $s_v\ne {\textsf {null}}$ or $p_v\ne {\textsf {null}}$ ), (inactive node with a pointer)

u, if $a_v=1$ and ( $s_v= {\textsf {null}}$ or $p_v= {\textsf {null}}$ ). (active node with a ${\textsf {null}}$ -pointer)

Note that ${\text{ EoPL}} ^*$ is efficiently reducible to and from EoPL, so that ${\text{ EoPL}} ^*$ is ${\text{ EOPL}} ^{dt}$ -complete.

Fig. 8.

Lemma 15.

${\text{ EoPL}} ^*$ is glueable.

Proof.

We show that ${\text{ EoPL}} ^*$ is $O(k)$ -glueable. Fix some partial ${\text{ EoPL}} ^*$ -assignment $x = (p, s, a)$ of size $k = \text{poly}(\log n)$ . The tree T that completes x proceeds as follows. We start by querying all variables assigned in x. Then, we iterate each of the following steps until a solution is found or no further queries are made.

(1)

Always query activity bits and reverse pointers. If we have queried a ${\textsf {null}}$ -pointer $s_v={\textsf {null}}$ or $p_v={\textsf {null}}$ , then we also query the activity bit $a_v$ . This activity bit is $a_v=0$ unless we have found a solution of type (7).

Moreover, if we have queried a non- ${\textsf {null}}$ pointer $s_u = v$ (respectively, $p_u=v$ ), then we also query the bits $a_u$ , $a_v$ and the pointer $p_v$ (respectively, $s_v$ ). Note that both activity bits must be 1 and the reverse pointer must point back, $p_v=u$ (respectively, $s_v=u$ ), as otherwise, we can find a solution by making a couple more queries. Indeed, if $a_u=0$ , then there is a solution of type (6). If $a_u=1$ and $a_v=0$ , then we can find a solution by determining the activity of v: either v is active, which is a mismatch with $a_v=0$ (type (5)), or v is inactive, which creates a sink. Finally, if $a_u=1$ and $p_v\ne u$ , then u is inactive, which is a mismatch with $a_u=1$ (type (5)).

(2)

Follow the distinguished path. Follow the successor path starting at the distinguished source node $(1, 1)$ until some node on row $k + 1$ is reached or a sink is found.

(3)

Follow early paths. If we have queried $a_u=1$ for some node u in the first $k+1$ rows that does not lie on the path discovered in Item 2, then we follow u’s predecessor path until a solution is found.

(4)

Follow late paths. If we have queried $a_u=1$ for some node u in the last $k+1$ rows, then we follow u’s successor path until a solution is found.

This completion adds at most $O(k)$ queries to x. An example of a completion that is non-witnessing is given in Figure 8. It is straightforward to argue that the resulting set of completed assignments is k-glueable using an adversary strategy similar to the one described in the proof of Lemma 13. □

Acknowledgments

We thank Albert Atserias, Ilario Bonacina, Pritish Kamath, and David Steurer for discussions, and the anonymous reviewers for their suggestions that helped us improve the presentation of the article.

Footnotes

This particular form is valid for refuting sets of polynomial equations and can be easily obtained from more general forms used for refuting sets of polynomial inequalities.

The existence of such a factorization is easy to see here, since both $a_i$ and t are conjunctions of literals. More generally, the existence of such a factorization is guaranteed for any multilinear polynomials $a_i$ and t satisfying $a_i(x)=0\Rightarrow t(x)=0$ on the Boolean hypercube, where we simplify expressions using the constraints $x_i^2-x_i=0$ . One way to prove this is by using the fact that two multilinear polynomials are syntactically identical if and only if they agree on the Boolean hypercube.

A Coefficient Size in Algebraic Proofs

In this Appendix, we show that if there are low-degree Nullstellensatz and Sherali–Adams refutations over $\mathbb {Z}$ , then the coefficients in the refutations will also be not too large in magnitude. In particular, if the degree of the proofs are d, the the magnitude of the coefficients can be assumed to be at most $\exp (n^{O(d)})$ without loss of generality. For Sherali–Adams this follows easily as any Sherali–Adams refutation over the reals can be converted into a Sherali–Adams refutation over $\mathbb {Z}$ without badly affecting the coefficient size.

Theorem 15.

Let F be an unsatisfiable CNF formula on n variables and m clauses. If there is a degree-d Sherali–Adams refutation of F, then there is a degree-d Sherali–Adams refutation of F over $\mathbb {Z}$ where every coefficient is bounded in magnitude by $\exp (n^{O(d)})$ .

Proof.

This is essentially the usual proof of completeness for Sherali–Adams (see, e.g., Reference [37]). Consider a degree-d Sherali–Adams refutation of F, which, by Lemma 6, we can write as

\begin{equation*} \sum _{i=1}^m -J_i\overline{C}_i + J = -1. \end{equation*}

We can express the existence of such a proof as a system of integer linear inequalities of the form $Ax = b, x \ge 0$ over $mn^{O(d)} = n^{O(d)}$ variables and over $n^{O(d)}$ constraints where all coefficients of the matrix A and b are in $\lbrace 1,0,-1\rbrace$ , and indeed b has a single non-zero entry with value $-1$ see Reference [37, Chapter 2] for an explicit description of the system). By known results on linear programming this implies that the coefficients of the above Sherali–Adams refutation can be assumed to be rational with description length $n^{O(d)}$ . Let L be the least common multiple of the denominators all rational numbers occurring in the refutation. By multiplying through by L, we obtain the identity

\begin{equation*} \sum _{i=1}^m - LJ_i \overline{C}_i + LJ = -L. \end{equation*}

We can then add the integer $L - 1$ to both sides (noting that $LJ + L-1$ is a conical junta) to obtain an integer-coefficient Sherali–Adams refutation with the desired coefficient bound. □

For Nullstellensatz, the proof is slightly different, as we need to recruit known bounds for integer solutions to systems of linear equations.

Theorem 16.

Let F be an unsatisfiable CNF formula on n variables and m clauses. If there is a degree-d Nullstellensatz refutation of F over $\mathbb {Z}$ , then there is a degree-d Nullstellensatz refutation over $\mathbb {Z}$ where every coefficient has magnitude at most $\exp ({n^{O(d)}})$ .

Proof.

This follows the standard proof of completeness for Nullstellensatz proofs (see, e.g., References [13, 63]). Write $F = C_1 \wedge \cdots \wedge C_m$ and suppose F has n variables. A degree-d $\mathbb {Z}$ -Nullstellensatz proof of F can be written as

\begin{equation*} \sum _{i=1}^m q_i\overline{C}_i = 1 \end{equation*}

for some integer-coefficient multilinear polynomials $q_i$ . We can express the existence of such a proof as a system of $\mathbb {Z}$ -linear equations $Ax = b$ over $mn^{O(d)}$ variables—roughly one variable for each monomial m of degree at most d—where each coefficient in A and b is small. The result then follows by the known strongly polynomial time algorithms for finding integer solutions to systems of linear equations over $\mathbb {Z}$ (in particular, via the Hermite Normal Form [50]).

The system of linear equations is defined as follows. For each $i \in [m]$ and $S \subseteq [n]$ with $|S| \le d$ , we let $\hat{q}_i(S) \in \mathbb {Z}$ denote the coefficient of the monomial $x_S = \prod _{i \in S} x_i$ in the polynomial $q_i$ . Letting $C_{n, d}$ denote all subsets of $[n]$ of size d, we can write the Nullstellensatz refutation as

\begin{equation*} \sum _{i=1}^m \sum _{S \in C_{n,d}} \hat{q}_i(S) x_S \overline{C}_i = 1. \end{equation*}

From this, we get a system of $\mathbb {Z}$ -linear equations over variables $\hat{q}_i(S)$ for each $i \in [m]$ , $S \in C_{n,d}$ enforcing that all monomials in the proof of degree $d \gt 1$ must cancel out to 0, and the monomials of degree 0 must sum to 1. The system of equations has one constraint for each monomial $x_S$ with $S \in C_{n,d}$ and at most $m |C_{n,d}| \le mn^{O(d)}$ variables; each coefficient in the system of linear equations is $\pm 1$ from the expansion of $x_S \overline{C}_i$ into a sum of monomials. By reducing to Hermite Normal Form, we can find an integer solution to this system with coefficients of size at most $\exp (n^{O(d)})$ . □

Finally, we can consider RevRes proofs. An obvious fact is that any Resolution proof with width w has $n^{O(w)}$ distinct clauses without loss of generality. However, this result fails for RevRes, since we can no longer reuse clauses an unlimited number of times. By combining the previous results with the intersection theorem (Theorem 14), one can also immediately deduce the following result that gives a weak bound on the size of RevRes and RevResT proofs with bounded width. We omit the proof.

Corollary 6.

Let F be an unsatisfiable CNF formula on n variables. If there is a width-d RevRes refutation of F (RevResT, respectively), then there is a width- $O(d)$ and size $\exp (n^{O(d)})$ RevRes refutation of F (RevResT, respectively).

References

[1]

David Aldous. 1983. Minimization algorithms and random walk on the d-cube. Ann. Probabil. 11, 2 (1983), 403–413. Retrieved from http://www.jstor.org/stable/2243696

Crossref

Google Scholar

[2]

Albert Atserias and Massimo Lauria. 2019. Circular (yet sound) proofs. In Proceedings of the 22nd Theory and Applications of Satisfiability Testing (SAT’19). Springer, 1–18.

Crossref

Google Scholar

[3]

Albert Atserias, Massimo Lauria, and Jakob Nordström. 2016. Narrow proofs may be maximally long. ACM Trans. Comput. Logic 17, 3 (2016), 1–30.

Digital Library

Google Scholar

[4]

Paul Beame, Stephen Cook, Jeff Edmonds, Russell Impagliazzo, and Toniann Pitassi. 1998. The relative complexity of NP search problems. J. Comput. Syst. Sci. 57, 1 (1998), 3–19.

Digital Library

Google Scholar

[5]

Paul Beame, Russell Impagliazzo, Jan Krajíček, Toniann Pitassi, and Pavel Pudlák. 1994. Lower bounds on Hilbert’s Nullstellensatz and propositional proofs. In Proceedings of the 35th Symposium on Foundations of Computer Science (FOCS’94). 794–806.

Digital Library

Google Scholar

[6]

Paul Beame and Søren Riis. 1998. More on the relative strength of counting principles. In Proceedings of the DIMACS Workshop on Proof Complexity and Feasible Arithmetics, Vol. 39. 13–35.

Google Scholar

[7]

Eli Ben-Sasson. 2009. Size-space tradeoffs for resolution. SIAM J. Comput. 38, 6 (2009), 2511–2525.

Digital Library

Google Scholar

[8]

Ilario Bonacina and Maria Luisa Bonet. 2022. On the strength of Sherali-Adams and Nullstellensatz as propositional proof systems. In Proceedings of the 37th Symposium on Logic in Computer Science (LICS’22). ACM.

Digital Library

Google Scholar

[9]

Ilario Bonacina and Neil Thapen. 2022. A Separation of PLS from PPP. Technical Report. Electronic Colloquium on Computational Complexity (ECCC). Retrieved from https://eccc.weizmann.ac.il/report/2022/089/

Google Scholar

[10]

María Luisa Bonet, Jordi Levy, and Felip Manyà. 2007. Resolution for Max-SAT. Artific. Intell. 171, 8-9 (2007), 606–618.

Digital Library

Google Scholar

[11]

Joshua Buresh-Oppenheim, Matthew Clegg, Russell Impagliazzo, and Toniann Pitassi. 2002. Homogenization and the polynomial calculus. Comput. Complex. 11, 3-4 (2002), 91–108.

Digital Library

Google Scholar

[12]

Joshua Buresh-Oppenheim and Tsuyoshi Morioka. 2004. Relativized NP search problems and propositional proof systems. In Proceedings of the 19th IEEE Conference on Computational Complexity (CCC’04). 54–67.

Crossref

Google Scholar

[13]

Samuel Buss. 1998. Lower bounds on Nullstellensatz proofs via designs. In Proof Complexity and Feasible Arithmetics. AMS, 59–71.

Google Scholar

[14]

Sam Buss, Noah Fleming, and Russell Impagliazzo. 2022. TFNP Characterizations of Proof Systems and Monotone Circuits. Retrieved from https://eccc.weizmann.ac.il/report/2022/141/

Google Scholar

[15]

Samuel Buss, Leszek Aleksander Kołodziejczyk, and Neil Thapen. 2014. Fragments of approximate counting. J. Symbol. Logic 79, 2 (2014), 496–525. Retrieved from http://www.jstor.org/stable/43303745

Crossref

Google Scholar

[16]

Xi Chen, Decheng Dai, Ye Du, and Shang-Hua Teng. 2009. Settling the complexity of Arrow-Debreu equilibria in markets with additively separable utilities. In Proceedings of the 50th Symposium on Foundations of Computer Science (FOCS’09). 273–282.

Digital Library

Google Scholar

[17]

Xi Chen, Xiaotie Deng, and Shang-Hua Teng. 2009. Settling the complexity of computing two-player Nash equilibria. J. ACM 56, 3 (2009), 14:1–14:57.

Digital Library

Google Scholar

[18]

Xi Chen, David Durfee, and Anthi Orfanou. 2015. On the complexity of Nash equilibria in anonymous games. In Proceedings of the 47th Symposium on Theory of Computing (STOC’15). 381–390.

Digital Library

Google Scholar

[19]

Xi Chen, Dimitris Paparas, and Mihalis Yannakakis. 2017. The complexity of non-monotone markets. J. ACM 64, 3 (2017), 20:1–20:56.

Digital Library

Google Scholar

[20]

Matthew Clegg, Jeff Edmonds, and Russell Impagliazzo. 1996. Using the Groebner basis algorithm to find proofs of unsatisfiability. In Proceedings of the 28th Symposium on Theory of Computing (STOC’96). 174–183.

Digital Library

Google Scholar

[21]

Bruno Codenotti, Amin Saberi, Kasturi Varadarajan, and Yinyu Ye. 2008. The complexity of equilibria: Hardness results for economies via a correspondence with games. Theoret. Comput. Sci. 408, 2–3 (2008), 188–198.

Digital Library

Google Scholar

[22]

Stephen Cook and Robert Reckhow. 1979. The relative efficiency of propositional proof systems. J. Symbol. Logic 44, 1 (1979), 36–50.

Crossref

Google Scholar

[23]

Stefan Dantchev and Barnaby Martin. 2012. Rank complexity gap for Lovász-Schrijver and Sherali-Adams proof systems. Comput. Complex. 22, 1 (Nov.2012), 191–213.

Digital Library

Google Scholar

[24]

Stefan Dantchev, Barnaby Martin, and Mark Rhodes. 2009. Tight rank lower bounds for the Sherali–Adams proof system. Theoret. Comput. Sci. 410, 21-23 (2009), 2054–2063.

Digital Library

Google Scholar

[25]

Constantinos Daskalakis. 2019. Equilibria, fixed points, and computational complexity. In Proceedings of the International Congress of Mathematicians (ICM’19). World Scientific.

Crossref

Google Scholar

[26]

Constantinos Daskalakis, Paul Goldberg, and Christos Papadimitriou. 2009. The complexity of computing a Nash equilibrium. SIAM J. Comput. 39, 1 (2009), 195–259.

Digital Library

Google Scholar

[27]

Constantinos Daskalakis and Christos Papadimitriou. 2011. Continuous local search. In Proceedings of the 22nd Symposium on Discrete Algorithms (SODA’11). SIAM, 790–804.

Crossref

Google Scholar

[28]

Susanna de Rezende, Mika Göös, and Robert Robere. 2022. Proofs, circuits, and communication. SIGACT News 53, 1 (2022).

Digital Library

Google Scholar

[29]

Susanna de Rezende, Jakob Nordström, Or Meir, and Robert Robere. 2019. Nullstellensatz size-degree trade-offs from reversible pebbling. In Proceedings of the 34th Computational Complexity Conference (CCC’19), Amir Shpilka (Ed.), Vol. 137. Schloss Dagstuhl, 18:1–18:16.

Digital Library

Google Scholar

[30]

Xiaotie Deng, Qi Qi, and Amin Saberi. 2012. Algorithmic solutions for envy-free cake cutting. Operat. Res. 60, 6 (2012), 1461–1476.

Crossref

Google Scholar

[31]

Alex Fabrikant, Christos Papadimitriou, and Kunal Talwar. 2004. The complexity of pure Nash equilibria. In Proceedings of the 36th ACM Symposium on Theory of Computing (STOC’04). 604–612.

Digital Library

Google Scholar

[32]

John Fearnley, Paul W. Goldberg, Alexandros Hollender, and Rahul Savani. 2021. The complexity of gradient descent: CLS $=$ PPAD $\cap$ PLS. In Proceedings of the 53rd Symposium on Theory of Computing (STOC’21). 46–59.

Digital Library

Google Scholar

[33]

John Fearnley, Spencer Gordon, Ruta Mehta, and Rahul Savani. 2020. Unique end of potential line. J. Comput. Syst. Sci. 114 (2020), 1–35.

Crossref

Google Scholar

[34]

Yuval Filmus, Meena Mahajan, Gaurav Sood, and Marc Vinyals. 2023. MaxSAT resolution and subcube sums. ACM Trans. Comput. Log. 24, 1 (2023), 8:1–8:27.

Digital Library

Google Scholar

[35]

Aris Filos-Ratsikas and Paul Goldberg. 2022. The complexity of necklace splitting, consensus-halving, and discrete ham sandwich. SIAM J. Comput. (2022). (to appear).

Digital Library

Google Scholar

[36]

Noah Fleming, Mika Göös, Stefan Grosser, and Robert Robere. 2022. On semi-algebraic proofs and algorithms. In Proceedings of the 13th Innovations in Theoretical Computer Science Conference (ITCS’22)(Leibniz International Proceedings in Informatics (LIPIcs), Vol. 215). Schloss Dagstuhl, 69:1–69:25.

Crossref

Google Scholar

[37]

Noah Fleming, Pravesh Kothari, and Toniann Pitassi. 2019. Semialgebraic proofs and efficient algorithm design. Found. Trends Theoret. Comput. Sci. 14, 1-2 (2019), 1–221.

Digital Library

Google Scholar

[38]

Mika Göös, Alexandros Hollender, Siddhartha Jain, Gilbert Maystre, William Pires, Robert Robere, and Ran Tao. 2022. Further collapses in TFNP. In Proceedings of the 37th Computational Complexity Conference (CCC’22). 33:1–33:15.

Digital Library

Google Scholar

[39]

Mika Göös, Pritish Kamath, Robert Robere, and Dmitry Sokolov. 2018. Adventures in monotone complexity and TFNP. In Proceedings of the 10th Innovations in Theoretical Computer Science Conference (ITCS’18), Vol. 124. 38:1–38:19.

Crossref

Google Scholar

[40]

Mika Göös and Toniann Pitassi. 2018. Communication lower bounds via critical block sensitivity. SIAM J. Comput. 47, 5 (2018), 1778–1806.

Digital Library

Google Scholar

[41]

Tuomas Hakoniemi. 2021. Monomial size vs. Bit-complexity in Sums-of-Squares and Polynomial Calculus. In Proceedings of the 36th Symposium on Logic in Computer Science (LICS’21). IEEE.

Digital Library

Google Scholar

[42]

Pavel Hubáček and Eylon Yogev. 2020. Hardness of continuous local search: Query complexity and cryptographic lower bounds. SIAM J. Comput. 49, 6 (2020), 1128–1172.

Digital Library

Google Scholar

[43]

Pavel Hubáček, Erfan Khaniki, and Neil Thapen. 2024. TFNP intersections through the lens of feasible disjunction. In Proceedings of the 15th Innovations in Theoretical Computer Science Conference (ITCS’24)(Leibniz International Proceedings in Informatics (LIPIcs), Vol. 287), Venkatesan Guruswami (Ed.). Schloss Dagstuhl–Leibniz-Zentrum für Informatik, Dagstuhl, Germany, 63:1–63:24.

Crossref

Google Scholar

[44]

Trinh Huynh and Jakob Nordström. 2012. On the virtue of succinct proofs: Amplifying communication complexity hardness to time–space trade-offs in proof complexity. In Proceedings of the 44th Symposium on Theory of Computing (STOC’12). ACM, 233–248.

Digital Library

Google Scholar

[45]

Dmitry Itsykson and Artur Riazanov. 2021. Proof complexity of natural formulas via communication arguments. In Proceedings of the 36th Computational Complexity Conference (CCC’21), Vol. 200. Schloss Dagstuhl, 3:1–3:34.

Digital Library

Google Scholar

[46]

Siddhartha Jain, Jiawei Li, Robert Robere, and Zhiyang Xun. 2024. On Pigeonhole Principles and Ramsey in TFNP. Retrieved from https://arxiv:2401.12604.

Google Scholar

[47]

David Johnson, Christos Papadimitriou, and Mihalis Yannakakis. 1988. How easy is local search? J. Comput. Syst. Sci. 37, 1 (1988), 79–100.

Digital Library

Google Scholar

[48]

Stasys Jukna. 2012. Boolean Function Complexity: Advances and Frontiers. Algorithms and Combinatorics, Vol. 27. Springer.

Digital Library

Google Scholar

[49]

Pritish Kamath. 2020. Some hardness escalation results in computational complexity theory. Ph.D. Dissertation. Massachusetts Institute of Technology. Retrieved from https://dspace.mit.edu/handle/1721.1/128290

Digital Library

Google Scholar

[50]

Ravindran Kannan and Achim Bachem. 1979. Polynomial algorithms for computing the smith and hermite normal forms of an integer matrix. SIAM J. Comput. 8, 4 (1979), 499–507.

Digital Library

Google Scholar

[51]

Jan Krajíček. 2019. Proof Complexity. Cambridge University Press.

Crossref

Google Scholar

[52]

Mark Krentel. 1989. Structure in locally optimal solutions. In Proceedings of the 30th Symposium on Foundations of Computer Science (FOCS’89). 216–221.

Digital Library

Google Scholar

[53]

Mark Krentel. 1990. On finding and verifying locally optimal solutions. SIAM J. Comput. 19, 4 (1990), 742–749.

Digital Library

Google Scholar

[54]

Javier Larrosa, Federico Heras, and Simon de Givry. 2008. A logical approach to efficient Max-SAT solving. Artific. Intell. 172, 2-3 (2008), 204–233.

Digital Library

Google Scholar

[55]

Yuhao Li, William Pires, and Robert Robere. 2024. Intersection classes in TFNP and proof complexity. In Proceedings of the 15th Innovations in Theoretical Computer Science Conference (ITCS’24)(Leibniz International Proceedings in Informatics (LIPIcs), Vol. 287), Venkatesan Guruswami (Ed.). Schloss Dagstuhl–Leibniz-Zentrum für Informatik, Dagstuhl, Germany, 74:1–74:22.

Crossref

Google Scholar

[56]

László Lovász, Moni Naor, Ilan Newman, and Avi Wigderson. 1995. Search problems in the decision tree model. SIAM J. Discrete Math. 8, 1 (1995), 119–132.

Digital Library

Google Scholar

[57]

Nimrod Megiddo and Christos Papadimitriou. 1991. On total functions, existence theorems and computational complexity. Theoret. Comput. Sci. 81, 2 (1991), 317–324.

Digital Library

Google Scholar

[58]

Ruta Mehta. 2018. Constant rank two-player games are PPAD-hard. SIAM J. Comput. 47, 5 (Jan.2018), 1858–1887.

Digital Library

Google Scholar

[59]

Tsuyoshi Morioka. 2001. Classification of search problems and their definability in bounded arithmetic. Master’s thesis. University of Toronto. Retrieved from https://www.collectionscanada.ca/obj/s4/f2/dsk3/ftp04/MQ58775.pdf

Google Scholar

[60]

Noam Nisan and Mario Szegedy. 1994. On the degree of Boolean functions as real polynomials. Comput. Complex. 4, 4 (Dec.1994), 301–313.

Digital Library

Google Scholar

[61]

Ryan O’Donnell. 2017. SOS is not obviously automatizable, even approximately. In Proceedings of the 8th Innovations in Theoretical Computer Science Conference (ITCS’17), Vol. 67. Schloss Dagstuhl, 59:1–59:10.

Crossref

Google Scholar

[62]

Christos Papadimitriou. 1994. On the complexity of the parity argument and other inefficient proofs of existence. J. Comput. Syst. Sci. 48, 3 (1994), 498–532.

Digital Library

Google Scholar

[63]

Toniann Pitassi. 1996. Algebraic propositional proof systems. In Proceedings of the DIMACS Workshop on Descriptive Complexity and Finite Models(DIMACS Series in Discrete Mathematics and Theoretical Computer Science, Vol. 31). DIMACS/AMS, 215–244.

Crossref

Google Scholar

[64]

Pavel Pudlák. 2015. On the complexity of finding falsifying assignments for Herbrand disjunctions. Arch. Math. Logic 54, 7-8 (2015), 769–783.

Digital Library

Google Scholar

[65]

Prasad Raghavendra and Benjamin Weitz. 2017. On the bit complexity of sum-of-squares proofs. In Proceedings of the 44th International Colloquium on Automata, Languages, and Programming (ICALP’17). 80:1–80:13.

Crossref

Google Scholar

[66]

Ran Raz and Avi Wigderson. 1992. Monotone circuits for matching require linear depth. J. ACM 39, 3 (July1992), 736–744.

Digital Library

Google Scholar

[67]

Alejandro Schäffer. 1991. Simple local search problems that are hard to solve. SIAM J. Comput. 20, 1 (1991), 56–87.

Digital Library

Google Scholar

[68]

Hanif Sherali and Warren Adams. 1994. A hierarchy of relaxations and convex hull characterizations for mixed-integer zero–one programming problems. Discrete Appl. Math. 52, 1 (July1994), 83–106.

Digital Library

Google Scholar

[69]

Katerina Sotiraki, Manolis Zampetakis, and Giorgos Zirdelis. 2018. PPP-completeness with connections to cryptography. In Proceedings of the 59th IEEE Symposium on Foundations of Computer Science (FOCS’18). 148–158.

Crossref

Google Scholar

[70]

Shengyu Zhang. 2009. Tight bounds for randomized and quantum local search. SIAM J. Comput. 39, 3 (2009), 948–977.

Digital Library

Google Scholar

Index Terms

Separations in Proof Complexity and TFNP
1. Theory of computation
  1. Computational complexity and cryptography
    1. Problems, reductions and completeness
    2. Proof complexity

Recommendations

Nisan-Wigderson generators in proof complexity: new lower bounds
CCC '22: Proceedings of the 37th Computational Complexity Conference

A map g : {0, 1}ⁿ → {0, 1}^m (m > n) is a hard proof complexity generator for a proof system P iff for every string b ∈ {0, 1}^m \ Rng(g), formula τb(g) naturally expressing b ∉ Rng(g) requires superpolynomial size P-proofs. One of the well-studied maps ...
Colourful TFNP and Propositional Proofs
CCC '23: Proceedings of the conference on Proceedings of the 38th Computational Complexity Conference

Recent work has shown that many of the standard TFNP classes - such as PLS, PPADS, PPAD, SOPL, and EOPL - have corresponding proof systems in propositional proof complexity, in the sense that a total search problem is in the class if and only if the ...
Small Depth Proof Systems

A proof system for a language L is a function f such that Range(f) is exactly L. In this article, we look at proof systems from a circuit complexity point of view and study proof systems that are computationally very restricted. The restriction we study ...

Comments

Information & Contributors

Information

Published In

Journal of the ACM Volume 71, Issue 4

August 2024

240 pages

EISSN:1557-735X

DOI:10.1145/3613647

Editor:
Venkatesan Guruswami
University of California, Berkeley, United States

Issue’s Table of Contents

This work is licensed under a Creative Commons Attribution International 4.0 License.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 August 2024

Online AM: 09 May 2024

Accepted: 16 April 2024

Revised: 29 January 2024

Received: 26 May 2023

Published in JACM Volume 71, Issue 4

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Swiss State Secretariat for Education, Research and Innovation (SERI)
Quantum Systems Accelerator
DOE. W. P., R. R., and R. T.
NSERC

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
382
Total Downloads

Downloads (Last 12 months)382
Downloads (Last 6 weeks)327

Reflects downloads up to 30 Aug 2024

Other Metrics

View Author Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Abstract

1 Separations in Proof Complexity

1.1 Resolution versus Sherali–Adams

1.2 Reversible Resolution versus Nullstellensatz

1.3 Techniques

2 Separations in TFNP

2.1 Introduction to TFNP

2.2 New Characterisations

2.3 Two Further Separations

2.4 Intersection Theorems in Proof Complexity

2.5 Open Problems

3 Definitions

3.1 Decision Tree TFNP

3.2 Search Problem Zoo

3.3 Reductions and Formulations

4 Reversible Resolution versus Nullstellensatz

4.1 Approximate Nullstellensatz

4.2 Lower Bound for ε-NS

4.3 Upper Bound for RevRes

4.4 Lower Bound for \(\mathbb {F}\) -NS

5 Resolution versus Sherali–Adams

5.1 Lower Bound for SA

5.2 Upper Bound for Resolution

6 Proofs of Characterisations

6.1 Unary Nullstellensatz and PPAD

6.2 Unary Sherali–Adams and PPADS

6.3 Reversible Resolution, SOPL, and EOPL

7 Intersection Theorems

8 Two Further Separations

8.1 Glueability

8.2 \({\text{ PLS}} ^{dt} \nsubseteq {\text{ PPP}} ^{dt}\)

8.3 \({\text{ EOPL}} ^{dt} \nsubseteq {\text{ UEOPL}} ^{dt}\)

Acknowledgments

Footnotes

A Coefficient Size in Algebraic Proofs

References

Index Terms

Recommendations

Nisan-Wigderson generators in proof complexity: new lower bounds

Colourful TFNP and Propositional Proofs

Small Depth Proof Systems

Comments

Information

Published In

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Funding Sources

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

PDF

eReader

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations