Hostname: page-component-cb9f654ff-mnl9s Total loading time: 0 Render date: 2025-09-06T23:14:59.911Z Has data issue: false hasContentIssue false

Complexity of Faceted Explanations in Propositional Abduction

Published online by Cambridge University Press:  05 September 2025

JOHANNES SCHMIDT
Affiliation:
Jönköping University, Jönköping, Sweden (e-mail: johannes.schmidt@ju.se)
MOHAMED MAIZIA
Affiliation:
Jönköping University, Jönköping, Sweden Linköping University, Linköping, Sweden (e-mail: mohamed.maizia@ju.se)
VICTOR LAGERKVIST
Affiliation:
Linköping University, Linköping, Sweden (e-mails: victor.lagerkvist@liu.se, johannes.fichte@liu.se)
JOHANNES K. FICHTE
Affiliation:
Linköping University, Linköping, Sweden (e-mails: victor.lagerkvist@liu.se, johannes.fichte@liu.se)
Rights & Permissions [Opens in a new window]

Abstract

Abductive reasoning is a popular non-monotonic paradigm that aims to explain observed symptoms and manifestations. It has many applications, such as diagnosis and planning in artificial intelligence and database updates. In propositional abduction, we focus on specifying knowledge by a propositional formula. The computational complexity of tasks in propositional abduction has been systematically characterized – even with detailed classifications for Boolean fragments. Unsurprisingly, the most insightful reasoning problems (counting and enumeration) are computationally highly challenging. Therefore, we consider reasoning between decisions and counting, allowing us to understand explanations better while maintaining favorable complexity. We introduce facets to propositional abductions, which are literals that occur in some explanation (relevant) but not all explanations (dispensable). Reasoning with facets provides a more fine-grained understanding of variability in explanations (heterogeneous). In addition, we consider the distance between two explanations, enabling a better understanding of heterogeneity/homogeneity. We comprehensively analyze facets of propositional abduction in various settings, including an almost complete characterization in Post’s framework.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (https://creativecommons.org/licenses/by-nc-sa/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

1 Introduction

Pedro is a passionate sailor. Today is Wednesday and he is about to go on the usual race to enjoy some waves and get a good challenge. But for some reason, nobody is out there for a race (a manifestation). He looks up into the sky and is a bit undecided, could the weather forecast have predicted calm winds or an unexpected storm resulting in calling off the race (a hypothesis) trying to explain his observation by finding appropriate causes. This type of backward reasoning is called abductive reasoning, one of the fundamental reasoning techniques that is commonly believed to be naturally used by humans when searching for diagnostic explanations. Abduction has many applications (Dai et al. Reference Dai, Xu, Yu and Zhou2019; Dellsén Reference Dellsén2024; Ignatiev et al. Reference Ignatiev, Narodytska and Marques-Silva2019; Yu et al. Reference Yu, Ignatiev, Stuckey, Narodytska and Marques-Silva2023; Yu et al. Reference Yu, Ignatiev, Stuckey, Narodytska and Marques-Silva2023; Hu et al. Reference Hu, Dai, Jiang and Zhou2025) and is well-studied in the areas of artificial intelligence, knowledge representation, and non-monotonic reasoning (Minsky Reference Minsky1974; Kakas et al. Reference Kakas, Kowalski and Toni1992; Miller Reference Miller2019).

Qualitative reasoning problems like deciding whether an explanation exists or whether a proposition is relevant or necessary are computationally hard but still within range of modern solving approaches. More precisely, these problems are located on the second level of the polynomial hierarchy (PH) in the general case (Eiter and Gottlob Reference Eiter and Gottlob1995). However, asking for relevance or necessary propositions does not provide much insight into the variability of explanations. Enumeration and counting allow for more fine-grained reasoning but are computationally extremely expensive (Hermann and Pichler Reference Hermann and Pichler2010; Creignou et al. Reference Creignou, Kröll, Pichler, Skritek and Vollmer2019). Instead, we turn our attention to the world between relevant (belongs to some explanations) and necessary propositions (belongs to all explanations) and consider propositions that are relevant but not necessary, called facets.

In this work, we study the computational complexity of problems involving facets. To this end, we work in the universal algebraic setting by restricting the types of clauses/relations that are allowed (e.g., only Horn-clauses, or only 2-CNF). The resulting sets can be described by functions called polymorphisms and in the Boolean domain form a lattice known as Post’s lattice (Post Reference Post1941). This setting makes it possible to obtain much more fine-grained complexity results and two prominent and early results are Lewis’ dichotomy for propositional satisfiability (Lewis Reference Lewis1979) and Schaefer’s dichotomy for Boolean constraint satisfaction (Schaefer Reference Schaefer1978). However, this approach has been applied to many more problems (Creignou and Vollmer Reference Creignou and Vollmer2008), including non-monotonic reasoning and several variants of abduction (Nordh and Zanuttini Reference Nordh and Zanuttini2008). We follow this line of research for faceted abduction.

Our main contributions are the following:

  1. 1. We introduce facets to propositional abduction thereby enabling a better understanding of propositions in explanations.

  2. 2. We establish a systematic complexity characterization for deciding facets in propositional abduction illustrated in Figure 1a.

    1. (i) Our classification provides a complete picture in Post’s framework for all fragments, which can be described via clauses, for example, Horn, 2CNF, and dualHorn. Only two open cases remain: relations definable as Boolean linear equations of even length (with, or without, unit clauses).

    2. (ii) Deciding facets is often not much harder than deciding explanation existence. In some surprising cases, the complexity increases.

    3. (iii) Our facet results imply a corresponding classification for the problem of deciding relevance, which has received significant attention in the classical abduction literature and mentioned as an open question.

  3. 3. We study the related problem of determining whether there exist two explanations of sufficiently high diversity. This metric can be precisely related to the existence of facets and several notions from our facet classification carry over. However, this problem is provably much harder and becomes NP-hard already for a small fragment of Horn consisting of just implication $(x \rightarrow y)$ .

Fig. 1. Illustration of complexity results via Post’s lattice.

1.1 Related works

The computational complexity of propositional abduction is well studied. Selman and Levesque (Reference Selman and Levesque1990) showed that explanation existence is tractable for Horn theories for special cases of propositional abduction, but NP-hard when checking for an explanation that contains a particular proposition. Bylander et al. (Reference Bylander, Allemang, Tanner and Josephson1991) generalized the results to “best” explanations (most plausible combination of hypotheses that explains all the data) and present a tractable sub-class of abduction problems. Eiter and Gottlob (Reference Eiter and Gottlob1995) proved $\Sigma ^{{\mathrm{P}}}_2$ -completeness for propositional abduction. Creignou and Zanuttini (Reference Creignou and Zanuttini2006) established a precise trichotomy ( ${{\mathrm{P}}}, {\mathrm{NP}}, \Sigma ^{{\mathrm{P}}}_2$ ) in Schaefer’s framework where inputs are restricted to generalized conjunctive normal form or subsets. Nordh and Zanuttini (Reference Nordh and Zanuttini2008) lifted results to propositional knowledge bases and establish a tetrachotomy ( ${\mathrm{P}}$ , $\mathrm{NP}$ , $\text{co}$ - $\mathrm{NP}$ , $\Sigma ^{{\mathrm{P}}}_2$ ). Creignou et al. (Reference Creignou, Schmidt and Thomas2010) showed a complete complexity classification for all considerable sets of Boolean functions. Relevance and dispensability are comparably speaking not as well understood. An early result by Friedrich et al. (Reference Friedrich, Gottlob and Nejdl1990) show that it is NP-hard to determine relevance over (definite) Horn theories, and Eiter and Gottlob (Reference Eiter and Gottlob1995) additionally prove that it is ${\Sigma }^{{{\mathrm{P}}}}_2$ -hard to decide relevance if the knowledge base is an arbitrary propositional formula. Zanuttini (Reference Zanuttini2003) later asks if there is a simple relationship between deciding relevance and the complexity of the underlying abduction problem. We find it surprising that Zanuttini’s question (later repeated by Nordh and Zanuttini (Reference Nordh and Zanuttini2008)) still remains unanswered given that we by now have a complete understanding of the classical complexity of virtually all propositional abduction problems. In addition, counting and enumeration complexity is also well studied (Creignou et al. Reference Creignou, Schmidt and Thomas2010, Reference Creignou, Kröll, Pichler, Skritek and Vollmer2019; Hermann and Pichler Reference Hermann and Pichler2010). Fellows et al. (Reference Fellows, Pfandler, Rosamond and Rümmele2012); Pfandler et al. (Reference Pfandler, Rümmele and Szeider2013); and Mahmood et al. (Reference Mahmood, Meier and Schmidt2021) included semantical structural restrictions (parameterized complexity) in the complexity study. The concept of facets has originally been introduced in the context of answer-set programming to enforce/forbid atoms in solutions and systematically investigate solutions without counting or enumeration (Alrabbaa et al. Reference Alrabbaa, Rudolph and Schweizer2018; Fichte et al. Reference Fichte, Gaggl and Rusovac2022a). The complexity of ASP facets for tight, normal, and disjunctive programs was established very recently (Rusovac et al. Reference Rusovac, Hecher, Gebser, Gaggl and Fichte2024). Speck et al. (Reference Speck, Hecher, Gnad, Fichte and Corrêa2025) lifted facets to symbolic planning for reasoning faster on the plan space, and Fichte et al. (Reference Fichte, Fröhlich, Hecher, Lagerkvist, Mahmood, Meier and Persson2025) introduced facets to abstract argumentation. Eiter and Geibinger (Reference Eiter and Geibinger2023) studied justifications for the presence, or absence, of an atom in the context of answer-set programming including so-called contrastive explanations. They provide a basic complexity theoretical characterization. Diversity has been considered in the literature on propositional satisfiability and logic programming, for example (Misra et al. Reference Misra, Mittal and Rai2024; Böhl et al. Reference Böhl, Gaggl and Rusovac2023). Abductive logic programming (ALP) combines logic programming with abductive reasoning, which then allows for generating hypotheses to explain observed facts or goals. Eiter et al. (Reference Eiter, Gottlob and Leone1995) studied the complexity of ALP regarding consistency, relevance, and necessity but focusing on normal and disjunctive programs and commonly used semantics (well-founded, stable).

2 Preliminaries

We follow standard notions in computational complexity theory (Papadimitriou Reference Papadimitriou1994; Arora and Barak Reference Arora and Barak2009), propositional logic (Kleine Büning and Lettmann Reference Kleine Büning and Lettmann1999), and propositional abduction (Bylander et al. Reference Bylander, Allemang, Tanner and Josephson1991). Below, we briefly state relevant notations.

2.1 Computational complexity

Let $\Sigma$ and $\Sigma '$ be some finite alphabets. We call $I \in \Sigma ^*$ an instance and $||I||$ denotes the size of $I$ . A decision problem is some subset $L\subseteq \Sigma ^*$ . Recall that ${\mathrm{P}}$ and $\mathrm{NP}$ are the complexity classes of all deterministically and non-deterministically polynomial-time solvable decision problems (Cook Reference Cook1971). A polynomial-time many-to-one reduction ( $\leq ^{\mathrm{P}}_m$ ) from $L$ to $L'$ is a function $r : \Sigma ^* \rightarrow {\Sigma '}^*$ such that for all $I \in \Sigma ^*$ we have $I \in L$ if and only if $r(I) \in L'$ and $r$ are computable in time $\mathcal{O}({{||I||}\cdot c})$ for some constant $c$ . In other words, a polynomial-time many-to-one reduction transforms instances of the decision problem $L$ into instances of decision problem $L'$ in polynomial time. We also need the PH (Stockmeyer and Meyer Reference Stockmeyer and Meyer1973; Stockmeyer Reference Stockmeyer1976; Wrathall Reference Wrathall1976). In particular, $\Delta ^{{\mathrm{P}}}_0 := \Pi ^{{\mathrm{P}}}_0 := \Sigma ^{{\mathrm{P}}}_0 := {{\mathrm{P}}}$ and $\Delta ^{{\mathrm{P}}}_{i+1} := P^{\Sigma ^p_{i}}$ , $\Sigma ^{{\mathrm{P}}}_{i+1} := {\mathrm{NP}}^{\Sigma ^{{\mathrm{P}}}_{i}}$ , and $\Pi ^{{\mathrm{P}}}_{i+1} := \text{co}{\mathrm{NP}}^{\Sigma ^{{\mathrm{P}}}_i}$ for $i\gt 0$ where $C^{D}$ is the class $C$ of decision problems augmented by an oracle for some complete problem in class $D$ .

2.1.1 Propositional logic

A literal is a variable $x$ or its negation $\neg x$ . A clause is a disjunction of literals, often represented as a set. A clause of arity 1, that is, either $(x)$ or $(\neg x)$ , is a unit clause. We work in a general setting where atoms can be expressions of the form $R(x_1, \ldots , x_r)$ for variables $x_1, \ldots , x_r$ and an $r$ -ary relation $R \subseteq \{0,1\}^r$ . A function $f \colon \{x_1, \ldots , x_r\} \to \{0,1\}$ is then said to satisfy an atom $R(x_1, \ldots , x_r)$ if $(f(x_1), \ldots , f(x_r)) \in R$ . A (conjunctive) propositional formula $\varphi$ is a conjunction of atoms and we write $\mathrm{var}(\varphi )$ for its set of variables. A mapping $\sigma \colon \mathrm{var}(\varphi ) \mapsto \{0,1\}$ is called an assignment to the variables of $\varphi$ and a model of a formula $\varphi$ is an assignment to $\mathrm{var}(\varphi )$ that satisfies $\varphi$ . For two formulas $\psi$ and $\varphi$ , we write $\psi \models \varphi$ if every model of $\psi$ also satisfies $\varphi$ .

2.2 Restrictions of constraint languages

As alluded in Section 1, we work in a fine-grained setting where not all possible relations are allowed. Formally, we say that a constraint language $\Gamma$ is a set of Boolean relations, and a $\Gamma$ -formula is a propositional formula $\varphi$ where $R \in \Gamma$ for each atom $R(x_1, \ldots , x_r)$ . For a constraint language $\Gamma$ , we write ${SAT}(\Gamma )$ for the problem of deciding if a given $\Gamma$ -formula admits at least one model. If $\Gamma$ is naturally expressible as a set of clauses, we represent $R \in \Gamma$ in clausal form. This is the the case for most, but not all, cases that we consider in this paper. Usually, we do not distinguish between the relation, its defining clause, or an atom involving the clause. For example, we simply write $(x)$ for the unary relation $\{(1)\}$ , $(\neg x)$ for $\{(0)\}$ , $(x_1 \rightarrow x_2)$ or $(\neg x_1 \lor x_2)$ for $\{(0,0), (0,1), (1,1)\}$ , and so on. The empty set $\emptyset$ is the (nullary) relation that is always false, and we write $(x_1 = x_2)$ for the equality relation $\{(0,0), (1,1)\}$ . For a constraint language $\Gamma$ and $k \geq 1$ , we often use the notation $k$ - $\Gamma$ for the set of relations/clauses of arity at most $k$ . Thus, 2-CNF contains all 1/2-clauses, and 2-affine contains the unary/binary relations definable as equations mod 2. Additionally, for a language $\Gamma$ we, let (1) $\Gamma ^{\text{-}} = \Gamma \setminus \{(x), (\neg x)\}$ be $\Gamma$ without the two unit clauses, and (2) $\Gamma ^+ = \Gamma \cup \{(x), (\neg x)\}$ be $\Gamma$ expanded with the two unit clauses. A language $\Gamma$ is $b$ -valid for $b \in \{0,1\}$ , if $(b, \ldots , b)\in R$ for each $R \in \Gamma$ . We introduce the most important constraint languages for the purpose of this paper in Table 1. This only covers a small number of the possible constraint languages. For many applications, including the facet classification in this paper, doing an exhaustive case analysis of all possible constraint languages is too complicated, and one needs simplifying assumptions. Here, it is known that each constraint language $\Gamma$ can equivalently well be described as a set of functions closed under functional composition and containing all projections $\pi ^n_i(x_1, \ldots x_n) = x_i$ , clones. Thus, each clone groups together many similar constraint languages and the Boolean clones form a lattice known as Post’s lattice when ordered by set inclusion (Post Reference Post1941). Many classification tasks become substantially simpler via Post’s lattice, and it is well known that each clone corresponds to a dual relational object called a co-clone, which in turn induces a useful closure property on relations. In this paper we only need a small fragment of this algebraic theory and define this closure property via so-called primitive positive definitions (pp-definitions) and say that an $r$ -ary relation $R$ has a pp-definition over $\Gamma$ if

\begin{equation*}R(x_1, \dots , x_r) := \exists y_1, \dots , y_n\, .\, \varphi (x_1, \ldots , x_r, y_1, \ldots , y_n)\end{equation*}

where $\varphi$ is a $(\Gamma \cup \{(x = y)\})$ -formula. Thus, put otherwise, $R$ can be defined as the set of models of $\exists y_1, \ldots , y_n\, .\, \varphi (x_1, \ldots , x_r, y_1, \ldots , y_n)$ with respect to the free variables $x_1, \ldots , x_r$ . The reason for allowing the equality relation $(x = y)$ as an atom is that it leads to a much simpler algebraic theory. However, we sometimes need the corresponding definability notion without (free) equality. A pp-definition where $\varphi$ is a $\Gamma$ -formula is called an equality-free primitive positive definition (efpp-definition).

Table 1. Constraint languages and their corresponding co-clones. Here, $\text{Pos}(c)$ and $\text{Neg}(c)$ denote the number of positive and negative literals in a clause $c$ , respectively. For more details, we refer to the work by Böhler et al. (Reference Böhler, Reith, Schnoor and Vollmer2005) and the table in the supplemental material to this paper

Definition 2.1. For a constraint language $\Gamma$ , we let $\langle \Gamma \rangle$ and ${\langle \Gamma \rangle }_{\neq }$ , resp., be the smallest set of relations containing $\Gamma$ and where $R \in \Gamma$ for any (ef)pp-definable relation $R$ over $\Gamma$ .

The set $\Gamma$ is in this context said to be a base, and it is known that all co-clones can be defined in this way. For details, we refer to the work by Böhler et al. (Reference Böhler, Reith, Schnoor and Vollmer2005) and the Table in the supplemental material to this paper. These notions generalize and unify many types of reductions and definability notions in the literature.

Example 2.2. For example, consider the classical reduction from $4$ -SAT to $3$ -SAT by splitting a 4-clause $(\ell _1 \lor \ell _2 \lor \ell _3 \lor \ell _4)$ into two 3-clauses $(\ell _1 \lor \ell _2 \lor x)$ and $(\ell _3 \lor \ell _4 \lor \neg x)$ where $x$ is a fresh variable. This can be viewed as a pp-definition $(\ell _1 \lor \ell _2 \lor \ell _3 \lor \ell _4) \equiv \exists x \, . \, (\ell _1 \lor \ell _2 \lor x) \land (\ell _3 \lor \ell _4 \lor \neg x)$ , and we conclude that $k\text{-CNF} \subseteq {\langle 3\text{-CNF} \rangle }$ for any $k \geq 1$ .

A table of all Boolean co-clones is available in the supplemental material as well as another more elaborate example.

2.3 Propositional abduction

Let $\Gamma$ be a constraint language, for example, a set of clauses. An instance $I$ of the positive propositional abduction problem over $\Gamma$ , ${ABD}(\Gamma )$ for short, is a tuple $I=(\text{KB}, H, M)$ with $\text{KB}$ being a $\Gamma$ -formula over a finite set of Boolean variables called the knowledge base (or theory), $H \subseteq \mathrm{var}(\text{KB})$ called hypotheses, $M\subseteq \mathrm{var}(\text{KB})$ called manifestations. Since we have defined a $\Gamma$ -formula as a conjunctive formula with atoms from $\Gamma$ we sometimes take the liberty of viewing the knowledge base as a set rather than as a formula. A positive explanation $E$ , explanation for short, is a subset $E\subseteq H$ such that (i) $\text{KB} \land E$ is satisfiable and (ii) $\text{KB} \land E \models M$ . An explanation $E$ is (subset-)minimal if no other set $E' \subsetneq E$ is an explanation of $I$ .

The problem ${ABD}(\Gamma )$ asks whether there is an explanation, which in the decision context is the same as asking whether there is a minimal explanation. If $\Gamma$ is arbitrary, we omit $\Gamma$ from the problem and write $ABD$ . Note that the complexity of ${ABD}(\Gamma )$ is completely determined (Nordh and Zanuttini Reference Nordh and Zanuttini2008) and illustrated in Figure 1b.

We write ${\mathcal{E}}(I)$ to refer to the set of all explanations and ${\mathcal{E}_M}(I)$ for the set of all subset-minimal explanations. A variable $x \in H$ is relevant if $x$ belongs to some subset-minimal explanation $E \in {\mathcal{E}_M}(I)$ and necessary if $x$ belongs to all subset-minimal explanations $E \in {\mathcal{E}_M}(I)$ . We abbreviate the sets of all relevant and necessary variables by ${\mathcal{R}\text{el}\mathcal{E}}(I)$ and ${\mathcal{N}\text{ec}\mathcal{E}}(I)$ , respectively.

Example 2.3. Consider our abduction example from the introduction, which we slightly extend. Therefore, let the $ABD$ instance $I=(\text{KB}, H,M)$ consists of the knowledge base

\begin{equation*}\text{KB} = \{ \underline {\boldsymbol{w}}\boldsymbol{ednesday} \rightarrow \underline {\boldsymbol{r}}\boldsymbol{aining},\; \underline {\boldsymbol{w}}\boldsymbol{ednesday} \land \underline {\boldsymbol{c}}\boldsymbol{alm} \rightarrow \underline {\boldsymbol{n}}\boldsymbol{o\text{-}race}, \end{equation*}
\begin{equation*} \underline {\boldsymbol{w}}\boldsymbol{ednesday} \land \underline {\boldsymbol{s}}\boldsymbol{torm} \rightarrow \underline {\boldsymbol{n}}\boldsymbol{o\text{-}race}\}, \end{equation*}

the manifestation $n$ , and the hypothesis $\{w, c, s, r\}$ .

The set of all explanations ${\mathcal{E}}(I)$ consists of the explanations $\{w, c\}$ , $\{w, c, s\}$ , $\{w, c, r\}$ , $\{w, s\}$ , $\{w, s, c\}$ , $\{w, s, r\}$ , and $\{w, s, c, r\}$ . The explanations $\{w, c\}$ and $\{w, s\}$ are subset-minimal and hence constitute the set ${\mathcal{E}_M}(I)$ . When considering the elements of ${\mathcal{E}_M}(I)$ , we immediately observe that the set of relevant propositions ${\mathcal{R}\text{el}\mathcal{E}}(I)$ is formed of $w$ , $c$ , and $s$ . Whereas the only element in the necessary propositions ${\mathcal{N}\text{ec}\mathcal{E}}(I)$ is $w$ .

3 Facets in explanations

In non-monotonic reasoning, we commonly consider knowledge bases with multiple possible solutions, each leading to different conclusions. Central decision-based reasoning problems consider all possible solutions and consider how a variable relates to all solutions. When asking for brave (credulous) or cautious (skeptical) reasoning, we decide whether a variable belongs to one solution or all solutions, respectively. The underlying idea is that brave reasoning allows for multiple potential conclusions from a knowledge base, that is, a knowledge base may have uncertain outcomes. In contrast, skeptical reasoning requires a guaranteed outcome. This concept is also known in abductive reasoning with relevant and necessary explanations and has been considered in the literature, for example (Friedrich et al. Reference Friedrich, Gottlob and Nejdl1990; Eiter and Gottlob Reference Eiter and Gottlob1995; Eiter et al. Reference Eiter, Gottlob and Leone1997; Zanuttini Reference Zanuttini2003; Nordh and Zanuttini Reference Nordh and Zanuttini2008). However, a detailed complexity classification in Post’s lattice is open to date. In this section, we consider reasoning between relevant and necessary explanations, so-called facets. Intuitively, a variable $p$ is a facet if it is (i) part of some explanation (relevant), but (ii) not included in every explanation (dispensible). We start by defining facets formally.

Definition 3.1 (Facets). Let $I=(\text{KB}, H, M)$ be an $ABD$ instance. A variable $x \in H$ is a facet in the instance $I$ if $x \in {\mathcal{R}\text{el}\mathcal{E}}(I) \setminus {\mathcal{N}\text{ec}\mathcal{E}}(I)$ .

Based on this definition, we define a computational problem whose task is to decide whether a given variable is a facet or not.

The following example illustrates facets in the sailing scenario.

Example 3.2 (Cont.). Consider our $ABD$ instance from Example 2.3. Since ${\mathcal{R}\text{el}\mathcal{E}}(I) = \{\underline {\boldsymbol{w}}\boldsymbol{ednesday}, \underline {\boldsymbol{c}}\boldsymbol{alm}, \underline {\boldsymbol{s}}\boldsymbol{torm}\}$ and ${\mathcal{N}\text{ec}\mathcal{E}}(I) = \{\underline {\boldsymbol{w}}\boldsymbol{ednesday}\}$ , we observe that $c$ and $s$ are facets allowing for a variability in explanations whereas $\underline {\boldsymbol{w}}\boldsymbol{ednesday}$ occurs in all explanations and is thus not a facet. Note that without minimality in the relevance definition $\underline {\boldsymbol{r}}\boldsymbol{aining}$ would (contrary to intuition) qualify as relevant and also as a facet.

The dispensability condition can be checked fairly easy in many cases. To this end, we test, given $(\text{KB}, H, M, x)$ , whether $(\text{KB}, H \setminus \{x\}, M)$ admits an explanation or not. In particular, if ${ABD}(\Gamma )$ is in $\mathrm{P}$ , we can check this in polynomial time. Thus, the interesting computational aspect of $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ is to decide when both relevance and dispensability can be decided without a major blow up in complexity. The core parts of our proofs give an immediate classification for relevance as well. Overall, we obtain an almost complete classification of $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ .

Theorem 3.3. The classification of $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ in Figure 1 a is correct.

Before we prove Theorem3.3 systematically, we observe that for most complexity questions of $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ it is sufficient to consider the efpp-closure $\left \langle \Gamma \right \rangle _{\neq }$ of $\Gamma$ .

Lemma 3.4. Let $\Gamma$ and $\Gamma '$ be two constraint languages. If $\Gamma ' \subseteq \left \langle \Gamma \right \rangle _{\neq }$ , then $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma ') \leq ^{\mathrm{P}}_m \mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ .

Proof (Idea). We omit details, since the construction is exactly the same as in previous work ((Nordh and Zanuttini Reference Nordh and Zanuttini2008), Lemma 22), but the basic idea is simply to replace each relation by the set of constraints prescribed by the efpp-definition, and introducing fresh variables (kept outside the hypothesis) for any existentially quantified variables. This exactly preserves the set of (minimal) explanations.

Lifting Lemma3.4 to pp-definability does not appear to be possible in general: the classical trick when encountering an equality constraint $(x = y)$ is to identify the two variables throughout the instance. But consider, for example, an instance with knowledge base $\{(x = y), (x \rightarrow m), (y \rightarrow m)\}$ , $H = \{x,y\}\}$ , and $M = \{m\}$ . Then, $x$ and $y$ are both facets since $\{x\}$ and $\{y\}$ are both minimal explanations, but if we identify $y$ with $x$ and remove the equality constraint, we obtain the instance $\{x \rightarrow m\}$ where $x$ is not a facet, since there is only one minimal explanation $\{x\}$ . However, the loss of the equality relation in the efpp-closure $\left \langle \Gamma \right \rangle _{\neq }$ turns out to be manageable. We explain, in the proof of Theorem3.3, why our results in the next two sections extend to all co-clones. To obtain the systematic cases, we require numerous lemmas, which we establish in the following.

3.1 Computational upper bounds

Recall from Figure 1b that ${ABD}(\Gamma )$ is always either in (i) ${\mathrm{P}}$ , (ii) (co)NP, or (iii) $\Sigma ^P_2$ . Hence, our first task is to identify the corresponding classes for the $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ problem. Ideally, one could hope that $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ can be solved without a large increase in complexity, for example, going from ${\mathrm{P}}$ to being $\mathrm{NP}$ -hard. We will see that this can often, but not always, be achieved. We begin by analyzing the simple language $\{x \rightarrow y\}$ where the only allowed constraint is an implication between two variables. From Figure 1b, we know that ${ABD}(\{x \rightarrow y\})$ is in $\mathrm{P}$ , and this can be extended to $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\{x \rightarrow y\})$ via a more involved algorithm.

Lemma 3.5. $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\{x \rightarrow y\}) \in P$ .

Proof. Let $(\text{KB}, H, M, x)$ be an instance of $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\{x \rightarrow y\})$ , that is, $\text{KB}$ only consists of implications. Note that $\text{KB}$ is 1-valid. Consequently, there is an explanation if and only if $H$ is an explanation. To see this, we observe that $H$ is always consistent with $\text{KB}$ . Thus, it has “maximal” entailment power. To explain a single $m \in M$ , a single $h \in H$ is always sufficient. For $m \in M$ , we denote by $h(m) = \{ h \in H \;|\; \text{KB} \land h \models m\}$ , that is, all hypotheses from $H$ that explain $m$ .

Now, we observe that $h(m)$ can be computed in polynomial time, for each $m\in M$ . Denote by $M_x \subseteq M$ the manifestations from $M$ that are explained by $x$ alone, that is, $M_x = \{ m \in M \;|\; \text{KB} \land x \models m\}$ . The set $M_x$ can also be computed in polynomial time by repeatedly checking whether $\text{KB} \land x \models m$ . Since $M_x$ is explained by $x$ , we make $x$ “relevant” by finding an $E \subseteq H\setminus \{x\}$ that avoids explaining at least one $m \in M_x$ . A maximal candidate for this is $H \setminus h(m)$ . We can accomplish this as follows:

This runs in p-time, since entailment for ${SAT}({x \rightarrow y, x, \bar x})$ is in p-time (Schaefer Reference Schaefer1978).

We continue with $\text{dualHorn}$ where $ABD$ is also in P. Here, membership in P for $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\text{dualHorn})$ is less obvious. Given that ${ABD}(\text{Horn})$ is NP-complete, we could suspect that checking for relevance and dispensability is computationally more expensive. First, we need the following technical lemma, where we recall that $\text{dualHorn}^{\text{-}} = \text{dualHorn} \setminus \{x, \neg x\}$ is the set obtained from $\text{dualHorn}$ by removing the two unit clauses.

Lemma 3.6 ( $\star$ Footnote 1 ). $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\textit{dualHorn}) \leq ^{\mathrm{P}}_m \mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\textit{dualHorn}^{\text{-}})$ .

Next, we show that the result for $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\{x \rightarrow y\})$ can be extended to $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\text{dualHorn}^{\text{-}})$ , and, thus, also to $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\text{dualHorn})$ via Lemma 3.6.

Lemma 3.7 ( $\star$ ). $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\text{dualHorn}^{\text{-}}) \in {{\mathrm{P}}}$ .

Our second major tractability result concerns 2- $\text{affine}$ , that is, either unit clauses or relations definable by $(x \oplus y = 0)$ (equality) or $(x \oplus y = 1)$ (inequality).

Lemma 3.8. $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\text{2-affine}) \in {{\mathrm{P}}}$ .

Proof. Let $(\text{KB}, H, M, x)$ be an instance of $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\text{2-affine})$ . We assume that each relation in $\text{KB}$ is represented by precisely one linear equation of arity at most 2, see (Creignou et al. Reference Creignou, Olive and Schmidt2011) and (Mahmood et al. Reference Mahmood, Meier and Schmidt2021). First, if $\text{KB}$ is not satisfiable we answer no. Second, we propagate all unit clauses as in Lemma 3.6. Each remaining equation then expresses either equality or inequality between two variables. With the transitivity of the equality relation and the fact that in the Boolean case $a \neq b \neq c$ implies $a = c$ , we can identify equivalence classes of variables such that each two classes are either independent or they must have contrary truth values. We call a pair of dependent equivalence classes $(X, Y)$ a cluster, that is $X$ and $Y$ must take contrary truth values. Denote by $X_1, \dots , X_p$ the equivalence classes that contain variables from $M$ such that $X_i \cap M \neq \emptyset$ . Denote by $Y_1, \dots , Y_p$ the equivalence classes such that for each $i$ the pair $(X_i, Y_i)$ represents a cluster. We make the following stepwise observations: (1) there is an explanation if and only if $H \cap X_i \neq \emptyset$ for every $1 \leq i \leq p$ , (2) a subset-minimal explanation is constructed by taking exactly one representative from each $X_i$ . (3) $x$ can be made relevant if $x \in X_i$ , for some $i$ . (4) $x$ is a facet if additionally each $X_i$ contains at least one representative different from $x$ . These checks can be done in polynomial time. We conclude that $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\text{2-affine}) \in {{\mathrm{P}}}$ .

We continue with the corresponding membership questions for complexity classes above ${\mathrm{P}}$ . Here, we make a case distinction of whether the underlying satisfiability problem is in P, and in particular whether ${SAT}(\Gamma ^+)$ is in P. We begin with the following lemma.

Lemma 3.9 ( $\star$ ). If ${SAT}(\Gamma ^+) \in {{\mathrm{P}}}$ , then there is a polynomial time algorithm to determine whether a given $E \subseteq H$ is an explanation for a given abduction instance $(\text{KB}, H, M)$ .

In particular, we obtain the following general statement, which shows that the complexity of $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ for many natural cases does not jump to ${\Sigma }^{{{\mathrm{P}}}}_2$ .

$\star$ ).

Lemma 3.10 ( For any constraint language $\Gamma$ , ${SAT}(\Gamma ^+) \in {{\mathrm{P}}} \Rightarrow \mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma ) \in {\mathrm{NP}}$ .

This covers a substantial number of cases since ${SAT}(\Gamma ^+)$ is in P when $\Gamma$ is Schaefer, that is, contained in ${\mathsf{IV}}_2$ ( $\text{dualHorn}$ ), ${\mathsf{IE}}_2$ ( $\text{Horn}$ ), ${\mathsf{IL}}_2$ ( $\text{affine}$ ), or ${\mathsf{ID}}_2$ ( $\text{2-CNF}$ ). Our last major tractable case concerns the set of essentially negative clauses $\text{EN}$ .

Lemma 3.11. $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\text{EN}) \in P$ .

Proof. We assume an arbitrary instance of $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(EN)$ : $(\text{KB},H,M,x)$ . We first apply unit propagation, exactly as in Lemma 3.6. We can now assume that the instance only contains negative clauses of size $\geq 2$ and equality clauses. We organize all variables which are equal to each other into equivalence classes as in Lemma3.8, with the exception that all classes are independent in this case. If some equivalence class that contains an $m_i \in M$ does not contain variables from $H$ , this $m_i$ cannot be entailed. Thus, the abduction problem has no solutions and no facets.

Otherwise, if all classes that contain an $m_i \in M$ contain at least one variable from $H$ , we set all variables in these classes to true and check if this is consistent with $\text{KB}$ . This, guarantees the existence of abduction solutions. If this is the case, we can check if $x$ is a facet. For this, we need two conditions: let $x \in C$ where $C$ is an equivalence class. First, there must be at least one manifestation $m_i \in C$ , else $x$ cannot imply any $m_i$ and thus is never needed in a subset-minimal solution ( $x$ would not be relevant). Second, there must be at least one variable $x_i\in C$ different from $x$ , otherwise $x$ will always be needed to explain $m_i$ and there can be no solution without it ( $x$ would be necessary).

3.2 Computational lower bounds

We begin with a general result that implies that the facet problem is always at least as hard as the underlying abduction problem, provided the Boolean equality relation can be expressed.

Lemma 3.12. ${ABD}(\Gamma ) \leq ^{\mathrm{P}}_m \mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ if $(x = y) \in \Gamma$ for any constraint language $\Gamma$ .

Proof. Given an instance $(\text{KB}, H, M)$ of ${ABD}(\Gamma )$ we let $x,y,m$ be fresh variables. We define the instance $(\text{KB}', H', M', x)$ of $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ as $\text{KB}' = \text{KB} \cup \{(x=m), (y=m)\},\ H' = H \cup \{x,y\},\ M' = M \cup \{m\}$ . We claim that $(\text{KB}, H, M)$ admits an explanation if and only if $x$ is a facet in $(\text{KB}', H', M')$ . (“ $\Rightarrow$ ”): assume that $E \subseteq H$ is a subset-minimal explanation. Then, $E \cup \{x\}$ and $E \cup \{y\}$ are both subset-minimal explanations in $(\text{KB}', H', M')$ , so $x$ is a facet. (“ $\Leftarrow$ ”): assume that $x$ is a facet in $(\text{KB}', H', M')$ . Then, there exists a subset minimal explanation $E' \subseteq H'$ where $x \in E'$ , and it follows that $E' \setminus \{x\}$ is a (subset-minimal) explanation for $(\text{KB}, H, M)$ .

By combining this with Lemmas3.4 and 3.13, we inherit all hardness results from $ABD$ . All non-polynomial cases of ${ABD}(\Gamma )$ satisfy $\Gamma \not \subseteq {\mathsf{IS}^{}_{12}}$ and $\Gamma \not \subseteq {\mathsf{IS}^{}_{02}}$ , that is, such languages are not essentially negative and not essentially positive (Nordh and Zanuttini Reference Nordh and Zanuttini2008).

Lemma 3.13. ((Mahmood et al. Reference Mahmood, Meier and Schmidt2021), Lemma 9) Let $\Gamma$ be a constraint language. If $\Gamma \not \subseteq {\mathsf{IS}^{}_{12}}$ and $\Gamma \not \subseteq {\mathsf{IS}^{}_{02}}$ , then $(x=y) \in \left \langle \Gamma \right \rangle _{\neq }$ and $\left \langle \Gamma \right \rangle = \left \langle \Gamma \right \rangle _{\neq }$ .

However, the facet problem $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}$ is generally even harder than $ABD$ . We present a technical lemma, providing us the unit clause $(x)$ for free. Then, we will present languages where $ABD$ is polynomial and $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}$ is NP-hard, as well as languages where $ABD$ is coNP-complete, while $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}$ is $\Sigma _2^P$ -complete.

Lemma 3.14 ( $\star$ ). For any constraint language $\Gamma$ , it holds that $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma \cup \{(x)\}) \leq ^{\mathrm{P}}_m \mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ .

We are now ready to state a crucial reduction, which is at the heart of the increased complexity of $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}$ versus $ABD$ . The $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}$ -problem allows to simulate negative unit clauses, provided that the language is 1-valid and can express implication $(x \rightarrow y)$ .

Lemma 3.15. If $\Gamma$ is 1-valid, then ${ABD}(\Gamma \cup \{(\neg x)\}) \leq ^{\mathrm{P}}_m \mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma \cup \{x \rightarrow y\})$ .

Proof. Let $(\text{KB}, H, M)$ be an instance of ${ABD}(\Gamma \cup \{(\neg x)\})$ . If $\text{KB}$ contains two unit clauses $\neg x$ and $\neg y$ for distinct variables $x$ and $y$ , we may simply identify $x$ with $y$ and obtain an equivalent instance. Thus, we may wlog assume that $\text{KB} = \varphi \land (\neg z)$ , $z \in \mathrm{var}(\varphi )$ , for a $\Gamma$ -formula $\varphi$ . Let $x,y,m$ denote fresh variables and define $V = \mathrm{var}(\varphi ) \cup H \cup M \cup \{x,y,m\}$ . We define the target instance $(\text{KB}', H', M', x)$ as

\begin{equation*}\text{KB}' = \varphi \land \bigwedge _{x_i \in V} (z \rightarrow x_i) \land (x \rightarrow m) \land (y \rightarrow m),\quad H' = H \cup \{x,y\},\quad M' = M \cup \{m\}.\end{equation*}

Note that $\text{KB}'$ is a $\Gamma \cup \{x \rightarrow y\}$ -formula, as required.

In the following, we prove correctness formally. Observe first that for $z=0$ , $\text{KB}$ and $\varphi$ have exactly the same models (upto the fresh variables $x,y,m$ ). For $z=1$ , $\varphi$ may admit additional models. However, due to $\text{KB}'$ containing the construct $\bigwedge _{x_i \in V} (z \rightarrow x_i)$ , the only additional model is the all-1 model.

Correctness:

(“ $\Rightarrow$ ”): Be $E\subseteq H$ an explanation for $(\text{KB}, H,M)$ . Then, with the above observation that the only additional model is the all-1 model (which satisfies $M$ and $m$ ), it is easily observed that

  1. 1. $E' = E \cup \{x\}$ constitutes an explanation for $(\text{KB}', H', M')$

  2. 2. $E' \setminus \{x\}$ is no explanation for $(\text{KB}', H', M')$

  3. 3. there is an explanation without $x$ , namely $E \cup \{y\}$

In summary, $x$ is a facet.

(“ $\Leftarrow$ ”): Be $x$ a facet for $(\text{KB}', H', M')$ . Then there is a set $E' \subseteq H'$ such that

  1. 1. $E'$ is an explanation for $(\text{KB}', H', M')$

  2. 2. $E'\setminus \{x\}$ is no explanation for $(\text{KB}', H', M')$

  3. 3. there is an explanation for $(\text{KB}', H', M')$ without $x$

From the construction it is easily observed that $E'$ must be of the form $E' = E \cup \{x\}$ , for an $E \subseteq H$ . Since $E'\setminus \{x\} = E$ fails as explanation for $(\text{KB}', H', M')$ , and $E$ is obviously consistent with (1-valid) $\text{KB}'$ , we conclude that $E$ fails due to $\text{KB}' \land E$ not entailing $M' = M \cup \{m\}$ . That is,

(1) \begin{align} \text{KB}' \land E \not \models M \cup \{m\} \end{align}

Further, since $E \cup \{x\}$ is an explanation, we know that $\text{KB}' \land E \cup \{x\}$ does entail $M' = M \cup \{x\}$ . Since by construction, $x$ can not be responsible for entailing $M$ , we conclude that $\text{KB}' \land E$ entails $M$ . That is,

(2) \begin{align} \text{KB}' \land E \models M \end{align}

From (1) and (2) we conclude that that $\text{KB}' \land E \not \models m$ . From this we conclude that $\text{KB}' \land E$ admits models where $z = 0$ (otherwise, $\text{KB}' \land E$ would entail $m$ , due to $\text{KB}'$ containing $z \rightarrow m$ ). Therefore, we conclude that $\text{KB}'[z=0] \land E$ admits models (is consistent) and entails $M$ . Since $\text{KB}'[z=0] \equiv \text{KB}$ (upto the “irrelevant” variables $x,y,m$ ) we conclude that 1) $\text{KB} \land E$ is consistent, and 2) $\text{KB} \land E \models M$ . That is, $E$ is an explanation for $(\text{KB}, H, M)$ .

We are now ready to derive the hardness results in a series of short, technical lemmas.

Lemma 3.16 ( $\star$ ). If ${\mathsf{II}}_1 = \left \langle \Gamma \right \rangle$ or ${\mathsf{IE}}_1 = \left \langle \Gamma \right \rangle$ , then ${ABD}(\Gamma \cup \{(\neg x)\}) \leq ^{\mathrm{P}}_m \mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ .

Lemma 3.17 ( $\star$ ). If $\mathsf{IN} \subseteq \left \langle \Gamma \right \rangle$ , then $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ is ${\Sigma }^{{{\mathrm{P}}}}_2$ -hard.

Lemma 3.18 ( $\star$ ). If ${\mathsf{IE}} \subseteq \left \langle \Gamma \right \rangle$ , then $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ is $\mathrm{NP}$ -hard.

We remark that Friedrich et al. (Reference Friedrich, Gottlob and Nejdl1990) prove NP-hardness for the relevance problem for ${\mathsf{IE}}_1$ . While this proof can be adapted to our setting it does not generalize to the other cases in this section. By combining all results, we obtain the main result of the paper.

Proof of Theorem 3.3. First, we observe that the each language $\Gamma$ considered in Lemma3.7, Lemma3.8, or Lemma3.11 either contains or can define equality $(x = y)$ . Hence, Lemma3.4 is applicable and proves tractability for any language $\Delta$ such that $\Delta \subseteq \left \langle \Gamma \right \rangle _{\neq } = {\langle \Gamma \rangle }$ . This covers all tractable cases in Figure 1a.

For intractability, all NP-complete cases except ${\mathsf{IE}}_1$ and $\mathsf{IE}$ follow from Lemma3.10 (since ${SAT}(\Gamma ^+)$ is in P for every such $\Gamma$ ), Lemma3.12, and Lemma3.13. The former two cases are instead proven to be NP-hard in Lemma3.18, and inclusion in NP follows from Lemma3.10. Last, ${\Sigma }^{{{\mathrm{P}}}}_2$ -hardness for all remaining cases follow from Lemma3.17, and inclusion in ${\Sigma }^{{{\mathrm{P}}}}_2$ is straightforward via arguments similar to Lemma3.10.

We view the two missing cases $\mathsf{IL}$ (even linear equations) and ${\mathsf{IL}}_1$ (even linear equations, and unit clauses) as interesting future research questions. However, via Lemma3.10 we may at least observe that $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma ) \in {\mathrm{NP}}$ for any base $\Gamma$ of $\mathsf{IL}$ or ${\mathsf{IL}}_1$ . Hence, the only question remaining is whether these problems are in P, NP-complete, or — unlikely but still possible — NP-intermediate.

We conclude this section with the observation that all membership and hardness proofs, which we have given for the $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}$ problem are also applicable to the relevance problem (deciding if $x \in H$ is relevant).

Theorem 3.19. The classification of $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ in Figure 1 a also describes the complexity of the relevance problem.

Proof. Recall that an instance of the relevance problem is given by $ I = (\text{KB}, H, M, x)$ and the question is whether $x \in {\mathcal{R}\text{el}\mathcal{E}}(I)$ , that is, whether $x$ belongs to a subset-minimal explanation. We revisit now all membership and hardness proofs for $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}$ and observe that they can easily be adapted to the relevance problem.

Membership in ${\mathrm{P}}$ . First observe that the reduction of Lemma 3.6 to get rid of unit clauses can be performed analogously on the relevance problem. Next we observe that all algorithms showing P-membership, that is, Lemmas3.5, 3.7, 3.8, and 3.11, first decide whether the given $x$ is relevant, and then in a second (independent) step decide whether $x$ is dispensable (not necessary). By dropping the dispensability check, we obtain a polynomial time algorithm to decide the relevance problem.

Membership in $\mathrm{NP}$ . Analogously to the P-membership algorithms we drop the step of the dispensability check: in Lemma3.10 omit guessing an $E_2 \subseteq H \setminus \{x\}$ and verifying that $E_2$ is an explanation.

Membership in ${\Sigma }^{{{\mathrm{P}}}}_2$ . Inclusion in ${\Sigma }^{{{\mathrm{P}}}}_2$ is straightforward via arguments similar to Lemma3.10 using an NP-oracle.

$\mathrm{NP}$ -hardness and ${\Sigma }^{{{\mathrm{P}}}}_2$ -hardness. First observe that Lemma3.4 is also applicable to relevance, since the underlying reduction preserves the exact set of explanations. Next observe that the reduction from Lemma3.12 carries over one-to-one to relevance. It is optional to simplify the proof by removing the clause $(y = m)$ and the variable $y$ (whose only purpose was to assure that $x$ is not necessary). Lemma3.14 is easily observed to carry over one-to-one to relevance. Lemma3.15 carries over one-to-one, again it is optional to simplify the proof by removing $y$ and the clause $(y \rightarrow m)$ . Finally, Lemmas3.16, 3.17, and 3.18 hold analogously, since all used lemmas carry over, as shown above.

In summary, this theorem is proven analogously to Theorem3.3.

4 Diverse explanations

In the previous section on facets, we considered whether there exists a variable that is relevant in explanations but dispensable. Thereby, we obtain a notion on flexibility on one variable belonging to explanations. Now, we lift flexibility from one variable to a set of variables in explanations and ask whether there exist two explanations of sufficiently high diversity. It turns out that this metric can be precisely related to the existence of facets and several notions from our facet classification carry over. However, measuring the distance is provably much harder and becomes NP-hard already for the small fragment of Horn consisting of a single implication $(x \rightarrow y)$ . Before, we illustrate our complexity results in detail, we define our distance measure and computational problem.

Definition 4.1. Let $I=(\text{KB}, H, M)$ be an $ABD$ instance, and $E_1\subseteq H$ and $E_2\subseteq H$ be two sets of variables over the hypotheses $H$ . Then, the distance $d(E_1, E_2)$ between $E_1$ and $E_2$ is the cardinality of their symmetric difference. More formally,

\begin{align*} d(E_1, E_2) := &\, {|\,E_1 \triangle E_2\,|} = {|\,(E_1 \cup E_2) \setminus (E_1 \cap E_2)\,|} \\ =&\, {|\,\{x \in H \;|\; x \in E_1 \text{ and } x \notin E_2 \text{ or } x \notin E_1 \text{ and } x \in E_2\}\,|}. \end{align*}

If $E_1$ and $E_2$ are explanations, i.e., $E_1, E_2 \in {\mathcal{E}}(I)$ , and $d(E_1,E_2) \geq k$ , then we call $E_1$ and $E_2$ $k$ -diverse explanations.

Note that the maximum distance is $|H|$ , which is reached by $d(H, \emptyset )$ . Our notion of distance is in line with the corresponding notion for ${SAT}(\Gamma )$ (Misra et al. Reference Misra, Mittal and Rai2024) and many other diversity problems studied in AI. Note that we do not require that the two explanations are minimal, since the distance notion does not require this.

Next, we define the diversity problem for abduction.

We have the following relationship to facets.

Proposition 4.2. Let $I = (\text{KB}, H, M)$ be an instance of ${ABD}(\Gamma )$ and $E_1, E_2 \in {\mathcal{E}_M}(I)$ . Then, every $x \in E_1 \triangle E_2$ is a facet.

Proof. Let $x \in E_1 \triangle E_2 = (E_1 \cup E_2) \setminus (E_1 \cap E_2)$ . We observe that if $x$ is not a facet then either (1) $x \in E_1 \cap E_2$ since it is part of every explanation, or (2) $x \notin E_1 \cup E_2$ since it is not included in any subset-minimal explanation. Hence, $x \notin E_1 \triangle E_2$ , meaning that $x \in E_1 \triangle E_2$ is only possible if $x$ is a facet.

From the relationship between facets and the distance notion, which we establish in Proposition4.2, we can suspect similarities between the computational problems $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ and $\mathrm{D\scriptstyle {IV}-ABD}(\Gamma )$ . First, we establish that all hardness results are inherited from $\textrm {ABD}$ , analogously to Lemma3.12 and Lemma3.4 for $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ .

Lemma 4.3 ( $\star$ ). ${ABD}(\Gamma ) \leq ^{\mathrm{P}}_m \mathrm{D\scriptstyle {IV}}\text{-}\mathrm{ABD}(\Gamma )$ if $(x = y) \in \Gamma$ .

Also, ${\Sigma }^{{{\mathrm{P}}}}_2$ -hardness for 1-valid and complementive languages is obtained analogously.

Lemma 4.4 ( $\star$ ). If $\mathsf{IN} \subseteq \left \langle \Gamma \right \rangle$ , then $\mathrm{D\scriptstyle {IV}}\text{-}\mathrm{ABD}(\Gamma )$ is ${\Sigma }^{{{\mathrm{P}}}}_2$ -hard.

However, the problem $\mathrm{D\scriptstyle {IV}}\text{-}\mathrm{ABD}$ is generally harder than $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}$ . Below, we establish $\mathrm{NP}$ -hardness for simple implicative languages. We reduce from the problem $\mathrm{D\scriptstyle {IV}}\text{-}\mathrm{P\scriptstyle {OS2}}\mathrm{SAT}$ where we are given a positive 2-CNF formula $\varphi$ and an integer $k$ . Therefore, we require two models of Hamming distance at least $k$ , which is $\mathrm{NP}$ -hard (Misra et al. Reference Misra, Mittal and Rai2024).

Lemma 4.5 ( $\star$ ). $\mathrm{D\scriptstyle {IV}}\text{-}\mathrm{P\scriptstyle {OS2}}\mathrm{SAT} \leq ^{\mathrm{P}}_m \mathrm{D\scriptstyle {IV}}\text{-}\mathrm{ABD}(\{x \rightarrow y\})$ .

Thus, despite the similarities between $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ and $\mathrm{D\scriptstyle {IV}}\text{-}\mathrm{ABD}(\Gamma )$ , the latter seems to be significantly harder. However, we observe two tractable cases.

Lemma 4.6 ( $\star$ ). $\mathrm{D\scriptstyle {IV}}\text{-}\mathrm{ABD}(\text{2-affine}), \mathrm{D\scriptstyle {IV}}\text{-}\mathrm{ABD}(\text{EP}) \in {{\mathrm{P}}}$ .

5 Conclusion

In this paper, we introduce faceted reasoning to propositional abduction. We illustrate that this reasoning allows more fine-grained decisions than previously explored notions such as relevance and necessity/dispensability. We relate facets to the problem of finding diverse explanations. We establish an almost complete complexity classification in Post’s lattice. In many cases, facets can be found without a major blow-up in complexity. This is particularly interesting, given that counting minimal explanations is almost always substantially harder (Hermann and Pichler Reference Hermann and Pichler2010). Our facet classification also implies a corresponding classification for the relevance problem, thus answering an open question (Zanuttini Reference Zanuttini2003; Nordh and Zanuttini Reference Nordh and Zanuttini2008). For diversity, our results are less conclusive, but any major tractable cases seem unlikely, since it is hard already for the fragment $(x \rightarrow y)$ .

5.1 Completing the trichotomy

The two open cases, affine equations of odd length with, or without, unit clauses, could be interesting to resolve. It seems unlikely that $\mathrm{I\scriptstyle {S}}\mathrm{F\scriptstyle {ACET}}(\Gamma )$ could be tractable for such languages, but, at the same time, any hardness proof likely needs to involve significant new ideas. Note that affine languages were also absent in earlier counting complexity classifications (Hermann and Pichler Reference Hermann and Pichler2010). Hence, there is a blind spot for complexity of abduction. One possible way forward could be to first classify the complexity of $\mathrm{D\scriptstyle {IV}}\text{-}\mathrm{ABD}(\Gamma )$ for all affine languages, which seems likely to be hard.

5.2 Parameterized complexity and diversity

It is reasonable that $\mathrm{D\scriptstyle {IV}}\text{-}\mathrm{ABD}(\Gamma )$ can be fully classified, but without any large, tractable cases. However, more fine-grained investigations involving problem structure (parameter) remains interesting. For $\mathrm{D\scriptstyle {IV}}\text{-}\mathrm{ABD}(\Gamma )$ a natural parameter is the maximum allowed distance. A systematic classification could not only open up new efficiently solvable cases but could also prove to be a useful framework for proving hardness for other types of diversity problems, especially, since the classical complexity of abduction is much richer than the one for SAT.

5.3 Abductive logic programming (ALP)

Since detailed complexity results on logic programs (Truszczyński Reference Truszczyński2011) and results on the complexity of ALP regarding consistency, relevance, and necessity (Eiter et al. Reference Eiter, Gottlob and Leone1995) exist, it could be interesting to extend our results to stable models in ALP where the input is given in form of rules.

5.4 Applications

Faceted reasoning can aid the search for heterogeneous explanations, which could be valuable in any domain with classical applications of abduction, for example, diagnosis, and explainable AI. More concretely, a practical application of facets are logistics applications where solutions need to be explained such as in the Beluga AI Competition (Gnad et al. Reference Gnad, Hecher, Gaggl, Rusovac, Speck and Fichte2025). There, several tasks aim at flexibility in explanations or alternatives. Finally, we hope that our results on diverse explanations will spark interest into a deeper complexity study and exploration to use these for tasks like model debugging or decision support. Beyond abductive reasoning, we expect that facets and diversity could be interesting for epistemic logic programs (Eiter et al. Reference Eiter, Fichte, Hecher and Woltran2024), default logic (Fichte et al. Reference Fichte, Meier and Schindler2024), and probabilistic reasoning (Fichte et al. Reference Fichte, Hecher and Nadeem2022b).

Acknowledgments

The work has received funding from the Swedish research council under grant VR-2022-03214 and ELLIIT funded by the Swedish government.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/S1471068425100215.

Footnotes

*

Author names are stated in reverse alphabetical order.

1 We prove statements marked by “ $\star$ ” in the supplemental material.

References

Alrabbaa, C., Rudolph, S. and Schweizer, L. 2018. Faceted answer-set navigation. In Proc. RuleML+RR’18. Springer, 211225.Google Scholar
Arora, S. and Barak, B. 2009. Computational Complexity - A Modern Approach. Cambridge University Press.10.1017/CBO9780511804090CrossRefGoogle Scholar
Böhl, E., Gaggl, S. A. and Rusovac, D. 2023. Representative answer sets: Collecting something of everything. In ECAI. IOS Press, 271278.Google Scholar
Böhler, E., Reith, S., Schnoor, H. and Vollmer, H. 2005. Bases for Boolean co-clones. Information Processing Letters 96, 2, 5966.10.1016/j.ipl.2005.06.003CrossRefGoogle Scholar
Bylander, T., Allemang, D., Tanner, M. C. and Josephson, J. R. 1991. The computational complexity of abduction. Artificial Intelligence 49, 1, 2560.10.1016/0004-3702(91)90005-5CrossRefGoogle Scholar
Cook, S. A. 1971. The complexity of theorem-proving procedures. In STOC 1971. ACM, 151158.Google Scholar
Creignou, N., Kröll, M., Pichler, R., Skritek, S. and Vollmer, H. 2019. A complexity theory for hard enumeration problems. Discrete Applied Mathematics 268, 191209.10.1016/j.dam.2019.02.025CrossRefGoogle Scholar
Creignou, N., Olive, F. and Schmidt, J. 2011. Enumerating all solutions of a Boolean CSP by non-decreasing weight. In Proc. SAT. Springer, Vol. 6695, 120133.Google Scholar
Creignou, N., Schmidt, J. and Thomas, M. 2010. Complexity of propositional abduction for restricted sets of Boolean functions. In Proc. KR. AAAI Press, 816.Google Scholar
Creignou, N. and Vollmer, H. 2008. Boolean Constraint Satisfaction Problems: When Does Post’s Lattice Help?. Springer, 337.Google Scholar
Creignou, N. and Zanuttini, B. 2006. A complete classification of the complexity of propositional abduction. SIAM Journal on Computing 36, 1, 207229.10.1137/S0097539704446311CrossRefGoogle Scholar
Dai, W.-Z., Xu, Q., Yu, Y. and Zhou, Z.-H. 2019. Bridging machine learning and logical reasoning by abductive learning. In Proc. NeurIPS 2019. Curran Associates, Inc, Vol, 32.Google Scholar
Dellsén, F. 2024. Abductive reasoning in science. In Elements in the Philosophy of Science. Cambridge University Press.Google Scholar
Eiter, T., Fichte, J. K., Hecher, M. and Woltran, S. 2024. Epistemic logic programs: Non-ground and counting complexity. In Proc. IJCAI. ijcai.org, 33333341.Google Scholar
Eiter, T. and Geibinger, T. 2023. Explaining answer-set programs with abstract constraint atoms. In Proc. IJCAI. International Joint Conferences on Artificial Intelligence Organization, 31933202.Google Scholar
Eiter, T. and Gottlob, G. 1995. The complexity of logic-based abduction. Journal of the ACM 42, 1, 342.10.1145/200836.200838CrossRefGoogle Scholar
Eiter, T., Gottlob, G. and Leone, N. 1995. Complexity results for abductive logic programming. In Proc. LPNMR. Springer, 114.Google Scholar
Eiter, T., Gottlob, G. and Leone, N. 1997. Semantics and complexity of abduction from default theories. Artificial Intelligence 90, 1, 177223.10.1016/S0004-3702(96)00040-9CrossRefGoogle Scholar
Fellows, M. R., Pfandler, A., Rosamond, F. A. and Rümmele, S. 2012. The parameterized complexity of abduction. In Proc. AAAI. AAAI Press, 743749.Google Scholar
Fichte, J. K., Fröhlich, N., Hecher, M., Lagerkvist, V., Mahmood, Y., Meier, A. and Persson, J. 2025. Facets in argumentation: A formal approach to argument significance. In Proc. IJCAI 2025. ijcai.org, In press.Google Scholar
Fichte, J. K., Gaggl, S. A. and Rusovac, D. 2022a. Rushing and strolling among answer sets – navigation made easy. In Proc. AAAI. AAAI Press, 56515659.Google Scholar
Fichte, J. K., Hecher, M. and Nadeem, M. A. 2022b. Plausibility reasoning via projected answer set counting - A hybrid approach. In Proc. IJCAI. ijcai.org, 26202626.Google Scholar
Fichte, J. K., Meier, A. and Schindler, I. 2024. Strong backdoors for default logic. ACM Transactions on Computational Logic 25, 3, 124.10.1145/3655024CrossRefGoogle Scholar
Friedrich, G., Gottlob, G. and Nejdl, W. 1990. Hypothesis classification, abductive diagnosis and therapy. In ESE 1990. Springer, 6978.Google Scholar
Gnad, D., Hecher, M., Gaggl, S., Rusovac, D., Speck, D. J. and Fichte, J. K. 2025. Interactive exploration of plan spaces. In Proc. KR 2025. To appear. IJCAI Organization, In press.Google Scholar
Hermann, M. and Pichler, R. 2010. Counting complexity of propositional abduction. Journal of Computer and System Sciences 76, 7, 634649.10.1016/j.jcss.2009.12.001CrossRefGoogle Scholar
Hu, W.-C., Dai, W.-Z., Jiang, Y. and Zhou, Z.-H. 2025. Efficient rectification of neuro-symbolic reasoning inconsistencies by abductive reflection. In Proc. AAAI 2025, 1733317341.Google Scholar
Ignatiev, A., Narodytska, N. and Marques-Silva, J. 2019. Abduction-based explanations for machine learning models. In Proc. AAAI 2019. AAAI Press, 15111519.Google Scholar
Kakas, A. C., Kowalski, R. A. and Toni, F. 1992. Abductive logic programming. Journal of Logic and Computation 2, 6, 719770.10.1093/logcom/2.6.719CrossRefGoogle Scholar
Kleine Büning, H. and Lettmann, T. 1999. Propositional Logic – Deduction and Algorithms, Vol. 48 of Cambridge tracts in theoretical computer science. Cambridge University Press.Google Scholar
Lewis, H. 1979. Satisfiability problems for propositional calculi. Mathematical Systems Theory 13, 4553.10.1007/BF01744287CrossRefGoogle Scholar
Mahmood, Y., Meier, A. and Schmidt, J. 2021. Parameterized complexity of abduction in Schaefer’s framework. Journal of Logic and Computation 31, 1, 266296.10.1093/logcom/exaa079CrossRefGoogle Scholar
Miller, T. 2019. Explanation in artificial intelligence: Insights from the social sciences. Artificial Intelligence 267, 138.10.1016/j.artint.2018.07.007CrossRefGoogle Scholar
Minsky, M. 1974. A framework for representing knowledge. Technical report, Massachusetts Institute of Technology, USA. https://dspace.mit.edu/bitstream/handle/1721.1/6089/AIM-306.pdf?sequence=2&isAllowed=y.Google Scholar
Misra, N., Mittal, H. and Rai, A. 2024. On the parameterized complexity of diverse SAT. In ISAAC 2024. Dagstuhl, 50:150:18.Google Scholar
Nordh, G. and Zanuttini, B. 2008. What makes propositional abduction tractable. Artificial Intelligence 172, 10, 12451284.10.1016/j.artint.2008.02.001CrossRefGoogle Scholar
Papadimitriou, C. H. 1994. Computational Complexity. Addison-Wesley.Google Scholar
Pfandler, A., Rümmele, S. and Szeider, S. 2013. Backdoors to abduction. In Proc. IJCAI 2013. IJCAI/AAAI, 10461052.Google Scholar
Post, E. 1941. The two-valued iterative systems of mathematical logic. Annals of Mathematics Studies 5, 1122.Google Scholar
Rusovac, D., Hecher, M., Gebser, M., Gaggl, S. A. and Fichte, J. K. 2024. Navigating and Querying Answer Sets: How Hard Is It Really and Why? In Proc. KR 2024. IJCAI Organization, 642653.10.24963/kr.2024/60CrossRefGoogle Scholar
Schaefer, T. J. 1978. The complexity of satisfiability problems. In Proc. STOC 1978. ACM, 216226.Google Scholar
Selman, B. and Levesque, H. J. 1990. Abductive and default reasoning: A computational core. In Proc. AAAI 1990. AAAI Press / The MIT Press, 343348.Google Scholar
Speck, D., Hecher, M., Gnad, D., Fichte, J. K. and Corrêa, A. B. 2025. Counting and reasoning with plans. In Proc. AAAI 2025. AAAI Press, 2668826696.Google Scholar
Stockmeyer, L. J. 1976. The polynomial-time hierarchy. Theoretical Computer Science 3, 1, 122.10.1016/0304-3975(76)90061-XCrossRefGoogle Scholar
Stockmeyer, L. J. and Meyer, A. R. 1973. Word problems requiring exponential time: Preliminary report. In STOC 1973. ACM, 19.Google Scholar
Truszczyński, M. 2011. Trichotomy and dichotomy results on the complexity of reasoning with disjunctive logic programs. Theory and Practice of Logic Programming 11, 6, 881904.10.1017/S1471068410000463CrossRefGoogle Scholar
Wrathall, C. 1976. Complete sets and the polynomial-time hierarchy. Theoretical Computer Science 3, 1, 2333.10.1016/0304-3975(76)90062-1CrossRefGoogle Scholar
Yu, J., Ignatiev, A., Stuckey, P. J., Narodytska, N. and Marques-Silva, J. 2023. Eliminating the impossible, whatever remains must be true: On extracting and applying background knowledge in the context of formal explanations. In Proc. AAAI 2023. AAAI Press, 41234131.Google Scholar
Zanuttini, B. 2003. New polynomial classes for logic-based abduction. Journal of Artificial Intelligence Research 19, 110.10.1613/jair.1170CrossRefGoogle Scholar
Figure 0

Fig. 1. Illustration of complexity results via Post’s lattice.

Figure 1

Table 1. Constraint languages and their corresponding co-clones. Here, $\text{Pos}(c)$ and $\text{Neg}(c)$ denote the number of positive and negative literals in a clause $c$, respectively. For more details, we refer to the work by Böhler et al. (2005) and the table in the supplemental material to this paper

Supplementary material: File

Schmidt et al. supplementary material

Schmidt et al. supplementary material
Download Schmidt et al. supplementary material(File)
File 627.4 KB