1 Introduction
Aggregate expressions are very useful and have been added to classical logic, query languages, constraint languages, and also to logic programming (LP) and answer set programming (ASP). The effort it takes to add aggregates to (syntax and semantics of) a logic is very language dependent. For example, to extend first order logic (FO) with a Count aggregate, we extend the definition of “term” with a new inductive rule: “ If $a_1,\ldots,a_n$ are variables, and $\psi$ a formula, then $Count(\{(a_1,\ldots,a_n),\psi\})$ is a term” and the definition of “interpretation of a term t in structure I” (used in the definition of $\models$ ) with: “ If $t = Count(\{(a_1,..,a_n),\psi\})$ then $t^I$ is $\# \{(d_1,\ldots,d_n)\in Dom(I)^n | I[a_1:d_1,\ldots,a_n:d_n] \models \psi$ }, that is, the number of tuples that satisfy $\psi$ in I.”. The method is simple and follows Frege’s compositionality principle.
In LP and ASP, it is much more difficult. Research into extensions with aggregates (well-founded and stable semantics) started with the work of Kemp and Stuckey (Reference Kemp and Stuckey1991). Many approaches exist, but so far no consensus on how to handle aggregates in those logics has been reached.
We propose a framework for defining semantics for extensions of LP and ASP based on Approximation Fixpoint Theory (AFT) introduced by Denecker et al. (Reference Denecker, Marek and Truszczyński2000) and (2004). AFT is an abstract lattice theoretic formalization of constructive methods for non-monotonic operators. It defines different types of constructions and fixpoints to a lattice operator in an approximation space, including supported, Kripke–Kleene (KK), stable and well-founded (WF) fixpoints. The theory has been applied to a range of non-monotonic logics to characterize existing as well as new semantics: for example, LP and ASP as shown in the paper by Denecker et al. (Reference Denecker, Bruynooghe and Vennekens2012), autoepistemic and default logic as shown in the paper by Denecker et al. (Reference Denecker, Marek and Truszczynski2003), higher order LP as shown in the paper by Charalambidis et al. (Reference Charalambidis, Rondogiannis and Symeonidou2018), argumentation frameworks and abstract dialectal frameworks as shown in the papers by Strass (Reference Strass2013) and Bogaerts (Reference Bogaerts2019). AFT has been applied to aggregate LP and ASP resulting in the ultimate stable and well-founded semantics in the work by Denecker et al. (Reference Denecker, Pelov and Bruynooghe2001) and the broader framework in the paper by Pelov et al. (Reference Pelov, Denecker and Bruynooghe2007) where stable and well-founded semantics are induced by a choice of a 3-valued truth function.
Here, we clarify and expand this work. First, to make the AFT framework more accessible to the ASP community, we show how each approximation truth assignment can be broken up in a lower and an upper ternary satisfaction relation which, in the context of ASP can be easily related to the reduct approach originally used by Gelfond and Lifschitz (Reference Gelfond and Lifschitz1988) to define stable semantics. Then we focus on aggregate ASP using examples from the literature. Where possible, we consider aggregate atoms with positive and negative literals as conditions. However, the semantics described by Gelfond and Zhang (Reference Gelfond and Zhang2019) does not allow negation by default inside an aggregate atom, therefore we only consider positive conditions for this specific case. It is shown that not only the semantics of Denecker et al. (Reference Denecker, Pelov and Bruynooghe2001), Pelov et al. (Reference Pelov, Denecker and Bruynooghe2007) but also those of Liu et al. (Reference Liu, Pontelli, Son and Truszczyński2010) and Gelfond and Zhang (Reference Gelfond and Zhang2019) are instances of our framework. But not all proposed semantics for aggregate ASP semantics belong to our framework; for example those of Ferraris (Reference Ferraris2011), Marek and Remmel (Reference Marek and Remmel2004), Faber et al. (Reference Faber, Pfeifer and Leone2011). We investigate the reason for this. The paper contributes to the discussion about semantics for Aggregate ASP by clarifying some important principles of NMR and by showing where they are applied and where other principles are applied.
2 Approximation fixpoint theory
Here, we recall the basics of AFT from the work by Denecker et al. (Reference Denecker, Marek and Truszczyński2000). In many non-monotonic languages, a theory defines a semantic lattice Footnote 1 operator $O: L\rightarrow L$ . If O is monotone, its least fixpoint is often taken as the semantics of the theory. Otherwise, AFT can be applied. The first step is to associate the approximation space $L^2$ to L. A pair $(x,y)\in L^2$ is an approximation of any $z\in[x,y]$ . With $x \leq y$ , the interval is non-empty and the pair is consistent. $L^c$ is the subspace of consistent pairs. $L^2$ and $L^c$ possess (i) a precision order, $(x,y)\leq_p (u,v)$ if $x\leq u$ and $y\geq v$ and (ii) the embedding of L, namely the set of exact pairs (x,x) which approximate only x. The least precise point is $(\bot,\top)$ , with $\bot=glb(L),\top=lub(L)$ .
The second step is to assign an approximating operator A to O: a $\leq_p$ -monotone operator on $L^2$ or $L^c$ such that if (x,y) approximates z then A(x,y) approximates O(z); an approximator on $L^2$ also has to be symmetric: $A(x,y)=(u,v)$ iff $A(y,x)=(v,u)$ . Thus, increasing the precision of the input to an approximator A increases the precision of the output. With an approximator A, several types of fixpoints are definable. The least fixpoint (lfp) construction $(\bot,\top)$ , $A(\bot,\top)$ ,, $A^\alpha(\bot,\top)$ , produces the KK fixpoint $\textit{KK}(A)=\textit{lfp}(A)$ . The KK fixpoint is a pair $(x,y) \in L^2$ . If exact, $x (= y)$ is the only fixpoint of O. Otherwise, it approximates all fixpoints of O, including “self-supported” ones that are often not minimal in L. Stable and WF fixpoint definitions contain mechanisms to reduce self-support. Footnote 2 A stable fixpoint $x\in L$ is one such that $x=\textit{lfp}(\lambda z: A(z,x)_1)$ , where $A(z,x)_1$ is the first component of the pair A(z,x). The WF fixpoint $\textit{WF}(A)$ is the least precise pair (x,y) with $x=\textit{lfp}(\lambda z: A(z,y)_1)$ and $y=\textit{lfp}(\lambda z: A(x,z)_2)$ . It is the least precise fixpoint of the monotone operator $(x,y)\mapsto (\textit{lfp}(\lambda z:A(z,y)_1),\textit{lfp}(\lambda z:A(x,z)_2))$ . We have that $\textit{KK}(A)$ and $\textit{WF}(A)$ are consistent and for each stable fixpoint x, $\textit{KK}(A)\leq_p \textit{WF}(A)\leq_p x$ and x is a minimal fixpoint of O. Footnote 3
AFT induces relationships between fixpoints of different approximators of O. If approximator A is pointwise less precise than B, then $\textit{KK}(A)\leq_p \textit{KK}(B)$ , $\textit{WF}(A)\leq_p \textit{WF}(B)$ and any stable fixpoint of A is a stable fixpoint of B. Thus, with increasing precision of the approximator, KK and WF fixpoints increase in precision, and the set of stable fixpoints grows. There exists a most precise approximator $\textit{Ult}_O$ of O, with the most precise KK and WF fixpoint and the largest set of stable fixpoints. For consistent pairs (x,y), $\textit{Ult}_O(x,y)$ is the most precise pair approximating O([x,y]), that is, $(\textit{glb}(O([x,y]),\textit{lub}(O([x,y])))$ . It follows, perhaps surprisingly, that if an even moderately precise approximator A has a stable fixpoint x, then x is approximated by the most precise WF fixpoint $\textit{WF}(\textit{Ult}_O)$ associated with O (but keep in mind that unprecise approximators are unlikely to have stable fixpoints). This counter-intuitive fact tells us that in all cases, if stable fixpoints exist, they are in the proximity of the most precise WF fixpoint that can be associated to O. This may explain the good quality of stable semantics in capturing intuitions. But while KK and WF are constructive, stable semantics is not really. It only has a constructive test: testing if x is stable is by testing if x is the limit of the $\textit{lfp}$ construction of $\lambda z: A(z,x)_1$ .
We now sketch how to use AFT to define constructive semantics for programs based on some logic ${\mathcal{L}}$ . Footnote 4 We assume ${\mathcal{L}}$ is a first-order logic with a (Herbrand) model semantics defined using a 2-valued truth function $\mathcal{H}^2$ , or equivalently, a satisfaction relation $\models_2$ (where $I\models_2 \phi$ iff $\mathcal{H}_{I}^2(\phi)={\mathbf{t}}$ ). An ${\mathcal{L}}$ -program is then defined as follows:
Definition 1 ( ${\mathcal{L}}$ -program) An ${\mathcal{L}}$ -program is a set of rules r of the form $p \leftarrow \psi$ such that the body $\psi$ is a formula of ${\mathcal{L}}$ and the head p is a propositional atom.
Importantly, $\leftarrow$ is not a connective of ${\mathcal{L}}$ but AFT defines its meaning as a construction operator. LP and (non-disjunctive) ASP are instances of this where ${\mathcal{L}}$ is simply the logic of conjunctions of literals p or $\neg p$ under standard interpretation. Given an ${\mathcal{L}}$ -program P, the corresponding lattice $L,\leq$ is the set of P’s Herbrand interpretations ordered by the truth order ( ${\mathbf{f}}<{\mathbf{t}}$ ). The sets $L^c$ and $L^2$ correspond to 3- and 4-valued interpretations. Any 3- or 4-valued interpretation ${\mathcal{I}}$ can be split into a pair (I,J) by splitting truth values as in ${\mathbf{t}}\sim({\mathbf{t}},{\mathbf{t}}), {\mathbf{f}}\sim({\mathbf{f}},{\mathbf{f}}), {\mathbf{u}}\sim({\mathbf{f}},{\mathbf{t}})$ and $\mathbf{i}\sim({\mathbf{t}},{\mathbf{f}})$ . If ${\mathcal{I}}$ is 3-valued, then $I\leq J$ , and I is a lower bound, J an upper bound of ${\mathcal{I}}$ . The 3- and 4-valued structures are equipped with a truth order, which isomorphically corresponds to the product order $\leq$ of $L^2$ and $L^c$ , and with a precision order $\leq_p$ ( ${\mathbf{u}} <_p {\mathbf{t}} <_p \mathbf{i}, {\mathbf{u}}<_p{\mathbf{f}}<_p\mathbf{i}$ which corresponds to the precision order of $L^2$ and $L^c$ . Any truth or precision monotone operator $\Gamma$ on 2-, 3- or 4-valued structures corresponds to a truth or precision monotone operator on $L, L^c, L^2$ .
An ${\mathcal{L}}$ -program P is characterised by the immediate consequence operator ${T_{P}}:L\rightarrow L$ . This operator fulfills the role of O, the approximated operator. The immediate consequence operator ${T_{P}}:L\rightarrow L$ for a program P is such that ${T_{P}}(I)=J$ if for every ground atom p, $\mathcal{H}_{J}^2(p) = \textit{lub}_\leq(\{\mathcal{H}_{I}^2(\psi) | (p \leftarrow \psi) \in P\})$ . There are two ways to define AFT semantics using an approximator $A (= A_P)$ for ${T_{P}}$ .
One way is to use $A_P = \textit{Ult}({T_{P}})$ , the most precise approximator of ${T_{P}}$ in $L^c$ leading to ultimate versions of the family of AFT semantics as described by Denecker et al. (Reference Denecker, Marek and Truszczyński2004) and used by Denecker et al. (Reference Denecker, Pelov and Bruynooghe2001) for defining semantics of Aggregate LP. This is the most precise approach, but computationally costly. The other way is to extend ${\mathcal{L}}$ ’s truth assignment to 3- or 4-valued interpretations and by interpreting ${T_{P}}$ ’s definition in this broader context. For example, a 3-valued truth assignment $\mathcal{H}^3$ induces a 3-valued immediate consequence operator ${\Phi_{P}}$ where ${\Phi_{P}}({\mathcal{I}})={\mathcal{I}}'$ if for every atom p, $\mathcal{H}_{{\mathcal{I}}'}^3(p) = \textit{lub}_{\leq}\{\mathcal{H}_{{\mathcal{I}}}^3(\psi)|(p\leftarrow \psi) \in P\}$ . The operator ${\Phi_{P}}$ corresponds isomorphically to an $L^c$ approximator $A_P$ on pairs $I<J$ of 2-valued interpretations. But for $A_P$ to be an approximator of ${T_{P}}$ , the 3-valued truth assignment $\mathcal{H}^3$ should satisfy a condition introduced for 3-valued logic by Kleene (Reference Kleene1952):
Definition 2 (Regular truth assignment) A 3-valued truth assignment $\mathcal{H}^3$ of $\mathcal{L}$ is regular iff for all formulas $\psi$ , for all 3-valued structures ${\mathcal{I}}$ interpretating $\psi$ : (1) (extension of $\mathcal{H}^2)$ if ${\mathcal{I}}$ is 2-valued, then $\mathcal{H}_{{\mathcal{I}}}^2(\psi)=\mathcal{H}_{{\mathcal{I}}}^3(\psi)$ and (ii) (precision monotonocity) if ${\mathcal{I}}\leq_p {\mathcal{I}}'$ then $\mathcal{H}_{{\mathcal{I}}}^3(\psi) \leq_p \mathcal{H}_{{\mathcal{I}}'}^3(\psi)$ . Footnote 5
For example, Kleene’s strong 3-valued truth assignment $\mathcal{H}^{SK}$ of FO (introduced by Kleene Reference Kleene1952) and Belnap’s 4-valued extension (introduced by Belnap Reference Belnap1977) are regular. They induce multi-valued extensions ${\Phi_{P}}$ of ${T_{P}}$ first introduced by Fitting (Reference Fitting1985). Later, ${\Phi_{P}}$ was found to correspond to an approximator $A_P$ of ${T_{P}}$ whose KK, WF and stable fixpoints correspond to the semantics of the same name.
In LP and ASP, it is often taken for granted that LP’s nonmonotonicity is due to its non-classical negation not. But it is evident in AFT-based semantics, that the main non-classical connective is the rule operator: its semantics is defined via operators and constructive processes while negation in bodies is treated like the other FO connectives, using three-valued logic only for approximation of the standard classical connectives.
The AFT road is quite unlike other semantic techniques in ASP. In the next section, we reformulate the framework in more accessible terms for the ASP community.
3 Ternary satisfaction relations
Ternary satisfaction relations were used originally by Liu et al. (Reference Liu, Pontelli, Son and Truszczyński2010) in the context of Aggregate ASP semantics where they were called sub-satisfiability relations.
We still assume a base logic ${\mathcal{L}}$ equipped with satisfaction relation $\models_2$ . Below, we restrict ourselves to 3-valued interpretations corresponding to pairs (I,J) of 2-valued Herbrand interpretations $I\subseteq J$ (but extension to non-Herbrand interpretations is possible).
Definition 3 (Ternary satisfaction relations) A ternary satisfaction relation (TSR) $\models_3$ of $\mathcal{L}$ is a relation between pairs (I,J) of interpretations such that $I\subseteq J$ , and formulas $\psi$ of $\mathcal{L}$ such that $I\models_2 \psi$ iff $(I,I)\models_3 \psi$ . It is lower-monotone if $(I, J) \models_3 \psi$ implies $(I', J) \models_3 \psi$ when $I\subseteq I' \subseteq J$ . It is lower-regular (upper-regular) if $(I, J) \models_3 \psi$ implies $(I', J') \models_3 \psi$ when $(I, J) \leq_p (I', J')$ ( $(I', J') \leq_p (I, J)$ ).
For any pair of a lower-regular TSR $\models_3$ and an upper-regular TSR $\models_3^\uparrow$ , it always holds that $\models_3\subseteq\models_3^\uparrow$ since $(I,J)\models_3\psi$ implies $(I,I)\models_2\psi$ which implies $(I, J)\models_3^\uparrow \psi$ . Also, a three-valued truth-function $\mathcal{H}^3$ corresponds one to one to pairs $(\models_3,\models_3^\uparrow)$ of TSRs satisfying $\models_3 \subseteq \models_3^\uparrow$ . The correspondence is: (i) $(I, J) \models_3 \psi$ iff $\mathcal{H}_{(I, J)}^3(\psi) = {\mathbf{t}}$ and (ii) $(I, J) \models_3^\uparrow \psi$ iff $\mathcal{H}_{(I, J)}^3(\psi) \in \{{\mathbf{t}}, {\mathbf{u}}\}$ .
Proposition 1 $\mathcal{H}^3$ is regular iff $\models_3$ is a lower- and $\models_3^\uparrow$ an upper-regular TSR. Footnote 6
Taking ${\mathcal{L}}$ as FO and $\mathcal{H}^3$ as the strong Kleene truth assignment, it is a folk result that $(I,J)\models_3 \psi$ if $\psi$ evaluates to true when interpreting all positively occurring atoms in I and all negatively occurring ones in J. For $(I,J)\models_3^\uparrow \psi$ , exchange the roles of I and J.
For each ${\mathcal{L}}$ -program P, the lower- and upper-regular TSR induce two distinct operators on consistent pairs $I\subseteq J$ : $A_P^{\models_3}(I, J) = \{p| \exists (p \leftarrow \psi) \in P: (I, J)\models_3 \psi\}$ and $A_P^{\models_3^\uparrow}(I, J) = \{p| \exists (p \leftarrow \psi) \in P: (I, J)\models_3^\uparrow \psi \}$ . Now, we define $A_P(I, J) = (A_P^{\models_3}(I, J), A_P^{\models_3^\uparrow}(I, J))$ .
Proposition 2 If $\models_3$ and $\models_3^\uparrow$ are lower- and upper-regular TSRs, then $A_P$ is an $L^c$ approximator. Moreover it is isomorphic to the 3-valued ${\Phi_{P}}$ induced by the 3-valued truth assignment $\mathcal{H}^3$ combining $\models_3$ and $\models_3^\uparrow$ .
Now, supported, KK, WF and stable models of the ${\mathcal{L}}$ -programs P can be defined in terms of $\models_3$ and $\models_3^\uparrow$ . Interestingly, J is a stable model of P iff J is the least fixpoint of $A_P^{\models_3}(I,J) =\lambda I\in [\bot,J]: \{p| \exists (p \leftarrow \psi) \in P: (I, J)\models_3 \psi\}$ . No need of $\models_3^\uparrow$ ! Clearly there exists an asymmetry between truth and falsity in stable models. While information about the truth of formulas, encoded by $\models_3$ , is essential to determine the stable fixpoints of a program, information about their falsity, given by $\models_3^\uparrow$ , is disregarded.
4 Generalizing the concept of answer set
Here, we generalize stable models of ${\mathcal{L}}$ -programs to answer sets. Now, ${\mathcal{L}}$ , the logic of the rule bodies, has, besides a satisfaction relation $\models_2$ also a TSR $\models_3$ .
Definition 4 I is an answer set of P if (1) for every $(p\leftarrow \psi)\in P$ , if $I\models_2\psi$ then $I\models_2 p$ ; (2) there is no $J\subset I$ such that for every $(p\leftarrow \psi)\in P$ , if $(J,I)\models_3 \psi$ then $(J,I)\models_3 p$ .
Proposition 3 (semi-constructive answer sets) If $\models_3$ is lower-monotone, then for ${\mathcal{L}}$ -programs P, I is an answer set of P iff I is the limit of the increasing sequence $\langle I_\alpha\rangle_{\alpha\geq 0}$ where (1) $I_0=\emptyset$ , (2) $I_{\alpha+1}=A_P^{\models_3}(I_\alpha,I)$ if $I_\alpha\subseteq I$ , (3) $I_\lambda = \bigcup_{\alpha<\lambda}I_\alpha$ for limit ordinal $\lambda$ .
In general, the (transfinite) fixpoint sequence may leave $[\emptyset,J]$ , or end up with a fixpoint $I \subseteq J$ . In case it is J, it is an answer set. Let $\models_3$ be lower-regular. Combining it with any upper-regular TSR $\models_3^\uparrow$ induces a regular truth assignment $\mathcal{H}^3$ , as well as an entire family of AFT fixpoints and models of ${\mathcal{L}}$ -programs: KK models, WF models and AFT-stable models approximated by the WF models. The AFT-stable models in this framework depend only on $\models_3$ and they correspond exactly to answer sets of $\models_3$ , since a lower-regular TSR is also lower-monotone.
Proposition 4 Answer sets and AFT-stable models coincide for ${\mathcal{L}}$ programs with a lower regular TSR.
To finish this section, we analyze the link with the original definition of answer set given by Gelfond and Lifschitz (Reference Gelfond and Lifschitz1988). There, ${\mathcal{L}}$ is the logic of ground sets/conjunctions of literals with FO’s standard satisfaction relation $\models_2$ . Let P be an ${\mathcal{L}}$ -program.
Definition 5 (Gelfond-Lifschitz reduct and answer set) The Gelfond-Lifschitz reduct (defined by Gelfond and Lifschitz Reference Gelfond and Lifschitz1988) $P^J$ of P for an interpretation J is obtained from P by deleting
-
all rules with a negative literal $\neg l$ such that $l \in J$ .
-
all negative literals in the bodies of the remaining rules.
J is a GL-answer set of P if $J \models_2 P^J$ and there is no $I\subset J$ such that $I\models_2 P^J$ .
We now identify the lower-regular TSR $\models_{GL}$ such that answer sets of programs induced by $\models_{GL}$ , coincide with GL-answer sets. The TSR that is needed evaluates bodies $\psi$ in pairs (I,J) by interpreting atomic literals of $\psi$ in I and negative literals in J. But as explained in the previous section, that is how the lower-regular TSR $\models_{SK}$ of the strong Kleene truth assignment operates: $(I,J)\models_{SK} \psi$ iff atoms in $\psi$ hold in I and negative literals hold in J. Thus, $\models_{GL}$ is the restriction of $\models_{SK}$ to conjunctions of literals. With this in mind, the following proposition is straightforward.
Proposition 5 For $I\in[\bot, J]$ , $I\models_2 P^J$ iff for every rule $p\leftarrow\psi\in P$ , if $(I,J)\models_{GL} \psi$ then $(I,J)\models_{GL} p$ . J is a GL-answer set of P iff J is an AFT-stable model of P under strong Kleene truth assignment.
Thus, this type of answer set fits in the AFT-landscape of KK, WF and stable models.
5 Aggregates programs in the AFT framework
Aggregate Programs For the remainder of the text, we will consider ${\mathcal{L}}$ -programs where ${\mathcal{L}}$ is the logic of conjunctions of literals and positive aggregate atoms. Footnote 7
An aggregate atom $a^\textit{Aggr}$ is of the form: $\textit{Agg}(\{a_1: \textit{cond}_1,..., a_n: \textit{cond}_n\})*w$ with aggregate symbol $\textit{Agg}$ (e.g., SUM), comparison connective $*$ (e.g., $\leq, =, \neq,\dots$ ), numerical value w and multiset $\{a_1: \textit{cond}_1,..., a_n: \textit{cond}_n\}$ wher each $cond_i$ is a literal and each $a_i$ is a weight. The $\models_2$ relation for ${\mathcal{L}}$ is naturally extended with a rule for evaluating aggregate atoms, so also ${T_{P}}$ is defined. This enables the first approach to apply AFT on this type of programs, using the 3-valued ultimate approximator $\textit{Ult}_{{T_{P}}}$ yielding the ultimate KK, WF and stable semantics. Up to the syntax, this is the semantics of Denecker et al. (Reference Denecker, Pelov and Bruynooghe2001). The second approach to apply AFT is based on defining a regular 3-valued truth assignment for aggregate atoms, leading to a three valued operator ${\Phi^{Aggr}_{P}}$ . Up to the syntax, this was the approach followed by Pelov et al. (Reference Pelov, Denecker and Bruynooghe2007).
We now start the study of existing approaches for handling aggregates in ASP. Some of these use more extensive programs than the ones analysed here, however our analysis only considers the simpler ${\mathcal{L}}$ -programs. Proposition 3 shows that there is a constructive test for answer sets of non-disjunctive ${\mathcal{L}}$ -programs for semantics with lower-monotone TSR’s. A lower-regular TSR is lower-monotone by definition. Thus, this constructive test is applicable for all semantics that fit in the AFT-framework. However, not all semantics for ASP programs in the literature have lower-regular or even lower-monotone ternary satisfaction relations.
Before we start, we define the precision order on the TSRs analogous to the precision order on truth assignments defined by Pelov et al. (Reference Pelov, Denecker and Bruynooghe2007).
Definition 6 (Precision relation over lower ternary satisfaction relations) A TSR $\models_a$ is less precise than a TSR $\models_b$ , or $\models_a \leq_p \models_b$ , iff for every formula $\psi$ and every pair of two-valued interpretations (I, J): $(I, J) \models_a \psi$ implies $(I, J) \models_b \psi$ .
Proposition 6 Let $\models_a, \models_b$ be TSRs that coincide with $\models_{GL}$ on aggregate free bodies. If $\models_a \leq_p \models_b$ and J is an answer set associated with $\models_a$ (an a-answer set), then J is a b-answer set.
5.1 Approaches that fit in the AFT framework
Pelov et al. (Reference Pelov, Denecker and Bruynooghe2007) This paper introduces several regular truth assignments for aggregate atoms. This is equivalent with expanding the lower- and upper-regular TSR with a rule for aggregate atoms. For defining answer sets for the syntax of this paper, the lower one suffices. Footnote 8 The least precise approximation, $\mathcal{H}^{triv}$ assigns to an aggregate atom $a^{Aggr}$ the same value as I and J when I and J agree on the conditions in $a^{\textit{Aggr}}$ and ${\mathbf{u}}$ when they disagree. The corresponding lower TSR is:
Definition 7 ( $\models_{\textit{triv}}$ ) $\models_{\textit{triv}}$ extends $\models_{\textit{GL}}$ with: $(I, J) \models_{\textit{triv}} a^\textit{Aggr} $ iff $J\models_2 a^{Aggr}$ and ${cond_i}^I={cond_i}^J$ for every condition $cond_i$ in $a^\textit{Aggr}$ .
The most precise regular truth assignment $\mathcal{H}^{\textit{ult}}$ assigns ${\mathbf{t}}$ ( ${\mathbf{f}}$ ) to an aggregate expression in (I,J) if it is ${\mathbf{t}}$ ( ${\mathbf{f}}$ ) in every $Z\in [I,J]$ . Otherwise, it assigns ${\mathbf{u}}$ . The corresponding lower TSR is:
Definition 8 ( $\models_{ult}$ ) $\models_{\textit{ult}}$ extends $\models_{\textit{GL}}$ with: $(I, J) \models_{\textit{ult}} a^{\textit{Aggr}} $ iff for each Z such that $I \subseteq Z \subseteq J$ : $Z \models_2 a^{\textit{Aggr}}$ .
Pelov (Reference Pelov2004) shows that for stratified aggregate programs (where predicates in aggregate expressions are defined at a lower level), the trivial and the ultimate truth assignments lead to the same semantics.
While very precise, Pelov (Reference Pelov2004) shows that the complexity of computing KK, WF and stable models under $\models_{\textit{ult}}$ moves to the next level of the polynomial hierarchy. To avoid this, he offers a less precise alternative called the bounded truth assignment.
Phrased in terms of the present aggregate programs, it uses functions $\textit{LB}_{\textit{Agg}}, \textit{UB}_{\textit{Agg}}: \mathcal{P}(\tilde{D_1}) \rightarrow D_2$ that maps any three-valued multiset $\{\textit{ms}\}^{{\mathcal{I}}}$ to respectively the minimum and the maximum of $\{Agg(\{ms\})^{I'}|I'\in [I, J]\}$ ; that is, $\textit{LB}_{\textit{Agg}}$ represents the lower bound for the aggregate function $\textit{Agg}$ on the possible multisets and $\textit{UB}_{\textit{Agg}}$ the upperbound. The truth value for aggregate atoms with sum and product is based on these bounds. The corresponding lower TSR is:
Definition 9 ( $\models_{\textit{bnd}}$ ) $\models_{\textit{bnd}}$ agrees with $\models_{\textit{ult}}$ except for aggregate atoms of the form $\textit{Agg}(\{\textit{ms}\})*w$ with $\textit{Agg} \in \{\mathit{SUM}, \mathit{PROD}\}$ and $* \in \{=, \neq\}$
-
$(I, J) \models_{\textit{bnd}} \textit{Agg}(\{\textit{ms}\})=w $ iff $\textit{LB}_{\textit{Agg}}(\{\textit{ms}\}^{(I, J)}) = w = \textit{UB}_{\textit{Agg}}(\{\textit{ms}\}^{(I, J)})$ .
-
$(I, J) \models_{\textit{bnd}} \textit{Agg}(\{\textit{ms}\}) \neq w $ iff $\textit{LB}_{\textit{Agg}}(\{\textit{ms}\}^{(I, J)}) > w$ or $\textit{UB}_{\textit{Agg}}(\{\textit{ms}\}^{(I, J)}) < w$ .
Pelov (Reference Pelov2004) lists polynomial algorithms to compute both bounds for all aggregate atoms discussed in his thesis. The same holds for the common aggregate atoms in ASP. Consequently, the complexity of computing the different types of models remains on the same level as for the non-aggregate case. Yet, the bound semantics is precise enough to solve many useful aggregate programs with recursion over the aggregates.
Proposition 7 The TSRs $\models_{\textit{triv}}$ , $\models_{\textit{ult}}$ and $\models_{\textit{bnd}}$ are lower-regular. Since $\models_{triv}\leq_p \models_{bnd} \leq_p \models_{\textit{ult}}$ , an answer set of $\models_{triv}$ is one of $\models_{bnd}$ , and one of $\models_{bnd}$ is one of $\models_{\textit{ult}}$ .
Liu et al. (Reference Liu, Pontelli, Son and Truszczyński2010). They introduced a kind of TSR to define semantics for abstract constraints. While the restrictions imposed on these relations are different and do not necessarily fit into AFT, their main example, the sub-satisfiability relation as proposed by Son et al. (2007) does. For any abstract constraint $\alpha$ : $(I, J) \models_{\textit{LPST}} \alpha$ if and only if for each interpretation Z such that $I \subseteq Z \subseteq J$ , it holds that $Z \models_2 \alpha$ .
Definition 10 ( $\models_{\textit{LPST}}$ ) The TSR $\models_{\textit{LPST}}$ extends $\models_{\textit{GL}}$ with: If $a^{\textit{Aggr}}$ is an aggregate atom, then $(I, J) \models_{\textit{LPST}} a^{\textit{Aggr}} $ iff for each Z such that $I \subseteq Z \subseteq J$ : $Z \models_2 a^{\textit{Aggr}}$ .
This is the same satisfaction relation as $\models_{\textit{ult}}$ in Definition 8, hence it is lower-regular and defines the same answer sets.
Gelfond and Zhang (Reference Gelfond and Zhang2019). They construct a reduct for aggregate programs with respect to a three-valued interpretation. We only consider the case where the interpretation is two-valued. Gelfond and Zhang (Reference Gelfond and Zhang2019) allow two kinds of negation: negation by default, which corresponds to negation as presented in this paper, and explicit negation, which is not a part of the syntax of the programs considered here but can be simulated by the well-known translation of explicitly negated atoms of a predicate p into atoms of a newly introduced predicate $p^*$ . Since Gelfond and Zhang (Reference Gelfond and Zhang2019) do not allow default negation within an aggregate atom, here it suffices to consider programs with positive conditions inside an aggregate atom. The reduction process is split into two main parts. The first part constructs a reduct regarding the aggregate atoms. It consists of two steps:
-
1. Removing all rules with aggregate atoms that evaluate to ${\mathbf{f}}$ in the interpretation.
-
2. Replacing every remaining aggregate atom by the conjunction of the subset of its conditions that are ${\mathbf{t}}$ in the interpretation.
In other words, given a rule r in a program P: $p \leftarrow l_1 \wedge ... \wedge l_n$ , such that for an $l_i$ it holds that $l_i = Agg(\{a_1: \textit{cond}_1,..., a_n: \textit{cond}_n\}) * w$ , then the rule is deleted in the reduct $P^J$ if $l_i$ evaluates to ${\mathbf{f}}$ in J. Otherwise the rule is replaced by $p \leftarrow l_1 \wedge ... \wedge l_{i-1} \wedge l_{i+1} \wedge ... \wedge l_n \wedge (\bigwedge \{\textit{cond}_j \in \{ \textit{cond}_1,\ldots,\textit{cond}_n\}| J \models_2 \textit{cond}_j\}$ .
The second part transforms the preliminary reduct after the first phase to its Gelfond-Lifschitz reduct. In this way, it preserves the capability to deal with ordinary propositional atoms. From this reduct one can inductively define the TSR $\models_{\textit{GZ}}$ :
Definition 11 ( $\models_{\textit{GZ}}$ ) $\models_{\textit{GZ}}$ extends $\models_{\textit{GL}}$ with: Let $a^{\textit{Aggr}} = \textit{Agg}(\{a_1: \textit{cond}_1,..., a_n: \textit{cond}_n\}) * w$ . $(I, J) \models_{\textit{GZ}} a^{\textit{Aggr}} $ iff $J\models_2 a^{Aggr}$ and $(I, J) \models_{\textit{GZ}} \bigwedge \{\textit{cond}_j \in \{ \textit{cond}_1,\ldots,\textit{cond}_n\}| J \models_2 \textit{cond}_j\}$ .
Proposition 8 For aggregate programs containing only positive conditions in aggregate atoms, the TSR $\models_{\textit{GZ}}$ is identical to the TSR $\models_{\textit{triv}}$ and lower-regular for consistent pairs, that is, with (I, J) a consistent pair, $(I, J) \models_{\textit{GZ}} a^{\textit{Aggr}} $ iff $(I, J) \models_{\textit{triv}} a^{\textit{Aggr}}$ .
Precision Complexity Trade-off. One expects more effort gives more precise approximations. From Theorem 7.4 in the paper by Pelov et al. (Reference Pelov, Denecker and Bruynooghe2007), it follows that if the evaluation of an expression with respect to a lower-regular ternary satisfaction relation $\models_3$ is polynomially computable, then checking whether or not a model is an answer set is in P and deciding whether an answer set for a program exists, is in NP. This is the case for $\models_{triv}$ and $\models_{bnd}$ and for instances of the framework that coincide with $\models_{triv}$ ; for instances that coincide with $\models_{ult}$ the check problem is in NP and the exists-problem is in $\Sigma_2^P$ .
5.2 Other ternary satisfaction relations
In the AFT framework, semantics of ASP aggregate atoms are based on lower regular TSRs. As we show in this section, also other well-known semantics can be characterized using TSRs, however, they are not regular. This is no coincidence. The definition of answer set semantics in terms of TSRs strongly resembles another well-known semantic method of ASP, namely using the logic of here-and-there (HT) (for an overview of HT and its applications to ASP, see the work by Cabalar et al. Reference Cabalar, Pearce and Valverde2017). Due to very different points of view on answer sets, the two frameworks obtain different requirements for the TSRs. AFT treats answer sets as the result of constructive processes; the rule operator serves to produce them. A production is safe if the TSR is lower-regular. In contrast, HT takes a non-constructive take on answer sets. In HT, extensions build on the inherently non-regular TSRs derived from the three-valued logic G3 introduced by GÖdel (Reference GÖdel1932) where the rule operator is treated as HT-material implication.
Reference Marek and Remmel Marek and Remmel (2004) . They study Set Constraints (SC) Programming. It builds an NSS-reduct for an SC-program. Liu et al. (Reference Liu, Pontelli, Son and Truszczyński2010) prove that this semantics for set constraints is also obtained by the following satisfaction rule for a set constraint $\alpha$ : $(I, J)\models_{\textit{MR}} \alpha$ , if $J\models_2 \alpha$ and there exists an interpretation $Z \subseteq I$ such that $Z\models_2 \alpha$ . This leads to the TSR $\models_{\textit{MR}}$ :
Definition 12 ( $\models_{\textit{MR}}$ ) $\models_{\textit{MR}}$ extends $\models_{\textit{GL}}$ with: $(I, J) \models_{\textit{MR}} a^{\textit{Aggr}} $ iff $J\models_2 a^{\textit{Aggr}}$ and there exists an interpretation Z such that $Z \subseteq I$ and $Z \models_2 a^{\textit{Aggr}}$ .
Consider the aggregate atom $\mathit{SUM}(\{1:p, -1:q\}) \geq 0$ and two intervals, namely $(\emptyset, \{p, q, s\}) <_p (\emptyset, \{q\})$ . We have that $(\emptyset, \{p, q, s\}) \models_{\textit{MR}} \mathit{SUM}(\{1:p, -1:q\}) \geq 0$ while $(\{p\}, \{q\}) \not\models_{\textit{MR}} \mathit{SUM}(\{1:p, -1:q\}) \geq 0$ . Hence, $\models_\textit{MR}$ is not lower-regular.
Proposition 9 (i) $\models_{\textit{MR}}$ extends $\models_2$ , that is, $(I,I) \models_{\textit{MR}} \psi $ iff $I \models_2 \psi$ . (ii) $\models_{\textit{MR}}$ is lower-monotone, that is, if $I \subseteq I'$ and $(I, J) \models_{\textit{MR}} \psi$ , then $(I', J)\models_{\textit{MR}} \psi$ .
While non-regular, the ternary relation $\models_{\textit{MR}}$ still extends the satisfaction relation $\models_2$ and induces a monotone operator $\lambda I: A_{\models_{MR}}(I,J)$ . Thus, an answer set may still be defined as an interpretation J that is a least fixpoint of this operator. But, due to non-regularity, some unexpected answer sets are obtained.
Example 1 Take the program:
The bodies of these rules are equivalent to $p\lor\neg q$ , respectively s and q. Therefore, one expects its answer sets to be the same as those of the simplified program:
To check that $J=\{p, q, s\}$ is an answer set, we observe that $(\emptyset, J)\models_{\textit{MR}} \mathit{SUM}(\{1:p, -1:q\})\geq 0$ , hence the first iteration derives the head s. Two more iterations reconstruct the fixpoint $\{p, q, s\}$ which therefore is an answer set according to these semantics.
On the other hand, the Gelfond-Lifschitz reduct of the simplified program with respect to $J = \{p, q, s\}$ is:
Since $\emptyset$ is a model of the reduct, $\{p,q,s\}$ is not an answer set of the simplified program. The culprit for “too many” answer sets of the aggregate program is the non-regularity of $\models_{MR}$ which leads to the unsafe derivation of aggregate atoms: in the first derivation, $(\emptyset,\{p, q, s\})\models_{MR}\mathit{SUM}(\{1:p, -1:q\})\geq 0$ which derived s, but this derivation was unsafe since this aggregate atom is not satisfied in the more precise $(\emptyset,\{q\})$ . However, things change when looking only at convex aggregate atoms.
Definition 13 (Convex aggregate atom) An aggregate atom $a^{\textit{Aggr}}$ is convex iff for all interpretations I, Z, J, such that $I \subseteq Z \subseteq J$ , it holds that if $I\models_2 a^{\textit{Aggr}}$ and $J\models_2 a^{\textit{Aggr}}$ , then $Z\models_2 a^{\textit{Aggr}}$ .
Proposition 10 For convex aggregate atoms, $\models_{\textit{MR}}$ behaves lower-regular and equivalent with $\models_{\textit{ult}}$ .
Reference Faber, Pfeifer and Leone Faber et al. (2011) . They provide semantics for a broader class of ASP programs including negated aggregate atoms. This approach constructs a reduct $P^J$ for a program P with respect to an interpretation J by deleting all rules of which the body is not satisfied in J. In general, the immediate consequence operator $T_{P^J}$ is not monotone, so answer sets cannot be defined using the $\textit{lfp}$ construction of this operator. Nevertheless, the $\textit{FPL}$ -answer sets are the answer sets of the following TSR:
Definition 14 ( $\models_{\textit{FPL}}$ ) We define $\models_{\textit{FPL}}$ as follows: $(I, J)\models_{\textit{FPL}} \psi$ iff $I \models_2 \psi$ and $J \models_2 \psi$ .
Proposition 11 $\models_{\textit{FPL}}$ extends $\models_2$ , that is, $(I,I) \models_{\textit{FPL}} \psi $ iff $I \models_2 \psi$ . For conjunctions of aggregate free literals, $\models_{\textit{FPL}}$ coincides with $\models_{\textit{GL}}$ .
This very simple TSR is neither lower-regular nor lower-monotone, thus the constructive test is inapplicable.
As a consequence, discrepancies between some aggregate programs and their aggregate-free simplification are also present. Using again Example 1, $(\emptyset, \{p, q, s\})\models_{\textit{FPL}} \mathit{SUM}(\{1:p, -1:q\})\geq 0$ . But again, $(\emptyset, \{q\})\not\models_{\textit{FPL}} \mathit{SUM}(\{1:p, -1:q\}) \geq 0$ . Similarily to $\models_{\textit{MR}}$ , the $\textit{FPL}$ -semantics leads to $\{p, q, s\}$ as an answer set. It is obvious that if $(I, J)\models_{\textit{FPL}} \psi$ , then $(I, J)\models_{\textit{MR}} \psi$ , since $I \subseteq I$ . Accordingly $\models_{\textit{FPL}}$ is less precise then $\models_{\textit{MR}}$ . Analogously, if $(I, J)\models_{\textit{ult}} \psi$ , then $(I, J)\models_{\textit{FPL}} \psi$ since the interpretations I and J are both elements of [I, J].
Proposition 12 For convex aggregate atoms, the TSR $\models_{\textit{FPL}}$ behaves lower-regular and equivalent to $\models_{\textit{ult}}$ .
Reference Ferraris Ferraris (2011) . This semantics is closely related to the $\textit{FPL}$ -semantics. Actually, they coincide for negation-free programs. Ferraris (Reference Ferraris2011) also cover more extensive instances of answer set programs, such as arbitrary propositional theories. It obtains the reduct $P^J$ for a program P and interpretation J by replacing all maximal subformulas in P that are unsatisfied in J by $\bot$ . This corresponds to the TSR $\models_{\textit{F}}$ :
Definition 15 ( $\models_{\textit{F}}$ ) $\models_{\textit{F}}$ extends $\models_{\textit{GL}}$ with: Let $a^{\textit{Aggr}} = \textit{Agg}(\{a_1:cond_1. \ldots, a_n:cond_n\})*w$ , $(I, J) \models_{\textit{F}} a^{\textit{Aggr}}$ iff $J \models_2 a^{\textit{Aggr}}$ and $I \models_2 \textit{Agg}(\{a_i:cond_i \in \{a_1:cond_1, \ldots, a_n:cond_n\}|J \models_2 cond_i\})*w$ .
Since $\models_{\textit{F}}$ and $\models_{\textit{FPL}}$ coincide for negation-free programs, an analogous discussion of Example 1 leads to the conclusion that the satisfaction relation $\models_{\textit{F}}$ is not lower-regular due to the lack of monotonicity.
Proposition 13 $\models_{\textit{F}}$ extends $\models_2$ , that is, $(I,I) \models_{\textit{F}} \psi $ iff $I \models_2 \psi$ .
Proposition 14 For convex aggregate atoms, $\models_{\textit{F}}$ behaves lower-monotone, that is, if $I \subseteq I'$ and $(I, J) \models_{\textit{F}} \psi$ , then $(I', J)\models_{\textit{F}} \psi$ .
Proposition 15 For anti-monotone aggregate atoms, $\models_{\textit{F}}$ behaves lower-regular and equivalent with $\models_{\textit{ult}}$ .
Table 1 gives an overview. Row 1 indicates for which aggregate atoms the ternary satisfaction relation behaves lower-regular. Row 2 shows which semantics from Pelov et al. (Reference Pelov, Denecker and Bruynooghe2007) coincides with this semantics for the subclass of programs such that the semantics behaves lower-regular. Row 3 indicates for which aggregate atoms the semantics is monotone in the first component such that the answer sets can be constructed as the $\textit{lfp}$ of ${\Phi_{P}}$ . Row 4 gives the set $S_a$ of semantics discussed in this paper that are strictly more precise than the considered instance a. Consequently every a-answer will always be an answer set for every semantics in $S_a$ . This does not generally hold the other way around. Interestingly, all non-regular semantics coincide with the ult-semantics of Pelov (Reference Pelov2004) for the subclass of programs in which they behave lower-regular. Footnote 10 For programs outside this subclass, the non-regular semantics differ and may derive more answer sets than the ult-semantics.
6 Conclusions and future work
Approximation Fixpoint Theory describes various types of constructions from nonmonotonic operators and was designed to formalize the view of Logic Programming as constructive definitions, a view at least implicit in stratified logic programs, in the KK and WF semantics, but also in the logic FO(ID) introduced by Denecker and Ternovska (Reference Denecker and Ternovska2008). We studied aggregate programs from this view point, showing how regular (3- or 4-valued) extensions of the strong Kleene truth assignment induce extensions of KK, WF and stable semantics. We showed that regular truth assignments correspond one-to-one to pairs of a lower-regular and an upper-regular ternary satisfaction relation $(I,J)\models_3 \psi$ , where the lower-regular one suffices for defining stable models. To study the relation with ASP, we then made a generalized study of TSRs as a tool to define answer sets. We analysed different properties of TSRs, and many semantics of aggregate programs in the literature to determine the corresponding lower ternary satisfaction relation and the properties that influence them, such as convexity, (anti-)monotonicity, and the sign of conditions in aggregates. We obtained many results linking many ASP semantics in various degrees to the AFT-framework.
In the ASP community, other views of LP and ASP exist than that as a logic of constructive definitions. They are developed in various frameworks such as the framework of HT (for more details see the paper by Cabalar et al. Reference Cabalar, Pearce and Valverde2017) or of Ferraris and Lifschitz (for more details see the paper by Ferraris et al. Reference Ferraris, Lee and Lifschitz2007). These frameworks often extend the original logic programming formalism in various directions, for example, with disjunction in the head, other negations. They may entirely redefine full FO and the meaning of its connectives. The base idea is that a program corresponds to a theory in some logic (e.g., HT or FO) from which answer set are derived using some equilibrium characterisation. Although these semantics are not constructive in the sense of AFT, there are surely many interesting mathematical relationships to AFT. For instance, it is striking that the logic of HT is also defined in terms of a ternary satisfaction relation. It is a goal for future work to investigate this.
Supplementary material
To view supplementary material for this article, please visit http://10.1017/S1471068422000126.