Fixpoint Semantics and Optimization of Recursive Datalog Programs with Aggregates

Zaniolo, Carlo; Yang, Mohan; Interlandi, Matteo; Das, Ariyam; Shkapsky, Alexander; Condie, Tyson

Computer Science > Databases

arXiv:1707.05681 (cs)

[Submitted on 18 Jul 2017 (v1), last revised 21 Jul 2017 (this version, v2)]

Title:Fixpoint Semantics and Optimization of Recursive Datalog Programs with Aggregates

Authors:Carlo Zaniolo, Mohan Yang, Matteo Interlandi, Ariyam Das, Alexander Shkapsky, Tyson Condie

View PDF

Abstract:A very desirable Datalog extension investigated by many researchers in the last thirty years consists in allowing the use of the basic SQL aggregates min, max, count and sum in recursive rules. In this paper, we propose a simple comprehensive solution that extends the declarative least-fixpoint semantics of Horn Clauses, along with the optimization techniques used in the bottom-up implementation approach adopted by many Datalog systems. We start by identifying a large class of programs of great practical interest in which the use of min or max in recursive rules does not compromise the declarative fixpoint semantics of the programs using those rules. Then, we revisit the monotonic versions of count and sum aggregates proposed in (Mazuran et al. 2013b) and named, respectively, mcount and msum. Since mcount, and also msum on positive numbers, are monotonic in the lattice of set-containment, they preserve the fixpoint semantics of Horn Clauses. However, in many applications of practical interest, their use can lead to inefficiencies, that can be eliminated by combining them with max, whereby mcount and msum become the standard count and sum. Therefore, the semantics and optimization techniques of Datalog are extended to recursive programs with min, max, count and sum, making possible the advanced applications of superior performance and scalability demonstrated by BigDatalog (Shkapsky et al. 2016) and Datalog-MC (Yang et al. 2017). This paper is under consideration for acceptance in TPLP.

Comments:	Paper presented at the 33nd International Conference on Logic Programming (ICLP 2017), Melbourne, Australia, August 28 to September 1, 2017. 16 pages, LaTeX (arXiv:1707.05681)
Subjects:	Databases (cs.DB)
Cite as:	arXiv:1707.05681 [cs.DB]
	(or arXiv:1707.05681v2 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.1707.05681

Submission history

From: Mohan Yang [view email]
[v1] Tue, 18 Jul 2017 15:25:35 UTC (95 KB)
[v2] Fri, 21 Jul 2017 06:32:51 UTC (95 KB)

Computer Science > Databases

Title:Fixpoint Semantics and Optimization of Recursive Datalog Programs with Aggregates

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:Fixpoint Semantics and Optimization of Recursive Datalog Programs with Aggregates

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators