Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3294052.3319688acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
research-article

The Space-Efficient Core of Vadalog

Published: 25 June 2019 Publication History

Abstract

Vadalog is a system for performing complex reasoning tasks such as those required in advanced knowledge graphs. The logical core of the underlying Vadalog language is the warded fragment of tuple-generating dependencies (TGDs). This formalism ensures tractable reasoning in data complexity, while a recent analysis focusing on a practical implementation led to the reasoning algorithm around which the Vadalog system is built. A fundamental question that has emerged in the context of Vadalog is the following: can we limit the recursion allowed by wardedness in order to obtain a formalism that provides a convenient syntax for expressing useful recursive statements, and at the same time achieves space-efficiency? After analyzing several real-life examples of warded sets of TGDs provided by our industrial partners, as well as recent benchmarks, we observed that recursion is often used in a restricted way: the body of a TGD contains at most one atom whose predicate is mutually recursive with a predicate in the head. We show that this type of recursion, known as piece-wise linear in the Datalog literature, is the answer to our main question. We further show that piece-wise linear recursion alone, without the wardedness condition, is not enough as it leads to the undecidability of reasoning. We finally study the relative expressiveness of the query languages based on (piece-wise linear) warded sets of TGDs.

References

[1]
Foto N. Afrati, Manolis Gergatsoulis, and Francesca Toni. 2003. Linearisability on datalog programs. Theor. Comput. Sci., Vol. 308, 1--3 (2003), 199--226.
[2]
Marcelo Arenas, Georg Gottlob, and Andreas Pieris. 2014. Expressive languages for querying the semantic web. In PODS. 14--26.
[3]
Marcelo Arenas, Georg Gottlob, and Andreas Pieris. 2018. Expressive Languages for Querying the Semantic Web. ACM Trans. Database Syst., Vol. 43, 3 (2018), 13:1--13:45.
[4]
Patricia C. Arocena, Boris Glavic, Radu Ciucanu, and René e J. Miller. 2015. The iBench Integration Metadata Generator. PVLDB, Vol. 9, 3 (2015), 108--119.
[5]
Catriel Beeri and Moshe Y. Vardi. 1981. The Implication Problem for Data Dependencies. In ICALP. 73--85.
[6]
Luigi Bellomarini, Georg Gottlob, Andreas Pieris, and Emanuel Sallinger. 2017. Swift Logic for Big Data and Knowledge Graphs. In IJCAI. 2--10.
[7]
Luigi Bellomarini, Emanuel Sallinger, and Georg Gottlob. 2018. The Vadalog System: Datalog-based Reasoning for Knowledge Graphs. PVLDB, Vol. 11, 9 (2018), 975--987.
[8]
Michael Benedikt, George Konstantinidis, Giansalvatore Mecca, Boris Motik, Paolo Papotti, Donatello Santoro, and Efthymia Tsamoura. 2017. Benchmarking the Chase. In PODS. 37--52.
[9]
Peter Van Emde Boas. 1997. The Convenience of Tilings. In Complexity, Logic, and Recursion Theory . 331--363.
[10]
Andrea Cal`i, Georg Gottlob, and Michael Kifer. 2013. Taming the Infinite Chase: Query Answering under Expressive Relational Constraints. J. Artif. Intell. Res., Vol. 48 (2013), 115--174.
[11]
Andrea Cal`i, Georg Gottlob, Thomas Lukasiewicz, Bruno Marnette, and Andreas Pieris. 2010. Datalog
[12]
/-: A Family of Logical Knowledge Representation and Query Languages for New Applications. In LICS. 228--242.
[13]
Andrea Cal`i, Georg Gottlob, and Andreas Pieris. 2012. Towards more expressive ontology languages: The query answering problem. Artif. Intell., Vol. 193 (2012), 87--128.
[14]
Edith Cohen, Eran Halperin, Haim Kaplan, and Uri Zwick. 2003. Reachability and Distance Queries via 2-Hop Labels. SIAM J. Comput., Vol. 32, 5 (2003), 1338--1355.
[15]
Samir Datta, Raghav Kulkarni, Anish Mukherjee, Thomas Schwentick, and Thomas Zeume. 2015. Reachability is in DynFO. In ICALP. 159--170.
[16]
Ronald Fagin, Phokion G. Kolaitis, René e J. Miller, and Lucian Popa. 2005. Data exchange: semantics and query answering. Theor. Comput. Sci., Vol. 336, 1 (2005), 89--124.
[17]
Georg Gottlob, Giorgio Orsi, and Andreas Pieris. 2014. Query Rewriting and Optimization for Ontological Databases. ACM Trans. Database Syst., Vol. 39, 3 (2014), 25:1--25:46.
[18]
Georg Gottlob and Andreas Pieris. 2015. Beyond SPARQL under OWL 2 QL Entailment Regime: Rules to the Rescue. In IJCAI . 2999--3007.
[19]
Ruoming Jin, Yang Xiang, Ning Ruan, and Haixun Wang. 2008. Efficiently answering reachability queries on very large directed graphs. In SIGMOD . 595--608.
[20]
David S. Johnson and Anthony C. Klug. 1984. Testing Containment of Conjunctive Queries under Functional and Inclusion Dependencies. J. Comput. Syst. Sci., Vol. 28, 1 (1984), 167--189.
[21]
Valerie King. 1999. Fully Dynamic Algorithms for Maintaining All-Pairs Shortest Paths and Transitive Closure in Digraphs. In FOCS. 81--91.
[22]
Mé lanie Kö nig, Michel Leclè re, Marie-Laure Mugnier, and Michaë l Thomazo. 2015. Sound, complete and minimal UCQ-rewriting for existential rules. Semantic Web, Vol. 6, 5 (2015), 451--475.
[23]
Roman Kontchakov, Martin Rezk, Mariano Rodriguez-Muro, Guohui Xiao, and Michael Zakharyaschev. 2014. Answering SPARQL Queries over Databases under OWL 2 QL Entailment Regime. In ISWC . 552--567.
[24]
David Maier, Alberto O. Mendelzon, and Yehoshua Sagiv. 1979. Testing Implications of Data Dependencies. ACM Trans. Database Syst., Vol. 4, 4 (1979), 455--469.
[25]
Jeffrey F. Naughton. 1986. Data Independent Recursion in Deductive Databases. In PODS. 267--279.
[26]
Jeffrey F. Naughton and Yehoshua Sagiv. 1987. A Decidable Class of Bounded Recursions. In PODS. 227--236.
[27]
Sushant Patnaik and Neil Immerman. 1997. Dyn-FO: A Parallel, Dynamic Complexity Class. J. Comput. Syst. Sci., Vol. 55, 2 (1997), 199--209.

Cited By

View all
  • (2023)KG-Roar: Interactive Datalog-Based Reasoning on Virtual Knowledge GraphsProceedings of the VLDB Endowment10.14778/3611540.361160916:12(4014-4017)Online publication date: 1-Aug-2023
  • (2022)Adventures with Datalog: Walking the Thin Line Between Theory and PracticeAIxIA 2022 – Advances in Artificial Intelligence10.1007/978-3-031-27181-6_34(489-500)Online publication date: 28-Nov-2022
  • (2021)Datalog UnchainedProceedings of the 40th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3452021.3458815(57-69)Online publication date: 20-Jun-2021
  • Show More Cited By

Index Terms

  1. The Space-Efficient Core of Vadalog

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    PODS '19: Proceedings of the 38th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems
    June 2019
    494 pages
    ISBN:9781450362276
    DOI:10.1145/3294052
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 June 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. complexity
    2. datalog
    3. expressive power
    4. query answering
    5. reasoning
    6. tuple-generating dependencies

    Qualifiers

    • Research-article

    Conference

    SIGMOD/PODS '19
    Sponsor:
    SIGMOD/PODS '19: International Conference on Management of Data
    June 30 - July 5, 2019
    Amsterdam, Netherlands

    Acceptance Rates

    PODS '19 Paper Acceptance Rate 29 of 87 submissions, 33%;
    Overall Acceptance Rate 642 of 2,707 submissions, 24%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 30 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)KG-Roar: Interactive Datalog-Based Reasoning on Virtual Knowledge GraphsProceedings of the VLDB Endowment10.14778/3611540.361160916:12(4014-4017)Online publication date: 1-Aug-2023
    • (2022)Adventures with Datalog: Walking the Thin Line Between Theory and PracticeAIxIA 2022 – Advances in Artificial Intelligence10.1007/978-3-031-27181-6_34(489-500)Online publication date: 28-Nov-2022
    • (2021)Datalog UnchainedProceedings of the 40th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3452021.3458815(57-69)Online publication date: 20-Jun-2021
    • (2021)Stable Model Semantics for Guarded Existential Rules and Description Logics: Decidability and ComplexityJournal of the ACM10.1145/344750868:5(1-87)Online publication date: 22-Oct-2021
    • (2021)Eliminating Harmful Joins in Warded Datalog+/−Rules and Reasoning10.1007/978-3-030-91167-6_18(267-275)Online publication date: 8-Sep-2021
    • (2019)Vadalog: Recent Advances and ApplicationsLogics in Artificial Intelligence10.1007/978-3-030-19570-0_2(21-37)Online publication date: 6-May-2019

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media