Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3428757.3429120acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
research-article

FedQPL: A Language for Logical Query Plans over Heterogeneous Federations of RDF Data Sources

Published: 27 January 2021 Publication History
  • Get Citation Alerts
  • Abstract

    Federations of RDF data sources provide great potential when queried for answers and insights that cannot be obtained from one data source alone. A challenge for planning the execution of queries over such a federation is that the federation may be heterogeneous in terms of the types of data access interfaces provided by the federation members. This challenge has not received much attention in the literature. This paper provides a solid formal foundation for future approaches that aim to address this challenge. Our main conceptual contribution is a formal language for representing query execution plans; additionally, we identify a fragment of this language that can be used to capture the result of selecting relevant data sources for different parts of a given query. As technical contributions, we show that this fragment is more expressive than what is supported by existing source selection approaches, which effectively highlights an inherent limitation of these approaches. Moreover, we show that the source selection problem is NP-hard and in σP2, and we provide an extensive set of rewriting rules that can be used as a basis for query optimization.

    References

    [1]
    Ibrahim Abdelaziz, Essam Mansour, Mourad Ouzzani, Ashraf Aboulnaga, and Panos Kalnis. 2017. Lusail: A System for Querying Linked Data at Scale. Proceedings of the VLDB Endowment (PVLDB) 11, 4 (2017).
    [2]
    Maribel Acosta, Olaf Hartig, and Juan F. Sequeda. 2019. Federated RDF Query Processing. In Encyclopedia of Big Data Technologies. Springer.
    [3]
    Maribel Acosta, Maria-Esther Vidal, Tomas Lampo, Julio Castillo, and Edna Ruckhaus. 2011. ANAPSID: An Adaptive Query Processing Engine for SPARQL Endpoints. In Proc. of the 11th International Semantic Web Conference (ISWC).
    [4]
    Amr Azzam, Javier D. Fernández, Maribel Acosta, Martin Beno, and Axel Polleres. 2020. SMART-KG: Hybrid Shipping for SPARQL Querying on the Web. In Proceedings of The Web Conference (WWW).
    [5]
    Philip A. Bernstein, Nathan Goodman, Eugene Wong, Christopher L. Reeve, and James B. Rothnie Jr. 1981. Query Processing in a System for Distributed Databases (SDD-1). ACM Trans. Database Syst. 6, 4 (1981).
    [6]
    Carlos Buil-Aranda, Aidan Hogan, Jürgen Umbrich, and Pierre-Yves Vandenbussche. 2013. SPARQL Web-Querying Infrastructure: Ready for Action?. In Proceedings of the 12th International Semantic Web Conference (ISWC).
    [7]
    Angelos Charalambidis, Antonis Troumpoukis, and Stasinos Konstantopoulos. 2015. SemaGrow: Optimizing Federated SPARQL Queries. In Proc. of the 11th International Conference on Semantic Systems (SEMANTICS).
    [8]
    Sijin Cheng and Olaf Hartig. 2020. FedQPL: A Language for Logical Query Plans over Heterogeneous Federations of RDF Data Sources (Extended Version). Online at http://olafhartig.de/FedQPL.ext.pdf.
    [9]
    Lee Feigenbaum, Gregory Todd Williams, Kendall Grant Clark, and Elias Torres. 2013. SPARQL 1.1 Protocol. W3C Recommendation.
    [10]
    Olaf Görlitz and Steffen Staab. 2011. Federated Data Management and Query Optimization for Linked Open Data. In New Directions in Web Data Management 1.
    [11]
    Olaf Görlitz and Steffen Staab. 2011. SPLENDID: SPARQL Endpoint Federation Exploiting VOID Descriptions. In Proc. of the 2nd International Workshop on Consuming Linked Data (COLD).
    [12]
    Laura M. Haas, Donald Kossmann, Edward L. Wimmers, and Jun Yang. 1997. Optimizing Queries Across Diverse Data Sources. In Proc. of the 23rd International Conference on Very Large Data Bases (VLDB).
    [13]
    Steve Harris, Andy Seaborne, and Eric Prud'hommeaux. 2013. SPARQL 1.1 Query Language. W3C Recommendation, Online at http://www.w3.org/TR/sparql11-query/.
    [14]
    Olaf Hartig, Christian Bizer, and Johann Christoph Freytag. 2009. Executing SPARQL Queries over the Web of Linked Data. In Proc. of the 8th International Semantic Web Conf. (ISWC).
    [15]
    Olaf Hartig and Carlos Buil-Aranda. 2016. Bindings-Restricted Triple Pattern Fragments. In 15th Ontologies, Databases, and Applications of Semantics (ODBASE).
    [16]
    Olaf Hartig, Ian Letter, and Jorge Pérez.2017. A Formal Framework for Comparing Linked Data Fragments. In Proc. of the 16th Int. Semantic Web Conf. (ISWC).
    [17]
    Richard M. Karp. 1972. Reducibility Among Combinatorial Problems. In Proceedings of a Symposium on the Complexity of Computer Computations.
    [18]
    Günter Ladwig and Thanh Tran. 2010. Linked Data Query Processing Strategies. In Proceedings of the 9th International Semantic Web Conference (ISWC).
    [19]
    Thomas Minier, Hala Skaf-Molli, and Pascal Molli. 2019. SaGe: Web Preemption for Public SPARQL Query Services. In Proceedings of the Web Conference (WWW).
    [20]
    Damla Oguz, Belgin Ergenc, Shaoyi Yin, Oguz Dikenelli, and Abdelkader Hameurlain. 2015. Federated Query Processing on Linked Data: A Qualitative Survey and Open Challenges. Knowledge Eng. Review 30, 5 (2015).
    [21]
    Jorge Pérez, Marcelo Arenas, and Claudio Gutierrez. 2009. Semantics and Complexity of SPARQL. ACM Trans. Database Syst. 34, 3 (2009).
    [22]
    Bastian Quilitz and Ulf Leser. 2008. Querying Distributed RDF Data Sources with SPARQL. In Proceedings of the 5th European Semantic Web Conference (ESWC).
    [23]
    Muhammad Saleem, Alexander Potocki, Tommaso Soru, Olaf Hartig, and Axel-Cyrille Ngonga Ngomo. 2018. CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation. In Proc. of the 14th Int. Conf. on Semantic Systems.
    [24]
    Michael Schmidt, Olaf Görlitz, Peter Haase, Günter Ladwig, Andreas Schwarte, and Thanh Tran. 2011. FedBench: A Benchmark Suite for Federated Semantic Data Query Processing. In Proc. of the 10th Int. Semantic Web Conference (ISWC).
    [25]
    Michael Schmidt, Michael Meier, and Georg Lausen. 2010. Foundations of SPARQL Query Optimization. In Proc. of the 13th Int. Conference on Database Theory (ICDT).
    [26]
    Andreas Schwarte, Peter Haase, Katja Hose, Ralf Schenkel, and Michael Schmidt. 2011. FedX: Optimization Techniques for Federated Query Processing on Linked Data. In Proceedings of the 10th International Semantic Web Conference (ISWC).
    [27]
    Jürgen Umbrich, Katja Hose, Marcel Karnstedt, Andreas Harth, and Axel Polleres. 2011. Data Summaries for Processing Live Queries over Linked Data. World Wide Web 14, 5-6 (2011).
    [28]
    Ruben Verborgh, Miel Vander Sande, Olaf Hartig, Joachim Van Herwegen, Laurens De Vocht, Ben De Meester, Gerald Haesendonck, and Pieter Colpaert. 2016. Triple Pattern Fragments: A Low-Cost Knowledge Graph Interface for the Web. Journal of Web Semantics 37 (2016).
    [29]
    Maria-Esther Vidal, Simón Castillo, Maribel Acosta, Gabriela Montoya, and Guillermo Palma. 2016. On the Selection of SPARQL Endpoints to Efficiently Execute Federated SPARQL Queries. Trans. Large-Scale Data- and Knowledge-Centered Systems 25 (2016).
    [30]
    Maria-Esther Vidal, Edna Ruckhaus, Tomas Lampo, Amadís Martínez, Javier Sierra, and Axel Polleres. 2010. Efficiently Joining Group Patterns in SPARQL Queries. In Proceedings of the 7th Extended Semantic Web Conference (ESWC).

    Cited By

    View all
    • (2023)FedShop: A Benchmark for Testing the Scalability of SPARQL Federation EnginesThe Semantic Web – ISWC 202310.1007/978-3-031-47243-5_16(285-301)Online publication date: 27-Oct-2023
    • (2023)Link Traversal Query Processing Over Decentralized Environments with Structural AssumptionsThe Semantic Web – ISWC 202310.1007/978-3-031-47240-4_1(3-22)Online publication date: 27-Oct-2023
    • (2023)Considering Vocabulary Mappings in Query Plans for Federations of RDF Data SourcesCooperative Information Systems10.1007/978-3-031-46846-9_2(21-40)Online publication date: 25-Oct-2023
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    iiWAS '20: Proceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services
    November 2020
    492 pages
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    In-Cooperation

    • Johannes Kepler University, Linz, Austria

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 27 January 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Federation
    2. Linked Data Fragments
    3. RDF
    4. SPARQL

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Funding Sources

    • Swedish Research Council

    Conference

    iiWAS '20

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)21
    • Downloads (Last 6 weeks)2

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)FedShop: A Benchmark for Testing the Scalability of SPARQL Federation EnginesThe Semantic Web – ISWC 202310.1007/978-3-031-47243-5_16(285-301)Online publication date: 27-Oct-2023
    • (2023)Link Traversal Query Processing Over Decentralized Environments with Structural AssumptionsThe Semantic Web – ISWC 202310.1007/978-3-031-47240-4_1(3-22)Online publication date: 27-Oct-2023
    • (2023)Considering Vocabulary Mappings in Query Plans for Federations of RDF Data SourcesCooperative Information Systems10.1007/978-3-031-46846-9_2(21-40)Online publication date: 25-Oct-2023
    • (2023)Distributed Query Processing and Reasoning Over Linked Big DataThe Recent Advances in Transdisciplinary Data Science10.1007/978-3-031-23387-6_11(158-170)Online publication date: 29-Jan-2023
    • (2022)A geospatial source selector for federated GeoSPARQL queryingOpen Research Europe10.12688/openreseurope.14605.22(48)Online publication date: 6-Oct-2022
    • (2022)Towards Query Processing over Heterogeneous Federations of RDF Data SourcesThe Semantic Web: ESWC 2022 Satellite Events10.1007/978-3-031-11609-4_11(57-62)Online publication date: 20-Jul-2022

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media