Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3485447.3511947acmconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
research-article

Federated SPARQL Query Processing over Heterogeneous Linked Data Fragments

Published: 25 April 2022 Publication History
  • Get Citation Alerts
  • Abstract

    Linked Data Fragments (LDFs) are Web interfaces that enable querying knowledge graphs on the Web. These interfaces, such as SPARQL endpoints or Triple Pattern Fragment servers, differ in the SPARQL expressions they can evaluate and the metadata they provide. So far, federated query processing has focused on federations with a single type of LDF interface, typically SPARQL endpoints. In this work, we address the challenges of SPARQL query processing over federations with heterogeneous LDF interfaces. To this end, we propose an interface-aware framework and illustrate its applicability with a prototypical approach. The results over the FedBench benchmark show a substantial improvement in performance by devising this interface-aware approach that exploits the capabilities of heterogeneous interfaces in federations.

    References

    [1]
    Ibrahim Abdelaziz, Essam Mansour, Mourad Ouzzani, Ashraf Aboulnaga, and Panos Kalnis. 2017. Lusail: A System for Querying Linked Data at Scale. Proc. VLDB Endow. 11, 4 (2017), 485–498. https://doi.org/10.1145/3186728.3164144
    [2]
    Maribel Acosta, Olaf Hartig, and Juan F. Sequeda. 2019. Federated RDF Query Processing. In Encyclopedia of Big Data Technologies, Sherif Sakr and Albert Y. Zomaya (Eds.). Springer. https://doi.org/10.1007/978-3-319-63962-8_228-1
    [3]
    Maribel Acosta and Maria-Esther Vidal. 2015. Networks of Linked Data Eddies: An Adaptive Web Query Processing Engine for RDF Data. In The Semantic Web - ISWC 2015 - 14th International Semantic Web Conference, Bethlehem, PA, USA, October 11-15, 2015, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 9366), Marcelo Arenas, Óscar Corcho, Elena Simperl, Markus Strohmaier, Mathieu d’Aquin, Kavitha Srinivas, Paul Groth, Michel Dumontier, Jeff Heflin, Krishnaprasad Thirunarayan, and Steffen Staab (Eds.). Springer, 111–127. https://doi.org/10.1007/978-3-319-25007-6_7
    [4]
    Maribel Acosta, Maria-Esther Vidal, Tomas Lampo, Julio Castillo, and Edna Ruckhaus. 2011. ANAPSID: An Adaptive Query Processing Engine for SPARQL Endpoints. In The Semantic Web - ISWC 2011 - 10th International Semantic Web Conference, Bonn, Germany, October 23-27, 2011, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 7031), Lora Aroyo, Chris Welty, Harith Alani, Jamie Taylor, Abraham Bernstein, Lalana Kagal, Natasha Fridman Noy, and Eva Blomqvist (Eds.). Springer, 18–34. https://doi.org/10.1007/978-3-642-25073-6_2
    [5]
    Maribel Acosta, Maria-Esther Vidal, and York Sure-Vetter. 2017. Diefficiency Metrics: Measuring the Continuous Efficiency of Query Processing Approaches. In The Semantic Web - ISWC 2017 - 16th International Semantic Web Conference, Vienna, Austria, October 21-25, 2017, Proceedings, Part II(Lecture Notes in Computer Science, Vol. 10588), Claudia d’Amato, Miriam Fernández, Valentina A. M. Tamma, Freddy Lécué, Philippe Cudré-Mauroux, Juan F. Sequeda, Christoph Lange, and Jeff Heflin (Eds.). Springer, 3–19. https://doi.org/10.1007/978-3-319-68204-4_1
    [6]
    Carlos Buil Aranda, Marcelo Arenas, and Óscar Corcho. 2011. Semantics and Optimization of the SPARQL 1.1 Federation Extension. In The Semanic Web: Research and Applications - 8th Extended Semantic Web Conference, ESWC 2011, Heraklion, Crete, Greece, May 29 - June 2, 2011, Proceedings, Part II. 1–15. https://doi.org/10.1007/978-3-642-21064-8_1
    [7]
    Amr Azzam, Christian Aebeloe, Gabriela Montoya, Ilkcan Keles, Axel Polleres, and Katja Hose. 2021. WiseKG: Balanced Access to Web Knowledge Graphs. In WWW ’21: The Web Conference 2021, Virtual Event / Ljubljana, Slovenia, April 19-23, 2021, Jure Leskovec, Marko Grobelnik, Marc Najork, Jie Tang, and Leila Zia (Eds.). ACM / IW3C2, 1422–1434. https://doi.org/10.1145/3442381.3449911
    [8]
    Amr Azzam, Javier D. Fernández, Maribel Acosta, Martin Beno, and Axel Polleres. 2020. SMART-KG: Hybrid Shipping for SPARQL Querying on the Web. In WWW ’20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, Yennun Huang, Irwin King, Tie-Yan Liu, and Maarten van Steen (Eds.). ACM / IW3C2, 984–994. https://doi.org/10.1145/3366423.3380177
    [9]
    Angelos Charalambidis, Antonis Troumpoukis, and Stasinos Konstantopoulos. 2015. SemaGrow: optimizing federated SPARQL queries. In Proceedings of the 11th International Conference on Semantic Systems, SEMANTiCS 2015, Vienna, Austria, September 15-17, 2015, Axel Polleres, Tassilo Pellegrini, Sebastian Hellmann, and Josiane Xavier Parreira (Eds.). ACM, 121–128. https://doi.org/10.1145/2814864.2814886
    [10]
    Sijin Cheng and Olaf Hartig. 2020. FedQPL: A Language for Logical Query Plans over Heterogeneous Federations of RDF Data Sources (Extended Version). arxiv:2010.01190 [cs.DB]
    [11]
    Kemele M. Endris, Mikhail Galkin, Ioanna Lytra, Mohamed Nadjib Mami, Maria-Esther Vidal, and Sören Auer. 2017. MULDER: Querying the Linked Data Web by Bridging RDF Molecule Templates. In Database and Expert Systems Applications - 28th International Conference, DEXA 2017, Lyon, France, August 28-31, 2017, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 10438), Djamal Benslimane, Ernesto Damiani, William I. Grosky, Abdelkader Hameurlain, Amit P. Sheth, and Roland R. Wagner (Eds.). Springer, 3–18. https://doi.org/10.1007/978-3-319-64468-4_1
    [12]
    Javier D. Fernández, Miguel A. Martínez-Prieto, Claudio Gutiérrez, Axel Polleres, and Mario Arias. 2013. Binary RDF representation for publication and exchange (HDT). J. Web Semant. 19(2013), 22–41. https://doi.org/10.1016/j.websem.2013.01.002
    [13]
    Olaf Görlitz and Steffen Staab. 2011. SPLENDID: SPARQL Endpoint Federation Exploiting VOID Descriptions. In Proceedings of the Second International Workshop on Consuming Linked Data (COLD2011), Bonn, Germany, October 23, 2011(CEUR Workshop Proceedings, Vol. 782), Olaf Hartig, Andreas Harth, and Juan F. Sequeda (Eds.). CEUR-WS.org. http://ceur-ws.org/Vol-782/GoerlitzAndStaab_COLD2011.pdf
    [14]
    Olaf Hartig and Carlos Buil Aranda. 2016. Bindings-Restricted Triple Pattern Fragments. In On the Move to Meaningful Internet Systems: OTM 2016 Conferences - Confederated International Conferences: CoopIS, C&TC, and ODBASE 2016, Rhodes, Greece, October 24-28, 2016, Proceedings(Lecture Notes in Computer Science, Vol. 10033), Christophe Debruyne, Hervé Panetto, Robert Meersman, Tharam S. Dillon, eva Kühn, Declan O’Sullivan, and Claudio Agostino Ardagna (Eds.). 762–779. https://doi.org/10.1007/978-3-319-48472-3_48
    [15]
    Olaf Hartig, Ian Letter, and Jorge Pérez. 2017. A Formal Framework for Comparing Linked Data Fragments. In The Semantic Web - ISWC 2017 - 16th International Semantic Web Conference, Vienna, Austria, October 21-25, 2017, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 10587), Claudia d’Amato, Miriam Fernández, Valentina A. M. Tamma, Freddy Lécué, Philippe Cudré-Mauroux, Juan F. Sequeda, Christoph Lange, and Jeff Heflin (Eds.). Springer, 364–382. https://doi.org/10.1007/978-3-319-68288-4_22
    [16]
    Lars Heling and Maribel Acosta. 2020. Cost- and Robustness-Based Query Optimization for Linked Data Fragments. In The Semantic Web - ISWC 2020 - 19th International Semantic Web Conference, Athens, Greece, November 2-6, 2020, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 12506), Jeff Z. Pan, Valentina A. M. Tamma, Claudia d’Amato, Krzysztof Janowicz, Bo Fu, Axel Polleres, Oshani Seneviratne, and Lalana Kagal (Eds.). Springer, 238–257. https://doi.org/10.1007/978-3-030-62419-4_14
    [17]
    Lars Heling and Maribel Acosta. 2020. Estimating Characteristic Sets for RDF Dataset Profiles Based on Sampling. In The Semantic Web - 17th International Conference, ESWC 2020, Heraklion, Crete, Greece, May 31-June 4, 2020, Proceedings(Lecture Notes in Computer Science, Vol. 12123), Andreas Harth, Sabrina Kirrane, Axel-Cyrille Ngonga Ngomo, Heiko Paulheim, Anisa Rula, Anna Lisa Gentile, Peter Haase, and Michael Cochez (Eds.). Springer, 157–175. https://doi.org/10.1007/978-3-030-49461-2_10
    [18]
    Lars Heling and Maribel Acosta. 2022. Robust query processing for linked data fragments. Semantic Web (2022). https://doi.org/10.3233/SW-212888
    [19]
    Thomas Minier, Hala Skaf-Molli, and Pascal Molli. 2019. SaGe: Web Preemption for Public SPARQL Query Services. In The World Wide Web Conference, WWW 2019, San Francisco, CA, USA, May 13-17, 2019, Ling Liu, Ryen W. White, Amin Mantrach, Fabrizio Silvestri, Julian J. McAuley, Ricardo Baeza-Yates, and Leila Zia (Eds.). ACM, 1268–1278. https://doi.org/10.1145/3308558.3313652
    [20]
    Gabriela Montoya, Christian Aebeloe, and Katja Hose. 2018. Towards Efficient Query Processing over Heterogeneous RDF Interfaces. In Emerging Topics in Semantic Technologies - ISWC 2018 Satellite Events [best papers from 13 of the workshops co-located with the ISWC 2018 conference](Studies on the Semantic Web, Vol. 36), Elena Demidova, Amrapali Zaveri, and Elena Simperl (Eds.). IOS Press, 39–53. https://doi.org/10.3233/978-1-61499-894-5-39
    [21]
    Gabriela Montoya, Hala Skaf-Molli, and Katja Hose. 2017. The Odyssey Approach for Optimizing Federated SPARQL Queries. In The Semantic Web - ISWC 2017 - 16th International Semantic Web Conference, Vienna, Austria, October 21-25, 2017, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 10587), Claudia d’Amato, Miriam Fernández, Valentina A. M. Tamma, Freddy Lécué, Philippe Cudré-Mauroux, Juan F. Sequeda, Christoph Lange, and Jeff Heflin (Eds.). Springer, 471–489. https://doi.org/10.1007/978-3-319-68288-4_28
    [22]
    Jorge Pérez, Marcelo Arenas, and Claudio Gutiérrez. 2009. Semantics and complexity of SPARQL. ACM Trans. Database Syst. 34, 3 (2009), 16:1–16:45. https://doi.org/10.1145/1567274.1567278
    [23]
    Bastian Quilitz and Ulf Leser. 2008. Querying Distributed RDF Data Sources with SPARQL. In The Semantic Web: Research and Applications, 5th European Semantic Web Conference, ESWC 2008, Tenerife, Canary Islands, Spain, June 1-5, 2008, Proceedings. 524–538. https://doi.org/10.1007/978-3-540-68234-9_39
    [24]
    Muhammad Saleem and Axel-Cyrille Ngonga Ngomo. 2014. HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation. In The Semantic Web: Trends and Challenges - 11th International Conference, ESWC 2014, Anissaras, Crete, Greece, May 25-29, 2014. Proceedings(Lecture Notes in Computer Science, Vol. 8465), Valentina Presutti, Claudia d’Amato, Fabien Gandon, Mathieu d’Aquin, Steffen Staab, and Anna Tordai (Eds.). Springer, 176–191. https://doi.org/10.1007/978-3-319-07443-6_13
    [25]
    Muhammad Saleem, Alexander Potocki, Tommaso Soru, Olaf Hartig, and Axel-Cyrille Ngonga Ngomo. 2018. CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation. In Proceedings of the 14th International Conference on Semantic Systems, SEMANTiCS 2018, Vienna, Austria, September 10-13, 2018(Procedia Computer Science, Vol. 137), Anna Fensel, Victor de Boer, Tassilo Pellegrini, Elmar Kiesling, Bernhard Haslhofer, Laura Hollink, and Alexander Schindler (Eds.). Elsevier, 163–174. https://doi.org/10.1016/j.procs.2018.09.016
    [26]
    Michael Schmidt, Olaf Görlitz, Peter Haase, Günter Ladwig, Andreas Schwarte, and Thanh Tran. 2011. FedBench: A Benchmark Suite for Federated Semantic Data Query Processing. In The Semantic Web - ISWC 2011 - 10th International Semantic Web Conference, Bonn, Germany, October 23-27, 2011, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 7031), Lora Aroyo, Chris Welty, Harith Alani, Jamie Taylor, Abraham Bernstein, Lalana Kagal, Natasha Fridman Noy, and Eva Blomqvist (Eds.). Springer, 585–600. https://doi.org/10.1007/978-3-642-25073-6_37
    [27]
    Michael Schmidt, Michael Meier, and Georg Lausen. 2010. Foundations of SPARQL query optimization. In Database Theory - ICDT 2010, 13th International Conference, Lausanne, Switzerland, March 23-25, 2010, Proceedings(ACM International Conference Proceeding Series), Luc Segoufin (Ed.). ACM, 4–33. https://doi.org/10.1145/1804669.1804675
    [28]
    Andreas Schwarte, Peter Haase, Katja Hose, Ralf Schenkel, and Michael Schmidt. 2011. FedX: Optimization Techniques for Federated Query Processing on Linked Data. In The Semantic Web - ISWC 2011 - 10th International Semantic Web Conference, Bonn, Germany, October 23-27, 2011, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 7031), Lora Aroyo, Chris Welty, Harith Alani, Jamie Taylor, Abraham Bernstein, Lalana Kagal, Natasha Fridman Noy, and Eva Blomqvist (Eds.). Springer, 601–616. https://doi.org/10.1007/978-3-642-25073-6_38
    [29]
    Arnaud Soulet and Fabian M. Suchanek. 2019. Anytime Large-Scale Analytics of Linked Open Data. In The Semantic Web - ISWC 2019 - 18th International Semantic Web Conference, Auckland, New Zealand, October 26-30, 2019, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 11778), Chiara Ghidini, Olaf Hartig, Maria Maleshkova, Vojtech Svátek, Isabel F. Cruz, Aidan Hogan, Jie Song, Maxime Lefrançois, and Fabien Gandon (Eds.). Springer, 576–592. https://doi.org/10.1007/978-3-030-30793-6_33
    [30]
    Ruben Taelman, Joachim Van Herwegen, Miel Vander Sande, and Ruben Verborgh. 2018. Comunica: A Modular SPARQL Query Engine for the Web. In The Semantic Web - ISWC 2018 - 17th International Semantic Web Conference, Monterey, CA, USA, October 8-12, 2018, Proceedings, Part II(Lecture Notes in Computer Science, Vol. 11137), Denny Vrandecic, Kalina Bontcheva, Mari Carmen Suárez-Figueroa, Valentina Presutti, Irene Celino, Marta Sabou, Lucie-Aimée Kaffee, and Elena Simperl (Eds.). Springer, 239–255. https://doi.org/10.1007/978-3-030-00668-6_15
    [31]
    Pierre-Yves Vandenbussche, Jürgen Umbrich, Luca Matteis, Aidan Hogan, and Carlos Buil Aranda. 2017. SPARQLES: Monitoring public SPARQL endpoints. Semantic Web 8, 6 (2017), 1049–1065. https://doi.org/10.3233/SW-170254
    [32]
    Ruben Verborgh, Miel Vander Sande, Olaf Hartig, Joachim Van Herwegen, Laurens De Vocht, Ben De Meester, Gerald Haesendonck, and Pieter Colpaert. 2016. Triple Pattern Fragments: A low-cost knowledge graph interface for the Web. J. Web Semant. 37-38(2016), 184–206. https://doi.org/10.1016/j.websem.2016.03.003
    [33]
    Maria-Esther Vidal, Simón Castillo, Maribel Acosta, Gabriela Montoya, and Guillermo Palma. 2016. On the Selection of SPARQL Endpoints to Efficiently Execute Federated SPARQL Queries. Trans. Large Scale Data Knowl. Centered Syst. 25 (2016), 109–149. https://doi.org/10.1007/978-3-662-49534-6_4

    Cited By

    View all
    • (2024)smart-KG: Partition-Based Linked Data Fragments for querying knowledge graphsSemantic Web10.3233/SW-243571(1-45)Online publication date: 20-Mar-2024
    • (2024)FedUP: Querying Large-Scale Federations of SPARQL EndpointsProceedings of the ACM on Web Conference 202410.1145/3589334.3645704(2315-2324)Online publication date: 13-May-2024
    • (2023)Data Management and Ontology Development for Provenance-Aware Organizations in Linked Data SpaceEuropean Journal of Technic10.36222/ejt.1402149Online publication date: 26-Dec-2023
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WWW '22: Proceedings of the ACM Web Conference 2022
    April 2022
    3764 pages
    ISBN:9781450390965
    DOI:10.1145/3485447
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 April 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Federation
    2. Linked Data Fragments
    3. RDF
    4. SPARQL

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    WWW '22
    Sponsor:
    WWW '22: The ACM Web Conference 2022
    April 25 - 29, 2022
    Virtual Event, Lyon, France

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)68
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 27 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)smart-KG: Partition-Based Linked Data Fragments for querying knowledge graphsSemantic Web10.3233/SW-243571(1-45)Online publication date: 20-Mar-2024
    • (2024)FedUP: Querying Large-Scale Federations of SPARQL EndpointsProceedings of the ACM on Web Conference 202410.1145/3589334.3645704(2315-2324)Online publication date: 13-May-2024
    • (2023)Data Management and Ontology Development for Provenance-Aware Organizations in Linked Data SpaceEuropean Journal of Technic10.36222/ejt.1402149Online publication date: 26-Dec-2023
    • (2023)Tunable Query Optimizer for Web APIs and User PreferencesProceedings of the 12th Knowledge Capture Conference 202310.1145/3587259.3627542(92-100)Online publication date: 5-Dec-2023
    • (2023)Link Traversal Query Processing Over Decentralized Environments with Structural AssumptionsThe Semantic Web – ISWC 202310.1007/978-3-031-47240-4_1(3-22)Online publication date: 6-Nov-2023
    • (2023)Knowledge Engineering in the Era of Artificial IntelligenceAdvances in Databases and Information Systems10.1007/978-3-031-42914-9_1(3-15)Online publication date: 4-Sep-2023
    • (2022)Systematic Construction of Knowledge Graphs for Research-Performing OrganizationsInformation10.3390/info1312056213:12(562)Online publication date: 30-Nov-2022
    • (2022)Bringing Federated Semantic Queries to the GIS-Based ScenarioISPRS International Journal of Geo-Information10.3390/ijgi1102008611:2(86)Online publication date: 25-Jan-2022
    • (2022)A geospatial source selector for federated GeoSPARQL queryingOpen Research Europe10.12688/openreseurope.14605.22(48)Online publication date: 6-Oct-2022
    • (2022)Utility-aware Semantics for Alternative Service Expressions in Federated SPARQL Queries2022 IEEE International Conference on Web Services (ICWS)10.1109/ICWS55610.2022.00042(208-218)Online publication date: Jul-2022
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media