Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3448016.3452765acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
short-paper

SOFOS: Demonstrating the Challenges of Materialized View Selection on Knowledge Graphs

Published: 18 June 2021 Publication History

Abstract

Analytical queries over RDF data are becoming prominent as a result of the proliferation of knowledge graphs. Yet, RDF databases are not optimized to perform such queries efficiently, leading to long processing times. A well known technique to improve the performance of analytical queries is to exploit materialized views.Although popular in relational databases, view materialization for RDF and SPARQL has not yet transitioned into practice, due to the non-trivial application to the RDF graph model. Motivated by a lack of understanding of the impact of view materialization alternatives for RDF data, we demonstrate Sofos, a system that implements and compares several cost models for view materialization. Sofos is, to the best of our knowledge, the first attempt to adapt cost models, initially studied in relational data, to the generic RDF setting, and to propose new ones, analyzing their pitfalls and merits. Sofos takes an RDF dataset and an analytical query for some facet in the data, and compares and evaluates alternative cost models, displaying statistics and insights about time, memory consumption, and query characteristics.

Supplementary Material

MP4 File (3448016.3452765.mp4)
Analytical queries over RDF data are becoming prominent as a result of the proliferation of knowledge graphs. Yet, RDF databases are not optimized to perform such queries efficiently, leading to long processing times. A well known technique to improve the performance of analytical queries is to exploit materialized views. Although popular in relational databases, view materialization for RDF and SPARQL has not yet transitioned into practice, due to the non-trivial application to the RDF graph model. Motivated by a lack of understanding of the impact of view materialization alternatives for RDF data, we demonstrate SOFOS, a system that implements and compares several cost models for view materialization. SOFOS is, to the best of our knowledge, the first attempt to adapt cost models, initially studied in relational data, to the generic RDF setting, and to propose new ones, analyzing their pitfalls and merits. SOFOS takes an RDF dataset and an analytical query for some facet in the data, and compares and evaluates alternative cost models, displaying statistics and insights about time, memory consumption, and query characteristics.

References

[1]
Gü nes Alucc, Olaf Hartig, M. Tamer Ö zsu, and Khuzaima Daudjee. 2014. Diversified Stress Testing of RDF Data Management Systems. In ISWC.
[2]
Angela Bonifati, Wim Martens, and Thomas Timm. 2019. An analytical study of large SPARQL query logs. The VLDB Journal (aug 2019).
[3]
Dario Colazzo, Francc ois Goasdoué, Ioana Manolescu, and Alexandra Roatics. 2014. RDF analytics: lenses over semantic graphs. In WWW. ACM, 467--478.
[4]
Lorena Etcheverry and Alejandro A Vaisman. 2012. QB4OLAP: a new vocabulary for OLAP cubes on the semantic web. In COLD, Vol. 905. 27--38.
[5]
Yuanbo Guo, Zhengxiang Pan, and Jeff Heflin. 2005. LUBM: A benchmark for OWL knowledge base systems. JWS, Vol. 3, 2--3 (2005), 158--182.
[6]
Nurefsan Gür, Jacob Nielsen, Katja Hose, and Torben Bach Pedersen. 2017. GeoSemOLAP: Geospatial OLAP on the Semantic Web Made Easy. In WWW Companion.
[7]
Venky Harinarayan, Anand Rajaraman, and Jeffrey D Ullman. 1996. Implementing data cubes efficiently. SIGMOD Rec., Vol. 25, 2 (1996), 205--216.
[8]
Dilshod Ibragimov, Katja Hose, Torben Bach Pedersen, and Esteban Zimányi. 2016. Optimizing aggregate SPARQL queries using materialized RDF views. In ISWC. 341--359.
[9]
Tapio Niemi, Jyrki Nummenmaa, and Peter Thanisch. 2001. Constructing OLAP cubes based on queries. In DOLAP. 9--15.
[10]
Natasha Noy, Yuqing Gao, Anshu Jain, Anant Narayanan, Alan Patterson, and Jamie Taylor. 2019. Industry-scale knowledge graphs: Lessons and challenges. ACM Queue, Vol. 17, 2 (2019), 48--75.
[11]
Jennifer Ortiz, Magdalena Balazinska, Johannes Gehrke, and S Sathiya Keerthi. 2019. An empirical analysis of deep learning for cardinality estimation. arXiv preprint arXiv:1905.06425 (2019).
[12]
W3C RDF Working Group. 2014. Resource description framework. W3C. http://www.w3.org/RDF/.
[13]
Andy Seaborne and Eric Prud'hommeaux. 2006. SPARQL query language for RDF. Technical Report. W3C.
[14]
Arnaud Soulet and Fabian M. Suchanek. 2019. Anytime Large-Scale Analytics of Linked Open Data. In ISWC. 576--592.
[15]
Georgia Troullinou, Haridimos Kondylakis, Evangelia Daskalaki, and Dimitris Plexousakis. 2015. RDF Digest: Ontology Exploration using Summaries. In ISWC.
[16]
Georgia Troullinou, Haridimos Kondylakis, Kostas Stefanidis, and Dimitris Plexousakis. 2018. Exploring RDFS KBs Using Summaries. In ISWC. 268--284.
[17]
Marcin Wylot, Manfred Hauswirth, Philippe Cudré-Mauroux, and Sherif Sakr. 2018. RDF Data Storage and Query Processing Schemes: A Survey. ACM Comput. Surv., Vol. 51, 4 (2018), 84:1--84:36.

Cited By

View all
  • (2025)Temporal graph processing in modern memory hierarchiesInformation Systems10.1016/j.is.2024.102462127(102462)Online publication date: Jan-2025
  • (2024)UniView: A Unified Autonomous Materialized View Management System for Various DatabasesProceedings of the VLDB Endowment10.14778/3685800.368587317:12(4353-4356)Online publication date: 8-Nov-2024
  • (2024)Optimising Queries for Pattern Detection Over Large Scale Temporally Evolving GraphsIEEE Access10.1109/ACCESS.2024.341735212(86790-86808)Online publication date: 2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '21: Proceedings of the 2021 International Conference on Management of Data
June 2021
2969 pages
ISBN:9781450383431
DOI:10.1145/3448016
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 June 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. RDF
  2. knowledge graphs
  3. view selection

Qualifiers

  • Short-paper

Funding Sources

  • Marie Sk?odowska-Curie grantagreement
  • Hellenic Foundation for Research and Innovation (H.F.R.I.)

Conference

SIGMOD/PODS '21
Sponsor:

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)30
  • Downloads (Last 6 weeks)3
Reflects downloads up to 27 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Temporal graph processing in modern memory hierarchiesInformation Systems10.1016/j.is.2024.102462127(102462)Online publication date: Jan-2025
  • (2024)UniView: A Unified Autonomous Materialized View Management System for Various DatabasesProceedings of the VLDB Endowment10.14778/3685800.368587317:12(4353-4356)Online publication date: 8-Nov-2024
  • (2024)Optimising Queries for Pattern Detection Over Large Scale Temporally Evolving GraphsIEEE Access10.1109/ACCESS.2024.341735212(86790-86808)Online publication date: 2024
  • (2024)Property Graphs at Scale: A Roadmap and Vision for the Future (Short Paper)Advanced Information Systems Engineering Workshops10.1007/978-3-031-61003-5_16(180-185)Online publication date: 1-Jun-2024
  • (2023)iSummary: Workload-Based, Personalized Summaries for Knowledge GraphsThe Semantic Web10.1007/978-3-031-33455-9_12(192-208)Online publication date: 28-May-2023
  • (2022)Multi‐objective materialized view selection using flamingo search optimization algorithmSoftware: Practice and Experience10.1002/spe.317453:4(988-1012)Online publication date: 2-Dec-2022
  • (2021)A survey on semantic schema discoveryThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-021-00717-x31:4(675-710)Online publication date: 27-Nov-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media