Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3442381.3449877acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Trav-SHACL: Efficiently Validating Networks of SHACL Constraints

Published: 03 June 2021 Publication History

Abstract

Knowledge graphs have emerged as expressive data structures for Web data. Knowledge graph potential and the demand for ecosystems to facilitate their creation, curation, and understanding, is testified in diverse domains, e.g., biomedicine. The Shapes Constraint Language (SHACL) is the W3C recommendation language for integrity constraints over RDF knowledge graphs. Enabling quality assements of knowledge graphs, SHACL is rapidly gaining attention in real-world scenarios. SHACL models integrity constraints as a network of shapes, where a shape contains the constraints to be fullfiled by the same entities. The validation of a SHACL shape schema can face the issue of tractability during validation. To facilitate full adoption, efficient computational methods are required. We present Trav-SHACL, a SHACL engine capable of planning the traversal and execution of a shape schema in a way that invalid entities are detected early and needless validations are minimized. Trav-SHACL reorders the shapes in a shape schema for efficient validation and rewrites target and constraint queries for fast detection of invalid entities. Trav-SHACL is empirically evaluated on 27 testbeds executed against knowledge graphs of up to 34M triples. Our experimental results suggest that Trav-SHACL exhibits high performance gradually and reduces validation time by a factor of up to 28.93 compared to the state of the art.

References

[1]
Maribel Acosta, Maria-Esther Vidal, and York Sure-Vetter. 2017. Diefficiency Metrics: Measuring the Continuous Efficiency of Query Processing Approaches. In International Semantic Web Conference ISWC. Springer, 3–19.
[2]
Medina Andreşel, Julien Corman, Magdalena Ortiz, Juan L. Reutter, Ognjen Savković, and Mantas Šimkus. 2020. Stable Model Semantics for Recursive SHACL. In ACM–The Web Conference. 1570–1580.
[3]
Sören Auer, Viktor Kovtun, Manuel Prinz, Anna Kasprzik, Markus Stocker, and Maria-Esther Vidal. 2018. Towards a Knowledge Graph for Science. In International Conference on Web Intelligence, Mining and Semantics, WIMS.
[4]
Sebastian R. Bader, Jaroslav Pullmann, Christian Mader, Sebastian Tramp, Christoph Quix, Andreas W. Müller, Haydar Akyürek, Matthias Böckmann, Benedikt T. Imbusch, Johannes Lipp, Sandra Geisler, and Christoph Lange. 2020. The International Data Spaces Information Model - An Ontology for Sovereign Exchange of Digital Content. In International Semantic Web Conference ISWC.
[5]
Iovka Boneva, Jose E. Labra Gayo, and Eric G. Prud’hommeaux. 2017. Semantics and Validation of Shapes Schemas for RDF. In International Semantic Web Conference ISWC. Springer, 104–120.
[6]
Stefano Ceri, Georg Gottlob, and Letizia Tanca. 1989. What you Always Wanted to Know About Datalog (And Never Dared to Ask). IEEE Trans. Knowl. Data Eng. 1, 1 (1989), 146–166.
[7]
Andrea Cimmino, Alba Fernández-Izquierdo, and Raúl García-Castro. 2020. Astrea: Automatic Generation of SHACL Shapes from Ontologies. In The Semantic Web – ESWC 2020. Springer, 497–513.
[8]
Julien Corman, Fernando Florenzano, Juan L. Reutter, and Ognjen Savković. 2019. SHACL2SPARQL: Validating a SPARQL Endpoint against Recursive SHACL Constraints. In International Semantic Web Conference ISWC Satellite Events.
[9]
Julien Corman, Fernando Florenzano, Juan L. Reutter, and Ognjen Savković. 2019. Validating SHACL Constraints over a SPARQL Endpoint. In International Semantic Web Conference ISWC. Springer, 145–163.
[10]
Julien Corman, Juan L. Reutter, and Ognjen Savković. 2018. Semantics and Validation of Recursive SHACL. In International Semantic Web Conference ISWC. Springer, 318–336.
[11]
Nicholson DN and Greene CS. 2020. Constructing knowledge graphs and their biomedical applications.Comput Struct Biotechnol J.(2020).
[12]
Yuanbo Guo, Zhengxiang Pan, and Jeff Heflin. 2005. LUBM: A Benchmark for OWL Knowledge Base Systems. Web Semantics 3, 2–3 (2005), 158–182.
[13]
Aidan Hogan, Eva Blomqvist, Michael Cochez, Claudia d’Amato, Gerard de Melo, Claudio Gutierrez, José Emilio Labra Gayo, Sabrina Kirrane, Sebastian Neumaier, Axel Polleres, Roberto Navigli, Axel-Cyrille Ngonga Ngomo, Sabbir M. Rashid, Anisa Rula, Lukas Schmelzeisen, Juan F. Sequeda, Steffen Staab, and Antoine Zimmermann. 2020. Knowledge Graphs. CoRR abs/2003.02320(2020).
[14]
Holger Knublauch, James A. Hendler, and Kingsley Idehen. 2011. SPIN - Overview and Motivation. W3C Submission. https://www.w3.org/Submission/2011/SUBM-spin-overview-20110222/
[15]
Holger Knublauch and Dimitris Kontokostas. 2017. Shapes Constraint Language (SHACL). W3C Recommendation. https://www.w3.org/TR/2017/REC-shacl-20170720/
[16]
Martin Leinberger, Philipp Seifer, Tjitze Rienstra, Ralf Lämmel, and Steffen Staab. 2020. Deciding SHACL Shape Containment through Description Logics Reasoning. In International Semantic Web Conference ISWC. Springer.
[17]
Boris Motik, Ian Horrocks, and Ulrike Sattler. 2007. Adding Integrity Constraints to OWL. In OWLED 2007 - OWL: Experiences and Directions, Vol. 258. CEUR Workshop Proceedings (CEUR-WS.org), Aachen.
[18]
Boris Motik, Ian Horrocks, and Ulrike Sattler. 2009. Bridging the Gap between OWL and Relational Databases. Web Semantics: Science, Services and Agents on the World Wide Web 7, 2(2009), 74–89.
[19]
Natalya Fridman Noy, Yuqing Gao, Anshu Jain, Anant Narayanan, Alan Patterson, and Jamie Taylor. 2019. Industry-scale knowledge graphs: lessons and challenges. Commun. ACM 62, 8 (2019), 36–43.
[20]
Paolo Pareti, George Konstantinidis, Fabio Mogavero, and Timothy J. Norman. 2020. SHACL Satisfiability and Containment. In International Semantic Web Conference ISWC. Springer.
[21]
Jorge Pérez, Marcelo Arenas, and Claudio Gutiérrez. 2009. Semantics and complexity of SPARQL. ACM Trans. Database Syst. 34, 3 (2009), 16:1–16:45.
[22]
Blerina Spahiu, Andrea Maurino, and Matteo Palmonari. 2018. Towards Improving the Quality of Knowledge Graphs with Data-driven Ontology Patterns and SHACL. In International Semantic Web Conference ISWC Satellite Events. 103–117.
[23]
Blerina Spahiu, Riccardo Porrini, Matteo Palmonari, Anisa Rula, and Andrea Maurino. 2016. ABSTAT: Ontology-Driven Linked Data Summaries with Pattern Minimalization. In The Semantic Web– ESWC 2016 Satellite Events. Springer.
[24]
Jiao Tao, Evren Sirin, Jie Bao, and Deborah L. McGuinness. 2010. Integrity Constraints in OWL. In AAAI Conference on Artificial Intelligence (AAAI).
[25]
Katherine Thornton, Harold Solbrig, Gregory S. Stupp, Jose Emilio Labra Gayo, Daniel Mietchen, Eric Prud’hommeaux, and Andra Waagmeester. 2019. Using Shape Expressions (ShEx) to Share RDF Data Models and to Guide Curation with Rigorous Validation. In The Semantic Web – ESWC 2019. Springer, 606–620.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '21: Proceedings of the Web Conference 2021
April 2021
4054 pages
ISBN:9781450383127
DOI:10.1145/3442381
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 June 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Knowledge Graph Constraints
  2. Quality Assessment
  3. SHACL Validation

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

WWW '21
Sponsor:
WWW '21: The Web Conference 2021
April 19 - 23, 2021
Ljubljana, Slovenia

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)97
  • Downloads (Last 6 weeks)5
Reflects downloads up to 31 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Enabling Efficient and Semantic-Aware Constraint Validation in Knowledge GraphsThe Semantic Web: ESWC 2024 Satellite Events10.1007/978-3-031-78955-7_11(104-114)Online publication date: 28-Jan-2025
  • (2024)InterpretME: A tool for interpretations of machine learning models over knowledge graphsSemantic Web10.3233/SW-233511(1-21)Online publication date: 5-Jan-2024
  • (2024)PALADINInformation Fusion10.1016/j.inffus.2024.102557112:COnline publication date: 1-Dec-2024
  • (2024)Compiling SHACL Into SQLThe Semantic Web – ISWC 202410.1007/978-3-031-77850-6_4(59-77)Online publication date: 27-Nov-2024
  • (2024)Modeling and Accessing Smart Materials with Integrity Constraints in the Shapes Constraint Language and Ontologies—The SmaDi WayAdvanced Engineering Materials10.1002/adem.202401017Online publication date: 30-Nov-2024
  • (2023)LPG-Based Knowledge Graphs: A Survey, a Proposal and Current TrendsInformation10.3390/info1403015414:3(154)Online publication date: 1-Mar-2023
  • (2023)Knowledge graphs for enhancing transparency in health data ecosystems1Semantic Web10.3233/SW-22329414:5(943-976)Online publication date: 8-May-2023
  • (2023)Knowledge Representation and The Semantic Web: An Historical Overview of Influences on Emerging ToolsRecent Advances in Computer Science and Communications10.2174/266625581566622052714561016:6Online publication date: Jul-2023
  • (2023)Extraction of Validating Shapes from Very Large Knowledge GraphsProceedings of the VLDB Endowment10.14778/3579075.357907816:5(1023-1032)Online publication date: 1-Jan-2023
  • (2023)A Method for Data Quality Validation Based on Shapes Constraint Language2023 2nd International Conference on Big Data, Information and Computer Network (BDICN)10.1109/BDICN58493.2023.00024(83-87)Online publication date: Jan-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media