Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Obi-Wan: ontology-based RDF integration of heterogeneous data

Published: 01 August 2020 Publication History

Abstract

We consider the problem of integrating heterogeneous data (relational, JSON, key-values, graphs etc.) and querying it efficiently. Traditional data integration systems fall into two classes: data warehousing, where all data source content is materialized in a single repository, and mediation, where data remains in their original stores and all data can be queried through a mediator.
We propose to demonstrate Obi-Wan, a novel mediator following the Ontology-Based Data access (OBDA) paradigm. Obi-Wan integrates data sources of many data models under an interface based on RDF graphs and ontologies (classes, properties, and relations between them). The novelty of Obi-Wan is to combine maximum integration power (GLAV mappings, see below) with the highest query answering power supported by an RDF mediator: RDF queries not only over the data but also over the integration ontologies. This makes it more flexible and powerful than comparable systems.

References

[1]
N. Abdallah, F. Goasdoué, and M. Rousset. DL-LITER in the light of propositional logic for decentralized data management. In IJCAI, 2009.
[2]
R. Alotaibi, D. Bursztyn, A. Deutsch, I. Manolescu, and S. Zampetakis. Towards Scalable Hybrid Stores: Constraint-Based Rewriting to the Rescue. In SIGMOD, June 2019.
[3]
J.-F. Baget, M. Leclère, M. Mugnier, S. Rocher, and C. Sipieter. Graal: A toolkit for query answering with existential rules. In RuleML, 2015.
[4]
R. Bonaque, T. D. Cao, et al. Mixed-instance querying: A lightweight integration architecture for data journalism. PVLDB, 9(13):1513--1516, 2016.
[5]
M. Buron, F. Goasdoué, I. Manolescu, and M. Mugnier. Reformulation-based query answering for RDF graphs with RDFS ontologies. In ESWC, 2019.
[6]
M. Buron, F. Goasdoué, I. Manolescu, and M. Mugnier. Ontology-based RDF integration of heterogeneous data. In EDBT, 2020.
[7]
D. Calvanese, B. Cogrel, S. Komla-Ebri, R. Kontchakov, D. Lanti, M. Rezk, M. Rodriguez-Muro, and G. Xiao. Ontop: Answering SPARQL queries over relational databases. Semantic Web, 8(3), 2017.
[8]
D. Calvanese, G. De Giacomo, D. Lembo, et al. The MASTRO system for ontology-based data access. Semantic Web, 2(1), 2011.
[9]
D. Calvanese, G. De Giacomo, D. Lembo, M. Lenzerini, R. Rosati, and M. Ruzzi. Using owl in data integration. In Semantic Web Information Management. 2009.
[10]
D. Calvanese, G. De Giacomo, M. Lenzerini, and M. Y. Vardi. Query processing under GLAV mappings for relational and graph databases. PVLDB, 6(2):61--72, 2012.
[11]
G. De Giacomo, D. Lembo, M. Lenzerini, A. Poggi, and R. Rosati. Using Ontologies for Semantic Data Integration. 2018.
[12]
A. Deutsch and V. Tannen. MARS: A system for publishing XML from mixed and redundant storage. In PVLDB, pages 201--212, 2003.
[13]
A. Doan, A. Halevy, and Z. G. Ives. Principles of Data Integration. Morgan Kaufmann, Waltham, MA, 2012.
[14]
J. Duggan, A. J. Elmore, M. Stonebraker, et al. The BigDAWG polystore system. SIGMOD, 44(2), 2015.
[15]
H. Garcia-Molina, Y. Papakonstantinou, D. Quass, et al. The TSIMMIS approach to mediation: Data models and languages. JIIS, 8(2), 1997.
[16]
F. Goasdoué, V. Lattès, and M. Rousset. The use of CARIN language and algorithms for information integration: The PICSEL system. IJCIS, 2000.
[17]
I. Manolescu, D. Florescu, and D. Kossmann. Answering XML queries on heterogeneous data sources. In VLDB, pages 241--250, 2001.
[18]
S. Nadal, K. Rabbani, O. Romero, and S. Tadesse. ODIN: A Dataspace Management System. 2019.
[19]
A. Poggi, D. Lembo, D. Calvanese, G. De Giacomo, M. Lenzerini, and R. Rosati. Linking data to ontologies. J. Data Semantics, 10, 2008.
[20]
M. Rodriguez-Muro, R. Kontchakov, and M. Zakharyaschev. Ontology-based data access: Ontop of databases. In ISWC, 2013.
[21]
J. F. Sequeda, M. Arenas, and D. P. Miranker. OBDA: query rewriting or materialization? in practice, both! In ISWC, 2014.
[22]
G. Smits, O. Pivert, H. Jaudoin, and F. Paulus. AGGREGO SEARCH: interactive keyword query construction. In EDBT, 2014.

Cited By

View all
  • (2023)Scalable Reasoning on Document Stores via Instance-Aware Query RewritingProceedings of the VLDB Endowment10.14778/3611479.361148116:11(2699-2713)Online publication date: 24-Aug-2023
  • (2023)Declarative RDF graph generation from heterogeneous (semi-)structured dataWeb Semantics: Science, Services and Agents on the World Wide Web10.1016/j.websem.2022.10075375:COnline publication date: 1-Jan-2023
  • (2023)Integration of Knowledge Bases and External Information Sources via Magic Properties and Query-Driven Entity LinkingInformation Integration and Web Intelligence10.1007/978-3-031-48316-5_30(309-324)Online publication date: 4-Dec-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 13, Issue 12
August 2020
1710 pages
ISSN:2150-8097
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 August 2020
Published in PVLDB Volume 13, Issue 12

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)51
  • Downloads (Last 6 weeks)7
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Scalable Reasoning on Document Stores via Instance-Aware Query RewritingProceedings of the VLDB Endowment10.14778/3611479.361148116:11(2699-2713)Online publication date: 24-Aug-2023
  • (2023)Declarative RDF graph generation from heterogeneous (semi-)structured dataWeb Semantics: Science, Services and Agents on the World Wide Web10.1016/j.websem.2022.10075375:COnline publication date: 1-Jan-2023
  • (2023)Integration of Knowledge Bases and External Information Sources via Magic Properties and Query-Driven Entity LinkingInformation Integration and Web Intelligence10.1007/978-3-031-48316-5_30(309-324)Online publication date: 4-Dec-2023
  • (2021)Querying multi-source heterogeneous fuzzy spatiotemporal dataJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-20235740:5(9843-9854)Online publication date: 1-Jan-2021
  • (2021)View selection over knowledge graphs in triple storesProceedings of the VLDB Endowment10.14778/3484224.348422714:13(3281-3294)Online publication date: 1-Sep-2021
  • (2020)Query Rewriting on Path Views Without Integrity ConstraintsFrom Data to Models and Back10.1007/978-3-030-70650-0_10(155-173)Online publication date: 20-Oct-2020

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media