Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A Rule-Based Transducer for Querying Incompletely Aligned Datasets

Published: 27 September 2018 Publication History
  • Get Citation Alerts
  • Abstract

    A growing number of Linked Open Data sources (from diverse provenances and about different domains) that can be freely browsed and searched to find and extract useful information have been made available. However, access to them is difficult for different reasons. This study addresses access issues concerning heterogeneity. It is common for datasets to describe the same or overlapping domains while using different vocabularies. Our study presents a transducer that transforms a SPARQL query suitably expressed in terms of the vocabularies used in a source dataset into another SPARQL query suitably expressed for a target dataset involving different vocabularies. The transformation is based on existing alignments between terms in different datasets. Whenever the transducer is unable to produce a semantically equivalent query because of the scarcity of term alignments, the transducer produces a semantic approximation of the query to avoid returning the empty answer to the user. Transformation across datasets is achieved through the management of a wide range of transformation rules. The feasibility of our proposal has been validated with a prototype implementation that processes queries that appear in well-known benchmarks and SPARQL endpoint logs. Results of the experiments show that the system is quite effective in achieving adequate transformations.

    References

    [1]
    Silvana Castano, Alfio Ferrara, Stefano Montanelli, and Gaia Varese. 2011. Ontology and instance matching. In Knowledge-Driven Multimedia Information Extraction and Ontology Evolution. Springer, 167--195.
    [2]
    Gianluca Correndo, Manuel Salvadores, Ian Millard, Hugh Glaser, and Nigel Shadbolt. 2010. SPARQL query rewriting for implementing data integration over linked data. In Proceedings of the 2010 EDBT/ICDT Workshops (EDBT’10). ACM, New York, Article 4, 11 pages.
    [3]
    Richard Cyganiak, David Wood, and Markus Lanthaler. 2014. RDF 1.1 Concepts and Abstract Sysntax. World Wide Web Consortium. W3C Recommendation February 25, 2014.
    [4]
    Jérôme David, Jérôme Euzenat, François Scharffe, and Cássia Trojahn dos Santos. 2011. The alignment API 4.0. Semantic Web 2, 1 (2011), 3--10.
    [5]
    Gonzalo Diaz, Marcelo Arenas, and Michael Benedikt. 2016. Sparqlbye: Querying RDF data by example. Proceedings of the VLDB Endowment 9, 13 (2016), 1533--1536.
    [6]
    Dennis Diefenbach, Vanessa Lopez, Kamal Singh, and Pierre Maret. 2018. Core techniques of question answering systems over knowledge bases: A survey. Knowledge and Information Systems 55, 3 (2018), 529--569.
    [7]
    Peter Dolog, Heiner Stuckenschmidt, and Holger Wache. 2006. Robust query processing for personalized information access on the semantic web. In Proceedings of the 7th International Conference on Flexible Query Answering Systems. Springer-Verlag, 343--355.
    [8]
    Jérôme Euzenat and Pavel Shvaiko. 2013. Ontology Matching (2nd ed.). Springer.
    [9]
    Riccardo Frosini, Andrea Calì, Alexandra Poulovassilis, and Peter T. Wood. 2017. Flexible query processing for SPARQL. Semantic Web 8, 4 (2017), 533--563.
    [10]
    Takahisa Fujino and Naoki Fukuta. 2014. Utilizing weighted ontology mappings on federated SPARQL querying. In Semantic Technology, Wooju Kim, Ying Ding, and Hong-Gee Kim (Eds.). Springer International Publishing, 331--347.
    [11]
    Pascal Gillet, Cassia Trojahn, Ollivier Haemmerlé, and Camille Pradel. 2013. Complex correspondences for query patterns rewriting. In Proceedings of the 8th International Conference on Ontology Matching-Volume 1111. CEUR-WS. org, 49--60.
    [12]
    Hugh Glaser, Afraz Jaffri, and Ian Millard. 2009. Managing co-reference on the semantic web. In WWW2009 Workshop: Linked Data on the Web (LDOW’09). Retrieved from http://eprints.soton.ac.uk/267587/.
    [13]
    Olaf Görlitz, Matthias Thimm, and Steffen Staab. 2012. Splodge: Systematic generation of SPARQL benchmark queries for linked open data. In International Semantic Web Conference. Springer, 116--132.
    [14]
    Olaf Hartig. 2011. Zero-knowledge query planning for an iterator implementation of link traversal based query execution. In Proceedings of the 8th Extended Semantic Web Conference on the Semantic Web: Research and Applications-Part I. Springer-Verlag, 154--169.
    [15]
    Bernhard Haslhofer, Flávio Martins, and João Magalhães. 2013. Using SKOS vocabularies for improving web search. In Proceedings of the 22nd International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 1253--1258.
    [16]
    Konrad Höffner, Sebastian Walter, Edgard Marx, Ricardo Usbeck, Jens Lehmann, and Axel-Cyrille Ngonga Ngomo. 2017. Survey on challenges of question answering in the semantic web. Semantic Web 8, 6 (2017), 895--920.
    [17]
    Günter Ladwig and Thanh Tran. 2010. Linked data query processing strategies. In Proceedings of the 9th International Semantic Web Conference on the Semantic Web-Part I. Springer-Verlag, 453--469.
    [18]
    Jens Lehmann and Lorenz Bühmann. 2011. AutoSPARQL: Let users query your knowledge base. In Proceedings of the 8th Extended Semantic Web Conference on the Semantic Web: Research and Applications-Part I. 63--79.
    [19]
    Vanessa Lopez, Miriam Fernández, Enrico Motta, and Nico Stieler. 2012. PowerAqua: Supporting users in querying and exploring the semantic web. Semantic Web 3, 3 (2012), 249--265.
    [20]
    Konstantinos Makris, Nikos Bikakis, Nektarios Gioldasis, and Stavros Christodoulakis. 2012. SPARQL-RW: Transparent query access over mapped RDF data sources. In Proceedings of the 15th International Conference on Extending Database Technology. ACM, 610--613.
    [21]
    Mohamed Morsey, Jens Lehmann, Sören Auer, and Axel-Cyrille Ngonga Ngomo. 2011. DBpedia SPARQL benchmark--Performance assessment with real queries on real data. In International Semantic Web Conference. Springer, 454--469.
    [22]
    Bastian Quilitz and Ulf Leser. 2008. Querying distributed RDF data sources with SPARQL. In Proceedings of the 5th European Semantic Web Conference on the Semantic Web: Research and Applications. Springer-Verlag, 524--538.
    [23]
    Kuldeep B. R. Reddy and P. Sreenivasa Kumar. 2013. Efficient trust-based approximate SPARQL querying of the web of linked data. In Uncertainty Reasoning for the Semantic Web II. Springer, 315--330.
    [24]
    Philip Resnik. 1995. Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the 14th International Joint Conference on Artificial Intelligence - Volume 1 (IJCAI’95). Morgan Kaufmann Publishers, San Francisco, 448--453. Retrieved from http://dl.acm.org/citation.cfm?id=1625855.1625914.
    [25]
    Marta Sabou, Mathieu d’Aquin, and Enrico Motta. 2008. SCARLET: Semantic relation discovery by harvesting online ontologies. In Proceedings of the 5th European Semantic Web Conference on the Semantic Web: Research and Applications. Springer-Verlag, 854--858.
    [26]
    Kai Schlegel, Florian Stegmaier, Sebastian Bayerl, Michael Granitzer, and Harald Kosch. 2014. Balloon fusion: SPARQL rewriting based on unified co-reference information. In 2014 IEEE 30th International Conference on Data Engineering Workshops (ICDEW’14). IEEE, 254--259.
    [27]
    Michael Schmidt, Olaf Görlitz, Peter Haase, Günter Ladwig, Andreas Schwarte, and Thanh Tran. 2011. FedBench: A benchmark suite for federated semantic data query processing. In International Semantic Web Conference, 2011 (ISWC’11). Springer, Berlin, Heidelberg, 585--600.
    [28]
    Michael Schmidt, Thomas Hornung, Georg Lausen, and Christoph Pinkel. 2009. SP 2Bench: A SPARQL performance benchmark. In IEEE 25th International Conference on Data Engineering, 2009 (ICDE’09). IEEE, 222--233.
    [29]
    Andreas Schwarte, Peter Haase, Katja Hose, Ralf Schenkel, and Michael Schmidt. 2011. FedX: Optimization techniques for federated query processing on linked data. In International Semantic Web Conference, 2011 (ISWC'11). Springer, Berlin, Heidelberg, 601--616.
    [30]
    Oshani Seneviratne and Rachel Sealfon. 2010. QueryMed: An intuitive federated SPARQL query builder for biomedical RDF data. Retrieved from http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.389.4282.
    [31]
    Saeedeh Shekarpour, Edgard Marx, Axel-Cyrille Ngonga Ngomo, and SÃren Auer. 2015. SINA: Semantic interpretation of user queries for question answering on interlinked data. Web Semantics: Science, Services and Agents on the World Wide Web 30 (2015), 39--51.
    [32]
    W3C SPARQL Working Group. 2013. SPARQL 1.1 Overview. Retrieved from https://www.w3.org/TR/sparql11-overview/. W3C Recommendation March 21, 2013.
    [33]
    W3C SPARQL Working Group. 2013. SPARQL 1.1 Query Language. Retrieved from https://www.w3.org/TR/2013/REC-sparql11-query-20130321/. W3C Recommendation March 21, 2013.
    [34]
    Yuan Tian, Jürgen Umbrich, and Yong Yu. 2011. Enhancing source selection for live queries over linked data via query log mining. In Proceedings of the 2011 Joint International Conference on the Semantic Web. Springer-Verlag, 176--191.
    [35]
    Raquel Trillo, Jorge Gracia, Mauricio Espinoza, and Eduardo Mena. 2007. Discovering the semantics of user keywords. Journal of Universal Computer Science 13, 12 (2007), 1908--1935.
    [36]
    Zhibiao Wu and Martha Palmer. 1994. Verbs semantics and lexical selection. In Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics, 133--138.
    [37]
    Ying Zhang, Pham Minh Duc, Oscar Corcho, and Jean-Paul Calbimonte. 2012. SRBench: A streaming RDF/SPARQL benchmark. In International Semantic Web Conference. Springer, 641--657.
    [38]
    Ziqi Zhang, Anna Lisa Gentile, Eva Blomqvist, Isabelle Augenstein, and Fabio Ciravegna. 2017. An unsupervised data-driven method to discover equivalent relations in large linked datasets. Semantic Web 8, 2 (2017), 197--223.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on the Web
    ACM Transactions on the Web  Volume 12, Issue 4
    November 2018
    215 pages
    ISSN:1559-1131
    EISSN:1559-114X
    DOI:10.1145/3281744
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 27 September 2018
    Accepted: 01 May 2018
    Revised: 01 May 2018
    Received: 01 July 2015
    Published in TWEB Volume 12, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. RDF
    2. SPARQL
    3. Semantic web
    4. linked open data
    5. query transformation

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    • FEDER/TIN2016-78011-C4-2-R project
    • FEDER/TIN2013-46238-C4-1-R project
    • Iñaki Goenaga (FCT-IG) Technology Centers Foundation

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 104
      Total Downloads
    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media