Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2902251.2902298acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
research-article

Designing a Query Language for RDF: Marrying Open and Closed Worlds

Published: 15 June 2016 Publication History

Abstract

When querying an RDF graph, a prominent feature is the possibility of extending the answer to a query with optional information. However, the definition of this feature in SPARQL --the standard RDF query language-- has raised some important issues. Most notably, the use of this feature increases the complexity of the evaluation problem, and its closed-world semantics is in conflict with the underlying open-world semantics of RDF. Many approaches for fixing such problems have been proposed, being the most prominent the introduction of the semantic notion of weakly-monotone SPARQL query. Weakly-monotone SPARQL queries have shaped the class of queries that conform to the open-world semantics of RDF. Unfortunately, finding an effective way of restricting SPARQL to the fragment of weakly-monotone queries has proven to be an elusive problem. In practice, the most widely adopted fragment for writing SPARQL queries is based on the syntactic notion of well designedness. This notion has proven to be a good approach for writing SPARQL queries, but its expressive power has yet to be fully understood. The starting point of this paper is to understand the relation between well-designed queries and the semantic notion of weak monotonicity. It is known that every well-designed SPARQL query is weakly monotone; as our first contribution we prove that the converse does not hold, even if an extension of this notion based on the use of disjunction is considered. Given this negative result, we embark on the task of defining syntactic fragments that are weakly-monotone, and have higher expressive power than the fragment of well-designed queries. To this end, we move to a more general scenario where infinite RDF graphs are also allowed, so that interpolation techniques studied for first-order logic can be applied. With the use of these techniques, we are able to define a new operator for SPARQL that gives rise to a query language with the desired properties (over finite and infinite RDF graphs). It should be noticed that every query in this fragment is weakly monotone if we restrict to the case of finite RDF graphs. Moreover, we use this result to provide a simple characterization of the class of monotone CONSTRUCT queries, that is, the class of SPARQL queries that produce RDF graphs as output. Finally, we pinpoint the complexity of the evaluation problem for the query languages identified in the paper.

References

[1]
S. Ahmetaj, W. Fischl, R. Pichler, M. Simkus, and S. Skritek. Towards reconciling SPARQL and certain answers. In Proceedings of the 24th International Conference on World Wide Web, WWW 2015, pages 23--33, 2015.
[2]
M. Ajtai and Y. Gurevich. Monotone versus positive. J. ACM, 34(4):1004--1015, Oct. 1987.
[3]
R. Angles and C. Gutierrez. The expressive power of SPARQL. In ISWC, pages 114--129, 2008.
[4]
R. Angles and C. Gutierrez. The expressive power of SPARQL. Springer, 2008.
[5]
M. Arenas, S. Conca, and J. Pérez. Counting beyond a yottabyte, or how SPARQL 1.1 property paths will prevent adoption of the standard. In Proceedings of the 21st international conference on World Wide Web, pages 629--638. ACM, 2012.
[6]
M. Arenas and J. Pérez. Querying semantic web data with sparql. In Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pages 305--316. ACM, 2011.
[7]
P. Barceló, R. Pichler, and S. Skritek. Efficient evaluation and approximation of well-designed pattern trees. In Proceedings of the 34th ACM Symposium on Principles of Database Systems, PODS 2015, pages 131--144, 2015.
[8]
M. Benedikt, B. ten Cate, and E. Tsamoura. Generating low-cost plans from proofs. In Proceedings of the 33rd ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS'14, pages 200--211, 2014.
[9]
T. Berners-Lee, J. Hendler, O. Lassila, et al. The semantic web. Scientific American, 284(5):28--37, 2001.
[10]
C. Buil-Aranda, M. Arenas, Ó. Corcho, and A. Polleres. Federating queries in SPARQL 1.1: Syntax, semantics and evaluation. J. Web Sem., 18(1):1--17, 2013.
[11]
S. R. Buss and L. Hay. On truth-table reducibility to sat. Information and Computation, 91(1):86 -- 102, 1991.
[12]
M. W. Chekol, J. Euzenat, P. Genevès, and N. Layaıda. SPARQL query containment under SHI axioms. In AAAI, 2012.
[13]
M. W. Chekol, J. Euzenat, P. Genevès, and N. Layaıda. SPARQL query containment under RDFS entailment regime. In IJCAR, pages 134--148, 2012.
[14]
M. Durst and M. Suignard. Rfc 3987, internationalized resource identifiers (iris), 2005.
[15]
T. Furche, B. Linse, F. Bry, D. Plexousakis, and G. Gottlob. RDF querying: Language constructs and evaluation methods compared. In Reasoning Web, pages 1--52. Springer, 2006.
[16]
C. A. Galindo-Legaria. Outerjoins as disjunctions. SIGMOD Rec., 23(2):348--358, May 1994.
[17]
F. Geerts, G. Karvounarakis, V. Christophides, and I. Fundulaki. Algebraic structures for capturing the provenance of SPARQL queries. In ICDT, 2013.
[18]
H. Halpin and J. Cheney. Dynamic provenance for SPARQL updates. In ISWC, 2014.
[19]
L. A. Hemachandra. The strong exponential hierarchy collapses. Journal of Computer and System Sciences, 39(3):299 -- 322, 1989.
[20]
E. V. Kostylev, J. L. Reutter, and M. Ugarte. CONSTRUCT queries in SPARQL. In 18th International Conference on Database Theory, ICDT 2015, March 23--27, 2015, Brussels, Belgium, pages 212--229, 2015.
[21]
A. Letelier, J. Pérez, R. Pichler, and S. Skritek. Static analysis and optimization of semantic web queries. In Proceedings of the 31st Symposium on Principles of Database Systems, PODS '12, pages 89--100. ACM, 2012.
[22]
A. Letelier, J. Pérez, R. Pichler, and S. Skritek. Static analysis and optimization of semantic web queries. ACM TODS, 38(4):25, 2013.
[23]
K. Losemann and W. Martens. The complexity of evaluating path expressions in SPARQL. In Proceedings of the 31st symposium on Principles of Database Systems, pages 101--112. ACM, 2012.
[24]
R. C. Lyndon. An interpolation theorem in the predicate calculus. Pacific J. of Mathematics, 9(1):129--142, 1959.
[25]
F. Manola and E. Miller. RDF Primer. W3C Recommendation, 10 February 2004. Available at http://www.w3.org/TR/2004/REC-rdf-primer-20040210/.
[26]
M. Otto. An interpolation theorem. Bulletin of Symbolic Logic, 6(4):447--462, 2000.
[27]
J. Pérez, M. Arenas, and C. Gutierrez. Semantics and complexity of sparql. In ISWC, pages 30--43, 2006.
[28]
J. Pérez, M. Arenas, and C. Gutierrez. Semantics and complexity of SPARQL. ACM TODS, 34(3), 2009.
[29]
F. Picalausa and S. Vansummeren. What are real SPARQL queries like? In SWIM, 2011.
[30]
R. Pichler and S. Skritek. Containment and equivalence of well-designed SPARQL. In Proceedings of the 33rd ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pages 39--50. ACM, 2014.
[31]
A. Polleres and J. P. Wallner. On the relation between SPARQL1.1 and answer set programming. Journal of Applied Non-Classical Logics, 23(1--2):159--212, 2013.
[32]
E. Prud'hommeaux and A. Seaborne. SPARQL query language for RDF. W3C Recommendation, 2008. Available at http://www.w3.org/TR/rdf-sparql-query/.
[33]
V. G. Ramanathan. Light at the end of the tunnel. 2013.
[34]
M. Schmidt, M. Meier, and G. Lausen. Foundations of sparql query optimization. In Proceedings of the 13th International Conference on Database Theory, ICDT '10, pages 4--33. ACM, 2010.
[35]
J. Sequeda, M. Arenas, and D. P. Miranker. On directly mapping relational databases to RDF and OWL. In Proceedings of the 21st World Wide Web Conference 2012, WWW 2012, pages 649--658, 2012.
[36]
L. J. Stockmeyer. The polynomial-time hierarchy. Theor. Comput. Sci., 3(1):1--22, 1976.
[37]
D. Toman and G. E. Weddell. Fundamentals of Physical Design and Query Compilation. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, 2011.
[38]
B. Trakhtenbrot. The impossibility of an algorithm for the decidability problem on finite classes. Proceedings of the USSR Academy of Sciences (in Russian), 70(4):569--572, 1950.
[39]
M. Y. Vardi. The complexity of relational query languages. In Proceedings of the fourteenth annual ACM symposium on Theory of computing, pages 137--146. ACM, 1982.
[40]
W3C SPARQL Working Group. SPARQL 1.1 Query language. W3C Recommendation, 21 March 2013. Available at http://www.w3.org/TR/sparql11-query/.
[41]
G. Wechsung. On the boolean closure of NP. In Fundamentals of Computation Theory, pages 485--493. Springer, 1985.

Cited By

View all
  • (2018)Efficient Evaluation and Static Analysis for Well-Designed Pattern Trees with ProjectionACM Transactions on Database Systems10.1145/323398343:2(1-44)Online publication date: 21-Aug-2018
  • (2018)Certain Answers Meet Zero-One LawsProceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3196959.3196983(195-207)Online publication date: 27-May-2018
  • (2018)Complexity and Expressive Power of Weakly Well-Designed SPARQLTheory of Computing Systems10.1007/s00224-017-9802-962:4(772-809)Online publication date: 1-May-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
PODS '16: Proceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems
June 2016
504 pages
ISBN:9781450341912
DOI:10.1145/2902251
  • General Chair:
  • Tova Milo,
  • Program Chair:
  • Wang-Chiew Tan
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 June 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. computational complexity
  2. construct queries
  3. incomplete information
  4. query languages
  5. semantic web

Qualifiers

  • Research-article

Funding Sources

  • Millennium Nucleus Center for Semantic Web Research
  • Conicyt - Programa de Formación de Capital Humano Avanzado
  • Innoviris the Brussels Institute for Research and Innovation.

Conference

SIGMOD/PODS'16
Sponsor:
SIGMOD/PODS'16: International Conference on Management of Data
June 26 - July 1, 2016
California, San Francisco, USA

Acceptance Rates

PODS '16 Paper Acceptance Rate 31 of 94 submissions, 33%;
Overall Acceptance Rate 642 of 2,707 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Efficient Evaluation and Static Analysis for Well-Designed Pattern Trees with ProjectionACM Transactions on Database Systems10.1145/323398343:2(1-44)Online publication date: 21-Aug-2018
  • (2018)Certain Answers Meet Zero-One LawsProceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3196959.3196983(195-207)Online publication date: 27-May-2018
  • (2018)Complexity and Expressive Power of Weakly Well-Designed SPARQLTheory of Computing Systems10.1007/s00224-017-9802-962:4(772-809)Online publication date: 1-May-2018
  • (2018)Certain Answers for SPARQL with Blank NodesThe Semantic Web – ISWC 201810.1007/978-3-030-00671-6_20(337-353)Online publication date: 18-Sep-2018
  • (2017)Designing a Query Language for RDFACM Transactions on Database Systems10.1145/312924742:4(1-46)Online publication date: 27-Oct-2017

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media