Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Naïve Evaluation of Queries over Incomplete Databases

Published: 30 December 2014 Publication History

Abstract

The term naïve evaluation refers to evaluating queries over incomplete databases as if nulls were usual data values, that is, to using the standard database query evaluation engine. Since the semantics of query answering over incomplete databases is that of certain answers, we would like to know when naïve evaluation computes them, that is, when certain answers can be found without inventing new specialized algorithms. For relational databases it is well known that unions of conjunctive queries possess this desirable property, and results on preservation of formulae under homomorphisms tell us that, within relational calculus, this class cannot be extended under the open-world assumption.
Our goal here is twofold. First, we develop a general framework that allows us to determine, for a given semantics of incompleteness, classes of queries for which naïve evaluation computes certain answers. Second, we apply this approach to a variety of semantics, showing that for many classes of queries beyond unions of conjunctive queries, naïve evaluation makes perfect sense under assumptions different from open world. Our key observations are: (1) naïve evaluation is equivalent to monotonicity of queries with respect to a semantics-induced ordering, and (2) for most reasonable semantics of incompleteness, such monotonicity is captured by preservation under various types of homomorphisms. Using these results we find classes of queries for which naïve evaluation works, for example, positive first-order formulae for the closed-world semantics. Even more, we introduce a general relation-based framework for defining semantics of incompleteness, show how it can be used to capture many known semantics and to introduce new ones, and describe classes of first-order queries for which naïve evaluation works under such semantics.

Supplementary Material

a31-gheerbrant-apndx.pdf (gheerbrant.zip)
Supplemental movie, appendix, image and software files for, Naïve Evaluation of Queries over Incomplete Databases

References

[1]
S. Abiteboul, R. Hull, and V. Vianu, 1995. Foundations of Databases. Addison-Wesley.
[2]
S. Abiteboul, P. Kanellakis, and G. Grahne. 1991. On the representation and querying of sets of possible worlds. Theor. Comput. Sci. 78, 1, 158--187.
[3]
S. Abiteboul, L. Segoufin, and V. Vianu. 2006. Representing and querying xml with incomplete information. ACM Trans. Database Syst. 31, 1, 208--254.
[4]
M. Ajtai, and Y. Gurevich, 1987. Monotone versus positive. J. ACM 34, 4, 1004--1015.
[5]
M. Arenas, P. Barcelo, L. Libkin, and F. Murlak. 2014. Foundations of Data Exchange. Cambridge University Press.
[6]
M. Arenas, J. Perez, and J. Reutter. 2011. Data exchange beyond complete data. In Proceedings of the 30th ACM Symposium on Principles of Database Systems (PODS'11). 83--94.
[7]
A. Atserias, A. Dawar, and P. Kolaitis. 2006. On preservation under homomorphisms and unions of conjunctive queries. J. ACM 53, 2, 208--237.
[8]
P. Barcelo, L. Libkin, A. Poggi, and C. Sirangelo. 2010. XML with incomplete information. J. ACM 58, 1.
[9]
P. Barcelo, L. Libkin, and J. Reutter. 2014. Querying regular graph patterns. J. ACM 61, 1.
[10]
P. Buneman, A. Jung, and A. Ohori. 1991. Using powerdomains to generalize relational databases. Theor. Comput. Sci. 91, 1, 23--55.
[11]
A. Cali, D. Lembo, and R. Rosati. 2003. On the decidability and complexity of query answering over inconsistent and incomplete databases. In Proceedings of the ACM Symposium on Principles of Database Systems (PODS'03). 260--271.
[12]
A. K. Chandra and P. M. Merlin. 1977. Optimal implementation of conjunctive queries in relational data bases. In Proceedings of the 9th Annual ACM Symposium on Theory of Computing (STOC'77). 77--90.
[13]
C. Chang and H. Keisler. 2012. Model Theory. Dover.
[14]
K. Compton. 1983. Some useful preservation theorems. J. Symbol. Logic 48, 2, 427--440.
[15]
C. Date and H. Darwin. 1996. A Guide to the SQL Standard. Addison-Wesley.
[16]
R. Fagin, P. Kolaitis, and L. Popa. 2005. Data exchange: Getting to the core. ACM Trans. Database Syst. 30, 1, 174--210.
[17]
A. Gheerbrant, L. Libkin, and C. Sirangelo. 2013. When is naive evaluation possible? In Proceedings of the ACM Symposium on Principles of Database Systems (PODS'13). 201--212.
[18]
A. Gheerbrant, L. Libkin, and T. Tan. 2012. On the complexity of query answering over incomplete xml documents. In Proceedings of the International Conference on Database Theory (ICDT'12). 169--181.
[19]
C. Gunter. 1992. Semantics of Programming Languages: Structures and Techniques. MIT Press.
[20]
P. Hell and J. Nesetril. 1992. The core of a graph. Discr. Math. 109, 1--3, 127--126.
[21]
P. Hell and J. Nesetril. 2004. Graphs and Homomorphisms. Oxford University Press.
[22]
A. Hernich. 2011. Answering non-monotonic queries in relational data exchange. Logical Methods Comput. Sci. 7, 3.
[23]
T. Imielinski and W. Lipski. 1984. Incomplete information in relational databases. J. ACM 31, 4, 761--791.
[24]
H. J. Keisler. 1965a. Finite approximations of infinitely long formulas. In Proceedings of the Symposium on the Theory of Models. North Holland, 158--169.
[25]
H. J. Keisler. 1965b. Some applications of infinitely long formulas. J. Symbol. Logic 30, 3, 339--349.
[26]
M. Lenzerini. 2002. Data integration: A theoretical perspective. In Proceedings of the 21st ACM Symposium on Principles of Database Systems (PODS'02). 233--246.
[27]
M. Levene and G. Loizou. 1993. Semantics for null extended nested relations. ACM Trans. Database Syst. 18, 3, 414--459.
[28]
L. Libkin. 1995. A semantics-based approach to design of query languages for partial information. In Semantics in Databases. Lecture Notes in Computer Science, vol. 1358, Springer, 170--208.
[29]
L. Libkin. 2011. Incomplete information and certain answers in general data models. In Proceedings of the ACM Symposium on Principles of Database Systems (PODS'11). 59--70.
[30]
L. Libkin. 2014a. Certain answers as objects and knowledge. In Principles of the 14th International Conference on Knowledge Representation and Reasoning (KR'14).
[31]
L. Libkin. 2014b. Incomplete information: What went wrong and how to fix it. In Proceedings of the ACM Symposium on Principles of Database Systems (PODS'14). 1--13.
[32]
J. Minker. 1982. On indefinite databases and the closed world assumption. In Proceedings of the 6th Conference on Automated Deduction (CADE'82). 292--308.
[33]
C. Nikolaou and M. Koubarakis. 2013. Incomplete information in rdf. In Proceedings of the 7th International Conference on Web Reasoning and Rule Systems (RR'13). 138--152.
[34]
A. Ohori. 1990. Semantics of types for database objects. Theor. Comput. Sci. 76, 53--91.
[35]
R. Reiter. 1977. On closed world data bases. In Logic and Data Bases. Springer, 55--76.
[36]
R. Reiter. 1982. Towards a logical reconstruction of relational database theory. In On Conceptual Modelling. Springer, 191--233.
[37]
B. Rossman. 2008. Homomorphism preservation theorems. J. ACM 55, 3.
[38]
B. Rounds. 1991. Situation-theoretic aspects of databases. In Situation Theory and Applications. CSLI Lecture Notes, vol. 26, University of Chicago Press, 229--256.
[39]
A. Stolboushkin. 1995. Finitely monotone properties. In Proceedings of the IEEE Symposium on Logic in Computer Science (LICS'95). 324--330.
[40]
M. Vardi. 1986. On the integrity of databases with incomplete information. In Proceedings of the ACM Symposium on Principles of Database Systems (PODS'86). 252--266.

Cited By

View all
  • (2022)Troubles with nulls, views from the usersProceedings of the VLDB Endowment10.14778/3551793.355181815:11(2613-2625)Online publication date: 1-Jul-2022
  • (2020)Coping with Incomplete Data: Recent AdvancesProceedings of the 39th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3375395.3387970(33-47)Online publication date: 14-Jun-2020
  • (2020)Queries with Arithmetic on Incomplete DatabasesProceedings of the 39th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3375395.3387666(179-189)Online publication date: 14-Jun-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Database Systems
ACM Transactions on Database Systems  Volume 39, Issue 4
Invited Articles Issue, SIGMOD 2013, PODS 2013 and ICDT 2013
December 2014
341 pages
ISSN:0362-5915
EISSN:1557-4644
DOI:10.1145/2691190
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 December 2014
Accepted: 01 August 2014
Revised: 01 May 2014
Received: 01 October 2013
Published in TODS Volume 39, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Incompleteness
  2. certain answers
  3. homomorphisms
  4. naive tables/evaluation
  5. orderings

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • EPSRC

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)11
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Troubles with nulls, views from the usersProceedings of the VLDB Endowment10.14778/3551793.355181815:11(2613-2625)Online publication date: 1-Jul-2022
  • (2020)Coping with Incomplete Data: Recent AdvancesProceedings of the 39th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3375395.3387970(33-47)Online publication date: 14-Jun-2020
  • (2020)Queries with Arithmetic on Incomplete DatabasesProceedings of the 39th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3375395.3387666(179-189)Online publication date: 14-Jun-2020
  • (2019)Temporal data exchangeInformation Systems10.1016/j.is.2019.07.004Online publication date: Jul-2019
  • (2018)Explainable certain answersProceedings of the 27th International Joint Conference on Artificial Intelligence10.5555/3304889.3304893(1683-1690)Online publication date: 13-Jul-2018
  • (2018)Certain Answers Meet Zero-One LawsProceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3196959.3196983(195-207)Online publication date: 27-May-2018
  • (2017)Correctness of SQL Queries on Databases with NullsACM SIGMOD Record10.1145/3156655.315665746:3(5-16)Online publication date: 31-Oct-2017
  • (2017)Indexing for Large Scale Data Querying Based on Spark SQL2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)10.1109/ICEBE.2017.25(103-108)Online publication date: Nov-2017
  • (2016)Making SQL Queries Correct on Incomplete DatabasesProceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/2902251.2902297(211-223)Online publication date: 15-Jun-2016
  • (2016)SQL’s Three-Valued Logic and Certain AnswersACM Transactions on Database Systems10.1145/287720641:1(1-28)Online publication date: 18-Mar-2016
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media