Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1989284.1989294acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
research-article

Incomplete information and certain answers in general data models

Published: 13 June 2011 Publication History

Abstract

While incomplete information is ubiquitous in all data models - especially in applications involving data translation or integration - our understanding of it is still not completely satisfactory. For example, even such a basic notion as certain answers for XML queries was only introduced recently, and in a way seemingly rather different from relational certain answers.
The goal of this paper is to introduce a general approach to handling incompleteness, and to test its applicability in known data models such as relations and documents. The approach is based on representing degrees of incompleteness via semantics-based orderings on database objects. We use it to both obtain new results on incompleteness and to explain some previously observed phenomena. Specifically we show that certain answers for relational and XML queries are two instances of the same general concept; we describe structural properties behind the naive evaluation of queries; answer open questions on the existence of certain answers in the XML setting; and show that previously studied ordering-based approaches were only adequate for SQL's primitive view of nulls. We define a general setting that subsumes relations and documents to help us explain in a uniform way how to compute certain answers, and when good solutions can be found in data exchange. We also look at the complexity of common problems related to incompleteness, and generalize several results from relational and XML contexts.

References

[1]
S. Abiteboul, O. Duschka. Complexity of answering queriesusing materialized views. In PODS 1998, pages 254--263.
[2]
S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison-Wesley, 1995.
[3]
S. Abiteboul, P Kanellakis, and G. Grahne. On the representation and querying of sets of possible worlds. TCS, 78(1):158--187, 1991.
[4]
S. Abiteboul, L. Segoufin, and V. Vianu. Representing and querying XML with incomplete information. ACM TODS, 31(1):208--254, 2006.
[5]
. Antova, C. Koch, D. Olteanu. 1010 6 worlds and beyond: efficient representationand processing of incomplete information. VLDB J. 18(5): 1021--1040 (2009).
[6]
M. Arenas, P. Barceló, L. Libkin, F. Murlak. Relational and XML Data Exchange. Morgan & Claypool, 2010.
[7]
P. Barceló, L. Libkin, A. Poggi, and C. Sirangelo. XML with incomplete information. J. ACM 58(1): 1--62 (2010).
[8]
H. Bjorklund, W. Martens, and T. Schwentick. Conjunctive query containment over trees. In DBPL'07, pages 66--80.
[9]
P. Buneman, A. Jung, A. Ohori. Using powerdomains to generalize relational databases. Theoretical Computer Science 91(1991), 23--55.
[10]
P. Buneman, S. Davidson, A. Watters. A semantics for complex objects and approximate answers. Journal of Computer and System Sciences 43(1991), 170--218.
[11]
A. Cali, G. Gottlob, T. Lukasiewicz. Datalog: a unified approach to ontologies and integrityconstraints. In ICDT'10, pages 14--30.
[12]
A. Cali, D. Lembo, and R. Rosati. On the decidability and complexity of query answering over inconsistent and incomplete databases. In PODS';03, pages 260--271.
[13]
S. Cohen and Y. Sagiv. An abstract framework for generating maximal answers toqueries. In ICDT 2005, pages 129--143.
[14]
C. Date and H. Darwin. A Guide to the SQL Standard. Addison-Wesley, 1996.
[15]
C. David, L. Libkin, F. Murlak. Certain answers for XML queries. In PODS 2010, pages 191--202.
[16]
A. Deutsch, A. Nash, J. Remmel. The chase revisited. In PODS'08, pages 149--158.
[17]
P. Erdos. Graph theory and probability. Canad. J. Math. 11 (1959), 34--38.
[18]
R. Fagin, Ph. Kolaitis, R. Miller, and L. Popa. Data exchange: semantics and query answering. Theoretical Computer Science, 336(1):89--124, 2005.
[19]
J. Flum and M. Grohe. Parameterized Complexity Theory. Springer, 2006.
[20]
G. Gottlob, C. Koch, and K. Schulz. Conjunctive queries over trees. J. ACM 53(2):238--272, 2006.
[21]
C. Gunter. Semantics of Programming Languages. The MIT Press, 1992.
[22]
M. Gyssens, J. Paredaens, J. Van den Bussche, D. Van Gucht. A graph-oriented object database model IEEE TKDE 6(4):572--586, 1994.
[23]
P. Hell and J. Nesetril. phGraphs and Homomorphisms. Oxford University Press, 2004.
[24]
J. Hubicka and J. Nesetril. Finite paths are universal. Order 22(1):21--40, 2005.
[25]
T. Imielinski and W. Lipski. Incomplete information in relational databases. J. ACM, 31(4):761--791, 1984.
[26]
P. Kolaitis and M. Vardi. A logical approach to constraint satisfaction. In Finite Model Theory and Its Applications, Springer2007, pages 339--370.
[27]
G. Kuper and M. Vardi. The logical data model. ACM Trans. Database Syst. (TODS) 18(3):379--413 (1993)
[28]
M. Lenzerini. Data integration: a theoretical perspective. In PODS'02, pages 233--246.
[29]
M. Levene and G. Loizou. Semantics of null extended nested relations. ACM Trans. Database Systems 18 (1992), 414--459.
[30]
L. Libkin. A semantics-based approach to design of query languages forpartial information. In Semantics in Databases, LNCS 1358, 1998, pages170--208.
[31]
A. Ohori. Semantics of types for database objects. Theoretical Computer Science 76 (1990), 53--91.
[32]
D. Olteanu, C. Koch, L. Antova. World-set decompositions: expressiveness and efficient algorithms. TCS 403 (2008), 265--284.
[33]
B. Rossman. Homomorphism preservation theorems. J. ACM 55(3): (2008).
[34]
B. Rounds. Situation-theoretic aspects of databases. In Proceedings of Conference on Situation Theory and Applications, CSLI vol. 26, 1991, pages 229--256.
[35]
D. Suciu. Probabilistic databases. Encyclopedia of Database Systems, 2009, pages 2150--2155.
[36]
Vardi. On the integrity of databases with incomplete information. In PODS'86, pages 252--266.
[37]
W. Wechler. Universal Algebra for Computer Scientists. Springer, 1992.

Cited By

View all
  • (2022)Query Processing over Incomplete DatabasesundefinedOnline publication date: 25-Feb-2022
  • (2021)UCQ-Rewritings for disjunctive knowledge and queries with negated atomsSemantic Web10.3233/SW-20039912:4(685-709)Online publication date: 1-Jan-2021
  • (2020)Coping with Incomplete Data: Recent AdvancesProceedings of the 39th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3375395.3387970(33-47)Online publication date: 14-Jun-2020
  • Show More Cited By

Index Terms

  1. Incomplete information and certain answers in general data models

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    PODS '11: Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
    June 2011
    332 pages
    ISBN:9781450306607
    DOI:10.1145/1989284
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 June 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. certain answers
    2. homomorphisms
    3. incompleteness
    4. naive tables/evaluation
    5. orderings
    6. xml

    Qualifiers

    • Research-article

    Conference

    SIGMOD/PODS '11
    Sponsor:

    Acceptance Rates

    PODS '11 Paper Acceptance Rate 25 of 113 submissions, 22%;
    Overall Acceptance Rate 642 of 2,707 submissions, 24%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 30 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Query Processing over Incomplete DatabasesundefinedOnline publication date: 25-Feb-2022
    • (2021)UCQ-Rewritings for disjunctive knowledge and queries with negated atomsSemantic Web10.3233/SW-20039912:4(685-709)Online publication date: 1-Jan-2021
    • (2020)Coping with Incomplete Data: Recent AdvancesProceedings of the 39th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3375395.3387970(33-47)Online publication date: 14-Jun-2020
    • (2019)Skyline queries over incomplete data streamsThe VLDB Journal10.1007/s00778-019-00577-628:6(961-985)Online publication date: 17-Oct-2019
    • (2018)Explainable certain answersProceedings of the 27th International Joint Conference on Artificial Intelligence10.5555/3304889.3304893(1683-1690)Online publication date: 13-Jul-2018
    • (2018)Incomplete data managementFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-016-6195-x12:1(4-25)Online publication date: 1-Feb-2018
    • (2016)Querying incomplete information in RDF with SPARQLArtificial Intelligence10.1016/j.artint.2016.04.005237:C(138-171)Online publication date: 1-Aug-2016
    • (2016)Certain answers as objects and knowledgeArtificial Intelligence10.1016/j.artint.2015.11.004232:C(1-19)Online publication date: 1-Mar-2016
    • (2015)How to define certain answersProceedings of the 24th International Conference on Artificial Intelligence10.5555/2832747.2832854(4282-4288)Online publication date: 25-Jul-2015
    • (2015)Recovering Exchanged DataProceedings of the 34th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/2745754.2745770(105-116)Online publication date: 20-May-2015
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media