Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1807085.1807113acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
research-article

Computing query probability with incidence algebras

Published: 06 June 2010 Publication History

Abstract

We describe an algorithm that evaluates queries over probabilistic databases using Mobius' inversion formula in incidence algebras. The queries we consider are unions of conjunctive queries (equivalently: existential, positive First Order sentences), and the probabilistic databases are tuple-independent structures. Our algorithm runs in PTIME on a subset of queries called "safe" queries, and is complete, in the sense that every unsafe query is hard for the class FP#P. The algorithm is very simple and easy to implement in practice, yet it is non-obvious. Mobius' inversion formula, which is in essence inclusion-exclusion, plays a key role for completeness, by allowing the algorithm to compute the probability of some safe queries even when they have some subqueries that are unsafe. We also apply the same lattice-theoretic techniques to analyze an algorithm based on lifted conditioning, and prove that it is incomplete.

References

[1]
N. Creignou and M. Hermann. Complexity of generalized satisfiability counting problems. Inf. Comput, 125(1):1--12, 1996.
[2]
Nadia Creignou. A dichotomy theorem for maximum generalized satisfiability problems. J. Comput. Syst. Sci., 51(3):511--522, 1995.
[3]
N. Dalvi, K. Schnaitter, and D. Suciu. Computing query probability with incidence algebras. Technical Report UW-CSE-10-03-02, University of Washington, March 2010.
[4]
N. Dalvi and D. Suciu. Efficient query evaluation on probabilistic databases. In VLDB, Toronto, Canada, 2004.
[5]
N. Dalvi and D. Suciu. The dichotomy of conjunctive queries on probabilistic structures. In PODS, pages 293--302, 2007.
[6]
N. Dalvi and D. Suciu. Management of probabilistic data: Foundations and challenges. In PODS, pages 1--12, Beijing, China, 2007. (invited talk).
[7]
Adnan Darwiche. A differential approach to inference in bayesian networks. Journal of the ACM, 50(3):280--305, 2003.
[8]
Erich Grädel, Yuri Gurevich, and Colin Hirsch. The complexity of query reliability. In PODS, pages 227--234, 1998.
[9]
Technical Report UW-CSE-10-03-02, University of Washington, March 2010.
[10]
Dan Olteanu and Jiewen Huang. Secondary-storage confidence computation for conjunctive queries with inequalities. In SIGMOD, pages 389--402, 2009.
[11]
Dan Olteanu, Jiewen Huang, and Christoph Koch. Sprout: Lazy vs. eager query plans for tuple-independent probabilistic databases. In ICDE, pages 640--651, 2009.
[12]
D. Poole. First-order probabilistic inference. In IJCAI, 2003.
[13]
Yehoushua Sagiv and Mihalis Yannakakis. Equivalences among relational expressions with the union and difference operators. Journal of the ACM, 27:633--655, 1980.
[14]
P. Sen, A.Deshpande, and L. Getoor. Bisimulation-based approximate lifted inference. In UAI, 2009.
[15]
Parag Singla and Pedro Domingos. Lifted first-order belief propagation. In AAAI, pages 1094--1099, 2008.
[16]
Richard P. Stanley. Enumerative Combinatorics. Cambridge University Press, 1997.
[17]
Ingo Wegener. BDDs--design, analysis, complexity, and applications. Discrete Applied Mathematics, 138(1-2):229--251, 2004.

Cited By

View all
  • (2023)Probabilistic Query Evaluation: The Combined FPRAS LandscapeProceedings of the 42nd ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3584372.3588677(339-347)Online publication date: 18-Jun-2023
  • (2022)Probabilistic DatabasesundefinedOnline publication date: 2-Mar-2022
  • (2020)Queries with difference on probabilistic databasesProceedings of the VLDB Endowment10.14778/3402707.34027414:11(1051-1062)Online publication date: 3-Jun-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
PODS '10: Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
June 2010
350 pages
ISBN:9781450300339
DOI:10.1145/1807085
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 June 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. incidence algebra
  2. mobius inversion
  3. probabilistic database

Qualifiers

  • Research-article

Conference

SIGMOD/PODS '10
Sponsor:
SIGMOD/PODS '10: International Conference on Management of Data
June 6 - 11, 2010
Indiana, Indianapolis, USA

Acceptance Rates

PODS '10 Paper Acceptance Rate 27 of 113 submissions, 24%;
Overall Acceptance Rate 642 of 2,707 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Probabilistic Query Evaluation: The Combined FPRAS LandscapeProceedings of the 42nd ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3584372.3588677(339-347)Online publication date: 18-Jun-2023
  • (2022)Probabilistic DatabasesundefinedOnline publication date: 2-Mar-2022
  • (2020)Queries with difference on probabilistic databasesProceedings of the VLDB Endowment10.14778/3402707.34027414:11(1051-1062)Online publication date: 3-Jun-2020
  • (2019): Weighted and Projected SamplingTools and Algorithms for the Construction and Analysis of Systems10.1007/978-3-030-17462-0_4(59-76)Online publication date: 6-Apr-2019
  • (2018)State-space abstractions for probabilistic inferenceJournal of Artificial Intelligence Research10.1613/jair.1.1126163:1(789-848)Online publication date: 1-Sep-2018
  • (2018)Dan Suciu Speaks Out on Research, Shyness and Being a ScientistACM SIGMOD Record10.1145/3186549.318655746:4(28-34)Online publication date: 22-Feb-2018
  • (2016)Tractable Lineages on Treelike InstancesProceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/2902251.2902301(355-370)Online publication date: 15-Jun-2016
  • (2014)Oblivious bounds on the probability of boolean functionsACM Transactions on Database Systems10.1145/253264139:1(1-34)Online publication date: 6-Jan-2014
  • (2014)Querying and Learning in Probabilistic DatabasesReasoning Web. Reasoning on the Web in the Big Data Era10.1007/978-3-319-10587-1_8(313-368)Online publication date: 2014
  • (2013)Provenance-based dictionary refinement in information extractionProceedings of the 2013 ACM SIGMOD International Conference on Management of Data10.1145/2463676.2465284(457-468)Online publication date: 22-Jun-2013
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media