Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

The complexity of causality and responsibility for query answers and non-answers

Published: 01 October 2010 Publication History

Abstract

An answer to a query has a well-defined lineage expression (alternatively called how-provenance) that explains how the answer was derived. Recent work has also shown how to compute the lineage of a non-answer to a query. However, the cause of an answer or non-answer is a more subtle notion and consists, in general, of only a fragment of the lineage. In this paper, we adapt Halpern, Pearl, and Chockler's recent definitions of causality and responsibility to define the causes of answers and non-answers to queries, and their degree of responsibility. Responsibility captures the notion of degree of causality and serves to rank potentially many causes by their relative contributions to the effect. Then, we study the complexity of computing causes and responsibilities for conjunctive queries. It is known that computing causes is NP-complete in general. Our first main result shows that all causes to conjunctive queries can be computed by a relational query which may involve negation. Thus, causality can be computed in PTIME, and very efficiently so. Next, we study computing responsibility. Here, we prove that the complexity depends on the conjunctive query and demonstrate a dichotomy between PTIME and NP-complete cases. For the PTIME cases, we give a non-trivial algorithm, consisting of a reduction to the max-flow computation problem. Finally, we prove that, even when it is in PTIME, responsibility is complete for LOGSPACE, implying that, unlike causality, it cannot be computed by a relational query.

References

[1]
S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison-Wesley, 1995.
[2]
P. Buneman, S. Khanna, and W. C. Tan. Why and where: A characterization of data provenance. In ICDT, 2001.
[3]
A. Chapman and H. V. Jagadish. Why not? In SIGMOD, 2009.
[4]
J. Cheney, L. Chiticariu, and W. C. Tan. Provenance in databases: Why, how, and where. Foundations and Trends in Databases, 1(4):379--474, 2009.
[5]
H. Chockler and J. Y. Halpern. Responsibility and blame: A structural-model approach. J. Artif. Intell. Res. (JAIR), 22:93--115, 2004.
[6]
H. Chockler, J. Y. Halpern, and O. Kupferman. What causes a system to satisfy a specification? ACM Trans. Comput. Log., 9(3), 2008.
[7]
Y. Cui, J. Widom, and J. L. Wiener. Tracing the lineage of view data in a warehousing environment. ACM Trans. Database Syst., 25(2):179--227, 2000.
[8]
N. Dalvi and D. Suciu. Management of probabilistic data: Foundations and challenges. In PODS, pages 1--12, Beijing, China, 2007. (invited talk).
[9]
T. Eiter and T. Lukasiewicz. Complexity results for structure-based causality. Artif. Intell., 142(1):53--89, 2002. (Conference version in IJCAI, 2002).
[10]
T. Eiter and T. Lukasiewicz. Causes and explanations in the structural-model approach: Tractable cases. Artif. Intell., 170(6--7):542--580, 2006.
[11]
G. Gottlob, N. Leone, and F. Scarcello. The complexity of acyclic conjunctive queries. J. ACM, 48(3):431--498, 2001.
[12]
T. J. Green, G. Karvounarakis, and V. Tannen. Provenance semirings. In PODS, 2007.
[13]
J. Y. Halpern and J. Pearl. Causes and explanations: A structural-model approach. Part I: Causes. Brit. J. Phil. Sci., 56:843--887, 2005. (Conference version in UAI, 2001).
[14]
M. Herschel, M. A. Hernández, and W. C. Tan. Artemis: A system for analyzing missing answers. PVLDB, 2(2):1550--1553, 2009.
[15]
J. Huang, T. Chen, A. Doan, and J. F. Naughton. On the provenance of non-answers to queries over extracted data. PVLDB, 1(1):736--747, 2008.
[16]
D. Lewis. Causation. The Journal of Philosophy, 70(17):556--567, 1973.
[17]
A. Meliou, W. Gatterbauer, J. Halpern, C. Koch, K. F. Moore, and D. Suciu. Causality in databases. IEEE Data Engineering Bulletin, Sept. 2010.
[18]
A. Meliou, W. Gatterbauer, K. F. Moore, and D. Suciu. The complexity of causality and responsibility for query answers and non-answers. CoRR, abs/1009.2021, 2010.
[19]
A. Meliou, W. Gatterbauer, K. F. Moore, and D. Suciu. Why so? or Why no? Functional causality for explaining query answers. In MUD, 2010. Full version: CoRR abs/0912.5340 (2009).
[20]
P. Menzies. Counterfactual theories of causation. Stanford Encylopedia of Philosophy, 2008.
[21]
D. Olteanu and J. Huang. Secondary-storage confidence computation for conjunctive queries with inequalities. In SIGMOD, 2009.
[22]
P. Senellart and G. Gottlob. On the complexity of deriving schema mappings from database instances. PODS, 2008.
[23]
Q. T. Tran and C.-Y. Chan. How to conquer why-not questions. In SIGMOD, 2010.
[24]
International multidisciplinary workshop on causality. IRIT, Toulouse, June 2009.

Cited By

View all
  • (2025)Differentially private explanations for aggregate query answersThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-024-00895-434:2Online publication date: 1-Mar-2025
  • (2024)PD-Explain: A Unified Python-Native Framework for Query Explanations Over DataFramesProceedings of the VLDB Endowment10.14778/3685800.368590317:12(4473-4476)Online publication date: 8-Nov-2024
  • (2024)The Generalized Causal-Effect Score in Data Management (short paper)Proceedings of the Conference on Governance, Understanding and Integration of Data for Effective and Responsible AI10.1145/3665601.3669843(32-35)Online publication date: 9-Jun-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 4, Issue 1
October 2010
45 pages

Publisher

VLDB Endowment

Publication History

Published: 01 October 2010
Published in PVLDB Volume 4, Issue 1

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Differentially private explanations for aggregate query answersThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-024-00895-434:2Online publication date: 1-Mar-2025
  • (2024)PD-Explain: A Unified Python-Native Framework for Query Explanations Over DataFramesProceedings of the VLDB Endowment10.14778/3685800.368590317:12(4473-4476)Online publication date: 8-Nov-2024
  • (2024)The Generalized Causal-Effect Score in Data Management (short paper)Proceedings of the Conference on Governance, Understanding and Integration of Data for Effective and Responsible AI10.1145/3665601.3669843(32-35)Online publication date: 9-Jun-2024
  • (2024)The Complexity of Resilience Problems via Valued Constraint Satisfaction ProblemsProceedings of the 39th Annual ACM/IEEE Symposium on Logic in Computer Science10.1145/3661814.3662071(1-14)Online publication date: 8-Jul-2024
  • (2024)Counterfactual Explanation at Will, with Zero Privacy LeakageProceedings of the ACM on Management of Data10.1145/36549332:3(1-29)Online publication date: 30-May-2024
  • (2024)Banzhaf Values for Facts in Query AnsweringProceedings of the ACM on Management of Data10.1145/36549262:3(1-26)Online publication date: 30-May-2024
  • (2024)Postulates for Provenance: Instance-based provenance for first-order logicProceedings of the ACM on Management of Data10.1145/36515962:2(1-16)Online publication date: 14-May-2024
  • (2024)Fast Shapley Value Computation in Data Assemblage Tasks as Cooperative Simple GamesProceedings of the ACM on Management of Data10.1145/36393112:1(1-28)Online publication date: 26-Mar-2024
  • (2024)Relative Keys: Putting Feature Explanation into ContextProceedings of the ACM on Management of Data10.1145/36392632:1(1-28)Online publication date: 26-Mar-2024
  • (2024)CleanEr: Interactive, Query-Guided Error Mitigation for Data Cleaning Systems2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00418(5421-5424)Online publication date: 13-May-2024
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media