research-article

Causal Discovery via Causal Star Graphs

Authors:

Jing GengAuthors Info & Claims

ACM Transactions on Knowledge Discovery from Data, Volume 17, Issue 7

Article No.: 98, Pages 1 - 24

https://doi.org/10.1145/3586997

Published: 14 April 2023 Publication History

Abstract

Discovering causal relationships among observed variables is an important research focus in data mining. Existing causal discovery approaches are mainly based on constraint-based methods and functional causal models (FCMs). However, the constraint-based method cannot identify the Markov equivalence class and the functional causal models cannot identify the complex interrelationships when multiple variables affect one variable. To address the two aforementioned problems, we propose a new graph structure Causal Star Graph (CSG) and a corresponding framework Causal Discovery via Causal Star Graphs (CD-CSG) to divide a causal directed acyclic graph into multiple CSGs for causal discovery. In this framework, we also propose a generalized learning in CSGs based on a variational approach to learn the representative intermediate variable of CSG’s non-central variables. Through the generalized learning in CSGs, the asymmetry in the forward and backward model of CD-CSG can be found to identify the causal directions in the directed acyclic graphs. We further divide the CSGs into three categories and provide the causal identification principle under each category in our proposed framework. Experiments using synthetic data show that the causal relationships between variables can be effectively identified with CD-CSG and the accuracy of CD-CSG is higher than the best existing model. By applying CD-CSG to real-world data, our proposed method can greatly augment the applicability and effectiveness of causal discovery.

References

[1]

Steen A. Andersson, David Madigan, and Michael D. Perlman. 1997. A characterization of Markov equivalence classes for acyclic digraphs. The Annals of Statistics 25, 2 (1997), 505–541.

[2]

Joshua D. Angrist and Guido W. Imbens. 1995. Two-stage least squares estimation of average causal effects in models with variable treatment intensity. Journal of the American Statistical Association 90, 430 (1995), 431–442.

[3]

Joshua D. Angrist, Guido W. Imbens, and Donald B. Rubin. 1996. Identification of causal effects using instrumental variables. Journal of the American Statistical Association 91, 434 (1996), 444–455.

[4]

Peter M. Bentler and David G. Weeks. 1980. Linear structural equations with latent variables. Psychometrika 45, 3 (1980), 289–308.

[5]

Patrick Blöbaum, Dominik Janzing, Takashi Washio, Shohei Shimizu, and Bernhard Schölkopf. 2018. Cause-effect inference by comparing regression errors. In Proceedings of the International Conference on Artificial Intelligence and Statistics. PMLR, 900–909.

[6]

Facundo Bromberg and Dimitris Margaritis. 2009. Improving the reliability of causal discovery from small datasets using argumentation. Journal of Machine Learning Research 10, 2 (2009), 141–180.

[7]

Peter Bühlmann, Jonas Peters, and Jan Ernest. 2014. CAM: Causal additive models, high-dimensional order search and penalized regression. The Annals of Statistics 42, 6 (2014), 2526–2556.

[8]

Ruichu Cai, Wei Chen, Kun Zhang, and Zhifeng Hao. 2017. A survey on non-temporal series observational data based causal discovery. Chinese Journal of Computers 40, 6 (2017), 1470–1490.

[9]

Ruichu Cai, Jie Qiao, Kun Zhang, Zhenjie Zhang, and Zhifeng Hao. 2019. Causal discovery with cascade nonlinear additive noise model. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. 1609–1615.

Digital Library

[10]

Billy Chang, Uwe Krüger, Rafal Kustra, and Junping Zhang. 2013. Canonical correlation analysis based on Hilbert–Schmidt independence criterion and centered kernel target alignment. In Proceedings of the 30th International Conference on Machine Learning.316–324.

[11]

David Maxwell Chickering. 2002. Optimal structure identification with greedy search. Journal of Machine Learning Research 3, Nov. (2002), 507–554.

[12]

Diego Colombo and Marloes H. Maathuis. 2014. Order-independent constraint-based causal structure learning. Journal of Machine Learning Research 15, 1 (2014), 3741–3782.

Digital Library

[13]

Diego Colombo and Marloes H. Maathuis. 2014. Order-independent constraint-based causal structure learning. Journal of Machine Learning Research 15, 1 (2014), 3741–3782.

Digital Library

[14]

Diego Colombo, Marloes H. Maathuis, Markus Kalisch, and Thomas S. Richardson. 2011. Learning high-dimensional DAGs with latent and selection variables. In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence. 850.

[15]

Gregory F. Cooper and Changwon Yoo. 1999. Causal discovery from a mixture of experimental and observational data. In Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence. 116–125.

[16]

Povilas Daniusis, Dominik Janzing, Joris M. Mooij, Jakob Zscheischler, Bastian Steudel, Kun Zhang, and Bernhard Schölkopf. 2010. Inferring deterministic causal relations. In Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence. 143–150.

[17]

José A. R. Fonollosa. 2019. Conditional distribution variability measures for causality detection. In Proceedings of the Cause Effect Pairs in Machine Learning. Springer, 339–347.

[18]

H. Altay Guvenir, Burak Acar, Gulsen Demiroz, and Ayhan Cekin. 1997. A supervised machine learning algorithm for arrhythmia analysis. In Proceedings of the Computers in Cardiology 1997. 433–436.

[19]

Jan Hauffa, Wolfgang Bräu, and Georg Groh. 2019. Detection of topical influence in social networks via granger-causal inference: A Twitter case study. In Proceedings of the International Conference on Advances in Social Networks Analysis and Mining. 969–977.

Digital Library

[20]

Alain Hauser and Peter Bühlmann. 2012. Characterization and greedy learning of interventional Markov equivalence classes of directed acyclic graphs. Journal of Machine Learning Research 13, 1 (2012), 2409–2464.

Digital Library

[21]

Patrik O. Hoyer, Dominik Janzing, Joris M. Mooij, Jonas Peters, and Bernhard Schölkopf. 2008. Nonlinear causal discovery with additive noise models. In Proceedings of the 22nd Annual Conference on Neural Information Processing Systems. 689–696.

[22]

Shoubo Hu, Zhitang Chen, Vahid Partovi Nia, Lai-Wan Chan, and Yanhui Geng. 2018. Causal inference and mechanism clustering of a mixture of additive noise models. In Proceedings of the Annual Conference on Neural Information Processing Systems. 5212–5222.

[23]

Dominik Janzing, Joris M. Mooij, Kun Zhang, Jan Lemeire, Jakob Zscheischler, Povilas Daniusis, Bastian Steudel, and Bernhard Schölkopf. 2012. Information-geometric approach to inferring causal directions. Artificial Intelligence 182-183 (2012), 1–31.

Digital Library

[24]

Diviyan Kalainathan, Olivier Goudet, and Ritik Dutta. 2020. Causal discovery toolbox: Uncovering causal relationships in python. Journal of Machine Learning Research 21, 1 (2020), 37:1–37:5.

[25]

Markus Kalisch and Peter Bühlman. 2007. Estimating high-dimensional directed acyclic graphs with the PC-algorithm. Journal of Machine Learning Research 8, 3 (2007), 613–636.

Digital Library

[26]

Diederik P. Kingma and Max Welling. 2014. Auto-encoding variational bayes. In Proceedings of the 2nd International Conference on Learning Representations.

[27]

Thuc Duy Le, Tao Hoang, Jiuyong Li, Lin Liu, Huawen Liu, and Shu Hu. 2019. A fast PC algorithm for high dimensional causal discovery with multi-core PCs. IEEE/ACM Transactions on Computational Biology and Bioinformatics 16, 5 (2019), 1483–1495.

Digital Library

[28]

Honghao Li, Vincent Cabeli, Nadir Sella, and Hervé Isambert. 2019. Constraint-based causal structure learning with consistent separating sets. In Proceedings of the Annual Conference on Neural Information Processing Systems 2019. 14257–14266.

[29]

Joris M. Mooij and Dominik Janzing. 2010. Distinguishing between cause and effect. In Proceedings of the Causality: Objectives and Assessment.147–156.

[30]

Joris M. Mooij, Jonas Peters, Dominik Janzing, Jakob Zscheischler, and Bernhard Schölkopf. 2016. Distinguishing cause from effect using observational data: Methods and benchmarks. Journal of Machine Learning Research 17, 1 (2016), 32:1–32:102.

[31]

Warwick J. Nash, Tracy L. Sellers, Simon R. Talbot, Andrew J. Cawthorn, and Wes B. Ford. 1994. The population biology of abalone (Haliotis species) in Tasmania. i. blacklip abalone (h. rubra) from the north coast and islands of bass strait. Sea Fisheries Division, Technical Report 48 (1994), p411.

[32]

Judea Pearl. 2000. Causality: Models, Reasoning, and Inference. Cambridge University Press.

Digital Library

[33]

Judea Pearl. 2014. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Elsevier.

Digital Library

[34]

Kurt Driessens and Saso Dzeroski. 2005. Combining model-based and instance-based learning for first order regression. Machine Learning, Proceedings of the Twenty-Second International Conference (ICML’05, Bonn, Germany, August 7-11, 2005), ACM International Conference Proceeding Series, Vol. 119, ACM, 193–200.

[35]

Karen Sachs, Omar Perez, Dana Pe’er, Douglas A. Lauffenburger, and Garry P. Nolan. 2005. Causal protein-signaling networks derived from multiparameter single-cell data. Science 308, 5721 (2005), 523–529.

[36]

Shohei Shimizu, Patrik O. Hoyer, Aapo Hyvärinen, and Antti J. Kerminen. 2006. A linear non-gaussian acyclic model for causal discovery. Journal of Machine Learning Research 7, 10 (2006), 2003–2030.

Digital Library

[37]

Peter Spirtes, Clark N. Glymour, Richard Scheines, and David Heckerman. 2000. Causation, Prediction, and Search. MIT Press.

[38]

Peter Spirtes, Christopher Meek, and Thomas Richardson. 1999. An algorithm for causal inference in the presence of latent variables and selection bias. Computation, Causation, and Discovery 21 (1999), 1–252.

[39]

Helga Stoyan and Uwe Jansen. 2013. Umweltstatistik: Statistische Verarbeitung und Analyse Von Umweltdaten. Springer-Verlag.

[40]

Xiaohai Sun, Dominik Janzing, Bernhard Schölkopf, and Kenji Fukumizu. 2007. A kernel-based causal learning algorithm. In Proceedings of the 24th International Conference on Machine Learning.855–862.

Digital Library

[41]

Tim Van Erven and Peter Harremos. 2014. Rényi divergence and kullback-leibler divergence. IEEE Transactions on Information Theory 60, 7 (2014), 3797–3820.

[42]

Tyler J. VanderWeele and Weihua An. 2013. Social networks and causal inference. Handbook of Causal Analysis for Social Research (2013), 353–374.

[43]

Thomas Verma and Judea Pearl. 1990. Equivalence and synthesis of causal models. In Proceedings of the 6th Annual Conference on Uncertainty in Artificial Intelligence. 255–270.

[44]

Yixin Wang and David M. Blei. 2019. The blessings of multiple causes. Journal of the American Statistical Association 114, 528 (2019), 1574–1596.

[45]

I-Cheng Yeh. 2006. Analysis of strength of concrete using design of experiments and neural networks. Journal of Materials in Civil Engineering 18, 4 (2006), 597–604.

[46]

Kun Zhang and Aapo Hyvärinen. 2009. On the identifiability of the post-nonlinear causal model. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence. 647–655.

[47]

Kun Zhang, Jonas Peters, Dominik Janzing, and Bernhard Schölkopf. 2011. Kernel-based conditional independence test and application in causal discovery. In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence. 804–813.

[48]

Kun Zhang, Zhikun Wang, Jiji Zhang, and Bernhard Schölkopf. 2016. On estimation of functional causal models: General results and application to the post-nonlinear causal model. ACM Transactions on Intelligent Systems and Technology 7, 2 (2016), 13:1–13:22.

Digital Library

[49]

Xiangzhou Zhang, Yong Hu, Kang Xie, Shouyang Wang, EWT Ngai, and Mei Liu. 2014. A causal feature selection algorithm for stock prediction modeling. Neurocomputing 142 (2014), 48–59.

Digital Library

[50]

Xinhua Zhang, Le Song, Arthur Gretton, and Alexander J. Smola. 2008. Kernel measures of independence for non-iid data. In Proceedings of the 22nd Annual Conference on Neural Information Processing Systems. 1937–1944.

Cited By

Yi JChen Z(2024)Deconfounded Cross-modal Matching for Content-based Micro-video Background Music RecommendationACM Transactions on Intelligent Systems and Technology10.1145/365004215:3(1-25)Online publication date: 15-Apr-2024
https://dl.acm.org/doi/10.1145/3650042
Zhao BWang SChi LYuan HYuan YLi QGeng JZhang S(2024)Coresets for fast causal discovery with the additive noise modelPattern Recognition10.1016/j.patcog.2023.110149148:COnline publication date: 17-Apr-2024
https://dl.acm.org/doi/10.1016/j.patcog.2023.110149

Index Terms

Causal Discovery via Causal Star Graphs
1. Computing methodologies
  1. Artificial intelligence
    1. Knowledge representation and reasoning
      1. Causal reasoning and diagnostics
2. Information systems
  1. Information systems applications
    1. Data mining

Recommendations

Coresets for fast causal discovery with the additive noise model
Abstract
Causal discovery reveals the true causal relationships behind data and discovering causal relationships from observed data is a particularly challenging problem, especially in large-scale datasets. The functional causal model is an effective ...
Highlights
- New coresets proposed for the additive noise model greatly reduces the data size for causal discovery.
- A time-efficient algorithm, FANM, is proposed for causal discovery based on the coresets.
- The coreset construction is applied to ...
Causal Discovery with Confounding Cascade Nonlinear Additive Noise Models
Identification of causal direction between a causal-effect pair from observed data has recently attracted much attention. Various methods based on functional causal models have been proposed to solve this problem, by assuming the causal process satisfies ...
Disentangling causality: assumptions in causal discovery and inference
Abstract
Causality has been a burgeoning field of research leading to the point where the literature abounds with different components addressing distinct parts of causality. For researchers, it has been increasingly difficult to discern the assumptions ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data

ACM Transactions on Knowledge Discovery from Data Volume 17, Issue 7

August 2023

319 pages

ISSN:1556-4681

EISSN:1556-472X

DOI:10.1145/3589018

Editor:
Charu Aggarwal
IBM T. J. Watson Research, USA

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 April 2023

Online AM: 06 March 2023

Accepted: 27 February 2023

Revised: 21 September 2022

Received: 15 February 2022

Published in TKDD Volume 17, Issue 7

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Key R&D Program of China
National Natural Science Fund of China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
598
Total Downloads

Downloads (Last 12 months)327
Downloads (Last 6 weeks)21

Reflects downloads up to 12 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Yi JChen Z(2024)Deconfounded Cross-modal Matching for Content-based Micro-video Background Music RecommendationACM Transactions on Intelligent Systems and Technology10.1145/365004215:3(1-25)Online publication date: 15-Apr-2024
https://dl.acm.org/doi/10.1145/3650042
Zhao BWang SChi LYuan HYuan YLi QGeng JZhang S(2024)Coresets for fast causal discovery with the additive noise modelPattern Recognition10.1016/j.patcog.2023.110149148:COnline publication date: 17-Apr-2024
https://dl.acm.org/doi/10.1016/j.patcog.2023.110149

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Full Text

View this article in Full Text.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View full text|Download PDF

View Issue’s Table of Contents