Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Causal Discovery via Causal Star Graphs

Published: 14 April 2023 Publication History

Abstract

Discovering causal relationships among observed variables is an important research focus in data mining. Existing causal discovery approaches are mainly based on constraint-based methods and functional causal models (FCMs). However, the constraint-based method cannot identify the Markov equivalence class and the functional causal models cannot identify the complex interrelationships when multiple variables affect one variable. To address the two aforementioned problems, we propose a new graph structure Causal Star Graph (CSG) and a corresponding framework Causal Discovery via Causal Star Graphs (CD-CSG) to divide a causal directed acyclic graph into multiple CSGs for causal discovery. In this framework, we also propose a generalized learning in CSGs based on a variational approach to learn the representative intermediate variable of CSG’s non-central variables. Through the generalized learning in CSGs, the asymmetry in the forward and backward model of CD-CSG can be found to identify the causal directions in the directed acyclic graphs. We further divide the CSGs into three categories and provide the causal identification principle under each category in our proposed framework. Experiments using synthetic data show that the causal relationships between variables can be effectively identified with CD-CSG and the accuracy of CD-CSG is higher than the best existing model. By applying CD-CSG to real-world data, our proposed method can greatly augment the applicability and effectiveness of causal discovery.

References

[1]
Steen A. Andersson, David Madigan, and Michael D. Perlman. 1997. A characterization of Markov equivalence classes for acyclic digraphs. The Annals of Statistics 25, 2 (1997), 505–541.
[2]
Joshua D. Angrist and Guido W. Imbens. 1995. Two-stage least squares estimation of average causal effects in models with variable treatment intensity. Journal of the American Statistical Association 90, 430 (1995), 431–442.
[3]
Joshua D. Angrist, Guido W. Imbens, and Donald B. Rubin. 1996. Identification of causal effects using instrumental variables. Journal of the American Statistical Association 91, 434 (1996), 444–455.
[4]
Peter M. Bentler and David G. Weeks. 1980. Linear structural equations with latent variables. Psychometrika 45, 3 (1980), 289–308.
[5]
Patrick Blöbaum, Dominik Janzing, Takashi Washio, Shohei Shimizu, and Bernhard Schölkopf. 2018. Cause-effect inference by comparing regression errors. In Proceedings of the International Conference on Artificial Intelligence and Statistics. PMLR, 900–909.
[6]
Facundo Bromberg and Dimitris Margaritis. 2009. Improving the reliability of causal discovery from small datasets using argumentation. Journal of Machine Learning Research 10, 2 (2009), 141–180.
[7]
Peter Bühlmann, Jonas Peters, and Jan Ernest. 2014. CAM: Causal additive models, high-dimensional order search and penalized regression. The Annals of Statistics 42, 6 (2014), 2526–2556.
[8]
Ruichu Cai, Wei Chen, Kun Zhang, and Zhifeng Hao. 2017. A survey on non-temporal series observational data based causal discovery. Chinese Journal of Computers 40, 6 (2017), 1470–1490.
[9]
Ruichu Cai, Jie Qiao, Kun Zhang, Zhenjie Zhang, and Zhifeng Hao. 2019. Causal discovery with cascade nonlinear additive noise model. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. 1609–1615.
[10]
Billy Chang, Uwe Krüger, Rafal Kustra, and Junping Zhang. 2013. Canonical correlation analysis based on Hilbert–Schmidt independence criterion and centered kernel target alignment. In Proceedings of the 30th International Conference on Machine Learning.316–324.
[11]
David Maxwell Chickering. 2002. Optimal structure identification with greedy search. Journal of Machine Learning Research 3, Nov. (2002), 507–554.
[12]
Diego Colombo and Marloes H. Maathuis. 2014. Order-independent constraint-based causal structure learning. Journal of Machine Learning Research 15, 1 (2014), 3741–3782.
[13]
Diego Colombo and Marloes H. Maathuis. 2014. Order-independent constraint-based causal structure learning. Journal of Machine Learning Research 15, 1 (2014), 3741–3782.
[14]
Diego Colombo, Marloes H. Maathuis, Markus Kalisch, and Thomas S. Richardson. 2011. Learning high-dimensional DAGs with latent and selection variables. In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence. 850.
[15]
Gregory F. Cooper and Changwon Yoo. 1999. Causal discovery from a mixture of experimental and observational data. In Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence. 116–125.
[16]
Povilas Daniusis, Dominik Janzing, Joris M. Mooij, Jakob Zscheischler, Bastian Steudel, Kun Zhang, and Bernhard Schölkopf. 2010. Inferring deterministic causal relations. In Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence. 143–150.
[17]
José A. R. Fonollosa. 2019. Conditional distribution variability measures for causality detection. In Proceedings of the Cause Effect Pairs in Machine Learning. Springer, 339–347.
[18]
H. Altay Guvenir, Burak Acar, Gulsen Demiroz, and Ayhan Cekin. 1997. A supervised machine learning algorithm for arrhythmia analysis. In Proceedings of the Computers in Cardiology 1997. 433–436.
[19]
Jan Hauffa, Wolfgang Bräu, and Georg Groh. 2019. Detection of topical influence in social networks via granger-causal inference: A Twitter case study. In Proceedings of the International Conference on Advances in Social Networks Analysis and Mining. 969–977.
[20]
Alain Hauser and Peter Bühlmann. 2012. Characterization and greedy learning of interventional Markov equivalence classes of directed acyclic graphs. Journal of Machine Learning Research 13, 1 (2012), 2409–2464.
[21]
Patrik O. Hoyer, Dominik Janzing, Joris M. Mooij, Jonas Peters, and Bernhard Schölkopf. 2008. Nonlinear causal discovery with additive noise models. In Proceedings of the 22nd Annual Conference on Neural Information Processing Systems. 689–696.
[22]
Shoubo Hu, Zhitang Chen, Vahid Partovi Nia, Lai-Wan Chan, and Yanhui Geng. 2018. Causal inference and mechanism clustering of a mixture of additive noise models. In Proceedings of the Annual Conference on Neural Information Processing Systems. 5212–5222.
[23]
Dominik Janzing, Joris M. Mooij, Kun Zhang, Jan Lemeire, Jakob Zscheischler, Povilas Daniusis, Bastian Steudel, and Bernhard Schölkopf. 2012. Information-geometric approach to inferring causal directions. Artificial Intelligence 182-183 (2012), 1–31.
[24]
Diviyan Kalainathan, Olivier Goudet, and Ritik Dutta. 2020. Causal discovery toolbox: Uncovering causal relationships in python. Journal of Machine Learning Research 21, 1 (2020), 37:1–37:5.
[25]
Markus Kalisch and Peter Bühlman. 2007. Estimating high-dimensional directed acyclic graphs with the PC-algorithm. Journal of Machine Learning Research 8, 3 (2007), 613–636.
[26]
Diederik P. Kingma and Max Welling. 2014. Auto-encoding variational bayes. In Proceedings of the 2nd International Conference on Learning Representations.
[27]
Thuc Duy Le, Tao Hoang, Jiuyong Li, Lin Liu, Huawen Liu, and Shu Hu. 2019. A fast PC algorithm for high dimensional causal discovery with multi-core PCs. IEEE/ACM Transactions on Computational Biology and Bioinformatics 16, 5 (2019), 1483–1495.
[28]
Honghao Li, Vincent Cabeli, Nadir Sella, and Hervé Isambert. 2019. Constraint-based causal structure learning with consistent separating sets. In Proceedings of the Annual Conference on Neural Information Processing Systems 2019. 14257–14266.
[29]
Joris M. Mooij and Dominik Janzing. 2010. Distinguishing between cause and effect. In Proceedings of the Causality: Objectives and Assessment.147–156.
[30]
Joris M. Mooij, Jonas Peters, Dominik Janzing, Jakob Zscheischler, and Bernhard Schölkopf. 2016. Distinguishing cause from effect using observational data: Methods and benchmarks. Journal of Machine Learning Research 17, 1 (2016), 32:1–32:102.
[31]
Warwick J. Nash, Tracy L. Sellers, Simon R. Talbot, Andrew J. Cawthorn, and Wes B. Ford. 1994. The population biology of abalone (Haliotis species) in Tasmania. i. blacklip abalone (h. rubra) from the north coast and islands of bass strait. Sea Fisheries Division, Technical Report 48 (1994), p411.
[32]
Judea Pearl. 2000. Causality: Models, Reasoning, and Inference. Cambridge University Press.
[33]
Judea Pearl. 2014. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Elsevier.
[34]
Kurt Driessens and Saso Dzeroski. 2005. Combining model-based and instance-based learning for first order regression. Machine Learning, Proceedings of the Twenty-Second International Conference (ICML’05, Bonn, Germany, August 7-11, 2005), ACM International Conference Proceeding Series, Vol. 119, ACM, 193–200.
[35]
Karen Sachs, Omar Perez, Dana Pe’er, Douglas A. Lauffenburger, and Garry P. Nolan. 2005. Causal protein-signaling networks derived from multiparameter single-cell data. Science 308, 5721 (2005), 523–529.
[36]
Shohei Shimizu, Patrik O. Hoyer, Aapo Hyvärinen, and Antti J. Kerminen. 2006. A linear non-gaussian acyclic model for causal discovery. Journal of Machine Learning Research 7, 10 (2006), 2003–2030.
[37]
Peter Spirtes, Clark N. Glymour, Richard Scheines, and David Heckerman. 2000. Causation, Prediction, and Search. MIT Press.
[38]
Peter Spirtes, Christopher Meek, and Thomas Richardson. 1999. An algorithm for causal inference in the presence of latent variables and selection bias. Computation, Causation, and Discovery 21 (1999), 1–252.
[39]
Helga Stoyan and Uwe Jansen. 2013. Umweltstatistik: Statistische Verarbeitung und Analyse Von Umweltdaten. Springer-Verlag.
[40]
Xiaohai Sun, Dominik Janzing, Bernhard Schölkopf, and Kenji Fukumizu. 2007. A kernel-based causal learning algorithm. In Proceedings of the 24th International Conference on Machine Learning.855–862.
[41]
Tim Van Erven and Peter Harremos. 2014. Rényi divergence and kullback-leibler divergence. IEEE Transactions on Information Theory 60, 7 (2014), 3797–3820.
[42]
Tyler J. VanderWeele and Weihua An. 2013. Social networks and causal inference. Handbook of Causal Analysis for Social Research (2013), 353–374.
[43]
Thomas Verma and Judea Pearl. 1990. Equivalence and synthesis of causal models. In Proceedings of the 6th Annual Conference on Uncertainty in Artificial Intelligence. 255–270.
[44]
Yixin Wang and David M. Blei. 2019. The blessings of multiple causes. Journal of the American Statistical Association 114, 528 (2019), 1574–1596.
[45]
I-Cheng Yeh. 2006. Analysis of strength of concrete using design of experiments and neural networks. Journal of Materials in Civil Engineering 18, 4 (2006), 597–604.
[46]
Kun Zhang and Aapo Hyvärinen. 2009. On the identifiability of the post-nonlinear causal model. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence. 647–655.
[47]
Kun Zhang, Jonas Peters, Dominik Janzing, and Bernhard Schölkopf. 2011. Kernel-based conditional independence test and application in causal discovery. In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence. 804–813.
[48]
Kun Zhang, Zhikun Wang, Jiji Zhang, and Bernhard Schölkopf. 2016. On estimation of functional causal models: General results and application to the post-nonlinear causal model. ACM Transactions on Intelligent Systems and Technology 7, 2 (2016), 13:1–13:22.
[49]
Xiangzhou Zhang, Yong Hu, Kang Xie, Shouyang Wang, EWT Ngai, and Mei Liu. 2014. A causal feature selection algorithm for stock prediction modeling. Neurocomputing 142 (2014), 48–59.
[50]
Xinhua Zhang, Le Song, Arthur Gretton, and Alexander J. Smola. 2008. Kernel measures of independence for non-iid data. In Proceedings of the 22nd Annual Conference on Neural Information Processing Systems. 1937–1944.

Cited By

View all
  • (2024)Deconfounded Cross-modal Matching for Content-based Micro-video Background Music RecommendationACM Transactions on Intelligent Systems and Technology10.1145/365004215:3(1-25)Online publication date: 15-Apr-2024
  • (2024)Coresets for fast causal discovery with the additive noise modelPattern Recognition10.1016/j.patcog.2023.110149148:COnline publication date: 17-Apr-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data
ACM Transactions on Knowledge Discovery from Data  Volume 17, Issue 7
August 2023
319 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3589018
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 April 2023
Online AM: 06 March 2023
Accepted: 27 February 2023
Revised: 21 September 2022
Received: 15 February 2022
Published in TKDD Volume 17, Issue 7

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Causal discovery
  2. causal star graph
  3. additive noise model
  4. functional causal model

Qualifiers

  • Research-article

Funding Sources

  • National Key R&D Program of China
  • National Natural Science Fund of China

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)327
  • Downloads (Last 6 weeks)21
Reflects downloads up to 12 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Deconfounded Cross-modal Matching for Content-based Micro-video Background Music RecommendationACM Transactions on Intelligent Systems and Technology10.1145/365004215:3(1-25)Online publication date: 15-Apr-2024
  • (2024)Coresets for fast causal discovery with the additive noise modelPattern Recognition10.1016/j.patcog.2023.110149148:COnline publication date: 17-Apr-2024

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media