Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1281192.1281203acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Temporal causal modeling with graphical granger methods

Published: 12 August 2007 Publication History

Abstract

The need for mining causality, beyond mere statistical correlations, for real world problems has been recognized widely. Many of these applications naturally involve temporal data, which raises the challenge of how best to leverage the temporal information for causal modeling. Recently graphical modeling with the concept of "Granger causality", based on the intuition that a cause helps predict its effects in the future, has gained attention in many domains involving time series data analysis. With the surge of interest in model selection methodologies for regression, such as the Lasso, as practical alternatives to solving structural learning of graphical models, the question arises whether and how to combine these two notions into a practically viable approach for temporal causal modeling. In this paper, we examine a host of related algorithms that, loosely speaking, fall under the category of graphical Granger methods, and characterize their relative performance from multiple viewpoints. Our experiments show, for instance, that the Lasso algorithm exhibits consistent gain over the canonical pairwise graphical Granger method. We also characterize conditions under which these variants of graphical Granger methods perform well in comparison to other benchmark methods. Finally, we apply these methods to a real world data set involving key performance indicators of corporations, and present some concrete results.

Supplementary Material

JPG File (p66-arnold-200.jpg)
JPG File (p66-arnold-768.jpg)
Low Resolution (p66-arnold-200.mov)
High Resolution (p66-arnold-768.mov)

References

[1]
T. Chu and C. Glymour. Semi-parametric Causal Inference for Nonlinear Time Series Data. J. of Machine Learning Res., submitted, 2006.
[2]
T. Chu, D. Danks, and C. Glymour. Data Driven Methods for Nonlinear Granger Causality: Climate Teleconnection Mechanisms, 2004.
[3]
D. Coppersmith and S. Winograd. Matrix multiplication via arithmetic progressions. Journal of Symbolic Computation, 9:251280, 1990.
[4]
A. Dobra, B. Jones, C. Hans, J. Nevins, M. West. Sparse graphical models for exploring gene expression data. J. of Multivariate Analysis, special issue on Multivariate Methods in Genomic Data Analysis, 90, 196--212, 2003.
[5]
M. Drton and M. D. Perlman. A SINful Approach to Gaussian Graphical Model Selection. Department of Statistics, University of Washington, Technical Report 457, 2004.
[6]
B. Efron, T. Hastie, I. Johnstone and R. Tibshirani. Least Angle Regression (with discussion). Annals of Statistics, 2003.
[7]
M. Eichler. Graphical modeling of multivariate time series with latent variables. Preprint, Universiteit Maastricht, 2006.
[8]
N. Friedman, I. Nachman, and D. Peer. Learning Bayesian network structure from massive datasets: The "sparse candidate" algorithm. In (UAI'99), 1999.
[9]
P. D. Gilbert. Combining VAR Estimation and State Space Model Reduction for Simple Good Predictions. J. of Forecasting: Special Issue on VAR Modelling, 14:229250, 1995.
[10]
G. Golub, M. Heath, and G. Wahba. Generalized cross validation as a method for choosing a good ridge parameter. Technometrics, 21: 215--224.
[11]
C. W. J. Granger. Investigating causal relations by econometric models and cross-spectral methods. Econometrica, 37: 424--438l, 1969.
[12]
D. Heckerman. A Tutorial on Learning with Bayesian Networks, In Learning in Graphical Models, M. Jordan, ed., MIT Press, Cambridge, MA, 1999.
[13]
P. O. Hoyer, S. Shimizu, and A. J. Kerminen. Estimation of linear, non-gaussian causal models in the presence of confounding latent variables. In (PGM'06), pp. 155--162, 2006.
[14]
M. Kalisch and P. Buehlmann. Estimating high-dimensional directed acyclic graphs with the PC algorithm. Technical report No. 130, ETH Zurich, 2005.
[15]
R. Kaplan and D. Norton. The Balanced Scorecard - Measures that Drive Performance. Harvard Business Review. 71--79, 1992.
[16]
R. Kaplan and D. Norton. The Balanced Scorecard: Translating Strategy into Action. Harvard Business School Press. Boston, MA, 1996.
[17]
S. Lauritzen. Graphical Models. Oxford University Press. 1996.
[18]
C. Meek. Graphical Models: Selecting Causal and Statistical Models. PhD thesis, Carnegie Mellon University, Philosophy Department, 1996.
[19]
N. Meinshausen and P. Buhlmann. High dimensional graphs and variable selection with the Lasso. Annals of Statistics, 34(3), 1436--1462, 2006.
[20]
A. Moneta and P. Spirtes. Graphical models for the identification of causal structures in multivariate time series models. In Proc. Fifth Intl. Conf. on Computational Intelligence in Economics and Finance, 2006.
[21]
R. Opgen-Rhein and K. Strimmer. Learning causal networks from systems biology time course data: an effective model selection procedure for the vector autoregressive process. BMC Bioinformatics 8 (suppl.) in press, 2007.
[22]
J. Pearl. Causality. Cambridge University Press, Cambridge, UK, 2000.
[23]
S. Roweis and Z. Ghahramani. A Unifying Review of Linear Gaussian Models, Neural Computation, Vol. 11, No. 2, 305--345, 1999.
[24]
S. Shimizu, A. Hyvärinen, P. O. Hoyer, and Y. Kano. Finding a causal ordering via independent component analysis. Computational Statistics & Data Analysis, 50(11): 3278--3293, 2006.
[25]
R. Silva, R. Scheines, C. Glymour, P. Spirtes. Learning the structure of linear latent variable models. J. of Machine Learning Res., 7(Feb): 191--246, 2006.
[26]
P. Spirtes, C. Glymour, and R. Scheines. Causation, Prediction, and Search. MIT Press, New York, NY, second edition, 2000.
[27]
Standard & Poor's Compustat Data. Available from http://www.compustat.com.
[28]
R. Scheines, P. Spirtes, C. Glymour, and C. Meek". TETRAD II: Tools for Discovery. Lawrence Erlbaum Associates, Hillsdale, NJ. 1994.
[29]
R. Tibshirani. Regression shrinkage and selection via the lasso. J. Royal. Statist. Soc B., Vol. 58, No. 1, pages 267--288, 1996.
[30]
P. A. Valdes-Sosa et. al. Estimating brain functional connectivity with sparse multivariate autoregression. Philos Trans R Soc Lond B Biol Sci. 2005 May 29; 360(1457): 969--81, 2005.

Cited By

View all
  • (2025)Causal similarity learning with multi-level predictive relation aggregation for grouped root cause diagnosis of industrial faultsControl Engineering Practice10.1016/j.conengprac.2024.106140154(106140)Online publication date: Jan-2025
  • (2025)Visual causal analysis of multivariate time seriesJournal of Visualization10.1007/s12650-024-01041-6Online publication date: 8-Jan-2025
  • (2024)Implicit-Causality-Exploration-Enabled Graph Neural Network for Stock PredictionInformation10.3390/info1512074315:12(743)Online publication date: 21-Nov-2024
  • Show More Cited By

Index Terms

  1. Temporal causal modeling with graphical granger methods

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2007
    1080 pages
    ISBN:9781595936097
    DOI:10.1145/1281192
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 August 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. causal modeling
    2. graphical models
    3. time series data

    Qualifiers

    • Article

    Conference

    KDD07

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)97
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 11 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Causal similarity learning with multi-level predictive relation aggregation for grouped root cause diagnosis of industrial faultsControl Engineering Practice10.1016/j.conengprac.2024.106140154(106140)Online publication date: Jan-2025
    • (2025)Visual causal analysis of multivariate time seriesJournal of Visualization10.1007/s12650-024-01041-6Online publication date: 8-Jan-2025
    • (2024)Implicit-Causality-Exploration-Enabled Graph Neural Network for Stock PredictionInformation10.3390/info1512074315:12(743)Online publication date: 21-Nov-2024
    • (2024)Towards the disappearing truthProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i15.29662(17167-17174)Online publication date: 20-Feb-2024
    • (2024)Causal Discovery from Temporal Data: An Overview and New PerspectivesACM Computing Surveys10.1145/370529757:4(1-38)Online publication date: 23-Nov-2024
    • (2024)(Vision Paper) A Vision for Spatio-Causal Situation Awareness, Forecasting, and PlanningACM Transactions on Spatial Algorithms and Systems10.1145/367255610:2(1-42)Online publication date: 1-Jul-2024
    • (2024)Learning Flexible Time-windowed Granger Causality Integrating Heterogeneous Interventional Time Series DataProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3672023(4408-4418)Online publication date: 25-Aug-2024
    • (2024)Forecasting the Impact of Anomalous Events on Business Process PerformanceProceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)10.1145/3632410.3632432(266-274)Online publication date: 4-Jan-2024
    • (2024)CausalMMM: Learning Causal Structure for Marketing Mix ModelingProceedings of the 17th ACM International Conference on Web Search and Data Mining10.1145/3616855.3635766(238-246)Online publication date: 4-Mar-2024
    • (2024)KPIRoot: Efficient Monitoring Metric-based Root Cause Localization in Large-scale Cloud Systems2024 IEEE 35th International Symposium on Software Reliability Engineering (ISSRE)10.1109/ISSRE62328.2024.00046(403-414)Online publication date: 28-Oct-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media