skip to main content
10.1145/1281192.1281203acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Temporal causal modeling with graphical granger methods

Published: 12 August 2007 Publication History

Abstract

The need for mining causality, beyond mere statistical correlations, for real world problems has been recognized widely. Many of these applications naturally involve temporal data, which raises the challenge of how best to leverage the temporal information for causal modeling. Recently graphical modeling with the concept of "Granger causality", based on the intuition that a cause helps predict its effects in the future, has gained attention in many domains involving time series data analysis. With the surge of interest in model selection methodologies for regression, such as the Lasso, as practical alternatives to solving structural learning of graphical models, the question arises whether and how to combine these two notions into a practically viable approach for temporal causal modeling. In this paper, we examine a host of related algorithms that, loosely speaking, fall under the category of graphical Granger methods, and characterize their relative performance from multiple viewpoints. Our experiments show, for instance, that the Lasso algorithm exhibits consistent gain over the canonical pairwise graphical Granger method. We also characterize conditions under which these variants of graphical Granger methods perform well in comparison to other benchmark methods. Finally, we apply these methods to a real world data set involving key performance indicators of corporations, and present some concrete results.

Supplementary Material

JPG File (p66-arnold-200.jpg)
JPG File (p66-arnold-768.jpg)
Low Resolution (p66-arnold-200.mov)
High Resolution (p66-arnold-768.mov)

References

[1]
T. Chu and C. Glymour. Semi-parametric Causal Inference for Nonlinear Time Series Data. J. of Machine Learning Res., submitted, 2006.
[2]
T. Chu, D. Danks, and C. Glymour. Data Driven Methods for Nonlinear Granger Causality: Climate Teleconnection Mechanisms, 2004.
[3]
D. Coppersmith and S. Winograd. Matrix multiplication via arithmetic progressions. Journal of Symbolic Computation, 9:251280, 1990.
[4]
A. Dobra, B. Jones, C. Hans, J. Nevins, M. West. Sparse graphical models for exploring gene expression data. J. of Multivariate Analysis, special issue on Multivariate Methods in Genomic Data Analysis, 90, 196--212, 2003.
[5]
M. Drton and M. D. Perlman. A SINful Approach to Gaussian Graphical Model Selection. Department of Statistics, University of Washington, Technical Report 457, 2004.
[6]
B. Efron, T. Hastie, I. Johnstone and R. Tibshirani. Least Angle Regression (with discussion). Annals of Statistics, 2003.
[7]
M. Eichler. Graphical modeling of multivariate time series with latent variables. Preprint, Universiteit Maastricht, 2006.
[8]
N. Friedman, I. Nachman, and D. Peer. Learning Bayesian network structure from massive datasets: The "sparse candidate" algorithm. In (UAI'99), 1999.
[9]
P. D. Gilbert. Combining VAR Estimation and State Space Model Reduction for Simple Good Predictions. J. of Forecasting: Special Issue on VAR Modelling, 14:229250, 1995.
[10]
G. Golub, M. Heath, and G. Wahba. Generalized cross validation as a method for choosing a good ridge parameter. Technometrics, 21: 215--224.
[11]
C. W. J. Granger. Investigating causal relations by econometric models and cross-spectral methods. Econometrica, 37: 424--438l, 1969.
[12]
D. Heckerman. A Tutorial on Learning with Bayesian Networks, In Learning in Graphical Models, M. Jordan, ed., MIT Press, Cambridge, MA, 1999.
[13]
P. O. Hoyer, S. Shimizu, and A. J. Kerminen. Estimation of linear, non-gaussian causal models in the presence of confounding latent variables. In (PGM'06), pp. 155--162, 2006.
[14]
M. Kalisch and P. Buehlmann. Estimating high-dimensional directed acyclic graphs with the PC algorithm. Technical report No. 130, ETH Zurich, 2005.
[15]
R. Kaplan and D. Norton. The Balanced Scorecard - Measures that Drive Performance. Harvard Business Review. 71--79, 1992.
[16]
R. Kaplan and D. Norton. The Balanced Scorecard: Translating Strategy into Action. Harvard Business School Press. Boston, MA, 1996.
[17]
S. Lauritzen. Graphical Models. Oxford University Press. 1996.
[18]
C. Meek. Graphical Models: Selecting Causal and Statistical Models. PhD thesis, Carnegie Mellon University, Philosophy Department, 1996.
[19]
N. Meinshausen and P. Buhlmann. High dimensional graphs and variable selection with the Lasso. Annals of Statistics, 34(3), 1436--1462, 2006.
[20]
A. Moneta and P. Spirtes. Graphical models for the identification of causal structures in multivariate time series models. In Proc. Fifth Intl. Conf. on Computational Intelligence in Economics and Finance, 2006.
[21]
R. Opgen-Rhein and K. Strimmer. Learning causal networks from systems biology time course data: an effective model selection procedure for the vector autoregressive process. BMC Bioinformatics 8 (suppl.) in press, 2007.
[22]
J. Pearl. Causality. Cambridge University Press, Cambridge, UK, 2000.
[23]
S. Roweis and Z. Ghahramani. A Unifying Review of Linear Gaussian Models, Neural Computation, Vol. 11, No. 2, 305--345, 1999.
[24]
S. Shimizu, A. Hyvärinen, P. O. Hoyer, and Y. Kano. Finding a causal ordering via independent component analysis. Computational Statistics & Data Analysis, 50(11): 3278--3293, 2006.
[25]
R. Silva, R. Scheines, C. Glymour, P. Spirtes. Learning the structure of linear latent variable models. J. of Machine Learning Res., 7(Feb): 191--246, 2006.
[26]
P. Spirtes, C. Glymour, and R. Scheines. Causation, Prediction, and Search. MIT Press, New York, NY, second edition, 2000.
[27]
Standard & Poor's Compustat Data. Available from http://www.compustat.com.
[28]
R. Scheines, P. Spirtes, C. Glymour, and C. Meek". TETRAD II: Tools for Discovery. Lawrence Erlbaum Associates, Hillsdale, NJ. 1994.
[29]
R. Tibshirani. Regression shrinkage and selection via the lasso. J. Royal. Statist. Soc B., Vol. 58, No. 1, pages 267--288, 1996.
[30]
P. A. Valdes-Sosa et. al. Estimating brain functional connectivity with sparse multivariate autoregression. Philos Trans R Soc Lond B Biol Sci. 2005 May 29; 360(1457): 969--81, 2005.

Cited By

View all
  • (2025)Sparse estimation of parameter support sets for generalized vector autoregressions by resampling and model aggregationJournal of Statistical Computation and Simulation10.1080/00949655.2025.2469905(1-16)Online publication date: 5-Mar-2025
  • (2025)Causal similarity learning with multi-level predictive relation aggregation for grouped root cause diagnosis of industrial faultsControl Engineering Practice10.1016/j.conengprac.2024.106140154(106140)Online publication date: Jan-2025
  • (2025)Visual causal analysis of multivariate time seriesJournal of Visualization10.1007/s12650-024-01041-6Online publication date: 8-Jan-2025
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
August 2007
1080 pages
ISBN:9781595936097
DOI:10.1145/1281192
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 August 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. causal modeling
  2. graphical models
  3. time series data

Qualifiers

  • Article

Conference

KDD07

Acceptance Rates

KDD '07 Paper Acceptance Rate 111 of 573 submissions, 19%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)95
  • Downloads (Last 6 weeks)9
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Sparse estimation of parameter support sets for generalized vector autoregressions by resampling and model aggregationJournal of Statistical Computation and Simulation10.1080/00949655.2025.2469905(1-16)Online publication date: 5-Mar-2025
  • (2025)Causal similarity learning with multi-level predictive relation aggregation for grouped root cause diagnosis of industrial faultsControl Engineering Practice10.1016/j.conengprac.2024.106140154(106140)Online publication date: Jan-2025
  • (2025)Visual causal analysis of multivariate time seriesJournal of Visualization10.1007/s12650-024-01041-6Online publication date: 8-Jan-2025
  • (2024)Implicit-Causality-Exploration-Enabled Graph Neural Network for Stock PredictionInformation10.3390/info1512074315:12(743)Online publication date: 21-Nov-2024
  • (2024)Towards the disappearing truthProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i15.29662(17167-17174)Online publication date: 20-Feb-2024
  • (2024)Causal Discovery from Temporal Data: An Overview and New PerspectivesACM Computing Surveys10.1145/370529757:4(1-38)Online publication date: 23-Nov-2024
  • (2024)(Vision Paper) A Vision for Spatio-Causal Situation Awareness, Forecasting, and PlanningACM Transactions on Spatial Algorithms and Systems10.1145/367255610:2(1-42)Online publication date: 1-Jul-2024
  • (2024)Learning Flexible Time-windowed Granger Causality Integrating Heterogeneous Interventional Time Series DataProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3672023(4408-4418)Online publication date: 25-Aug-2024
  • (2024)Forecasting the Impact of Anomalous Events on Business Process PerformanceProceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)10.1145/3632410.3632432(266-274)Online publication date: 4-Jan-2024
  • (2024)CausalMMM: Learning Causal Structure for Marketing Mix ModelingProceedings of the 17th ACM International Conference on Web Search and Data Mining10.1145/3616855.3635766(238-246)Online publication date: 4-Mar-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media