skip to main content
research-article

Causal Discovery via Causal Star Graphs

Published:14 April 2023Publication History
Skip Abstract Section

Abstract

Discovering causal relationships among observed variables is an important research focus in data mining. Existing causal discovery approaches are mainly based on constraint-based methods and functional causal models (FCMs). However, the constraint-based method cannot identify the Markov equivalence class and the functional causal models cannot identify the complex interrelationships when multiple variables affect one variable. To address the two aforementioned problems, we propose a new graph structure Causal Star Graph (CSG) and a corresponding framework Causal Discovery via Causal Star Graphs (CD-CSG) to divide a causal directed acyclic graph into multiple CSGs for causal discovery. In this framework, we also propose a generalized learning in CSGs based on a variational approach to learn the representative intermediate variable of CSG’s non-central variables. Through the generalized learning in CSGs, the asymmetry in the forward and backward model of CD-CSG can be found to identify the causal directions in the directed acyclic graphs. We further divide the CSGs into three categories and provide the causal identification principle under each category in our proposed framework. Experiments using synthetic data show that the causal relationships between variables can be effectively identified with CD-CSG and the accuracy of CD-CSG is higher than the best existing model. By applying CD-CSG to real-world data, our proposed method can greatly augment the applicability and effectiveness of causal discovery.

REFERENCES

  1. [1] Andersson Steen A., Madigan David, and Perlman Michael D.. 1997. A characterization of Markov equivalence classes for acyclic digraphs. The Annals of Statistics 25, 2 (1997), 505541.Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Angrist Joshua D. and Imbens Guido W.. 1995. Two-stage least squares estimation of average causal effects in models with variable treatment intensity. Journal of the American Statistical Association 90, 430 (1995), 431442.Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Angrist Joshua D., Imbens Guido W., and Rubin Donald B.. 1996. Identification of causal effects using instrumental variables. Journal of the American Statistical Association 91, 434 (1996), 444455.Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Bentler Peter M. and Weeks David G.. 1980. Linear structural equations with latent variables. Psychometrika 45, 3 (1980), 289308.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Blöbaum Patrick, Janzing Dominik, Washio Takashi, Shimizu Shohei, and Schölkopf Bernhard. 2018. Cause-effect inference by comparing regression errors. In Proceedings of the International Conference on Artificial Intelligence and Statistics. PMLR, 900909.Google ScholarGoogle Scholar
  6. [6] Bromberg Facundo and Margaritis Dimitris. 2009. Improving the reliability of causal discovery from small datasets using argumentation. Journal of Machine Learning Research 10, 2 (2009), 141–180.Google ScholarGoogle Scholar
  7. [7] Bühlmann Peter, Peters Jonas, and Ernest Jan. 2014. CAM: Causal additive models, high-dimensional order search and penalized regression. The Annals of Statistics 42, 6 (2014), 25262556.Google ScholarGoogle ScholarCross RefCross Ref
  8. [8] Cai Ruichu, Chen Wei, Zhang Kun, and Hao Zhifeng. 2017. A survey on non-temporal series observational data based causal discovery. Chinese Journal of Computers 40, 6 (2017), 14701490.Google ScholarGoogle Scholar
  9. [9] Cai Ruichu, Qiao Jie, Zhang Kun, Zhang Zhenjie, and Hao Zhifeng. 2019. Causal discovery with cascade nonlinear additive noise model. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. 16091615.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Chang Billy, Krüger Uwe, Kustra Rafal, and Zhang Junping. 2013. Canonical correlation analysis based on Hilbert–Schmidt independence criterion and centered kernel target alignment. In Proceedings of the 30th International Conference on Machine Learning.316324.Google ScholarGoogle Scholar
  11. [11] Chickering David Maxwell. 2002. Optimal structure identification with greedy search. Journal of Machine Learning Research 3, Nov. (2002), 507554.Google ScholarGoogle Scholar
  12. [12] Colombo Diego and Maathuis Marloes H.. 2014. Order-independent constraint-based causal structure learning. Journal of Machine Learning Research 15, 1 (2014), 37413782.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Colombo Diego and Maathuis Marloes H.. 2014. Order-independent constraint-based causal structure learning. Journal of Machine Learning Research 15, 1 (2014), 37413782.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Colombo Diego, Maathuis Marloes H., Kalisch Markus, and Richardson Thomas S.. 2011. Learning high-dimensional DAGs with latent and selection variables. In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence. 850.Google ScholarGoogle Scholar
  15. [15] Cooper Gregory F. and Yoo Changwon. 1999. Causal discovery from a mixture of experimental and observational data. In Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence. 116125.Google ScholarGoogle Scholar
  16. [16] Daniusis Povilas, Janzing Dominik, Mooij Joris M., Zscheischler Jakob, Steudel Bastian, Zhang Kun, and Schölkopf Bernhard. 2010. Inferring deterministic causal relations. In Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence. 143150.Google ScholarGoogle Scholar
  17. [17] Fonollosa José A. R.. 2019. Conditional distribution variability measures for causality detection. In Proceedings of the Cause Effect Pairs in Machine Learning. Springer, 339347.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Guvenir H. Altay, Acar Burak, Demiroz Gulsen, and Cekin Ayhan. 1997. A supervised machine learning algorithm for arrhythmia analysis. In Proceedings of the Computers in Cardiology 1997. 433436.Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Hauffa Jan, Bräu Wolfgang, and Groh Georg. 2019. Detection of topical influence in social networks via granger-causal inference: A Twitter case study. In Proceedings of the International Conference on Advances in Social Networks Analysis and Mining. 969977.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. [20] Hauser Alain and Bühlmann Peter. 2012. Characterization and greedy learning of interventional Markov equivalence classes of directed acyclic graphs. Journal of Machine Learning Research 13, 1 (2012), 24092464.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. [21] Hoyer Patrik O., Janzing Dominik, Mooij Joris M., Peters Jonas, and Schölkopf Bernhard. 2008. Nonlinear causal discovery with additive noise models. In Proceedings of the 22nd Annual Conference on Neural Information Processing Systems. 689696.Google ScholarGoogle Scholar
  22. [22] Hu Shoubo, Chen Zhitang, Nia Vahid Partovi, Chan Lai-Wan, and Geng Yanhui. 2018. Causal inference and mechanism clustering of a mixture of additive noise models. In Proceedings of the Annual Conference on Neural Information Processing Systems. 52125222.Google ScholarGoogle Scholar
  23. [23] Janzing Dominik, Mooij Joris M., Zhang Kun, Lemeire Jan, Zscheischler Jakob, Daniusis Povilas, Steudel Bastian, and Schölkopf Bernhard. 2012. Information-geometric approach to inferring causal directions. Artificial Intelligence 182-183 (2012), 131.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] Kalainathan Diviyan, Goudet Olivier, and Dutta Ritik. 2020. Causal discovery toolbox: Uncovering causal relationships in python. Journal of Machine Learning Research 21, 1 (2020), 37:1–37:5.Google ScholarGoogle Scholar
  25. [25] Kalisch Markus and Bühlman Peter. 2007. Estimating high-dimensional directed acyclic graphs with the PC-algorithm. Journal of Machine Learning Research 8, 3 (2007), 613636.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Kingma Diederik P. and Welling Max. 2014. Auto-encoding variational bayes. In Proceedings of the 2nd International Conference on Learning Representations.Google ScholarGoogle Scholar
  27. [27] Le Thuc Duy, Hoang Tao, Li Jiuyong, Liu Lin, Liu Huawen, and Hu Shu. 2019. A fast PC algorithm for high dimensional causal discovery with multi-core PCs. IEEE/ACM Transactions on Computational Biology and Bioinformatics 16, 5 (2019), 14831495.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. [28] Li Honghao, Cabeli Vincent, Sella Nadir, and Isambert Hervé. 2019. Constraint-based causal structure learning with consistent separating sets. In Proceedings of the Annual Conference on Neural Information Processing Systems 2019. 1425714266.Google ScholarGoogle Scholar
  29. [29] Mooij Joris M. and Janzing Dominik. 2010. Distinguishing between cause and effect. In Proceedings of the Causality: Objectives and Assessment.147156.Google ScholarGoogle Scholar
  30. [30] Mooij Joris M., Peters Jonas, Janzing Dominik, Zscheischler Jakob, and Schölkopf Bernhard. 2016. Distinguishing cause from effect using observational data: Methods and benchmarks. Journal of Machine Learning Research 17, 1 (2016), 32:1–32:102.Google ScholarGoogle Scholar
  31. [31] Nash Warwick J., Sellers Tracy L., Talbot Simon R., Cawthorn Andrew J., and Ford Wes B.. 1994. The population biology of abalone (Haliotis species) in Tasmania. i. blacklip abalone (h. rubra) from the north coast and islands of bass strait. Sea Fisheries Division, Technical Report 48 (1994), p411.Google ScholarGoogle Scholar
  32. [32] Pearl Judea. 2000. Causality: Models, Reasoning, and Inference. Cambridge University Press.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. [33] Pearl Judea. 2014. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Elsevier.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. [34] Driessens Kurt and Dzeroski Saso. 2005. Combining model-based and instance-based learning for first order regression. Machine Learning, Proceedings of the Twenty-Second International Conference (ICML’05, Bonn, Germany, August 7-11, 2005), ACM International Conference Proceeding Series, Vol. 119, ACM, 193–200.Google ScholarGoogle Scholar
  35. [35] Sachs Karen, Perez Omar, Pe’er Dana, Lauffenburger Douglas A., and Nolan Garry P.. 2005. Causal protein-signaling networks derived from multiparameter single-cell data. Science 308, 5721 (2005), 523529.Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Shimizu Shohei, Hoyer Patrik O., Hyvärinen Aapo, and Kerminen Antti J.. 2006. A linear non-gaussian acyclic model for causal discovery. Journal of Machine Learning Research 7, 10 (2006), 20032030.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. [37] Spirtes Peter, Glymour Clark N., Scheines Richard, and Heckerman David. 2000. Causation, Prediction, and Search. MIT Press.Google ScholarGoogle Scholar
  38. [38] Spirtes Peter, Meek Christopher, and Richardson Thomas. 1999. An algorithm for causal inference in the presence of latent variables and selection bias. Computation, Causation, and Discovery 21 (1999), 1252.Google ScholarGoogle Scholar
  39. [39] Stoyan Helga and Jansen Uwe. 2013. Umweltstatistik: Statistische Verarbeitung und Analyse Von Umweltdaten. Springer-Verlag.Google ScholarGoogle Scholar
  40. [40] Sun Xiaohai, Janzing Dominik, Schölkopf Bernhard, and Fukumizu Kenji. 2007. A kernel-based causal learning algorithm. In Proceedings of the 24th International Conference on Machine Learning.855862.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. [41] Erven Tim Van and Harremos Peter. 2014. Rényi divergence and kullback-leibler divergence. IEEE Transactions on Information Theory 60, 7 (2014), 37973820.Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] VanderWeele Tyler J. and An Weihua. 2013. Social networks and causal inference. Handbook of Causal Analysis for Social Research (2013), 353374.Google ScholarGoogle ScholarCross RefCross Ref
  43. [43] Verma Thomas and Pearl Judea. 1990. Equivalence and synthesis of causal models. In Proceedings of the 6th Annual Conference on Uncertainty in Artificial Intelligence. 255270.Google ScholarGoogle Scholar
  44. [44] Wang Yixin and Blei David M.. 2019. The blessings of multiple causes. Journal of the American Statistical Association 114, 528 (2019), 15741596.Google ScholarGoogle ScholarCross RefCross Ref
  45. [45] Yeh I-Cheng. 2006. Analysis of strength of concrete using design of experiments and neural networks. Journal of Materials in Civil Engineering 18, 4 (2006), 597604.Google ScholarGoogle ScholarCross RefCross Ref
  46. [46] Zhang Kun and Hyvärinen Aapo. 2009. On the identifiability of the post-nonlinear causal model. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence. 647655.Google ScholarGoogle Scholar
  47. [47] Zhang Kun, Peters Jonas, Janzing Dominik, and Schölkopf Bernhard. 2011. Kernel-based conditional independence test and application in causal discovery. In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence. 804813.Google ScholarGoogle Scholar
  48. [48] Zhang Kun, Wang Zhikun, Zhang Jiji, and Schölkopf Bernhard. 2016. On estimation of functional causal models: General results and application to the post-nonlinear causal model. ACM Transactions on Intelligent Systems and Technology 7, 2 (2016), 13:1–13:22.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. [49] Zhang Xiangzhou, Hu Yong, Xie Kang, Wang Shouyang, Ngai EWT, and Liu Mei. 2014. A causal feature selection algorithm for stock prediction modeling. Neurocomputing 142 (2014), 4859.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. [50] Zhang Xinhua, Song Le, Gretton Arthur, and Smola Alexander J.. 2008. Kernel measures of independence for non-iid data. In Proceedings of the 22nd Annual Conference on Neural Information Processing Systems. 19371944.Google ScholarGoogle Scholar

Index Terms

  1. Causal Discovery via Causal Star Graphs

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Knowledge Discovery from Data
        ACM Transactions on Knowledge Discovery from Data  Volume 17, Issue 7
        August 2023
        319 pages
        ISSN:1556-4681
        EISSN:1556-472X
        DOI:10.1145/3589018
        Issue’s Table of Contents

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 14 April 2023
        • Online AM: 6 March 2023
        • Accepted: 27 February 2023
        • Revised: 21 September 2022
        • Received: 15 February 2022
        Published in tkdd Volume 17, Issue 7

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
      • Article Metrics

        • Downloads (Last 12 months)406
        • Downloads (Last 6 weeks)36

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Full Text

      View this article in Full Text.

      View Full Text

      HTML Format

      View this article in HTML Format .

      View HTML Format