Abstract
This paper addresses causal inference and modeling over event streams where data have high throughput and are unbounded. The availability of large amount of data along with the high data throughput present several new challenges related to causal modeling, such as the need for fast causal inference operations while ensuring consistent and valid results. There is no existing work specifically for such a streaming environment. We meet the challenges by introducing a time-centric causal inference strategy that leverages temporal precedence information to decrease the number of conditional independence tests required to establish the dependencies between the variables in a causal network. Dependency and temporal precedence of cause over effect are the two properties of a causal relationship. We also present the Temporal Network Inference algorithm to model the temporal precedence relations into a temporal network. Then, we propose the Fast Causal Network Inference algorithm for faster learning of causal network using the temporal network. Experiments using synthetic and real datasets demonstrate the efficacy of the proposed algorithms.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Heckerman, D.: A Bayesian approach to learning causal networks. In: UAI, pp. 285–295 (1995)
Ellis, B., Wong, W.H.: Learning causal Bayesian network structures from experimental data. J. American Statistics Association 103(482), 778–789 (2008)
Li, G., Leong, T.-Y.: Active learning for causal Bayesian network structure with non-symmetrical entropy. In: Theeramunkong, T., Kijsirikul, B., Cercone, N., Ho, T.-B. (eds.) PAKDD 2009. LNCS, vol. 5476, pp. 290–301. Springer, Heidelberg (2009)
Meganck, S., Leray, P., Manderick, B.: Learning causal Bayesian networks from observations and experiments: a decision theoretic approach. In: Torra, V., Narukawa, Y., Valls, A., Domingo-Ferrer, J. (eds.) MDAI 2006. LNCS (LNAI), vol. 3885, pp. 58–69. Springer, Heidelberg (2006)
Pearl, J.: Causality: Models, Reasoning and Inference, 2nd edn. Cambridge University Press (2009)
Spirtes, P., Glymour, C.N., Scheines, R.: Causality from probability. In: ACSS (1990)
Spirtes, P., Glymour, C., Scheines, R.: Causation, Prediction, and Search. MIT Press (2000)
Cheng, J., Greiner, R., Kelly, J., Bell, D., Liu, W.: Learning Bayesian networks from data: an information-theory based approach. Artificial Intelligence 137(1-2), 43–90 (2002)
Pearl, J.: Causal diagrams for empirical research. Biometrika 82, 669–688 (1995)
Popper, K.: The Logic of Scientific Discovery, Reprint edn. Routledge (October 1992)
Chickering, D.M.: Learning equivalence classes of Bayesian-network structures. J. Machine Learning Research 2, 445–498 (2002)
de Campos, L.M.: A scoring function for learning Bayesian networks based on mutual information and conditional independence tests. J. Machine Learning Research 7, 2149–2187 (2006)
Bishop, Y.M., Fienberg, S.E., Holland, P.W.: Discrete Multivariate Analysis: Theory and Practice. MIT Press (1975)
Kullback, S.: Information Theory and Statistics, 2nd edn. Dover Publication (1968)
Spirtes, P., Meek, C.: Learning Bayesian networks with discrete variables from data. In: KDD, pp. 294–299 (1995)
Pearl, J.: Graphs, causality, and structural equation models. Sociological Methods and Research 27, 226–284 (1998)
Tsamardinos, I., Brown, L.E., Aliferis, C.F.: The max-min hill-climbing Bayesian network structure learning algorithm. Mach. Learn. 65(1), 31–78 (2006)
Frank, A., Asuncion, A.: UCI machine learning repository (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag GmbH Berlin Heidelberg
About this paper
Cite this paper
Acharya, S., Lee, B.S. (2013). Fast Causal Network Inference over Event Streams. In: Bellatreche, L., Mohania, M.K. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2013. Lecture Notes in Computer Science, vol 8057. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40131-2_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-40131-2_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40130-5
Online ISBN: 978-3-642-40131-2
eBook Packages: Computer ScienceComputer Science (R0)