Abstract
We present an algorithm for learning correlations among link types and node attributes in relational data that represent complex networks. The link correlations are represented in a Bayes net structure. This provides a succinct graphical way to display relational statistical patterns and support powerful probabilistic inferences. The current state of the art algorithm for learning relational Bayes nets captures only correlations among entity attributes given the existence of links among entities. The models described in this paper capture a wider class of correlations that involve uncertainty about the link structure. Our base line method learns a Bayes net from join tables directly. This is a statistically powerful procedure that finds many correlations, but does not scale well to larger datasets. We compare join table search with a hierarchical search strategy.
Keywords
- Descriptive Attribute
- Markov Logic Network
- Link Correlation
- Heterogeneous Information Network
- Hierarchical Search
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chickering, D.: Optimal structure identification with greedy search. Journal of Machine Learning Research 3, 507–554 (2003)
Domingos, P., Lowd, D.: Markov Logic: An Interface Layer for Artificial Intelligence. Morgan and Claypool Publishers (2009)
Getoor, L., Friedman, N., Koller, D., Pfeffer, A., Taskar, B.: Probabilistic relational models. In: Introduction to Statistical Relational Learning, ch. 5, pp. 129–173. MIT Press (2007)
Getoor, L., Taskar, B., Koller, D.: Selectivity estimation using probabilistic models. ACM SIGMOD Record 30(2), 461–472 (2001)
Heckerman, D., Geiger, D., Chickering, D.: Learning Bayesian networks: The combination of knowledge and statistical data. Machine Learning 20, 197–243 (1995)
Khosravi, H., Man, T., Hu, J., Gao, E., Schulte, O.: Learn and join algorithm code, http://www.cs.sfu.ca/~oschulte/jbn/
Khosravi, H., Schulte, O., Man, T., Xu, X., Bina, B.: Structure learning for Markov logic networks with many descriptive attributes. In: AAAI, pp. 487–493 (2010)
Khot, T., Natarajan, S., Kersting, K., Shavlik, J.W.: Learning Markov logic networks via functional gradient boosting. In: ICDM, pp. 320–329. IEEE Computer Society (2011)
Khot, T., Shavlik, J., Natarajan, S.: Boostr (2013), http://pages.cs.wisc.edu/~tushar/Boostr/
Natarajan, S., Khot, T., Kersting, K., Gutmann, B., Shavlik, J.W.: Gradient-based boosting for statistical relational learning: The relational dependency network case. Machine Learning 86(1), 25–56 (2012)
Neapolitan, R.E.: Learning Bayesian Networks. Pearson Education (2004)
Poole, D.: First-order probabilistic inference. In: IJCAI, pp. 985–991 (2003)
Russell, S., Norvig, P.: Artificial Intelligence: A Modern Approach. Prentice Hall (2010)
Schulte, O.: A tractable pseudo-likelihood function for Bayes nets applied to relational data. In: SIAM SDM, pp. 462–473 (2011)
Schulte, O.: Challenge paper: Marginal probabilities for instances and classes. In: ICML-SRL Workshop on Statistical Relational Learning (June 2012)
Schulte, O., Khosravi, H.: Learning graphical models for relational data via lattice search. Machine Learning 88(3), 331–368 (2012)
Schulte, O., Khosravi, H., Kirkpatrick, A., Gao, T., Zhu, Y.: Modelling relational statistics with bayes nets. In: Inductive Logic Programming, ILP (2012)
Schulte, O., Khosravi, H., Kirkpatrick, A., Gao, T., Zhu, Y.: Modelling relational statistics with bayes nets. Machine Learning (2013) (forthcoming)
Sun, Y., Han, J.: Mining Heterogeneous Information Networks: Principles and Methodologies, vol. 3. Morgan & Claypool Publishers (2012)
The Tetrad Group. The Tetrad project (2008), http://www.phil.cmu.edu/projects/tetrad/
Ullman, J.D.: Principles of Database Systems, 2nd edn. W. H. Freeman & Co. (1982)
Yin, X., Han, J., Yang, J., Yu, P.S.: Crossmine: Efficient classification across multiple database relations. In: ICDE, pp. 399–410. IEEE Computer Society (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Qian, Z., Schulte, O. (2014). Learning Bayes Nets for Relational Data with Link Uncertainty. In: Croitoru, M., Rudolph, S., Woltran, S., Gonzales, C. (eds) Graph Structures for Knowledge Representation and Reasoning. Lecture Notes in Computer Science(), vol 8323. Springer, Cham. https://doi.org/10.1007/978-3-319-04534-4_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-04534-4_9
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04533-7
Online ISBN: 978-3-319-04534-4
eBook Packages: Computer ScienceComputer Science (R0)