Generative Clausal Networks: Relational Decision Trees as Probabilistic Circuits

Ventola, Fabrizio; Dhami, Devendra Singh; Kersting, Kristian

doi:10.1007/978-3-030-97454-1_18

Generative Clausal Networks: Relational Decision Trees as Probabilistic Circuits

Fabrizio Ventola¹⁰,
Devendra Singh Dhami¹⁰ &
Kristian Kersting^10,11

Conference paper
First Online: 24 February 2022

397 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13191))

Abstract

In many real-world applications, the i.i.d. assumption does not hold and thus capturing the interactions between instances is essential for the task at hand. Recently, a clear connection between predictive modelling such as decision trees and probabilistic circuits, a form of deep probabilistic model, has been established although it is limited to propositional data. We introduce the first connection between relational rule models and probabilistic circuits, obtaining tractable inference from discriminative rule models while operating on the relational domain. Specifically, given a relational rule model, we make use of Mixed Sum-Product Networks (MSPNs)—a deep probabilistic architecture for hybrid domains—to equip them with a full joint distribution over the class and how (often) the rules fire. Our empirical evaluation shows that we can answer a wide range of probabilistic queries on relational data while being robust to missing, out-of-domain data and partial counts. We show that our method generalizes to different distributions outperforming strong baselines. Moreover, due to the clear probabilistic semantics of MSPNs we have informative model interpretations.

F. Ventola and D. S. Dhami—Equal contribution.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
We are not strict on “density” vs. “distribution”.
2.
A set of random variables is finitely exchangeable with respect to a joint distribution P, if all permutations of the variables result in the same joint probabilities. Note that finite exchangeable does not require independence; the random variables can have strong dependencies.
3.
In the figure and in the following text \(\boldsymbol{1}()\) represents an indicator function.
4.
www.drugbank.ca.
5.
https://github.com/SPFlow/SPFlow.
6.
Both Scikit-learn implementation with default hyperparameters.

References

Altman, N.S.: An introduction to kernel and nearest-neighbor nonparametric regression. The Am. Stat. (1992)
Google Scholar
Austin, P.C., White, I.R., Lee, D.S., van Buuren, S.: Missing data in clinical research: a tutorial on multiple imputation. Can. J. Cardiol. (2020)
Google Scholar
Blockeel, H., de Raedt, L.: Top-down induction of first-order logical decision trees. AI (1998)
Google Scholar
van Buuren, S., Groothuis-Oudshoorn, K.: mice: Multivariate imputation by chained equations in r. J. Stat. Softw. (2011)
Google Scholar
Cawley, G.C., Talbot, N.L.: On over-fitting in model selection and subsequent selection bias in performance evaluation. JMLR (2010)
Google Scholar
Chattopadhyay, P., Balaji, Y., Hoffman, J.: Learning to balance specificity and invariance for in and out of domain generalization. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12354, pp. 301–318. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58545-7_18
Chapter Google Scholar
Cheng, W., Kok, S., Pham, H.V., Chieu, H.L., Chai, K.M.A.: Language modeling with sum-product networks. In: INTERSPEECH (2014)
Google Scholar
Choi, Y., Vergari, A., Van den Broeck, G.: Probabilistic circuits: a unifying framework for tractable probabilistic models (2020)
Google Scholar
Correia, A.H.C., Peharz, R., de Campos, C.P.: Joints in random forests. In: NeurIPS (2020)
Google Scholar
Darwiche, A.: A differential approach to inference in Bayesian networks. JACM (2003)
Google Scholar
De Raedt, L., Kersting, K., Natarajan, S., Poole, D.: Statistical relational artificial intelligence: logic, probability, and computation (2016)
Google Scholar
Dhami, D.S., Kunapuli, G., Das, M., Page, D., Natarajan, S.: Drug-drug interaction discovery: kernel learning from heterogeneous similarities. Smart Health (2018)
Google Scholar
Dhami, D.S., Yen, S., Kunapuli, G., Natarajan, S.: Non-parametric learning of Gaifman models. arXiv preprint arXiv:2001.00528 (2020)
Flach, P.A., Lachiche, N.: Naive Bayesian classification of structured data. Mach. Learn. (2004)
Google Scholar
Gangal, V., Arora, A., Einolghozati, A., Gupta, S.: Likelihood ratios and generative classifiers for unsupervised out-of-domain detection in task oriented dialog. In: AAAI (2020)
Google Scholar
Gens, R., Domingos, P.: Learning the structure of sum-product networks. In: ICML (2013)
Google Scholar
Getoor, L., Taskar, B.: Statistical relational learning (2007)
Google Scholar
Khosravi, P., Vergari, A., Choi, Y., Liang, Y., den Broeck, G.V.: Handling missing data in decision trees: a probabilistic approach. arXiv preprint arXiv:2006.16341 (2020)
Khot, T., Natarajan, S., Kersting, K., Shavlik, J.: Learning Markov logic networks via functional gradient boosting. In: ICDM (2011)
Google Scholar
Kok, S., et al.: The alchemy system for statistical relational AI (2005)
Google Scholar
Landwehr, N., Kersting, K., De Raedt, L.: nfoil: Integrating Naıve Bayes and foil. In: AAAI (2005)
Google Scholar
Landwehr, N., Kersting, K., De Raedt, L.: Integrating Naive Bayes and foil. JMLR (2007)
Google Scholar
Lee, K., Lee, K., Lee, H., Shin, J.: A simple unified framework for detecting out-of-distribution samples and adversarial attacks. In: NeurIPS (2018)
Google Scholar
Mitchell, T., et al.: Never-ending learning. Commun. ACM (2018)
Google Scholar
Molina, A., Natarajan, S., Kersting, K.: Poisson sum-product networks: a deep architecture for tractable multivariate Poisson distributions. In: AAAI (2017)
Google Scholar
Molina, A., Vergari, A., Mauro, N.D., Natarajan, S., Esposito, F., Kersting, K.: Mixed sum-product networks: a deep architecture for hybrid domains. In: AAAI (2018)
Google Scholar
Natarajan, S., Khot, T., Kersting, K., Gutmann, B., Shavlik, J.: Gradient-based boosting for statistical relational learning: the relational dependency network case. Mach. Learn. (2012)
Google Scholar
Nath, A., Domingos, P.: Learning relational sum-product networks. In: AAAI (2015)
Google Scholar
Neville, J., Jensen, D.: Relational dependency networks. JMLR (2007)
Google Scholar
Peharz, R., Gens, R., Pernkopf, F., Domingos, P.M.: On the latent variable interpretation in sum-product networks. TPAMI (2017)
Google Scholar
Peharz, R., Kapeller, G., Mowlaee, P., Pernkopf, F.: Modeling speech with sum-product networks: application to bandwidth extension. In: ICASSP (2014)
Google Scholar
Peharz, R., et al.: Random sum-product networks: a simple and effective approach to probabilistic deep learning. In: UAI (2019)
Google Scholar
Poon, H., Domingos, P.M.: Sum-product networks: a new deep architecture. In: UAI (2011)
Google Scholar
Richardson, M., Domingos, P.: Markov logic networks. Mach. Learn. (2006)
Google Scholar
Tan, M., et al.: Out-of-domain detection for low-resource text classification tasks. In: EMNLP-IJCNLP (2019)
Google Scholar
Tran, D., Vafa, K., Agrawal, K.K., Dinh, L., Poole, B.: Discrete flows: invertible generative models of discrete data. In: NeurIPS 2019 (2019)
Google Scholar
Vapnik, V., Vashist, A.: A new learning paradigm: learning using privileged information. Neural Netw. (2009)
Google Scholar
Vergari, A., Di Mauro, N., Esposito, F.: Visualizing and understanding sum-product networks. Mach. Learn. (2019)
Google Scholar
Yang, E., Ravikumar, P.K., Allen, G.I., Liu, Z.: On Poisson graphical models. In: NIPS (2013)
Google Scholar
Yu, Z., Ventola, F., Kersting, K.: Whittle networks: a deep likelihood model for time series. In: ICML (2021)
Google Scholar
Yuan, Z., Wang, H., Wang, L., Lu, T., Palaiahnakote, S., Tan, C.L.: Modeling spatial layout for scene image understanding via a novel multiscale sum-product network. Expert Syst. Appl. (2016)
Google Scholar

Download references

Acknowledgments

This work was supported by the ICT-48 Network of AI Research Excellence Center “TAILOR” (EU Horizon 2020, GA No 952215), the Federal Ministry of Education and Research (BMBF; Competence Center for AI and Labour; “kompAKI”, FKZ 02L19C150), the German Science Foundation (DFG, German Research Foundation; GRK 1994/1 “AIPHES”), the Hessian Ministry of Higher Education, Research, Science and the Arts (HMWK; projects “The Third Wave of AI” and “The Adaptive Mind”), the Hessian research priority programme LOEWE within the project “WhiteBox”, and the Collaboration Lab “AI in Construction” (AICO).

Author information

Authors and Affiliations

Department of Computer Science, TU Darmstadt, Darmstadt, Germany
Fabrizio Ventola, Devendra Singh Dhami & Kristian Kersting
Hessian Center for AI and Centre for Cognitive Science, Darmstadt, Germany
Kristian Kersting

Authors

Fabrizio Ventola
View author publications
You can also search for this author in PubMed Google Scholar
Devendra Singh Dhami
View author publications
You can also search for this author in PubMed Google Scholar
Kristian Kersting
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fabrizio Ventola .

Editor information

Editors and Affiliations

National Cener for Scientific Research Demokritos, Athens, Greece
Nikos Katzouris
University of Piraeus, Athens, Greece
Alexander Artikis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ventola, F., Dhami, D.S., Kersting, K. (2022). Generative Clausal Networks: Relational Decision Trees as Probabilistic Circuits. In: Katzouris, N., Artikis, A. (eds) Inductive Logic Programming. ILP 2021. Lecture Notes in Computer Science(), vol 13191. Springer, Cham. https://doi.org/10.1007/978-3-030-97454-1_18

Download citation

DOI: https://doi.org/10.1007/978-3-030-97454-1_18
Published: 24 February 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-97453-4
Online ISBN: 978-3-030-97454-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics