Pruning strategies for the efficient traversal of the search space in PILP environments

Côrte-Real, Joana; Dutra, Inês; Rocha, Ricardo

doi:10.1007/s10115-021-01620-1

Pruning strategies for the efficient traversal of the search space in PILP environments

Regular Paper
Published: 19 November 2021

Volume 63, pages 3183–3215, (2021)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

143 Accesses
1 Altmetric
Explore all metrics

Abstract

Probabilistic inductive logic programming (PILP) is a statistical relational learning technique which extends inductive logic programming by considering probabilistic data. The ability to use probabilities to represent uncertainty comes at the cost of an exponential evaluation time when composing theories to model the given problem. For this reason, PILP systems rely on various pruning strategies in order to reduce the search space. However, to the best of the authors’ knowledge, there has been no systematic analysis of the different pruning strategies, how they impact the search space and how they interact with one another. This work presents a unified representation for PILP pruning strategies which enables end-users to understand how these strategies work both individually and combined and to make an informed decision on which pruning strategies to select so as to best achieve their goals. The performance of pruning strategies is evaluated both time and quality-wise in two state-of-the-art PILP systems with datasets from three different domains. Besides analysing the performance of the pruning strategies, we also illustrate the utility of PILP in one of the application domains, which is a real-world application.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 2

Probabilistic inductive constraint logic

Article Open access 10 November 2020

Estimation-Based Search Space Traversal in PILP Environments

Inductive Logic Programming Meets Relational Databases: Efficient Learning of Markov Logic Networks

Notes

Depending on the language bias (mechanism employed to constrain the search space), it might also be the case that the same literal with different arguments can occur multiple times in the rule. However, for the description of the search space, repeated literals in rules can be mimicked by introducing auxiliary predicates (for instance). Therefore, for the sake of simplicity, this case is not considered in what follows.
Remember that a theory is evaluated probabilistically against the set of all examples.
Another option would be to use a probabilistic inference method which determines the prediction values of a theory using an approximation, but this is outside of the scope of this work.
http://www.cs.wisc.edu/~dpage/kddcup2001.
http://rtw.ml.cmu.edu.

References

Bellodi E, Riguzzi F ( 2012) , Learning the structure of probabilistic logic programs. In: Inductive logic programming, Springer, pp 61–75
Bellodi E, Riguzzi F (2015) Structure learning of probabilistic logic programs by searching the clause space. Theory Pract Logic Program 15(02):169–212
Article Google Scholar
Berg WA, Hruban RH, Kumar D, Singh HR, Brem RF, Gatewood OM (1996) Lessons from mammographic histopathologic correlation of large-core needle breast biopsy. Radiographics 16(5):1111–1130
Article Google Scholar
Brancato B, Crocetti E, Bianchi S, Catarzi S, Risso GG, Bulgaresi P, Piscioli F, Scialpi M, Ciatto S, Houssami N (2012) Accuracy of needle biopsy of breast lesions visible on ultrasound: audit of fine needle versus core needle biopsy in 3233 consecutive samplings with ascertained outcomes. Breast 21(4):449–454
Article Google Scholar
Burbank F (1997) Stereotactic breast biopsy: comparison of 14- and 11-gauge mammotome probe performance and complication rates. Am Surg 63(11):988–995
Google Scholar
Cook D, Holder L (2006) Mining graph data. Wiley, London
Book Google Scholar
Côrte-Real J, Dries A, Dutra I, Rocha R ( 2018) Improving candidate quality of probabilistic logic models. In: dal Palù A, Tarau P (eds) Technical communications of the 34th international conference on logic programming (ICLP 2018), Oxford, UK
Côrte-Real J, Dutra I, Rocha R (2016) Estimation-based search space traversal in PILP environments. In: Russo A, Cussens J (eds) Proceedings of the 26th international conference on inductive logic programming (ILP 2016), LNAI, Springer, London, UK. Published in 2017
Côrte-Real J, Mantadelis T, Dutra I, Rocha R, Burnside E (2015) SkILL: a stochastic inductive logic learner. In: International conference on machine learning and applications, Miami, Florida, USA
Costa VS, Rocha R, Damas L (2012) The YAP prolog system. J Theory Pract Logic Program 12(1 & 2):5–34
Article MathSciNet Google Scholar
De Raedt L, Dries A, Thon I, Van den Broeck G, Verbeke M ( 2015) Inducing probabilistic relational rules from probabilistic examples. In: International joint conference on artificial intelligence, AAAI Press, pp 1835–1843
De Raedt L, Kersting K ( 2004) Probabilistic inductive logic programming. In: International conference on algorithmic learning theory. Springer, Berlin, pp 19–36
De Raedt L, Kimmig A (2015) Probabilistic (logic) programming concepts. Mach Learn 100(1):5–47
De Raedt L, Thon I ( 2011) Probabilistic rule learning. In: Inductive logic programming. Springer, pp 47–58
Džeroski S (2010) Relational data mining. Springer, Berlin
MATH Google Scholar
Džeroski S, De Raedt L, Driessens K (2001) Relational reinforcement learning. Mach Learn 43(1–2):7–52
Article Google Scholar
Gonçalves AV, Thuler LC, Kestelman FP, Carmo PA, Lima CF, Cipolotti R (2011) Underestimation of malignancy of core needle biopsy for nonpalpable breast lesions. Rev Bras Ginecol Obstet 33(7):123–131
Google Scholar
Halpern J (1990) An analysis of first-order logics of probability. Artif Intell 46(3):311–350
Article MathSciNet Google Scholar
Kersting K, De Raedt L, Kramer S ( 2000) Interpreting Bayesian logic programs. In: AAAI workshop on learning statistical models from relational data, pp 29–35
Kimmig A, Demoen B, Raedt LD, Costa VS, Rocha R (2011) On the implementation of the probabilistic logic programming language ProbLog. Theory Pract Logic Program 11(2 & 3):235–262
Article MathSciNet Google Scholar
Kok S, Domingos P ( 2005) Learning the structure of Markov logic networks. In: International conference on machine learning. ACM, pp 441–448
Liberman L (2000) Percutaneous imaging-guided core breast biopsy: state of the art at the millennium. Am J Roentgenol 174(5):1191–1199
Article Google Scholar
Liberman L, Drotman M, Morris EA et al (2000) Imaging-histologic discordance at percutaneous breast biopsy. Cancer 89(12):2538–2546
Article Google Scholar
Muggleton S (1996) Stochastic logic programs. Adv Inductive Logic Program 32:254–264
MathSciNet Google Scholar
Muggleton S, Raedt LD (1994) Inductive logic programming: theory and methods. J Log Program 19(20):629–679
Article MathSciNet Google Scholar
Muggleton S, Santos J, Almeida C, Tamaddoni-Nezhad A ( 2008) TopLog: ILP using a logic program declarative bias. In: International conference on logic programming, Springer, Berlin, pp 687–692
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830
MathSciNet MATH Google Scholar
Poole D (1997) The independent choice logic for modelling multiple agents under uncertainty. Artif Intell 94(1):7–56
Article MathSciNet Google Scholar
Richardson M, Domingos P (2006) Markov logic networks. Mach Learn 62(1–2):107–136
Article Google Scholar
Santos Costa V, Page D, Qazi M, Cussens J ( 2002) CLP(BN): constraint logic programming for probabilistic knowledge. In: Conference on uncertainty in artificial intelligence, pp 517–524
Sato T (1995) A statistical learning method for logic programs with distribution semantics. In: Proceedings of the 12th international conference on logic programming (ICLP 95), Citeseer
Sato T, Kameya Y ( 1997) PRISM: A language for symbolic-statistical modeling. In: International joint conference on artificial intelligence, vol 97, Morgan Kaufmann, pp 1330–1339
Vennekens J, Verbaeten S, Bruynooghe M (2004) Logic programs with annotated disjunctions. In: Logic programming. Springer, pp 431–445

Download references

Acknowledgements

This work was financed by National Funds through the Portuguese funding agency, FCT—Fundação para a Ciência e a Tecnologia, within project UIDB/50014/2020. Joana Côrte-Real was financed by the FCT grant SFRH/BD/52235/2013.

Author information

Authors and Affiliations

Department of Computer Science, Faculty of Sciences, University of Porto, Rua do Campo Alegre, 1021, 4169-007, Porto, Portugal
Joana Côrte-Real, Inês Dutra & Ricardo Rocha
CRACS & INESC TEC, Porto, Portugal
Joana Côrte-Real & Ricardo Rocha
CINTESIS, Porto, Portugal
Inês Dutra

Authors

Joana Côrte-Real
View author publications
You can also search for this author in PubMed Google Scholar
Inês Dutra
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo Rocha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Inês Dutra.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Côrte-Real, J., Dutra, I. & Rocha, R. Pruning strategies for the efficient traversal of the search space in PILP environments. Knowl Inf Syst 63, 3183–3215 (2021). https://doi.org/10.1007/s10115-021-01620-1

Download citation

Received: 23 July 2020
Revised: 06 October 2021
Accepted: 16 October 2021
Published: 19 November 2021
Issue Date: December 2021
DOI: https://doi.org/10.1007/s10115-021-01620-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Pruning strategies for the efficient traversal of the search space in PILP environments

Abstract

Access this article

Similar content being viewed by others

Probabilistic inductive constraint logic

Estimation-Based Search Space Traversal in PILP Environments

Inductive Logic Programming Meets Relational Databases: Efficient Learning of Markov Logic Networks

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Pruning strategies for the efficient traversal of the search space in PILP environments

Abstract

Access this article

Similar content being viewed by others

Probabilistic inductive constraint logic

Estimation-Based Search Space Traversal in PILP Environments

Inductive Logic Programming Meets Relational Databases: Efficient Learning of Markov Logic Networks

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation