Skip to main content
Log in

Discovering disjoint object property pairs in knowledge graphs using Probabilistic Soft Logic

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Although Knowledge Graphs (KGs) have turned out to become a popular and powerful tool in the industry world, the major focus of most researchers has been only on adding more and more triples to the A-Boxes of the KGs. An often overlooked but important part of a KG is its T-Box. If the T-Box contains incorrect statements or if certain correct statements are absent in it, it can lead to inconsistent knowledge in the KG or to information loss respectively. In this paper, we propose a novel system, DOPLEX, based on Probabilistic Soft Logic (PSL) to detect disjointness between pairs of object properties present in the KG. Current approaches mainly rely on checking the absence of common triples and miss out on exploiting the semantics of property names. In the proposed system, in addition to checking common triples, PSL is used to determine if property names imply disjointness. We particularly focus on knowledge graphs that are auto-extracted from large text corpora. Our evaluation demonstrates that the proposed approach discovers disjoint property pairs with better precision when compared to the state-of-the-art system without compromising much on the number of disjoint pairs discovered. Towards the end of the paper, we discuss the disjointness of properties in the context of time and propose a new notion called temporal-non-disjointness and discuss its importance and characteristics. We also present an approach for the discovery of property pairs that are potentially temporally non-disjoint.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Data availability

The data generated during and/or analysed during the current study are available in the project web page: https://sites.google.com/site/ontoworks/projects.

Notes

  1. The actual property name in NELL is countryleadbyperson. We use camel case for property names in NELL to improve readability.

  2. Insight obtained through NELL’s discussion forum.

  3. http://rtw.ml.cmu.edu/rtw/kbbrowser/pred:writerwrotebook.

  4. http://rtw.ml.cmu.edu/rtw/kbbrowser/pred:riverflowsthroughcity.

  5. http://rtw.ml.cmu.edu/rtw/kbbrowser/pred:personbornincountry.

  6. http://rtw.ml.cmu.edu/rtw/kbbrowser/pred:countryleadbyperson.

  7. http://rtw.ml.cmu.edu/rtw/kbbrowser/pred:countryhascitizen.

  8. https://conceptnet.io/.

  9. https://sites.google.com/site/ontoworks/projects.

  10. https://pypi.org/project/language-tool-python/.

References

  1. Agresti A, Coull BA (1998) Approximate is better than “exact’’ for interval estimation of binomial proportions. Am Stat 52(2):119–126

    Google Scholar 

  2. Bach S.H, Broecheler M, Huang B, Getoor L (2017) Hinge-Loss Markov random fields and probabilistic soft logic. J Mach Learn Res 18(1):3846–3912

    MATH  Google Scholar 

  3. Bühmann L, Lehmann J (2012) Universal OWL axiom enrichment for large knowledge bases. In: Knowledge engineering and knowledge management—18th international conference, EKAW 2012, Galway City, Ireland, October 8–12, 2012. Proceedings, volume 7603 of lecture notes in computer science. Springer, pp 57–71

  4. Bühmann L, Lehmann J (2013) Pattern based knowledge base enrichment. In: The semantic web—ISWC 2013—12th international semantic web conference, Sydney, NSW, Australia, October 21–25, 2013, Proceedings, Part I, volume 8218 of lecture notes in computer science. Springer, pp 33–48

  5. Cao S, Yang Q, Li Z, Liu G, Zhang D, Xu J (2020) Memtimes: temporal scoping of facts with memory network. In: Database systems for advanced applications—25th international conference, DASFAA 2020, Jeju, South Korea, September 24–27, 2020, Proceedings, Part III, volume 12114 of lecture notes in computer science. Springer, pp 70–86 (2020)

  6. Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG (2001) A simple algorithm for identifying negated findings and diseases in discharge summaries. J Biomed Inform 34(5):301–310

    Article  Google Scholar 

  7. Enger M, Velldal E, Øvrelid L (2017) An open-source tool for negation detection: a maximum-margin approach. In: Proceedings of the workshop computational semantics beyond events and roles. pp 64–69

  8. Fleischhacker D, Völker J (2011) Inductive learning of disjointness axioms. In: On the move to meaningful internet systems: OTM. pp 680–697

  9. Fleischhacker D, Völker J, Stuckenschmidt H (2012) Mining RDF data for property axioms. In: OTM confederated international conferences. On the move to meaningful internet systems. Springer, pp 718–735

  10. Fleiss JL (1971) Measuring nominal scale agreement among many raters. Psychol Bull 76(5):378

    Article  Google Scholar 

  11. Getoor L, Taskar B (2007) Introduction to statistical relational learning, 1st edn. MIT Press, Cambridge

    Book  MATH  Google Scholar 

  12. Han Z, Chen P, Ma Y, Tresp V (2020) Dyernie: dynamic evolution of Riemannian manifold embeddings for temporal knowledge graph completion. In: Proceedings of the 2020 conference on empirical methods in natural language processing, EMNLP 2020, Online, November 16–20, 2020. Association for Computational Linguistics, pp 7301–7316

  13. Hur A, Janjua N, Ahmed M (2021) A survey on state-of-the-art techniques for knowledge graphs construction and challenges ahead. CoRR arXiv:2110.08012

  14. Kouki P, Fakhraei S, Foulds JR, Eirinaki M, Getoor L (2015) Hyper: a flexible and extensible probabilistic framework for hybrid recommender systems. In: Proceedings of the 9th ACM conference on recommender systems, RecSys 2015, Vienna, Austria, September 16–20, 2015. ACM, pp 99–106

  15. Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 159–174

  16. Lehmann J, Isele R, Jakob M, Jentzsch A, Kontokostas D, Mendes PN, Hellmann S, Morsey M, van Kleef P, Auer S, Bizer C (2015) Dbpedia—a large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web 6(2):167–195

    Article  Google Scholar 

  17. Liao S, Liang S, Meng Z, Zhang Q (2021) Learning dynamic embeddings for temporal knowledge graphs. In: WSDM ’21, the fourteenth ACM international conference on web search and data mining, virtual event, Israel, March 8–12, 2021. ACM, pp 535–543

  18. Ma Y, Gao H, Wu T, Qi G (2014) Learning disjointness axioms with association rule mining and its application to inconsistency detection of linked data. In: The semantic web and web science— 8th Chinese conference, CSWS 2014, Wuhan, China, August 8–12, 2014, Revised Selected Papers, volume 480 of communications in computer and information science. Springer, pp 29–41 (2014)

  19. McInnes L, Healy J, Astels S (2017) hdbscan: Hierarchical density based clustering. J Open Source Softw 2

  20. Meilicke C, Völker J, Stuckenschmidt H (2008) Learning disjointness for debugging mappings between lightweight ontologies. In: Knowledge engineering: practice and patterns, 16th international conference, EKAW 2008, Acitrezza, Italy, September 29–October 2, 2008. Proceedings, volume 5268 of lecture notes in computer science. Springer, pp 93–108 (2008)

  21. Memon M, Ting H, Cheah J, Thurasamy R, Chuah F, Cham T (2020) Sample size for survey research: review and recommendations. J Appl Struct Equ Model 4(2):1–20

    Google Scholar 

  22. Messner J, Abboud R, Ceylan İİ (2021) Temporal knowledge graph completion using box embeddings. CoRR arXiv:2109.08970

  23. Mihalcea R, Corley C, Strapparava C (2006) Corpus-based and knowledge-based measures of text semantic similarity. In: Proceedings of the 21st national conference on artificial intelligence—volume 1, AAAI’06. AAAI Press, pp 775–780 (2006)

  24. Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems 26: 27th annual conference on neural information processing systems 2013. Proceedings of a meeting held December 5–8, 2013, Lake Tahoe, Nevada, United States. pp 3111–3119

  25. Mitchell T, Cohen W, Hruschka E, Talukdar P, Betteridge J, Carlson A, Dalvi B, Gardner M, Kisiel B, Krishnamurthy J, Lao N, Mazaitis K, Mohamed T, Nakashole N, Platanios E, Ritter A, Samadi M, Settles B, Wang R, Wijaya D, Gupta A, Chen X, Saparov A, Greaves M, Welling J (2015) Never-ending learning. In: Proceedings of the twenty-ninth AAAI conference on artificial intelligence (AAAI-15)

  26. Montella S, Rojas Barahona LM, Heinecke J (2021) Hyperbolic temporal knowledge graph embeddings with relational and time curvatures. In: Findings of the association for computational linguistics: ACL-IJCNLP 2021. Association for Computational Linguistics

  27. Nakashole N, Weikum G, Suchanek FM (2012) PATTY: a taxonomy of relational patterns with semantic types. In: Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning, EMNLP-CoNLL 2012, July 12–14, 2012, Jeju Island, Korea. ACL, pp 1135–1145

  28. Nguyen TH, Tettamanzi AGB (2019) Learning class disjointness axioms using grammatical evolution. In: Genetic programming—22nd European conference, EuroGP 2019, held as part of EvoStar 2019, Leipzig, Germany, April 24–26, 2019, Proceedings, volume 11451 of lecture notes in computer science. Springer, pp 278–294

  29. Pujara J, Miao H, Getoor L, Cohen WW (2013) Knowledge graph identification. In: The semantic web—ISWC 2013—12th international semantic web conference, Sydney, NSW, Australia, October 21–25, 2013, Proceedings, Part I, volume 8218 of lecture notes in computer science. Springer, pp 542–557

  30. Rula A, Palmonari M, Rubinacci S, Ngomo AN, Lehmann J, Maurino A, Esteves D (2019) TISCO: temporal scoping of facts. J Web Semant 54:72–86

    Article  Google Scholar 

  31. Takaku Y, Kaji N, Yoshinaga N, Toyoda M (2012) Identifying constant and unique relations by using time-series text. In: Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning, EMNLP-CoNLL 2012, July 12–14, 2012, Jeju Island, Korea. ACL, pp 883–892

  32. Talukdar PP, Wijaya D, Mitchell TM (2012) Coupled temporal scoping of relational facts. In: Proceedings of the fifth international conference on web search and web data mining, WSDM 2012, Seattle, WA, USA, February 8–12, 2012. ACM, pp 73–82

  33. Tanon TP, Weikum G, Suchanek FM (2020) YAGO 4: a reason-able knowledge base. In: The semantic web—17th international conference, ESWC 2020, Heraklion, Crete, Greece, May 31–June 4, 2020, Proceedings, volume 12123 of lecture notes in computer science. Springer, pp 583–596

  34. Völker J, Fleischhacker D, Stuckenschmidt H (2015) Automatic acquisition of class disjointness. J Web Semant 35:124–139

    Article  Google Scholar 

  35. Völker J, Vrandecic D, Sure Y, Hotho A (2007) Learning disjointness. In: The semantic web: research and applications, 4th European semantic web conference, ESWC 2007, Innsbruck, Austria, June 3–7, 2007, Proceedings, volume 4519 of lecture notes in computer science. Springer, pp 175–189

  36. Xu C, Chen Y, Nayyeri M, Lehmann J (2021) Temporal knowledge graph completion using a linear temporal regularizer and multivector embeddings. In: Proceedings of the 2021 conference of the North American chapter of the association for computational linguistics: human language technologies, NAACL-HLT 2021, June 6–11, 2021. Association for Computational Linguistics, pp 2569–2578

  37. Zhang F, Li Z, Peng D, Cheng J (2021) RDF for temporal data management: a survey. Earth Sci Inform 14(2):563–599

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. Subhashree.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Subhashree, S., Kumar, P.S. Discovering disjoint object property pairs in knowledge graphs using Probabilistic Soft Logic. Knowl Inf Syst 65, 899–919 (2023). https://doi.org/10.1007/s10115-022-01773-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-022-01773-7

Keywords

Navigation