Skip to main content

Tab2Onto: Unsupervised Semantification with Knowledge Graph Embeddings

  • Conference paper
  • First Online:
  • 842 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13384))

Abstract

A large amount of data is generated every day by different systems and applications. In many cases, this data comes in a tabular format that lacks semantic representation and poses new challenges in data modelling. For semantic applications, it then becomes necessary to lift the data to a richer representation, such as a knowledge graph that adheres to a semantic ontology. We propose Tab2Onto, an unsupervised approach for learning ontologies from tabular data using knowledge graph embeddings, clustering, and a human in the loop. We conduct a set of experiments to investigate our approach on a benchmarking dataset from a medical domain and learn the ontology of diseases. Our code and datasets are provided at https://tab2onto.dice-research.org/.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    In case of multiple CSV files, they are joined into a single file.

  2. 2.

    https://github.com/dice-group/Vectograph.

  3. 3.

    https://github.com/pwin/owlready2.

  4. 4.

    https://archive.ics.uci.edu/ml/datasets/Lymphography.

References

  1. Demir, C., Moussallem, D., Heindorf, S., Ngomo, A.C.N.: Convolutional hypercomplex embeddings for link prediction. In: Asian Conference on Machine Learning, pp. 656–671. PMLR (2021)

    Google Scholar 

  2. Ermilov, I., Auer, S., Stadler, C.: User-driven semantic mapping of tabular data. In: I-SEMANTICS, pp. 105–112. ACM (2013)

    Google Scholar 

  3. Heindorf, S., et al.: EvoLearner: learning description logics with evolutionary algorithms. In: WWW (2022)

    Google Scholar 

  4. Nederstigt, L.J., Aanen, S.S., Vandic, D., Frasincar, F.: FLOPPIES: a framework for large-scale ontology population of product information from tabular data in e-commerce stores. Decis. Support Syst. 59, 296–311 (2014)

    Article  Google Scholar 

  5. Ngonga Ngomo, A.C., et al.: LIMES: a framework for link discovery on the semantic web. KI-Künstliche Intell. 35(3), 413–423 (2021)

    Article  Google Scholar 

  6. Sharma, K., Marjit, U., Biswas, U.: Automatically converting tabular data to RDF: an ontological approach. Int. J. Web Semant. Technol. 6(3) (2015)

    Google Scholar 

  7. Westphal, P., Bühmann, L., Bin, S., Jabeen, H., Lehmann, J.: SML-bench - a benchmarking framework for structured machine learning. Semantic Web 10(2), 231–245 (2019)

    Article  Google Scholar 

  8. Wong, W., Liu, W., Bennamoun, M.: Ontology learning from text: a look back and into the future. ACM Comput. Surv. 44(4), 20:1–20:36 (2012)

    Google Scholar 

Download references

Acknowledgements

This work has been supported by the German Federal Ministry of Education and Research (BMBF) within the project DAIKIRI under grant number 01IS19085B and the German Federal Ministry for Economic Affairs and Climate Action (BMWK) within the project RAKI under grant number 01MD19012B.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hamada M. Zahera .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zahera, H.M. et al. (2022). Tab2Onto: Unsupervised Semantification with Knowledge Graph Embeddings. In: Groth, P., et al. The Semantic Web: ESWC 2022 Satellite Events. ESWC 2022. Lecture Notes in Computer Science, vol 13384. Springer, Cham. https://doi.org/10.1007/978-3-031-11609-4_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-11609-4_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-11608-7

  • Online ISBN: 978-3-031-11609-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics