ABSTRACT
For Ontology Alignment (OA), the task is to align semantically equivalent concepts and relations from different ontologies. This task plays a crucial role in many downstream tasks and applications in academia and industry. Since manually aligning ontologies is inefficient and costly, numerous approaches exist to do this automatically. However, most approaches are tailored to specific domains, are rule-based systems or based on feature engineering, and require external knowledge. The most recent advances in the field of OA rely on the widely proven effectiveness of pre-trained language models to represent the human-generated language that describes the entities in an ontology. However, these approaches additionally require sophisticated algorithms or Graph Neural Networks to exploit an ontology’s graphical structure to achieve state-of-the-art performance. In this work, we present NLFOA, or Natural Language Focused Ontology Alignment, which purely focuses on the natural language contained in ontologies to process the ontology’s semantics as well as graphical structure. An evaluation of our approach on common OA datasets shows superior results when finetuning with only a small number of training samples. Additionally, it demonstrates strong results in a zero-shot setting which could be employed in an active learning setup to reduce human labor when manually aligning ontologies significantly.
- Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics. Minneapolis, MN, USA, 4171–4186.Google Scholar
- Sefika Efeoglu. 2023. GraphMatcher: A Graph Representation Learning Approach for Ontology Matching: Results for OAEI 2022. In CEUR Workshop Proceedings, Vol. 3324. Hangzhou, China (virtual), 174–180.Google Scholar
- Vadim Ermolayev and Maxim Davidovsky. 2012. Agent-based Ontology Alignment: Basics, Applications, Theoretical foundations, and Demonstration. In Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics. Craiova, Romania, 1–12.Google ScholarDigital Library
- Daniel Faria, Catia Pesquita, Emanuel Santos, Matteo Palmonari, Isabel F Cruz, and Francisco M Couto. 2013. The AgreementMakerLight Ontology Matching system. In OTM Confederated International Conferences ’On the Move to Meaningful Internet Systems’. 527–541.Google ScholarCross Ref
- Daniel Faria, Marta Contreiras Silva, Pedro Cotovio, Patrícia Eugénio, and Catia Pesquita. 2022. Matcha and Matcha-DL Results for OAEI 2022. In CEUR Workshop Proceedings.Google Scholar
- J Gargantilla and A Gomez-Perez. 2004. OntoWeb: A Survey on Ontology-based Applications. OntoWeb Consortium IST Project IST-2000-29243, Deliverable 1 (2004).Google Scholar
- Francis Gosselin and Amal Zouaq. 2023. SEBMatcher Results for OAEI 2022. In CEUR Workshop Proceedings, Vol. 3324. Hangzhou, China (virtual), 202–209.Google Scholar
- Junheng Hao, Chuan Lei, Vasilis Efthymiou, Abdul Quamar, Fatma Özcan, Yizhou Sun, and Wei Wang. 2021. Medto: Medical Data to Ontology Matching using Hybrid Graph Neural Networks. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Washington DC, USA, 2946–2954.Google ScholarDigital Library
- Terry F Hayamizu, Sherri de Coronado, Gilberto Fragoso, Nicholas Sioutos, James A Kadin, and Martin Ringwald. 2012. The Mouse-Human Anatomy Ontology Mapping Project. Database: The Journal of Biological Databases and Curation 2012 (2012), bar066.Google ScholarCross Ref
- Yuan He, Jiaoyan Chen, Denvar Antonyrajah, and Ian Horrocks. 2022. BERTMap: A BERT-based Ontology Alignment System. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 5684–5691.Google ScholarCross Ref
- Yuan He, Jiaoyan Chen, Hang Dong, Ernesto Jiménez-Ruiz, Ali Hadian, and Ian Horrocks. 2022. Machine Learning-Friendly Biomedical Datasets for Equivalence and Subsumption Ontology Matching. In The Semantic Web – ISWC 2022. Hangzhou, China (virtual), 575–591.Google ScholarDigital Library
- Sven Hertling and Heiko Paulheim. 2021. ATBox Results for OAEI 2021. In CEUR Workshop Proceedings, Vol. 3063. 137–143.Google Scholar
- Sven Hertling, Jan Portisch, and Heiko Paulheim. 2022. KERMIT - A Transformer-Based Approach for Knowledge Graph Matching. CoRR abs/2204.13931 (2022).Google Scholar
- Vivek Iyer, Arvind Agarwal, and Harshit Kumar. 2021. VeeAlign: Multifaceted Context Representation Using Dual Attention for Ontology Alignment. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP). Online and Punta Cana, Dominican Republic, 10780–10792.Google ScholarCross Ref
- Ernesto Jiménez-Ruiz and Bernardo Cuenca Grau. 2011. LogMap: Logic-based and Scalable Ontology Matching. In Proceedings of the 10th International Conference on The Semantic Web (ISWC) – Volume Part I. Bonn, Germany, 273–288.Google ScholarCross Ref
- Jeff Johnson, Matthijs Douze, and Hervé Jégou. 2019. Billion-scale similarity search with GPUs. IEEE Transactions on Big Data (2019).Google ScholarCross Ref
- Peter Kardos, Zsolt Szántó, and Richárd Farkas. 2023. WomboCombo Results for OAEI 2022. In CEUR Workshop Proceedings, Vol. 3324. Hangzhou, China (virtual), 216–219.Google Scholar
- Leon Knorr and Jan Portisch. 2022. Fine-TOM Matcher Results for OAEI 2021. In CEUR Workshop Proceedings, Vol. 3063. online, 144–151.Google Scholar
- Prodromos Kolyvakis, Alexandros Kalousis, and Dimitris Kiritsis. 2018. DeepAlignment: Unsupervised Ontology Matching with Refined Word Vectors. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), Volume 1 (Long Papers). New Orleans, LA, USA, 787–798.Google ScholarCross Ref
- Daniel Kossack, Niklas Borg, Leon Knorr, and Jan Portisch. 2022. TOM Matcher Results for OAEI 2021. In CEUR Workshop Proceedings, Vol. 3063. online, 193–198.Google Scholar
- Weiwei Lin and Reiko Haga. 2021. Matching Cyber Security Ontologies through Genetic Algorithm-Based Ontology Alignment Technique. Security and Communication Networks 2021 (2021).Google Scholar
- Ikechukwu Nkisi-Orji, Nirmalie Wiratunga, Stewart Massie, Kit-Ying Hui, and Rachel Heaven. 2019. Ontology Alignment based on Word Embedding and Random Forest Classification. In Joint European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD). Dublin, Ireland, 557–572.Google ScholarDigital Library
- Mina Abd Nikooie Pour, Alsayed Algergawy, Patrice Buche, Leyla J Castro, Jiaoyan Chen, Hang Dong, Omaima Fallatah, Daniel Faria, Irini Fundulaki, Sven Hertling, Yuan He, Ian Horrocks, Martin Huschka, Liliana Ibanescu, Ernesto Jimenez-Ruiz, Naouel Karam, Amir Laadhar, Patrick Lambrix, Huanyu Li, Ying Li, Franck Michel, Engy Nasr, Heiko Paulheim, Catia Pesquita, Tzanina Saveta, Pavel Shvaiko, Cassia Trojahn, Chantelle Verhey, Mingfang Wu, Beyza Yaman, Ondrej Zamazal, and Lu Zhou. 2023. Results of the Ontology Alignment Evaluation Initiative 2022. In CEUR Workshop Proceedings, Vol. 3324. Hangzhou, China (virtual), 84–128.Google Scholar
- Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Hong Kong, China, 3973–3983.Google ScholarCross Ref
- Pavel Shvaiko and Jérôme Euzenat. 2011. Ontology Matching: state of the art and future challenges. IEEE Transactions on Knowledge and Data Engineering 25, 1 (2011), 158–176.Google ScholarDigital Library
- Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, and Tie-Yan Liu. 2020. MPNet: Masked and Permuted Pre-Training for Language Understanding. In Advances in Neural Information Processing Systems (NeurIPS), Vol. 33. online, 16857–16867.Google Scholar
- Xiaobin Tang, Jing Zhang, Bo Chen, Yang Yang, Hong Chen, and Cuiping Li. 2020. BERT-INT:A BERT-based Interaction Model For Knowledge Graph Alignment. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI). Yokohama, Japan, 3174–3180.Google ScholarCross Ref
- Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention Is All You Need. In Advances in Neural Information Processing Systems (NIPS), Vol. 30. Long Beach, CA, USA, 5998–6008.Google Scholar
- Javier Vela and Jorge Gracia. 2023. Cross-Lingual Ontology Matching with CIDER-LM: Results for OAEI 2022. In CEUR Workshop Proceedings, Vol. 3324. Hangzhou, China (virtual), 158–165.Google Scholar
- Lucy Lu Wang, Chandra Bhagavatula, Mark Neumann, Kyle Lo, Chris Wilhelm, and Waleed Ammar. 2018. Ontology Alignment in the Biomedical Domain Using Entity Definitions and Context. In Proceedings of the BioNLP 2018 Workshop. Melbourne, Australia, 47–55.Google ScholarCross Ref
- Zhu Wang. 2023. AMD Results for OAEI 2022. In CEUR Workshop Proceedings, Vol. 3324. Hangzhou, China (virtual), 145–152.Google Scholar
- Jifang Wu, Jianghua Lv, Haoming Guo, and Shilong Ma. 2020. DAEOM: A Deep Attentional Embedding Approach for Biomedical Ontology Matching. Applied Sciences 10, 21 (2020), 7909.Google ScholarCross Ref
- Ondej Zamazal and Vojtch Svtek. 2017. The Ten-Year OntoFarm and its Fertilization within the Onto-Sphere. Web Semantics: Science, Services and Agents on the World Wide Web 43, C (2017), 46–53.Google Scholar
Index Terms
- NLFOA: Natural Language Focused Ontology Alignment
Recommendations
Ontology alignment design patterns
Interoperability between heterogeneous ontological descriptions can be performed through ontology mediation techniques. At the heart of ontology mediation lies the alignment: a specification of correspondences between ontology entities. Ontology ...
Ontology Alignment Quality: A Framework and Tool for Validation
Recently semantic web technologies, such as ontologies, have been proposed as key enablers for integrating heterogeneous data schemas in business and governmental systems. Algorithms designed to align different but related ontologies have become ...
Comments