research-article

Matching Large Biomedical Ontologies Using Symbolic Regression

Authors:

Jorge Martinez-Gil,

Franck MorvanAuthors Info & Claims

iiWAS2021: The 23rd International Conference on Information Integration and Web Intelligence

Pages 162 - 167

https://doi.org/10.1145/3487664.3487781

Published: 30 December 2021 Publication History

Abstract

The problem of ontology matching consists of finding the semantic correspondences between two ontologies that, although belonging to the same domain, have been developed separately. Matching methods are of great importance since they allow us to find the pivot points from which an automatic data integration process can be established. Unlike the most recent developments based on deep learning, this study presents our research on the development of new methods for ontology matching that are accurate and interpretable at the same time. For this purpose, we rely on a symbolic regression model specifically trained to find the mathematical expression that can solve the ground truth accurately, with the possibility of being understood by a human operator and forcing the processor to consume as little energy as possible. The experimental evaluation results show that our approach seems to be promising.

References

[1]

Olivier Bodenreider. 2004. The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res. 32, Database-Issue (2004), 267–270. https://doi.org/10.1093/nar/gkh061

[2]

Olivier Bodenreider, Terry F. Hayamizu, Martin Ringwald, Sherri de Coronado, and Songmao Zhang. 2005. Of Mice and Men: Aligning Mouse and Human Anatomies. In AMIA 2005, American Medical Informatics Association Annual Symposium, Washington, DC, USA, October 22-26, 2005. AMIA. http://knowledge.amia.org/amia-55142-a2005a-1.613296/t-001-1.616182/f-001-1.616183/a-012-1.616655/a-013-1.616652

[3]

Sherri de Coronado, Margaret W. Haber, Nicholas Sioutos, Mark S. Tuttle, and Lawrence W. Wright. 2004. NCI Thesaurus: Using Science-Based Terminology to Integrate Cancer Research Results. In MEDINFO 2004 - Proceedings of the 11th World Congress on Medical Informatics, San Francisco, California, USA, September 7-11, 2004(Studies in Health Technology and Informatics, Vol. 107), Marius Fieschi, Enrico W. Coiera, and Jack Yu-Chan Li (Eds.). IOS Press, 33–37. https://doi.org/10.3233/978-1-60750-949-3-33

[4]

Kalyanmoy Deb, Samir Agrawal, Amrit Pratap, and T. Meyarivan. 2002. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6, 2 (2002), 182–197. https://doi.org/10.1109/4235.996017

Digital Library

[5]

Kevin Donnelly. 2006. SNOMED-CT: The advanced terminology and coding system for eHealth. Studies in health technology and informatics 121 (2006), 279.

[6]

Finale Doshi-Velez and Been Kim. 2017. Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608(2017).

[7]

Daniel Faria, Catia Pesquita, Emanuel Santos, Matteo Palmonari, Isabel F. Cruz, and Francisco M. Couto. 2013. The AgreementMakerLight Ontology Matching System. In On the Move to Meaningful Internet Systems: OTM 2013 Conferences - Confederated International Conferences: CoopIS, DOA-Trusted Cloud, and ODBASE 2013, Graz, Austria, September 9-13, 2013. Proceedings(Lecture Notes in Computer Science, Vol. 8185), Robert Meersman, Hervé Panetto, Tharam S. Dillon, Johann Eder, Zohra Bellahsene, Norbert Ritter, Pieter De Leenheer, and Dejing Dou (Eds.). Springer, 527–541. https://doi.org/10.1007/978-3-642-41030-7_38

[8]

Terry F Hayamizu, Mary Mangan, John P Corradi, James A Kadin, and Martin Ringwald. 2005. The Adult Mouse Anatomical Dictionary: a tool for annotating and integrating data. Genome biology 6, 3 (2005), 1–8.

[9]

Ernesto Jiménez-Ruiz and Bernardo Cuenca Grau. 2011. LogMap: Logic-Based and Scalable Ontology Matching. In The Semantic Web - ISWC 2011 - 10th International Semantic Web Conference, Bonn, Germany, October 23-27, 2011, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 7031), Lora Aroyo, Chris Welty, Harith Alani, Jamie Taylor, Abraham Bernstein, Lalana Kagal, Natasha Fridman Noy, and Eva Blomqvist(Eds.). Springer, 273–288. https://doi.org/10.1007/978-3-642-25073-6_18

Digital Library

[10]

Prodromos Kolyvakis, Alexandros Kalousis, and Dimitris Kiritsis. 2018. DeepAlignment: Unsupervised Ontology Matching with Refined Word Vectors. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2018, New Orleans, Louisiana, USA, June 1-6, 2018, Volume 1 (Long Papers), Marilyn A. Walker, Heng Ji, and Amanda Stent (Eds.). Association for Computational Linguistics, 787–798. https://doi.org/10.18653/v1/n18-1072

[11]

Prodromos Kolyvakis, Alexandros Kalousis, Barry Smith, and Dimitris Kiritsis. 2018. Biomedical ontology alignment: an approach based on representation learning. J. Biomedical Semantics 9, 1 (2018), 21:1–21:20. https://doi.org/10.1186/s13326-018-0187-8

[12]

Amir Laadhar, Faiza Ghozzi, Imen Megdiche, Franck Ravat, Olivier Teste, and Faïez Gargouri. 2019. POMap++ Results for OAEI 2019: Fully Automated Machine Learning Approach for Ontology Matching. In Proceedings of the 14th International Workshop on Ontology Matching co-located with the 18th International Semantic Web Conference (ISWC 2019), Auckland, New Zealand, October 26, 2019(CEUR Workshop Proceedings, Vol. 2536), Pavel Shvaiko, Jérôme Euzenat, Ernesto Jiménez-Ruiz, Oktie Hassanzadeh, and Cássia Trojahn (Eds.). CEUR-WS.org, 169–174. http://ceur-ws.org/Vol-2536/oaei19_paper13.pdf

[13]

Yoonkyong Lee, Mayssam Sayyadian, AnHai Doan, and Arnon Rosenthal. 2007. eTuner: tuning schema matching software using synthetic scenarios. VLDB J. 16, 1 (2007), 97–122. https://doi.org/10.1007/s00778-006-0024-z

Digital Library

[14]

Vladimir I Levenshtein. 1966. Binary codes capable of correcting deletions, insertions, and reversals. In Soviet physics doklady, Vol. 10. 707–710.

[15]

Jorge Martinez-Gil, Enrique Alba, and José Francisco Aldana-Montes. 2008. Optimizing Ontology Alignments by Using Genetic Algorithms. In Proceedings of the First International Workshop on Nature Inspired Reasoning for the Semantic Web, Karlsruhe, Germany, October 27, 2008(CEUR Workshop Proceedings, Vol. 419), Christophe Guéret, Pascal Hitzler, and Stefan Schlobach(Eds.). CEUR-WS.org.

Digital Library

[16]

Jorge Martinez-Gil and José Francisco Aldana-Montes. 2011. Evaluation of two heuristic approaches to solve the ontology meta-matching problem. Knowl. Inf. Syst. 26, 2 (2011), 225–247. https://doi.org/10.1007/s10115-009-0277-0

Digital Library

[17]

Jorge Martinez-Gil and José Francisco Aldana-Montes. 2012. An overview of current ontology meta-matching solutions. Knowl. Eng. Rev. 27, 4 (2012), 393–412. https://doi.org/10.1017/S0269888912000288

Digital Library

[18]

Jorge Martinez-Gil and Jose M. Chaves-Gonzalez. 2020. A novel method based on symbolic regression for interpretable semantic similarity measurement. Expert Syst. Appl. 160(2020), 113663. https://doi.org/10.1016/j.eswa.2020.113663

[19]

Natalya Fridman Noy, Mark A. Musen, José L. V. Mejino Jr., and Cornelius Rosse. 2004. Pushing the envelope: challenges in a frame-based representation of human anatomy. Data Knowl. Eng. 48, 3 (2004), 335–359. https://doi.org/10.1016/j.datak.2003.06.002

Digital Library

[20]

John W Ratcliff and David E Metzener. 1988. Pattern-matching-the gestalt approach. Dr Dobbs Journal 13, 7 (1988), 46.

[21]

Dominique Ritze and Heiko Paulheim. 2011. Towards an automatic parameterization of ontology matching tools based on example mappings. In Proceedings of the 6th International Workshop on Ontology Matching, Bonn, Germany, October 24, 2011(CEUR Workshop Proceedings, Vol. 814), Pavel Shvaiko, Jérôme Euzenat, Tom Heath, Christoph Quix, Ming Mao, and Isabel F. Cruz (Eds.). CEUR-WS.org. http://ceur-ws.org/Vol-814/om2011_Tpaper4.pdf

Digital Library

[22]

Gizem Sogancioglu, Hakime Öztürk, and Arzucan Özgür. 2017. BIOSSES: a semantic sentence similarity estimation system for the biomedical domain. Bioinform. 33, 14 (2017), i49–i58. https://doi.org/10.1093/bioinformatics/btx238

[23]

Junli Wang, Zhijun Ding, and Changjun Jiang. 2006. GAOM: Genetic Algorithm Based Ontology Matching. In Proceedings of The 1st IEEE Asia-Pacific Services Computing Conference, APSCC 2006, December 12-15, 2006, Guangzhou, China. IEEE Computer Society, 617–620. https://doi.org/10.1109/APSCC.2006.59

Digital Library

[24]

Jifang Wu, Jianghua Lv, Haoming Guo, and Shilong Ma. 2020. DAEOM: A Deep Attentional Embedding Approach for Biomedical Ontology Matching. Applied Sciences 10, 21 (2020), 7909.

[25]

Mengyi Zhao, Songmao Zhang, Weizhuo Li, and Guowei Chen. 2018. Matching biomedical ontologies based on formal concept analysis. J. Biomed. Semant. 9, 1 (2018), 11:1–11:27. https://doi.org/10.1186/s13326-018-0178-9

Cited By

Sun DLv QTsai PXue XZhang K(2025)Anchor-based ontology partitioning and Genetic Programming with Relevance Reasoning for large-scale biomedical ontology matchingExpert Systems with Applications10.1016/j.eswa.2025.126445270(126445)Online publication date: Apr-2025
https://doi.org/10.1016/j.eswa.2025.126445
Xue XLin JSu T(2024)Efficient ontology matching through compact linear genetic programming with surrogate-assisted local searchSwarm and Evolutionary Computation10.1016/j.swevo.2024.10175891(101758)Online publication date: Dec-2024
https://doi.org/10.1016/j.swevo.2024.101758
Xue X(2024)Automatic Knowledge Graph matching via Self-adaptive Designed Genetic ProgrammingKnowledge-Based Systems10.1016/j.knosys.2024.111628293:COnline publication date: 7-Jun-2024
https://dl.acm.org/doi/10.1016/j.knosys.2024.111628
Show More Cited By

Index Terms

Matching Large Biomedical Ontologies Using Symbolic Regression
1. Applied computing
  1. Life and medical sciences
2. Computing methodologies
  1. Artificial intelligence
  2. Machine learning
    1. Machine learning approaches

Index terms have been assigned to the content through auto-classification.

Recommendations

Matching large ontologies: A divide-and-conquer approach

Ontologies proliferate with the progress of the Semantic Web. Ontology matching is an important way of establishing interoperability between (Semantic) Web applications that use different but related ontologies. Due to their sizes and monolithic nature, ...
Semantic enrichment in ontologies for matching
AOW '06: Proceedings of the second Australasian workshop on Advances in ontologies - Volume 72

Matching (or mapping) between heterogeneous ontologies becomes crucial for interoperability in distributed and intelligent environments. Although many efforts in ontology mapping have already been conducted, most of them rely heavily on the meaning of ...
Semantic matching in hierarchical ontologies

Hierarchical ontologies play a key role in organizing documents in a repository. While matching the ontologies, the relationships among the concepts are considered to be a major aspect. In hierarchical ontologies, the concepts are associated with one ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

iiWAS2021: The 23rd International Conference on Information Integration and Web Intelligence

November 2021

658 pages

ISBN:9781450395564

DOI:10.1145/3487664

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 December 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

French Ministry of Foreign and European Affairs
French Ministry of Higher Education and Research
Austrian Agency for International Cooperation in Education and Research

Conference

iiWAS2021

iiWAS2021: The 23rd International Conference on Information Integration and Web Intelligence

November 29 - December 1, 2021

Linz, Austria

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
89
Total Downloads

Downloads (Last 12 months)13
Downloads (Last 6 weeks)0

Reflects downloads up to 20 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Sun DLv QTsai PXue XZhang K(2025)Anchor-based ontology partitioning and Genetic Programming with Relevance Reasoning for large-scale biomedical ontology matchingExpert Systems with Applications10.1016/j.eswa.2025.126445270(126445)Online publication date: Apr-2025
https://doi.org/10.1016/j.eswa.2025.126445
Xue XLin JSu T(2024)Efficient ontology matching through compact linear genetic programming with surrogate-assisted local searchSwarm and Evolutionary Computation10.1016/j.swevo.2024.10175891(101758)Online publication date: Dec-2024
https://doi.org/10.1016/j.swevo.2024.101758
Xue X(2024)Automatic Knowledge Graph matching via Self-adaptive Designed Genetic ProgrammingKnowledge-Based Systems10.1016/j.knosys.2024.111628293:COnline publication date: 7-Jun-2024
https://dl.acm.org/doi/10.1016/j.knosys.2024.111628
Xue XSun DShankar AViriyasitavat WSiarry P(2024)Efficient large-scale biomedical ontology matching with anchor-based biomedical ontology partitioning and compact geometric semantic genetic programmingJournal of Industrial Information Integration10.1016/j.jii.2024.10063741(100637)Online publication date: Sep-2024
https://doi.org/10.1016/j.jii.2024.100637

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten