Abstract
Ontology-based data integration attempts to overcome the semantic heterogeneity problem in data integration. Semantic heterogeneity refers to an ambiguous interpretation of terms that describes the meaning of data in heterogeneous resources. However, the presence of semantic duplicates such as similar attributes in the integrated ontologies can lead to incomplete query results. This paper proposes to use the Apriori algorithm from market basket analysis to find similar attributes in an ontology.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Astrova, I., Koschel, A.: Automatic detection of duplicated attributes in ontology. In: Cordeiro, J., Filipe, J. (Eds.) ICEIS 2009: Proceedings of the 11th International Conference on Enterprise Information Systems, Volume DISI. INSTICC, 2009, pp. 283–286 (2009)
Astrova, I.: Improving query results with automatic duplicate detection. In: Ioannidis, Y., Manghi, P., Pagano, P. (Eds.) Proceedings of the Second Workshop on Very Large Digital Libraries, VLDL 2009: A Workshop in conjunction with the European Conference on Digital Libraries 2009. Institute of Information Science and Technology; DELOS Association (2009)
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules In: Stonebraker, M., Hellerstein, J.M. (Eds.) Readings in database systems (3rd ed.). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, pp. 580–592 (1998)
Kargiot, E., Kontopoulos, E.: OntoLife: an ontology for semantically managing personal information. http://lpis.csd.auth.gr/ontologies/ontolist.html#ontolife
Person Ontology. http://ebiquity.umbc.edu/ontology/person.owl
Friend of a Friend (FOAF) Ontology. http://xmlns.com/foaf/spec/
Family Tree Ontology. http://users.auth.gr/elkar/thesis/FamilyTree.owl
Relationship Ontology. http://purl.org/vocab/relationship/
ISO lists for Countries and Languages Ontology. http://psi.oasis-open.org/iso/639/#
Project Ontology. http://ebiquity.umbc.edu/ontology/project.owl
Research Ontology. http://ebiquity.umbc.edu/ontology/research.owl
Publication Ontology. http://ebiquity.umbc.edu/ontology/publication.owl
PersonProjectAssociation Ontology. http://ebiquity.umbc.edu/ontology/association.owl
Biography Ontology. http://users.auth.gr/elkar/thesis/Biography.owl
El Sayed, A., et al.: A new context-aware measure for semantic distance using a taxonomy and a text corpus. In: Proceedings of IRI, pp. 279–284 (2007)
Barbar, A., Collard, M.: A distance-based approach for database re-engineering. In: Proceedings of the ACS/IEEE International Conference on Computer Systems and Applications (AICCSA 2001). IEEE Computer Society, Washington, DC, USA, pp. 188–190 (2001)
Barbar, A., Collard, M.: Semantic extraction: a user-driven method (2001). http://www.fit.vutbr.cz/events/ism/2001/pdf/barbar.pdf
Khan, Z.C., Keet, C.M.: SUGOI: automated ontology interchangeability. In: Knowledge Engineering and Knowledge Management, pp. 150–153 (2015)
Mascardi, V., Locoro, A., Rosso, P.: Automatic ontology matching via upper ontologies: a systematic evaluation. IEEE Trans. Knowl. Data Eng. 22(5), 609–623 (2010)
Cavique, L.: Graph-based structures for the market baskets analysis (2004). http://lcavique.no.sapo.pt/publicacoes/Similis%20APDIO.pdf
Acknowledgement
Irina Astrova’s work was supported by the Estonian Ministry of Education and Research institutional research grant IUT33-13.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Astrova, I., Koschel, A., Lee, S.L. (2020). How the Apriori Algorithm Can Help to Find Semantic Duplicates in Ontology. In: Virvou, M., Nakagawa, H., C. Jain, L. (eds) Knowledge-Based Software Engineering: 2020. JCKBSE 2020. Learning and Analytics in Intelligent Systems, vol 19. Springer, Cham. https://doi.org/10.1007/978-3-030-53949-8_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-53949-8_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-53948-1
Online ISBN: 978-3-030-53949-8
eBook Packages: Computer ScienceComputer Science (R0)