Abstract
Ontologies, as representation of shared conceptualization for variety of specific domains, are the heart of the Semantic Web. In order to facilitate interoperability across multiple ontologies, we need an automatic mechanism to align ontologies. Therefore, many methods to measure similarity between concepts existing in two different ontologies are proposed. In this paper, we will enumerate these methods along with their shortcomings in each case. In information content (IC) based similarity measures, the process of IC computation for concepts is so challenging and in many cases with failing. We will propose our new approach that is based on concepts’ definitions. These definitions would help us to compute reliable and easy to calculate information contents for concepts. Applying these methods to the biomedical domain, using MEDLINE as corpus, International Classification of Diseases, Ninth Revision, Clinical Modification (ICD9CM) as thesaurus, and available reference standard, we will find our method outperforms other similarity measures.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Muthaiyah, S., Kerschberg, L.: A Hybrid Ontology Mediation Approach for the Semantic Web. International Journal of E-Business Research 4, 79–91 (2008)
Chen, B., Foster, G., Kuhn, R.: Bilingual Sense Similarity for Statistical Machine Translation. In: Proceedings of the ACL, pp. 834–843 (2010)
Pucher, M.: WordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for Meetings. In: Proceedings of the ACL, pp. 129–132 (2007)
Firth, J.R.: A Synopsis of Linguistic Theory 1930-1955. In: Studies in Linguistic Analysis, pp. 1–32 (1957)
Landauer, T.K., Dumais, S.T.: A Solution to Plato’s Problem: The Latent Semantic Analysis Theory of the Acquisition. Induction and Representation of Knowledge. Psychological Review 104, 211–240 (1997)
Lesk, M.: Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Cone from an Ice-cream Cone. In: Proceedings of the 5th Annual International Conference on Systems Documentation, New York, USA, pp. 24–26 (1986)
Banerjee, S., Pedersen, T.: An Adapted Lesk Algorithm for Word Sense Disambiguation using WordNet. In: Proceedings of the Third International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City (2002)
Patwardhan, S., Pedersen, T.: Using WordNet-based Context Vectors to Estimate the Semantic Relatedness of Concepts. In: Proceedings of the EACL 2006 Workshop, Making Sense of Sense: Bringing Computational Linguistics and Psycholinguistics Together, Trento, Italy, pp. 1–8 (2006)
Islam, A., Inkpen, D.: Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words. In: Proceedings of the International Conference on Language Resources and Evaluation, Genoa, Italy (2006)
Matveeva, I., Levow, G., Farahat, A., Royer, C.: Terms Representation with Generalized Latent Semantic Analysis. In: Proceedings of the Recent Advances in Natural Language Processing Conference (2005)
Evgeniy, G., Shaul, M.: Computing Semantic Relatedness using Wikipedia based Explicit Semantic Analysis. IJCAI, 1606–1611 (2007)
Rada, R., Mili, H., Bicknell, E., Blettner, M.: Development and Application of a Metric on Semantic Nets. IEEE Transactions on Systems, Man and Cybernetics 19, 17–30 (1989)
Caviedes, J., Cimino, J.: Towards the Development of a ConceptualDistance Metric for the UMLS. Journal of Biomedical Informatic 372, 77–85 (2004)
Wu, Z., Palmer, M.: Verb Semantics and Lexical Selections. In: Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (1994)
Leacock, C., Chodorow, M.: Combining Local Context and WordNet Similarity for Word Sense Identification in WordNet: An Electronic Lexical Database, pp. 265–283 (1998)
Zhong, J., Zhu, H., Li, J., Yu, Y.: Conceptual Graph Matching for Semantic Search. In: Proceedings of the 10th International Conference on Conceptual Structures, p. 92 (2002)
Nguyen, H.A., Al-Mubaid, H.: New Ontology-based Semantic Similarity Measure for the Biomedical Domain. In: Proceedings of IEEE International Conference on Granular Computing, GrC 2006, pp. 623–628 (2006)
Resnik, P.: Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, pp. 448–453 (1995)
Jiang, J.J., Conrath, D.W.: Semantic Similarity based on Corpus Statistics and Lexical Taxonomy. In: International Conference on Research in Computational Linguistics (1997)
Lin, D.: An Information-theoretic Definition of Similarity. In: 15th International Conference on Machine Learning, Madison, USA (1998)
Pakhomov, S., McInnes, B., Adam, T., Liu, Y., Pedersen, T., Melton, G.: Semantic Similarity and Relatedness between Clinical Terms: An Experimental Study. In: Proceedings of AMIA, pp. 572–576 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pesaranghader, A., Muthaiyah, S. (2013). Definition-based Information Content Vectors for Semantic Similarity Measurement. In: Noah, S.A., et al. Soft Computing Applications and Intelligent Systems. M-CAIT 2013. Communications in Computer and Information Science, vol 378. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40567-9_23
Download citation
DOI: https://doi.org/10.1007/978-3-642-40567-9_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40566-2
Online ISBN: 978-3-642-40567-9
eBook Packages: Computer ScienceComputer Science (R0)