Skip to main content

Definition-based Information Content Vectors for Semantic Similarity Measurement

  • Conference paper
Soft Computing Applications and Intelligent Systems (M-CAIT 2013)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 378))

Included in the following conference series:

Abstract

Ontologies, as representation of shared conceptualization for variety of specific domains, are the heart of the Semantic Web. In order to facilitate interoperability across multiple ontologies, we need an automatic mechanism to align ontologies. Therefore, many methods to measure similarity between concepts existing in two different ontologies are proposed. In this paper, we will enumerate these methods along with their shortcomings in each case. In information content (IC) based similarity measures, the process of IC computation for concepts is so challenging and in many cases with failing. We will propose our new approach that is based on concepts’ definitions. These definitions would help us to compute reliable and easy to calculate information contents for concepts. Applying these methods to the biomedical domain, using MEDLINE as corpus, International Classification of Diseases, Ninth Revision, Clinical Modification (ICD9CM) as thesaurus, and available reference standard, we will find our method outperforms other similarity measures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Muthaiyah, S., Kerschberg, L.: A Hybrid Ontology Mediation Approach for the Semantic Web. International Journal of E-Business Research 4, 79–91 (2008)

    Article  Google Scholar 

  2. Chen, B., Foster, G., Kuhn, R.: Bilingual Sense Similarity for Statistical Machine Translation. In: Proceedings of the ACL, pp. 834–843 (2010)

    Google Scholar 

  3. Pucher, M.: WordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for Meetings. In: Proceedings of the ACL, pp. 129–132 (2007)

    Google Scholar 

  4. Firth, J.R.: A Synopsis of Linguistic Theory 1930-1955. In: Studies in Linguistic Analysis, pp. 1–32 (1957)

    Google Scholar 

  5. Landauer, T.K., Dumais, S.T.: A Solution to Plato’s Problem: The Latent Semantic Analysis Theory of the Acquisition. Induction and Representation of Knowledge. Psychological Review 104, 211–240 (1997)

    Article  Google Scholar 

  6. Lesk, M.: Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Cone from an Ice-cream Cone. In: Proceedings of the 5th Annual International Conference on Systems Documentation, New York, USA, pp. 24–26 (1986)

    Google Scholar 

  7. Banerjee, S., Pedersen, T.: An Adapted Lesk Algorithm for Word Sense Disambiguation using WordNet. In: Proceedings of the Third International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City (2002)

    Google Scholar 

  8. Patwardhan, S., Pedersen, T.: Using WordNet-based Context Vectors to Estimate the Semantic Relatedness of Concepts. In: Proceedings of the EACL 2006 Workshop, Making Sense of Sense: Bringing Computational Linguistics and Psycholinguistics Together, Trento, Italy, pp. 1–8 (2006)

    Google Scholar 

  9. Islam, A., Inkpen, D.: Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words. In: Proceedings of the International Conference on Language Resources and Evaluation, Genoa, Italy (2006)

    Google Scholar 

  10. Matveeva, I., Levow, G., Farahat, A., Royer, C.: Terms Representation with Generalized Latent Semantic Analysis. In: Proceedings of the Recent Advances in Natural Language Processing Conference (2005)

    Google Scholar 

  11. Evgeniy, G., Shaul, M.: Computing Semantic Relatedness using Wikipedia based Explicit Semantic Analysis. IJCAI, 1606–1611 (2007)

    Google Scholar 

  12. Rada, R., Mili, H., Bicknell, E., Blettner, M.: Development and Application of a Metric on Semantic Nets. IEEE Transactions on Systems, Man and Cybernetics 19, 17–30 (1989)

    Article  Google Scholar 

  13. Caviedes, J., Cimino, J.: Towards the Development of a ConceptualDistance Metric for the UMLS. Journal of Biomedical Informatic 372, 77–85 (2004)

    Article  Google Scholar 

  14. Wu, Z., Palmer, M.: Verb Semantics and Lexical Selections. In: Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (1994)

    Google Scholar 

  15. Leacock, C., Chodorow, M.: Combining Local Context and WordNet Similarity for Word Sense Identification in WordNet: An Electronic Lexical Database, pp. 265–283 (1998)

    Google Scholar 

  16. Zhong, J., Zhu, H., Li, J., Yu, Y.: Conceptual Graph Matching for Semantic Search. In: Proceedings of the 10th International Conference on Conceptual Structures, p. 92 (2002)

    Google Scholar 

  17. Nguyen, H.A., Al-Mubaid, H.: New Ontology-based Semantic Similarity Measure for the Biomedical Domain. In: Proceedings of IEEE International Conference on Granular Computing, GrC 2006, pp. 623–628 (2006)

    Google Scholar 

  18. Resnik, P.: Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, pp. 448–453 (1995)

    Google Scholar 

  19. Jiang, J.J., Conrath, D.W.: Semantic Similarity based on Corpus Statistics and Lexical Taxonomy. In: International Conference on Research in Computational Linguistics (1997)

    Google Scholar 

  20. Lin, D.: An Information-theoretic Definition of Similarity. In: 15th International Conference on Machine Learning, Madison, USA (1998)

    Google Scholar 

  21. Pakhomov, S., McInnes, B., Adam, T., Liu, Y., Pedersen, T., Melton, G.: Semantic Similarity and Relatedness between Clinical Terms: An Experimental Study. In: Proceedings of AMIA, pp. 572–576 (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pesaranghader, A., Muthaiyah, S. (2013). Definition-based Information Content Vectors for Semantic Similarity Measurement. In: Noah, S.A., et al. Soft Computing Applications and Intelligent Systems. M-CAIT 2013. Communications in Computer and Information Science, vol 378. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40567-9_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40567-9_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40566-2

  • Online ISBN: 978-3-642-40567-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics