Skip to main content

Advertisement

Log in

An information Content-Based Approach for Measuring Concept Semantic Similarity in WordNet

  • Published:
Wireless Personal Communications Aims and scope Submit manuscript

    We’re sorry, something doesn't seem to be working properly.

    Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

Computing information content (IC) of a concept is a core issue for semantic similarity measures of IC-based. So far, little works focused on calculating the IC of multiple inheritance nodes. So in this paper, a new IC computing model is proposed to calculate the IC of node (including single inheritance node and multiple inheritances node) in WordNet. This model calculates the IC of the concept through the parameters hypernyms, hyponyms, relative depth, maximum nodes and siblings. Experimental results of “poison” snippet in WordNet taxonomy shows that this model can effectively deal with the cases of single inheritance and multiple inheritance. Meanwhile, the results indicate this model is sensitive to distinguish the IC value of nodes while one of relative depth, hyponym, hypernym or sibling is different. Based on proposed model, a taxonomical semantic similarity measure is proposed to compute the semantic similarity of multiple inheritance nodes. Finally, this paper compares proposed approach with other similarity measures based IC in the given fragment of WordNet classification tree, and the results show that the proposed similarity approach acquires good performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  1. Adhikari, A., Singh, S., Dutta, A., & Dutta, B. (2015). A novel information theoretic approach for finding semantic similarity in WordNet (pp. 1–6). At Macau, China: IEEE TENCON.

    Google Scholar 

  2. Cai, Y., Zhang, Q., Lu, W., et al. (2017). A hybrid approach for measuring semantic similarity based on IC-weighted path distance in WordNet. Journal of Intelligent Information Systems, 1, 1–25.

    Article  Google Scholar 

  3. Lofi, C. (2016). Measuring semantic similarity and relatedness with distributional and knowledge-based approaches. Information and Media Technologies, 10(3), 493–501.

    Google Scholar 

  4. Hadj Taieb, M. A., Ben Aouicha, M., & Ben Hamadou, A. (2014). Ontology-based approach for measuring semantic similarity. Engineering Applications of Artificial Intelligence, 36(C), 238–261.

    Article  Google Scholar 

  5. Lin, D. (1999). WordNet: an electronic lexical database. Computational Linguistics, 25(2), 292–296.

    Google Scholar 

  6. Hadj Taieb, M. A., Ben Aouicha, M., & Ben Hamadou, A. (2013). Computing semantic relatedness using wikipedia features. Knowledge-Based Systems, 50(50), 260–278.

    Article  Google Scholar 

  7. Aouicha, M. B., & Taieb, M. A. (2015). Computing semantic similarity between biomedical concepts using new information content approach. Journal of Biomedical Informatics, 59(1), 258–275.

    Google Scholar 

  8. Lu, W., Cai, Y., Che, X., et al. (2016). Joint semantic similarity assessment with raw corpus and structured ontology for semantic-oriented service discovery. Personal and Ubiquitous Computing, 20(3), 311–323.

    Article  Google Scholar 

  9. Pirró*, G., & Euzenat, J. (2010). A feature and information theoretic framework for semantic similarity and relatedness. The semantic Web—ISWC 2010 (pp. 615–630). Berlin Heidelberg: Springer.

    Google Scholar 

  10. Sánchez, D., Solé-Ribalta, A., Batet, M., & Serratosa, F. (2012). Enabling semantic similarity estimation acrossmultiple ontologies: an evaluation in the biomedical domain. Journal of Biomedical Informatics, 45(1), 141–155.

    Article  Google Scholar 

  11. Zhang, Y., Shang, L., Huang, L., et al. (2016). A hybrid similarity measure method for patent portfolio analysis. Journal of Informetrics, 10(4), 1108–1130.

    Article  Google Scholar 

  12. Jiang, J. J., & Conrath, D. W. (1997). Semantic similarity based on corpus statistics and lexical taxonomy. In Proceedings of international conference on research in computational linguistics (pp. 22–24). Taipei, Taiwan.

  13. Seco, N., Veale, T., & Hayes, J. (2004). An intrinsic information content metric for semantic similarity in WordNet. In Eureopean conference on artificial intelligence, Ecai’2004, including prestigious applicants of intelligent systems, Pais 2004, Valencia, Spain (pp. 1089–1090).

  14. Zhou, Z., Wang, Y., & Gu, J. (2008). A new model of information content for semantic similarity in WordNet. In International conference on future generation communication and NETWORKING symposia (pp. 85–89). IEEE.

  15. Sebti, A., & Barfroush, A. A. (2008). A new word sense similarity measure in WordNet. In IEEE international multi conference on computer science and information technology (pp. 369–373).

  16. Meng, L., Gu, J., & Zhou, Z. (2012). A new model of information content based on concepts topology for measuring semantic similarity in WordNet. International Journal of Grid and Distributed Computing, 5(3), 81–94.

    Google Scholar 

  17. David Sanchez and Montserrat Batet. (2012). A new model to compute the information content of concepts from taxonomic knowledge. International Journal on Semantic Web Information Systems Archive, 8(2), 34–50.

    Article  Google Scholar 

  18. Devitt, A., and Vogel, C. (2004). The topology of WordNet: Some metrics. In Proceedings of GWC-04, 2nd global WordNet conference (pp. 106–111).

  19. Pirró*, G. (2009). A semantic similarity metric combining features and intrinsic information content. Data & Knowledge Engineering, 68(11), 1289–1308.

    Article  Google Scholar 

  20. Resnik, P. (1995). Using information content to evaluate semantic similarity in a taxonomy. In International joint conference on artificial intelligence (pp. 448–453). Morgan Kaufmann Publishers Inc.

  21. Tversky, A. (1977). Features of similarity. Psychological Review, 84(4), 290–3052.

    Article  Google Scholar 

  22. Rada, R., Mili, H., Bicknell, E., et al. (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems Man and Cybernetics, 19(1), 17–30.

    Article  Google Scholar 

  23. Wu, Z., & Palmer, M. (1995). Verb semantics and lexical selection. In ACL proceedings of annual meeting on association for computational linguistics (pp. 133–138).

  24. Leacock, C., & Chodorow, M. (1998). Combining local context and WordNet similarity for word sense identification. WordNet: An Electronic Lexical Database, 49(2), 265–283.

    Google Scholar 

Download references

Acknowledgements

The authors acknowledge the National Natural Science Foundation of China (Grant No.: 61562072).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaogang Zhang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, X., Sun, S. & Zhang, K. An information Content-Based Approach for Measuring Concept Semantic Similarity in WordNet. Wireless Pers Commun 103, 117–132 (2018). https://doi.org/10.1007/s11277-018-5429-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11277-018-5429-7

Keywords