Skip to main content
Log in

Semantic similarity assessment of words using weighted WordNet

  • Original Article
  • Published:
International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Abstract

Word and concept similarity assessment is one of the most important elements in natural language processing and information and knowledge retrieval. WordNet, as a popular concept hierarchy, is used in many such applications. Similarity of words in WordNet is also considered in recent researches. Many researches that use WordNet, have calculated similarity between each pair-word by considering depth of subsumer of the words and shortest path between them. In this paper, three novel models to make better semantic word similarity measure have been presented and it was improved by giving weights to the edges of WordNet hierarchy. It was considered that the nearer an edge is to the root in the hierarchy, the less effect it has in calculating the similarity. Therefore, we have offered a new formula for weighting the edges of hierarchy and based on that, we calculated the distance between two words and depth of words; and then tuned parameters of the transfer functions using particle swarm optimization. Experimental results on a common benchmark, created by human judgment, show that the resultant correlation improved; furthermore our formulae were applied to a more realistic application called sentence similarity assessment and it led to the better results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Matsuoka J, Lepage Y (2011) Ambiguity spotting using WordNet semantic similarity in support to recommended practice for software requirements specifications. In Proceedings of the IEEE international conference on Natural Language Processing and Knowledge Engineering, Tokushima, pp 479–484

    Google Scholar 

  2. Liu PY, Zhao TJ, Yu XF (2006) Application-oreinted comparison and evaluation of six semantic similarity measures based on WordNet. In Proceedings of the Fifth International Conference on Machine Learning and Cybernetics, Dalian, pp 2605–2610

    Google Scholar 

  3. Resnik P (1999) Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. Artif Intell Res 11:95–130

    MATH  Google Scholar 

  4. Seco N, Veale T, Hayes J. (2004) An intrinsic information content metric for semantic similarity in WordNet, Technical report, University College Dublin, pp. 1089–1090

  5. Andrea Rodríguez M, Egenhofer Max J (2003) Determining semantic similarity among entity classes from different ontologies. IEEE Trans Knowl Data Eng 15(2):442–456

    Article  Google Scholar 

  6. Altintas E, Karsligil E, Coskun V (2006) A new semantic similarity measure evaluated in word sense disambiguation. In: S. Werner (ed) Proceedings of the 15th NODALIDA conference, pp 8–11

  7. Hao D, Zuo W, Peng T, He F (2011) An approach for calculating semantic similarity between words using WordNet. In: Second International Conference on digital manufacturing and automation, pp 177–180

  8. Church KW, Hanks P (1989) Word association norms, mutual information, and lexicography. In: Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, ACL27′89, pp76–83

  9. Hindle D (1990) Noun classification from predicate-argument structures. In: In proceedings of the 28th Annual Meeting of the Association for Computational Linguistics, ACL28′90, pp 268–275

  10. Grefenstette G (1992) Use of syntactic context to produce term association lists for text retrieval. In: Proceedings of the 15th Annual International Conference on Research and Development in Information Retrieval, SIGIR’92, pp 89–97

  11. Liu XY, Zhou YM, Zheng RS (2007) Measuring semantic similarity in WordNet. In: Proceedings of the Sixth International Conference on Machine Learning and Cybernetics, Hong Kong, pp 3431–3435

  12. Resnik P (1995) Using information content to evaluate semantic similarity. In: Proceeding of the 14th International Joint Conference on Artificial Intelligence, Montreal, pp 448–453

  13. Jiang JJ, Conrath DW (1997) Semantic similarity based on corpus statistics and lexical taxonomy. In: Proceeding of International Conference on Research in Computational Linguistics, Taiwan, pp 19–33

  14. Rada R, Mili H, Bicknell E, Bletner M (1989) Development and application of a metric on semantic nets. IEEE Trans Syst Man Cybern 19(1):17–30

    Article  Google Scholar 

  15. Wu Z, Palmer M (1994) Verb semantics and lexical selection. In: 32nd Annual Meeting of the Association for Computational Linguistics, pp 133–138

  16. Leacock C, Chodorow M (1998) Combining local context and WordNet similarity for word sense identification. In: Proceeding of Fellbaum 1998, pp 265–283

  17. Lin D (1998) An information-theoretic definition of similarity. In: Proceeding of the 15th International Conference on Machine Learning, Madison, pp 296–304

  18. Yang D, Powers David MW (2005) Measuring semantic similarity in the taxonomy of WordNet. In: Proceeding of the 28th Australasian Computer Science Conference, Australia, pp 315–332

  19. Li H, Tian Y, Ye B, Cai Q (2010) Comparison of Current Semantic Similarity Methods in WordNet. In: Proceedings of the IEEE International Conference on computer application and system modeling (ICCASM2010), pp 408–411

  20. Wan S, Angryk RA (2007) Measuring semantic similarity using WordNet-based context vectors. In: Proceedings of the IEEE International Conference on systems, man and cybernetics (SMC-IEEE’07), Montreal, Canada, pp 908–913

  21. Li Y, Zuhair AB, McLean D (2003) An approach for measuring semantic similarity between words using multiple information sources. IEEE Trans Knowl Data Eng 15(4):871–882

    Article  Google Scholar 

  22. Knappe R, Bulskov H, Andreasen T (2004) Perspectives on ontology-based querying. Int J Intell Sys 22(7):739–761

    Article  Google Scholar 

  23. Ghazizadeh AM, Naghibzadeh M, Yasrebi E (2010) Using Wordnet to determine semantic similarity of words (IST 2010). IEEE, Tehran, pp 1019–1027

    Google Scholar 

  24. Rubenstein H, Goodenough John B (1965) Contextual correlates of synonymy. Commun ACM 8(10):627–633

    Article  Google Scholar 

  25. Miller G, Beckwith R, Fellbaum C, Gross D, Miller K (1990) Introduction to WordNet: an online lexical database. Int J Lexicogr 3(4):235–244

    Article  Google Scholar 

  26. Fellbaum C (1998) WordNet: an electronic lexical database. Bradford Books, Bradford

    MATH  Google Scholar 

  27. Manna S, Sumudu B, Mendis U (2010) Fuzzy word similarity: a semantic approach using WordNet. In: Proceedings of the IEEE International Conference Fuzzy Systems, pp 1–8

  28. http://wordnet.princeton.edu/wordnet/man2.1/wnstats.7WN.html

  29. Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of the IEEE International Conference on neural networks, pp 1942–1948

  30. Achananuparp P, Hu X, Shen X (2008) The evaluation of sentence similarity measures. In: Proceedings of the 10th International Conference on data warehousing and knowledge discovery. Springer, Berlin, Heidelberg, pp 305–316

  31. http://cogcomp.cs.illinois.edu/page/software_view/12

  32. Pourgholamali F, Kahani M (2012) Semantic role based sentence compression. In: 2nd International eConference on computer and knowledge engineering (ICCKE-2012) (Accepted and ready to be published)

  33. Castillo JJ (2011) A WordNet-based semantic approach to textual entailment and cross-lingual textual entailment. Int J Mach Learn Cybern 2(3):177–189

    Article  Google Scholar 

Download references

Acknowledgments

This research is partially supported by research chancellor, Ferdowsi University of Mashhad, Mashhad, Iran under the contract no. 13203.

We would like to thank F. Pourgholamali for helping us to evaluate our similarity formulae in the sentence similarity application.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mostafa Ghazizadeh Ahsaee.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ghazizadeh Ahsaee, M., Naghibzadeh, M. & Yasrebi Naeini, S.E. Semantic similarity assessment of words using weighted WordNet. Int. J. Mach. Learn. & Cyber. 5, 479–490 (2014). https://doi.org/10.1007/s13042-012-0135-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13042-012-0135-3

Keywords

Navigation