Abstract
Word and concept similarity assessment is one of the most important elements in natural language processing and information and knowledge retrieval. WordNet, as a popular concept hierarchy, is used in many such applications. Similarity of words in WordNet is also considered in recent researches. Many researches that use WordNet, have calculated similarity between each pair-word by considering depth of subsumer of the words and shortest path between them. In this paper, three novel models to make better semantic word similarity measure have been presented and it was improved by giving weights to the edges of WordNet hierarchy. It was considered that the nearer an edge is to the root in the hierarchy, the less effect it has in calculating the similarity. Therefore, we have offered a new formula for weighting the edges of hierarchy and based on that, we calculated the distance between two words and depth of words; and then tuned parameters of the transfer functions using particle swarm optimization. Experimental results on a common benchmark, created by human judgment, show that the resultant correlation improved; furthermore our formulae were applied to a more realistic application called sentence similarity assessment and it led to the better results.
Similar content being viewed by others
References
Matsuoka J, Lepage Y (2011) Ambiguity spotting using WordNet semantic similarity in support to recommended practice for software requirements specifications. In Proceedings of the IEEE international conference on Natural Language Processing and Knowledge Engineering, Tokushima, pp 479–484
Liu PY, Zhao TJ, Yu XF (2006) Application-oreinted comparison and evaluation of six semantic similarity measures based on WordNet. In Proceedings of the Fifth International Conference on Machine Learning and Cybernetics, Dalian, pp 2605–2610
Resnik P (1999) Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. Artif Intell Res 11:95–130
Seco N, Veale T, Hayes J. (2004) An intrinsic information content metric for semantic similarity in WordNet, Technical report, University College Dublin, pp. 1089–1090
Andrea Rodríguez M, Egenhofer Max J (2003) Determining semantic similarity among entity classes from different ontologies. IEEE Trans Knowl Data Eng 15(2):442–456
Altintas E, Karsligil E, Coskun V (2006) A new semantic similarity measure evaluated in word sense disambiguation. In: S. Werner (ed) Proceedings of the 15th NODALIDA conference, pp 8–11
Hao D, Zuo W, Peng T, He F (2011) An approach for calculating semantic similarity between words using WordNet. In: Second International Conference on digital manufacturing and automation, pp 177–180
Church KW, Hanks P (1989) Word association norms, mutual information, and lexicography. In: Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, ACL27′89, pp76–83
Hindle D (1990) Noun classification from predicate-argument structures. In: In proceedings of the 28th Annual Meeting of the Association for Computational Linguistics, ACL28′90, pp 268–275
Grefenstette G (1992) Use of syntactic context to produce term association lists for text retrieval. In: Proceedings of the 15th Annual International Conference on Research and Development in Information Retrieval, SIGIR’92, pp 89–97
Liu XY, Zhou YM, Zheng RS (2007) Measuring semantic similarity in WordNet. In: Proceedings of the Sixth International Conference on Machine Learning and Cybernetics, Hong Kong, pp 3431–3435
Resnik P (1995) Using information content to evaluate semantic similarity. In: Proceeding of the 14th International Joint Conference on Artificial Intelligence, Montreal, pp 448–453
Jiang JJ, Conrath DW (1997) Semantic similarity based on corpus statistics and lexical taxonomy. In: Proceeding of International Conference on Research in Computational Linguistics, Taiwan, pp 19–33
Rada R, Mili H, Bicknell E, Bletner M (1989) Development and application of a metric on semantic nets. IEEE Trans Syst Man Cybern 19(1):17–30
Wu Z, Palmer M (1994) Verb semantics and lexical selection. In: 32nd Annual Meeting of the Association for Computational Linguistics, pp 133–138
Leacock C, Chodorow M (1998) Combining local context and WordNet similarity for word sense identification. In: Proceeding of Fellbaum 1998, pp 265–283
Lin D (1998) An information-theoretic definition of similarity. In: Proceeding of the 15th International Conference on Machine Learning, Madison, pp 296–304
Yang D, Powers David MW (2005) Measuring semantic similarity in the taxonomy of WordNet. In: Proceeding of the 28th Australasian Computer Science Conference, Australia, pp 315–332
Li H, Tian Y, Ye B, Cai Q (2010) Comparison of Current Semantic Similarity Methods in WordNet. In: Proceedings of the IEEE International Conference on computer application and system modeling (ICCASM2010), pp 408–411
Wan S, Angryk RA (2007) Measuring semantic similarity using WordNet-based context vectors. In: Proceedings of the IEEE International Conference on systems, man and cybernetics (SMC-IEEE’07), Montreal, Canada, pp 908–913
Li Y, Zuhair AB, McLean D (2003) An approach for measuring semantic similarity between words using multiple information sources. IEEE Trans Knowl Data Eng 15(4):871–882
Knappe R, Bulskov H, Andreasen T (2004) Perspectives on ontology-based querying. Int J Intell Sys 22(7):739–761
Ghazizadeh AM, Naghibzadeh M, Yasrebi E (2010) Using Wordnet to determine semantic similarity of words (IST 2010). IEEE, Tehran, pp 1019–1027
Rubenstein H, Goodenough John B (1965) Contextual correlates of synonymy. Commun ACM 8(10):627–633
Miller G, Beckwith R, Fellbaum C, Gross D, Miller K (1990) Introduction to WordNet: an online lexical database. Int J Lexicogr 3(4):235–244
Fellbaum C (1998) WordNet: an electronic lexical database. Bradford Books, Bradford
Manna S, Sumudu B, Mendis U (2010) Fuzzy word similarity: a semantic approach using WordNet. In: Proceedings of the IEEE International Conference Fuzzy Systems, pp 1–8
http://wordnet.princeton.edu/wordnet/man2.1/wnstats.7WN.html
Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of the IEEE International Conference on neural networks, pp 1942–1948
Achananuparp P, Hu X, Shen X (2008) The evaluation of sentence similarity measures. In: Proceedings of the 10th International Conference on data warehousing and knowledge discovery. Springer, Berlin, Heidelberg, pp 305–316
Pourgholamali F, Kahani M (2012) Semantic role based sentence compression. In: 2nd International eConference on computer and knowledge engineering (ICCKE-2012) (Accepted and ready to be published)
Castillo JJ (2011) A WordNet-based semantic approach to textual entailment and cross-lingual textual entailment. Int J Mach Learn Cybern 2(3):177–189
Acknowledgments
This research is partially supported by research chancellor, Ferdowsi University of Mashhad, Mashhad, Iran under the contract no. 13203.
We would like to thank F. Pourgholamali for helping us to evaluate our similarity formulae in the sentence similarity application.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ghazizadeh Ahsaee, M., Naghibzadeh, M. & Yasrebi Naeini, S.E. Semantic similarity assessment of words using weighted WordNet. Int. J. Mach. Learn. & Cyber. 5, 479–490 (2014). https://doi.org/10.1007/s13042-012-0135-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-012-0135-3