Abstract
In this paper, we firstly describe a novel approach to calculate the Chinese sememe similarity based on the HowNet hierarchical sememe tree. When we calculate the sememe similarity, we not only take Semantic Distance, Node Depth and Semantic Coincidence Degree into consideration, but also propose two impact factors named Node Environment Dense (NED) and Node Layer Ratio (NLR) to optimize the calculation process. Secondly, quite a few words described by identical concept definition in HowNet should have a certain discrimination according to human perception, so we propose a hybrid modification algorithm integrated of TongYiCi CiLin (hereinafter, CiLin) to deal with this case. Experiment results of the HowNet-based multi-factor similarity hybrid algorithm shows that this approach improves the similarity of independent sememe words and the words having identical concept descriptions in HowNet, while no large bias influence on the similarity of other words.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Liu, Y.J., Xu, Y.: Automatic question answering system based on weighted semantic similarity model. Journal of Southeast University 34(5), 609–612 (2004)
Liu, Q., Li, S.J.: Word similarity computing based on How-net. Computational Linguistics and Chinese Language Processing 17(2), 59–74 (2002)
Li, F., Li, F.: An new approach measuring semantic similarity in HowNet 2000. Journal of Chinese Information Processing 21(3), 99–105 (2007)
Dong, Zh.D., Dong, Q.: HowNet (1999), http://www.keenage.com
Jiang, M., Xia, S.B., Wang, H.W.: An improved word similarity computing method based on HowNet. Journal of Chinese Information Processing 22(5), 84–88 (2008)
Xia, T.: Study on Chinese words semantic similarity computation. Computer Engineering 33(6), 191–194 (2007)
Shi, B., Yan, J.Z., Wang, P.: Ontology-based measure of semantic similarity between concepts. Computer Engineering 35(19), 83–85 (2009)
Resik, P.: Using information content to evaluate semantic similarity. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, pp. 448–453. IEEE Press, Montreal (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, B., Yang, J., He, L. (2012). Chinese HowNet-Based Multi-factor Word Similarity Algorithm Integrated of Result Modification. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds) Neural Information Processing. ICONIP 2012. Lecture Notes in Computer Science, vol 7667. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34500-5_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-34500-5_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34499-2
Online ISBN: 978-3-642-34500-5
eBook Packages: Computer ScienceComputer Science (R0)