Skip to main content

Chinese HowNet-Based Multi-factor Word Similarity Algorithm Integrated of Result Modification

  • Conference paper
Neural Information Processing (ICONIP 2012)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7667))

Included in the following conference series:

Abstract

In this paper, we firstly describe a novel approach to calculate the Chinese sememe similarity based on the HowNet hierarchical sememe tree. When we calculate the sememe similarity, we not only take Semantic Distance, Node Depth and Semantic Coincidence Degree into consideration, but also propose two impact factors named Node Environment Dense (NED) and Node Layer Ratio (NLR) to optimize the calculation process. Secondly, quite a few words described by identical concept definition in HowNet should have a certain discrimination according to human perception, so we propose a hybrid modification algorithm integrated of TongYiCi CiLin (hereinafter, CiLin) to deal with this case. Experiment results of the HowNet-based multi-factor similarity hybrid algorithm shows that this approach improves the similarity of independent sememe words and the words having identical concept descriptions in HowNet, while no large bias influence on the similarity of other words.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Liu, Y.J., Xu, Y.: Automatic question answering system based on weighted semantic similarity model. Journal of Southeast University 34(5), 609–612 (2004)

    Google Scholar 

  2. Liu, Q., Li, S.J.: Word similarity computing based on How-net. Computational Linguistics and Chinese Language Processing 17(2), 59–74 (2002)

    Google Scholar 

  3. Li, F., Li, F.: An new approach measuring semantic similarity in HowNet 2000. Journal of Chinese Information Processing 21(3), 99–105 (2007)

    Google Scholar 

  4. Dong, Zh.D., Dong, Q.: HowNet (1999), http://www.keenage.com

  5. Jiang, M., Xia, S.B., Wang, H.W.: An improved word similarity computing method based on HowNet. Journal of Chinese Information Processing 22(5), 84–88 (2008)

    Google Scholar 

  6. Xia, T.: Study on Chinese words semantic similarity computation. Computer Engineering 33(6), 191–194 (2007)

    Google Scholar 

  7. Shi, B., Yan, J.Z., Wang, P.: Ontology-based measure of semantic similarity between concepts. Computer Engineering 35(19), 83–85 (2009)

    Google Scholar 

  8. Resik, P.: Using information content to evaluate semantic similarity. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, pp. 448–453. IEEE Press, Montreal (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wu, B., Yang, J., He, L. (2012). Chinese HowNet-Based Multi-factor Word Similarity Algorithm Integrated of Result Modification. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds) Neural Information Processing. ICONIP 2012. Lecture Notes in Computer Science, vol 7667. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34500-5_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34500-5_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34499-2

  • Online ISBN: 978-3-642-34500-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics