A Similarity Algorithm Based on the Generality and Individuality of Words

Zou, Yinfeng; Ouyang, Chunping; Liu, Yongbin; Yang, Xiaohua; Yu, Ying

doi:10.1007/978-3-319-50496-4_48

Yinfeng Zou¹⁸,
Chunping Ouyang¹⁸,
Yongbin Liu¹⁸,
Xiaohua Yang¹⁸ &
…
Ying Yu¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10102))

Included in the following conference series:

4611 Accesses
2 Citations

Abstract

“HowNet” is a popular platform of Chinese text similarity calculation. The study has found that there is still some short-comings about the effect of “HowNet” architecture, the organization of vocabulary, concept description on word similarity measurement. In hence, on the basis of analyzing the generality and individuality of words in “HowNet”, a similarity algorithm based on the generality and individuality of words is proposed. Furthermore, experimental data is from NLPCC-ICCPOL 2016 Chinese words similarity evaluation task data set. Experimental results show that the algorithm is more feasible and stable, and better than some of the other classic algorithms. Moreover, the size of experimental data sets has a little influence on experimental results. In all experiments, the Pearson correlation coefficient and the Spearman’s coefficient have stably reached 0.460 and 0.440.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Guo, Y.: The Research of HowNet Based Word Similarity Computation and its Application: The Master’s Degree Thesis of Hunan University in China, pp. 8–16 (2012)
Google Scholar
Liu, Q., Li, S.J.: Lexical semantic similarity calculation based on HowNet. In: Proceedings of the Third Symposium on Chinese Lexical Semantics, Taibei (2002)
Google Scholar
Hua, X.L., Zhu, Q.M.: Chinese text similarity method research by combining semantic analysis with statistics. Appl. Res. Comput. 29(3), 833–836 (2012)
Google Scholar
Xia, T.: Study on Chinese words semantic similarity computation. Appl. Res. Comput. 33(6), 191–194 (2007)
Google Scholar
Jiang, M., Xiao, S.B., Wang, H.W., et al.: An improved word similarity computing method based on HowNet. J. Chin. Inf. Process. 22(3), 84–89 (2008)
Google Scholar
Wang, X.L., Wang, Y.: Improved word similarity algorithm based on HowNet. Appl. Res. Comput. 31(11), 3075–3077 (2011)
Google Scholar
Sun, J., Zhang, D.Z.: Word similarity calculation based on inverse concept frequency. J. Xiamen Univ. (Nat. Sci.) 54(2), 257–262 (2015)
Google Scholar
Zhang, L., Yin, C.Y., et al.: Chinese word similarity computing based on semantic tree. J. Chin. Inf. Process. 24(6), 23–30 (2010)
Google Scholar
Liu, J., Guo, Y., et al.: Word similarity computation based on the HowNet 2008. J. Chin. 36(8), 1728–1733 (2015)
Google Scholar

Download references

Acknowledgement

This research work is supported by National Science Foundation of China (No. 61402220, No. 61502221), the Scientific Research Fund of Hunan Provincial Education Department (No. 14B153, No. 16C1378, No. 15C1186), the Philosophy and Social Science Foundation of Hunan Province (No. 14YBA335).

Author information

Authors and Affiliations

School of Computer Science and Technology, University of South China, Hengyang, 421001, China
Yinfeng Zou, Chunping Ouyang, Yongbin Liu, Xiaohua Yang & Ying Yu

Authors

Yinfeng Zou
View author publications
You can also search for this author in PubMed Google Scholar
Chunping Ouyang
View author publications
You can also search for this author in PubMed Google Scholar
Yongbin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohua Yang
View author publications
You can also search for this author in PubMed Google Scholar
Ying Yu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chunping Ouyang .

Editor information

Editors and Affiliations

Microsoft Research Asia, Beijing, China
Chin-Yew Lin
Brandeis University, Waltham, Massachusetts, USA
Nianwen Xue
Peking University, Beijing, China
Dongyan Zhao
Fudan University, Shanghai, China
Xuanjing Huang
Peking University, Beijing, China
Yansong Feng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zou, Y., Ouyang, C., Liu, Y., Yang, X., Yu, Y. (2016). A Similarity Algorithm Based on the Generality and Individuality of Words. In: Lin, CY., Xue, N., Zhao, D., Huang, X., Feng, Y. (eds) Natural Language Understanding and Intelligent Applications. ICCPOL NLPCC 2016 2016. Lecture Notes in Computer Science(), vol 10102. Springer, Cham. https://doi.org/10.1007/978-3-319-50496-4_48

Download citation

DOI: https://doi.org/10.1007/978-3-319-50496-4_48
Published: 02 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50495-7
Online ISBN: 978-3-319-50496-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics