Abstract
Sentiment analysis has become significantly important with the increasing demand of Natural Language Processing (NLP). A novel Chinese Sentiment Words Polarity (CSWP) analyzing method, which is based on sentiment morpheme matching method and word embedding method, is proposed in this paper. In the CSWP, the sentiment morpheme matching method is creatively combined with existing word embedding method, it not only successfully retained the advantages of flexibility and timeliness of the unsupervised methods, but also improved the performance of the original word embedding method. Firstly, the CSWP uses word embedding method to calculate the polarity score for candidate sentiment words, then the sentiment morpheme matching method is applied to make further analysis for the polarity of words. Finally, to deal with the low recognition ratio in the sentiment morpheme matching method, a synonym expanding step is added into the morpheme matching method, which can significantly improve the recognition ratio of the sentiment morpheme matching method. The performance of CSWP is evaluated through extensive experiments on 20000 users’ comments. Experimental results show that the proposed CSWP method has achieved a desirable performance when compared with other two baseline methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 1(2009):12 (2009)
Fei, X., Wang, H., Zhu, J.: Sentiment word identification using the maximum entropy model. In: 2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE), pp. 1–4. IEEE (2010)
Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association For Computational Linguistics, pp. 417–424. Association for Computational Linguistics (2002)
Fan, X., Li, X., Du, F., Li, X., Wei, M.: Apply word vectors for sentiment analysis of app reviews. In: 2016 3rd International Conference on Systems and Informatics (ICSAI), pp. 1062–1066. IEEE (2016)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-2002 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)
Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics, pp. 174–181. Association for Computational Linguistics (1997)
Kanayama, H., Nasukawa, T.: Fully automatic lexicon expansion for domain-oriented sentiment analysis. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pp. 355–363. Association for Computational Linguistics (2006)
Ku, L.-W., Liang, Y.-T., Chen, H.-H.: Opinion extraction, summarization and tracking in news and blog corpora. In: Proceedings of AAAI, pp. 100–107 (2006)
Jansen, B.J., Zhang, M., Sobel, K., Chowdury, A.: Twitter power: Tweets as electronic word of mouth. J. Assoc. Inf. Sci. Technol. 60(11), 2169–2188 (2009)
Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: LREc, vol. 10 (2010)
Kouloumpis, E., Wilson, T., Moore, J.D.: Twitter sentiment analysis: the good the bad and the omg! Icwsm, 11(538-541), 164 (2011)
Huang, S., Han, W., Que, X., Wang, W.: Polarity identification of sentiment words based on emoticons. In: 2013 9th International Conference on Computational Intelligence and Security (CIS), pp. 134–138. IEEE (2013)
Gauch, S., Wang, J.: Corpus analysis for TREC 5 query expansion. In: TREC (1996)
Wiebe, J.: Learning subjective adjectives from corpora. AAAI/IAAI 20 (2000)
Geng, H.T., Cai, Q.S., Kun, Y., Zhao, P.: A kind of automatic text keyphrase extraction method based on word co-occurrence. J. Nanjing Univ. 42(2), 156–162 (2006)
Acknowledgment
This work was supported by the National Natural Science Foundation of China (61572060, U1536107, 61472024), and CERNET Innovation Project (NGII20151104, NGII20160316).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Niu, J., Sun, M., Mo, S. (2018). Sentiment Analysis of Chinese Words Using Word Embedding and Sentiment Morpheme Matching. In: Romdhani, I., Shu, L., Takahiro, H., Zhou, Z., Gordon, T., Zeng, D. (eds) Collaborative Computing: Networking, Applications and Worksharing. CollaborateCom 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 252. Springer, Cham. https://doi.org/10.1007/978-3-030-00916-8_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-00916-8_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00915-1
Online ISBN: 978-3-030-00916-8
eBook Packages: Computer ScienceComputer Science (R0)