Skip to main content

Sentiment Analysis of Chinese Words Using Word Embedding and Sentiment Morpheme Matching

  • Conference paper
  • First Online:
Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom 2017)

Abstract

Sentiment analysis has become significantly important with the increasing demand of Natural Language Processing (NLP). A novel Chinese Sentiment Words Polarity (CSWP) analyzing method, which is based on sentiment morpheme matching method and word embedding method, is proposed in this paper. In the CSWP, the sentiment morpheme matching method is creatively combined with existing word embedding method, it not only successfully retained the advantages of flexibility and timeliness of the unsupervised methods, but also improved the performance of the original word embedding method. Firstly, the CSWP uses word embedding method to calculate the polarity score for candidate sentiment words, then the sentiment morpheme matching method is applied to make further analysis for the polarity of words. Finally, to deal with the low recognition ratio in the sentiment morpheme matching method, a synonym expanding step is added into the morpheme matching method, which can significantly improve the recognition ratio of the sentiment morpheme matching method. The performance of CSWP is evaluated through extensive experiments on 20000 users’ comments. Experimental results show that the proposed CSWP method has achieved a desirable performance when compared with other two baseline methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/hankcs/HanLP.

  2. 2.

    https://www.istarshine.com/index.php/Data/dataSurvey#platform.

  3. 3.

    http://www.datatang.com/data/44317.

  4. 4.

    http://www.ltp-cloud.com/download/.

References

  1. Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 1(2009):12 (2009)

    Google Scholar 

  2. Fei, X., Wang, H., Zhu, J.: Sentiment word identification using the maximum entropy model. In: 2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE), pp. 1–4. IEEE (2010)

    Google Scholar 

  3. Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association For Computational Linguistics, pp. 417–424. Association for Computational Linguistics (2002)

    Google Scholar 

  4. Fan, X., Li, X., Du, F., Li, X., Wei, M.: Apply word vectors for sentiment analysis of app reviews. In: 2016 3rd International Conference on Systems and Informatics (ICSAI), pp. 1062–1066. IEEE (2016)

    Google Scholar 

  5. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-2002 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)

    Google Scholar 

  6. Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics, pp. 174–181. Association for Computational Linguistics (1997)

    Google Scholar 

  7. Kanayama, H., Nasukawa, T.: Fully automatic lexicon expansion for domain-oriented sentiment analysis. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pp. 355–363. Association for Computational Linguistics (2006)

    Google Scholar 

  8. Ku, L.-W., Liang, Y.-T., Chen, H.-H.: Opinion extraction, summarization and tracking in news and blog corpora. In: Proceedings of AAAI, pp. 100–107 (2006)

    Google Scholar 

  9. Jansen, B.J., Zhang, M., Sobel, K., Chowdury, A.: Twitter power: Tweets as electronic word of mouth. J. Assoc. Inf. Sci. Technol. 60(11), 2169–2188 (2009)

    Article  Google Scholar 

  10. Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: LREc, vol. 10 (2010)

    Google Scholar 

  11. Kouloumpis, E., Wilson, T., Moore, J.D.: Twitter sentiment analysis: the good the bad and the omg! Icwsm, 11(538-541), 164 (2011)

    Google Scholar 

  12. Huang, S., Han, W., Que, X., Wang, W.: Polarity identification of sentiment words based on emoticons. In: 2013 9th International Conference on Computational Intelligence and Security (CIS), pp. 134–138. IEEE (2013)

    Google Scholar 

  13. Gauch, S., Wang, J.: Corpus analysis for TREC 5 query expansion. In: TREC (1996)

    Google Scholar 

  14. Wiebe, J.: Learning subjective adjectives from corpora. AAAI/IAAI 20 (2000)

    Google Scholar 

  15. Geng, H.T., Cai, Q.S., Kun, Y., Zhao, P.: A kind of automatic text keyphrase extraction method based on word co-occurrence. J. Nanjing Univ. 42(2), 156–162 (2006)

    Google Scholar 

Download references

Acknowledgment

This work was supported by the National Natural Science Foundation of China (61572060, U1536107, 61472024), and CERNET Innovation Project (NGII20151104, NGII20160316).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianwei Niu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Niu, J., Sun, M., Mo, S. (2018). Sentiment Analysis of Chinese Words Using Word Embedding and Sentiment Morpheme Matching. In: Romdhani, I., Shu, L., Takahiro, H., Zhou, Z., Gordon, T., Zeng, D. (eds) Collaborative Computing: Networking, Applications and Worksharing. CollaborateCom 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 252. Springer, Cham. https://doi.org/10.1007/978-3-030-00916-8_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00916-8_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00915-1

  • Online ISBN: 978-3-030-00916-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics