Skip to main content

An Approach to Constructing Sentiment Collocation Dictionary for Chinese Short Text Based on Word2Vec

  • Conference paper
  • First Online:
Emerging Technologies for Education (SETE 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10676))

Included in the following conference series:

Abstract

The sentiment analysis of short texts is an important research hotspot in natural language processing. Based on the word features, this paper constructs a binary sentiment dictionary for a Chinese short text corpus using statistical methods. Then we calculate the sentiment value of the dictionary by Word2Vec algorithm and seed words. To evaluate the effectiveness of the dictionary, we manually annotated sentiment of the dictionary and compared with the calculation result. We also compared the performance effects of using different emotional dictionaries for the sentiment classification. The results show that the sentiment collocation dictionary is performed well in the emotional classification of Chinese short texts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Redman, S., Ellis, R.: A way with words. Book 1. Cambridge University Press, Cambridge (1989)

    Google Scholar 

  2. Esuli, A., Sebastoamo, F.: Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of LREC Genoa-Italy: LREC, pp. 417–422 (2006)

    Google Scholar 

  3. Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: International Conference on Language Resources and Evaluation, pp. 83–90 (2010)

    Google Scholar 

  4. Tang, D.: Nation Taiwan University: simplified Chinese emotional dictionary (2013). http://www.datatang.com/data/11837

  5. Xu, L., Lin, H.F., Pan, Y., et al.: Constructing the affective lexicon ontology. J. China Soc. Sci. Tech. Inf. 27(2), 180–185 (2008)

    Google Scholar 

  6. Yang, A.M., Lin, J.H., Zhou, Y.M.: Method on building Chinese text sentiment lexicon. J. Front. Comput. Sci. Technol. 7(11), 1033–1039 (2013)

    Google Scholar 

  7. Zhou, Y.M., Yang, A.M., Lin, J.H.: Construction method of sentiment lexicon for news reviews. J. Shandong Univ. (Eng. Sci.) 44(3), 36–40 (2014)

    Google Scholar 

  8. Zhou, Y.M., Yang, A.M., Yang, J.N.: A method of building Chinese microblog sentiment lexicon. Comput. Sci. 41(8), 67–70 (2014)

    Google Scholar 

  9. Zhou, J.F., Yang, A.M., et al.: Micro-bolg sentimental feature selection based on bigram collocation. Comput. Eng. 40(6), 162–165 (2014)

    Google Scholar 

  10. Hamdan, H., Bellot, P., Bechet, F.: Sentiment lexicon-based features for sentiment analysis in short text. In: International Conference on Intelligent Text Processing and Computational Linguistics (2015)

    Google Scholar 

  11. Zhang, C., Zeng, D., Li, J., Wang, F.Y., Zuo, W.: Sentiment analysis of Chinese documents. J. Assoc. Inf. Sci. Technol. 60(12), 2474–2487 (2009)

    Article  Google Scholar 

  12. Tomas, M.: Word2Vec project (2014). https://code.google.com/p/Word2Vec/

  13. Zhang, D., Xu, H., Su, Z., Xu, Y.: Chinese comments sentiment classification based on Word2Vec and SVM perf. Expert Syst. Appl. 42(4), 1857–1863 (2015)

    Article  Google Scholar 

  14. Joseph, T., Lev, R., Yoshua, B.: Word representations: a simple and general method for semi-supervised learning. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 384–394 (2010)

    Google Scholar 

  15. Chen, Y.J.: The Automatic Extraction Method of Collocation of Words in Modern Chinese. East China Normal University (2005)

    Google Scholar 

  16. Wang, S., Yang, A.: A method of collocation orientation identification based on hybrid language information. J. Chin. Motion Process. 24(3), 69–74 (2012)

    MathSciNet  Google Scholar 

  17. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: Proceedings of Workshop at ICLR (2013)

    Google Scholar 

  18. Su, J.: Incredible Word2Vec.trained models (2017). http://spaces.ac.cn/archives/4304/comment-page-1

  19. Řehůřek, R.: Gensim: Topic modelling for humans (2017). http://radimrehurek.com/genism

Download references

Acknowledgments

The work was supported by the University Innovative Talent Project of Guangdong Province (2013).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yangqing Lin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhou, J., Chen, B., Lin, Y. (2017). An Approach to Constructing Sentiment Collocation Dictionary for Chinese Short Text Based on Word2Vec. In: Huang, TC., Lau, R., Huang, YM., Spaniol, M., Yuen, CH. (eds) Emerging Technologies for Education. SETE 2017. Lecture Notes in Computer Science(), vol 10676. Springer, Cham. https://doi.org/10.1007/978-3-319-71084-6_64

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-71084-6_64

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-71083-9

  • Online ISBN: 978-3-319-71084-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics