Construction of Microblog-Specific Chinese Sentiment Lexicon Based on Representation Learning

Kong, Li; Li, Chuanyi; Ge, Jidong; Yang, Yufan; Zhang, Feifei; Luo, Bin

doi:10.1007/978-3-319-97304-3_16

Li Kong^15,16,
Chuanyi Li^15,16,
Jidong Ge^15,16,
Yufan Yang^15,16,
Feifei Zhang^15,16 &
…
Bin Luo^15,16

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11012))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

3484 Accesses
2 Citations

Abstract

Sentiment analysis is a research hotspot in Nature Language Processing, and high-quality sentiment lexicon plays an important part in sentiment analysis. In this paper, we explore an approach to build a microblog-specific Chinese sentiment lexicon from massive microblog data. In feature learning, in order to enhance the quality of word embedding, we build a neural architecture to train a sentiment-aware word embedding by integrating three kinds of knowledge, including the context words and their composing characters, the polarity of sentences and the polarity of labeled words. Experiments conducted on several public datasets show that in both unsupervised and supervised microblog sentiment classification, the lexicon generated by our approach achieves the state-of-the-art performance compared to several existing Chinese sentiment lexicons and our feature learning method successfully catches both semantics and sentiment information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Chinese Microblog Sentiment Analysis Based on Sentiment Features

SVM-Based Sentiment Analysis Algorithm of Chinese Microblog Under Complex Sentence Pattern

RETRACTED ARTICLE: A combination of TEXTCNN model and Bayesian classifier for microblog sentiment analysis

Article 11 May 2023

Notes

References

Liu, B.: Sentiment Analysis and Opinion Mining. University of Illinois at Chicago, Chicago (2012)
Google Scholar
Chen, X., Xu, L., Liu, Z.: Joint learning of character and word embeddings. In: Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, pp. 1236–1242 (2015)
Google Scholar
Yu, J., Jian, X., Xin, H.: Joint embeddings of chinese words, characters, and fine-grained subcharacter components. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 286–291 (2017)
Google Scholar
Feng, S., Song, K., Wang, D.: A word-emoticon mutual reinforcement ranking model for building sentiment lexicon from massive collection of microblogs. World Wide Web 18, 949–967 (2015)
Article Google Scholar
Wu, F., Huang, Y., Song, Y.: Towards building a high-quality microblog-specific Chinese sentiment lexicon. Decis. Support Syst. 87, 39–49 (2016)
Article Google Scholar
Tan, J., Xu, M., Shang, L., Jia, X.: Sentiment analysis for images on microblogging by integrating textual information with multiple kernel learning. In: Booth, R., Zhang, M.-L. (eds.) PRICAI 2016. LNCS, vol. 9810, pp. 496–506. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-42911-3_41
Chapter Google Scholar
Tang, D., Wei, F., Qin, B.: Building large-scale Twitter-specific sentiment lexicon: a representation learning approach. In: Proceedings of the 25th International Conference on Computational Linguistics: Technical Papers, pp. 172–182 (2014)
Google Scholar
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168–177 (2004)
Google Scholar
Heerschop, B., Hogenboom, A., Frasincar, F.: Sentiment lexicon creation from lexical resources. In: Abramowicz, W. (ed.) BIS 2011. LNBIP, vol. 87, pp. 185–196. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-21863-7_16
Chapter Google Scholar
Esuli, A., Sebastiani, F.: PageRanking WordNet synsets: an application to opinion mining. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, ACL, pp. 424–431 (2010)
Google Scholar
Hatzivassiloglou, V., McKeown, K.: Predicting the semantic orientation of adjectives. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics, pp. 174–181 (1997)
Google Scholar
Kanayama, H., Nasukawa, T.: Fully automatic lexicon expansion for domain-oriented sentiment analysis. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pp. 355–363 (2006)
Google Scholar
Qiu, G., Liu, B., Bu, J., Chen, C.: Opinion word expansion and target extraction through double propagation. Comput. Linguist. 37(1), 9–27 (2011)
Article Google Scholar
Turney, P.: Thumbs up or thumbs down?: Semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 417–424 (2002)
Google Scholar
Hamilton, W., Clark, K., Leskovec, J.: Inducing domain-specific sentiment lexicons from unlabeled corpora. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 595–605 (2016)
Google Scholar
Islam, M., Inkpen, D.: Second order co-occurrence PMI for determining the semantic similarity of words. In: Language Resources and Evaluation, pp. 1033–1038 (2006)
Google Scholar
Vo, D., Zhang, Y.: Don’t count, predict! An automatic approach to learning sentiment lexicons for short text. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 219–224 (2016)
Google Scholar
Mikolov, T., Chen, K., Corrado, G.: Efficient estimation of word representations in vector space. http://arxiv.org/abs/1309.4168 (2013)
Wang, L., Xia, R.: Sentiment lexicon construction with representation learning based on hierarchical sentiment supervision. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 513–521 (2017)
Google Scholar
Su, T., Lee, H.: Learning Chinese word representations from glyphs of characters. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 264–273 (2017)
Google Scholar
Xu, J., Liu, J., Zhang, L.: Improve Chinese word embeddings by exploiting internal structure. In: Proceedings of NAACL-HLT, pp. 1041–1050 (2016)
Google Scholar
Yin, R., Wang, Q., Liu, R.: Multi-granularity Chinese word embedding. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 981–986 (2016)
Google Scholar
Sun, Y., Lin, L., Yang, N., Ji, Z., Wang, X.: Radical-enhanced Chinese character embedding. In: Loo, C.K., Yap, K.S., Wong, K.W., Teoh, A., Huang, K. (eds.) ICONIP 2014. LNCS, vol. 8835, pp. 279–286. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-12640-1_34
Chapter Google Scholar
Mikolov, T., Sutskever, L., Chen, K.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Mohammad, S.: Building the state-of-the-art in sentiment analysis of tweets. In: Proceedings of the Seventh International Workshop on Semantic Evaluation Exercises, SemEval 2013, pp. 321–327 (2013)
Google Scholar
Myers, J., Well, A., Lorch, R.: Research Design and Statistical Analysis, 2nd edn. Routledge, London (2010)
Google Scholar

Download references

Acknowledgment

This work was supported by the National Key R&D Program of China (2016YFC0800803).

Author information

Authors and Affiliations

State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
Li Kong, Chuanyi Li, Jidong Ge, Yufan Yang, Feifei Zhang & Bin Luo
Software Institute, Nanjing University, Nanjing, China
Li Kong, Chuanyi Li, Jidong Ge, Yufan Yang, Feifei Zhang & Bin Luo

Authors

Li Kong
View author publications
You can also search for this author in PubMed Google Scholar
Chuanyi Li
View author publications
You can also search for this author in PubMed Google Scholar
Jidong Ge
View author publications
You can also search for this author in PubMed Google Scholar
Yufan Yang
View author publications
You can also search for this author in PubMed Google Scholar
Feifei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Bin Luo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Chuanyi Li or Jidong Ge .

Editor information

Editors and Affiliations

Southeast University, Nanjing, China
Xin Geng
University of Tasmania, Hobart, Tasmania, Australia
Byeong-Ho Kang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kong, L., Li, C., Ge, J., Yang, Y., Zhang, F., Luo, B. (2018). Construction of Microblog-Specific Chinese Sentiment Lexicon Based on Representation Learning. In: Geng, X., Kang, BH. (eds) PRICAI 2018: Trends in Artificial Intelligence. PRICAI 2018. Lecture Notes in Computer Science(), vol 11012. Springer, Cham. https://doi.org/10.1007/978-3-319-97304-3_16

Download citation

DOI: https://doi.org/10.1007/978-3-319-97304-3_16
Published: 27 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97303-6
Online ISBN: 978-3-319-97304-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics