Research on Fine-Grained Sentiment Classification

Wang, Zhihui; Wang, Xiaodong; Chang, Tao; Lv, Shaohe; Guo, Xiaoting

doi:10.1007/978-3-030-32236-6_39

Research on Fine-Grained Sentiment Classification

Zhihui Wang ORCID: orcid.org/0000-0003-4612-7632¹³,
Xiaodong Wang¹³,
Tao Chang¹³,
Shaohe Lv¹³ &
…
Xiaoting Guo¹³

Conference paper
First Online: 30 September 2019

4636 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11839))

Abstract

Aiming at the fine-grained sentiment classification that distinguishes the emotional intensity, the commonly used dataset SST-1 is analyzed in depth. Through the analysis, it is found that the dataset has serious problems such as data imbalance and small overall scale, which seriously restricts the classification effect. In order to solve the related problems, data augmentation method is adopted to realize the optimization of the dataset. The IMDB and other data which are relatively homologous to the original dataset are annotated, and the focus is to expand the categories with fewer numbers. By this way, the problem of data imbalance is effectively alleviated and the original data scale is expanded. Then, based on the Bidirectional Encoder Representations from Transformers (BERT) model, which has good overall performance on natural language processing, the benchmark classification model is built. Through multiple comparison experiments on the original dataset and the enhanced data, the influence of the deficiency of the original dataset on the classification effect is verified. And, it is fully demonstrated that the enhanced data can effectively improve the test results and solve the problem of large differences in performance between different categories well.

Supported by National Key Laboratory of Parallel and Distributed Processing, College of Computer Science and Technology, National University of Defence Technology.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Huang, Z., Tang, X., Xie, B., et al.: Sentiment classification using machine learning techniques with syntax features. In: International Conference on Computational Science & Computational Intelligence (2015)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., et al.: BERT: pre-training of deep bidirectional transformers for language understanding. https://arxiv.org/abs/1810.04805. Accessed 14 May 2019
Pang, B., Lee, L.: Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: Proceedings of the ACL, Ann Arbor, pp. 115–124 (2005)
Google Scholar
Ding, Z., Xia, R., Yu, J., et al.: Densely connected bidirectional LSTM with applications to sentence classification. https://arxiv.org/abs/1802.00889. Accessed 14 May 2019
Socher, R., Pennington, J., Huang, E.H., et al.: Semi-supervised recursive autoencoders for predicting sentiment distributions. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, Edinburgh, pp. 151–161 (2011)
Google Scholar
Hadji, I., Wildes, R.P.: What do we understand about convolutional networks? https://arxiv.org/abs/1803.08834. Accessed 14 May 2019
Cardie, C.: Deep recursive neural networks for compositionality in language. In: International Conference on Neural Information Processing Systems, pp. 2096–2104. MIT Press, Montreal (2014)
Google Scholar
Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014, Doha, pp. 1746–1751 (2014)
Google Scholar
Kalchbrenner, N., Grefenstette, E., Blunsom, P.: A convolutional neural network for modelling sentences. In: Annual Meeting of the Association for Computational Linguistics, ACL 2014, Baltimore, pp. 655–665 (2014)
Google Scholar
Yin, W., Schütze, H.: Multichannel variable-size convolution for sentence classification. In: Proceedings of the Nineteenth Conference on Computational Natural Language Learning, CoNLL 2015, Beijing, pp. 204–214 (2015)
Google Scholar
Tai, K.S., Socher, R., Manning, C.D.: Improved semantic representations from tree-structured long short-term memory networks. In: Annual Meeting of the Association for Computational Linguistics, ACL 2015, Beijing, pp. 1556–1566 (2015)
Google Scholar
Qian, Q., Huang, M., Lei, J., Zhu, X.: Linguistically regularized LSTMs for sentiment classification. https://arxiv.org/abs/1611.03949. Accessed 14 May 2019
Zhou, P., Qi, Z., Zheng, S., et al.: Text classification improved by integrating bidirectional LSTM with two-dimensional max pooling. In: Computational Linguistics, COLING 2016, Osaka, pp. 3485–3495 (2016)
Google Scholar
Vaswani, A., et al.: Attention is all you need. In: Annual Conference on Neural Information Processing Systems, NIPS 2017, Long Beach (2017)
Google Scholar
Taylor, W.L.: “Cloze Procedure”: a new tool for measuring readability. J. Q. 30(4), 415–433 (1953)
Google Scholar

Download references

Author information

Authors and Affiliations

National Key Laboratory of Parallel and Distributed Processing, College of Computer Science and Technology, National University of Defence Technology, Changsha, 410073, China
Zhihui Wang, Xiaodong Wang, Tao Chang, Shaohe Lv & Xiaoting Guo

Authors

Zhihui Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaodong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tao Chang
View author publications
You can also search for this author in PubMed Google Scholar
Shaohe Lv
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoting Guo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhihui Wang .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Jie Tang
National University of Singapore, Singapore, Singapore
Min-Yen Kan
Peking University, Beijing, China
Dongyan Zhao
Peking University, Beijing, China
Sujian Li
Zhengzhou University, Zhengzhou, China
Hongying Zan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, Z., Wang, X., Chang, T., Lv, S., Guo, X. (2019). Research on Fine-Grained Sentiment Classification. In: Tang, J., Kan, MY., Zhao, D., Li, S., Zan, H. (eds) Natural Language Processing and Chinese Computing. NLPCC 2019. Lecture Notes in Computer Science(), vol 11839. Springer, Cham. https://doi.org/10.1007/978-3-030-32236-6_39

Download citation

DOI: https://doi.org/10.1007/978-3-030-32236-6_39
Published: 30 September 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32235-9
Online ISBN: 978-3-030-32236-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)