An Empirical Study and Comparison for Tweet Sentiment Analysis

Yan, Leiming; Tao, Hao

doi:10.1007/978-3-319-48674-1_55

Leiming Yan^17,18 &
Hao Tao¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10040))

Included in the following conference series:

International Conference on Cloud Computing and Security

Abstract

Tweet sentiment analysis has been an effective and valuable technique in the sentiment analysis domain. We conduct a systematic and thorough empirical study on traditional machine learning algorithms and two deep learning approaches for tweet sentiment analysis, and expect to provide a guideline for choosing which efficient classification algorithms. Based on our experiments, we found that the Support Vector Machine and the Random Forest work better statistically than other methods. Although deep learning approaches have achieved many successes in image and voice processing, simple RNN and LSTM networks do not outweigh SVM and RF in our experiments. Moreover, for the tweet feature selection, the combination of bi-grams, SentiWordNet and Stop words removal shows more effectiveness in accuracy improving.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Liu, B.: Opinion mining and sentiment analysis. In: Liu, B. (ed.) Web data mining: exploring hyperlinks, contents, and usage data. Data-Centric Systems and Applications, pp. 459–526. Springer, Heidelberg (2011)
Chapter Google Scholar
Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. In: Cs224n Project Report, Stanford, vol. 1, p. 12 (2009)
Google Scholar
Gu, B., Sheng, V.S.: A Robust regularization path algorithm for ν-support vector classification. IEEE Trans. Neural Netw. Learn. Syst. (2016). doi:10.1109/TNNLS.2016.2527796
Google Scholar
Bin, Gu, Sheng, V.S., Wang, Z., Ho, D., Osman, S., Li, S.: Incremental learning for ν-support vector regression. Neural Netw. 67, 140–150 (2015)
Article Google Scholar
Wen, X., Shao, L., Xue, Y., Fang, W.: A rapid learning algorithm for vehicle classification. Inf. Sci. 295(1), 395–406 (2015)
Article Google Scholar
Ravi, K., Ravi, V.: A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl. Based Syst. 89.C, 14–46 (2015)
Article Google Scholar
Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: International Conference on Language Resources and Evaluation, LREC 2010, 17–23 May 2010, Valletta, Malta, pp. 83–90 (2010)
Google Scholar
Mikolov, T, et al.: Recurrent neural network based language model. In: INTERSPEECH 2010, Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, September 2010, pp. 1045–1048
Google Scholar
Cheng, J., et al.: Exploring sentiment parsing of microblogging texts for opinion polling on chinese public figures. Appl. Intell. 45(2), 1–14 (2016)
Article Google Scholar
Sundermeyer, M., Schlüter, R., Ney, H.: LSTM neural networks for language modeling. In: Interspeech, pp. 601–608 (2012)
Google Scholar
Wang, X., et al.: Predicting polarities of tweets by composing word embeddings with long short-term memory. In: Meeting of the Association for Computational Linguistics and the, International Joint Conference on Natural Language Processing (2015)
Google Scholar
Anupama, R., Rajeswar, S., Chaudhury, S.: A hypothesize-and-verify framework for text recognition using deep recurrent neural networks. In: International Conference on Document Analysis and Recognition IEEE Computer Society, pp. 936–940 (2015)
Google Scholar
Saif, H., et al.: Evaluation datasets for twitter sentiment analysis. a survey and a new dataset, the STS-Gold. In: Workshop: Emotion and Sentiment in Social and Expressive Media: Approaches and Perspectives From Ai (2013)
Google Scholar

Download references

Acknowledgements

This work is supported by the NSFC (61272421, 41271410), the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD)

Author information

Authors and Affiliations

Jiangsu Engineering Center of Network Monitoring of Nanjing University Information Science Technology, Nanjing, China
Leiming Yan
School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing, China
Leiming Yan
Faculty of Computer Science, University of New Brunswick, Fredericton, Canada
Hao Tao

Authors

Leiming Yan
View author publications
You can also search for this author in PubMed Google Scholar
Hao Tao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Leiming Yan .

Editor information

Editors and Affiliations

University of Information Science and Technology, Nanjing, China
Xingming Sun
Michigan State University , EAST LANSING, Michigan, USA
Alex Liu
National Dong Hwa University , Shoufeng, Taiwan
Han-Chieh Chao
Purdue University , West Lafayette, Indiana, USA
Elisa Bertino

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yan, L., Tao, H. (2016). An Empirical Study and Comparison for Tweet Sentiment Analysis. In: Sun, X., Liu, A., Chao, HC., Bertino, E. (eds) Cloud Computing and Security. ICCCS 2016. Lecture Notes in Computer Science(), vol 10040. Springer, Cham. https://doi.org/10.1007/978-3-319-48674-1_55

Download citation

DOI: https://doi.org/10.1007/978-3-319-48674-1_55
Published: 01 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48673-4
Online ISBN: 978-3-319-48674-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics