Abstract
Tweet sentiment analysis has been an effective and valuable technique in the sentiment analysis domain. We conduct a systematic and thorough empirical study on traditional machine learning algorithms and two deep learning approaches for tweet sentiment analysis, and expect to provide a guideline for choosing which efficient classification algorithms. Based on our experiments, we found that the Support Vector Machine and the Random Forest work better statistically than other methods. Although deep learning approaches have achieved many successes in image and voice processing, simple RNN and LSTM networks do not outweigh SVM and RF in our experiments. Moreover, for the tweet feature selection, the combination of bi-grams, SentiWordNet and Stop words removal shows more effectiveness in accuracy improving.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Liu, B.: Opinion mining and sentiment analysis. In: Liu, B. (ed.) Web data mining: exploring hyperlinks, contents, and usage data. Data-Centric Systems and Applications, pp. 459–526. Springer, Heidelberg (2011)
Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. In: Cs224n Project Report, Stanford, vol. 1, p. 12 (2009)
Gu, B., Sheng, V.S.: A Robust regularization path algorithm for ν-support vector classification. IEEE Trans. Neural Netw. Learn. Syst. (2016). doi:10.1109/TNNLS.2016.2527796
Bin, Gu, Sheng, V.S., Wang, Z., Ho, D., Osman, S., Li, S.: Incremental learning for ν-support vector regression. Neural Netw. 67, 140–150 (2015)
Wen, X., Shao, L., Xue, Y., Fang, W.: A rapid learning algorithm for vehicle classification. Inf. Sci. 295(1), 395–406 (2015)
Ravi, K., Ravi, V.: A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl. Based Syst. 89.C, 14–46 (2015)
Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: International Conference on Language Resources and Evaluation, LREC 2010, 17–23 May 2010, Valletta, Malta, pp. 83–90 (2010)
Mikolov, T, et al.: Recurrent neural network based language model. In: INTERSPEECH 2010, Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, September 2010, pp. 1045–1048
Cheng, J., et al.: Exploring sentiment parsing of microblogging texts for opinion polling on chinese public figures. Appl. Intell. 45(2), 1–14 (2016)
Sundermeyer, M., Schlüter, R., Ney, H.: LSTM neural networks for language modeling. In: Interspeech, pp. 601–608 (2012)
Wang, X., et al.: Predicting polarities of tweets by composing word embeddings with long short-term memory. In: Meeting of the Association for Computational Linguistics and the, International Joint Conference on Natural Language Processing (2015)
Anupama, R., Rajeswar, S., Chaudhury, S.: A hypothesize-and-verify framework for text recognition using deep recurrent neural networks. In: International Conference on Document Analysis and Recognition IEEE Computer Society, pp. 936–940 (2015)
Saif, H., et al.: Evaluation datasets for twitter sentiment analysis. a survey and a new dataset, the STS-Gold. In: Workshop: Emotion and Sentiment in Social and Expressive Media: Approaches and Perspectives From Ai (2013)
Acknowledgements
This work is supported by the NSFC (61272421, 41271410), the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Yan, L., Tao, H. (2016). An Empirical Study and Comparison for Tweet Sentiment Analysis. In: Sun, X., Liu, A., Chao, HC., Bertino, E. (eds) Cloud Computing and Security. ICCCS 2016. Lecture Notes in Computer Science(), vol 10040. Springer, Cham. https://doi.org/10.1007/978-3-319-48674-1_55
Download citation
DOI: https://doi.org/10.1007/978-3-319-48674-1_55
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48673-4
Online ISBN: 978-3-319-48674-1
eBook Packages: Computer ScienceComputer Science (R0)