Skip to main content

An Empirical Study and Comparison for Tweet Sentiment Analysis

  • Conference paper
  • First Online:
Cloud Computing and Security (ICCCS 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10040))

Included in the following conference series:

Abstract

Tweet sentiment analysis has been an effective and valuable technique in the sentiment analysis domain. We conduct a systematic and thorough empirical study on traditional machine learning algorithms and two deep learning approaches for tweet sentiment analysis, and expect to provide a guideline for choosing which efficient classification algorithms. Based on our experiments, we found that the Support Vector Machine and the Random Forest work better statistically than other methods. Although deep learning approaches have achieved many successes in image and voice processing, simple RNN and LSTM networks do not outweigh SVM and RF in our experiments. Moreover, for the tweet feature selection, the combination of bi-grams, SentiWordNet and Stop words removal shows more effectiveness in accuracy improving.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Liu, B.: Opinion mining and sentiment analysis. In: Liu, B. (ed.) Web data mining: exploring hyperlinks, contents, and usage data. Data-Centric Systems and Applications, pp. 459–526. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  2. Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. In: Cs224n Project Report, Stanford, vol. 1, p. 12 (2009)

    Google Scholar 

  3. Gu, B., Sheng, V.S.: A Robust regularization path algorithm for ν-support vector classification. IEEE Trans. Neural Netw. Learn. Syst. (2016). doi:10.1109/TNNLS.2016.2527796

    Google Scholar 

  4. Bin, Gu, Sheng, V.S., Wang, Z., Ho, D., Osman, S., Li, S.: Incremental learning for ν-support vector regression. Neural Netw. 67, 140–150 (2015)

    Article  Google Scholar 

  5. Wen, X., Shao, L., Xue, Y., Fang, W.: A rapid learning algorithm for vehicle classification. Inf. Sci. 295(1), 395–406 (2015)

    Article  Google Scholar 

  6. Ravi, K., Ravi, V.: A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl. Based Syst. 89.C, 14–46 (2015)

    Article  Google Scholar 

  7. Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: International Conference on Language Resources and Evaluation, LREC 2010, 17–23 May 2010, Valletta, Malta, pp. 83–90 (2010)

    Google Scholar 

  8. Mikolov, T, et al.: Recurrent neural network based language model. In: INTERSPEECH 2010, Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, September 2010, pp. 1045–1048

    Google Scholar 

  9. Cheng, J., et al.: Exploring sentiment parsing of microblogging texts for opinion polling on chinese public figures. Appl. Intell. 45(2), 1–14 (2016)

    Article  Google Scholar 

  10. Sundermeyer, M., Schlüter, R., Ney, H.: LSTM neural networks for language modeling. In: Interspeech, pp. 601–608 (2012)

    Google Scholar 

  11. Wang, X., et al.: Predicting polarities of tweets by composing word embeddings with long short-term memory. In: Meeting of the Association for Computational Linguistics and the, International Joint Conference on Natural Language Processing (2015)

    Google Scholar 

  12. Anupama, R., Rajeswar, S., Chaudhury, S.: A hypothesize-and-verify framework for text recognition using deep recurrent neural networks. In: International Conference on Document Analysis and Recognition IEEE Computer Society, pp. 936–940 (2015)

    Google Scholar 

  13. Saif, H., et al.: Evaluation datasets for twitter sentiment analysis. a survey and a new dataset, the STS-Gold. In: Workshop: Emotion and Sentiment in Social and Expressive Media: Approaches and Perspectives From Ai (2013)

    Google Scholar 

Download references

Acknowledgements

This work is supported by the NSFC (61272421, 41271410), the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD)

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Leiming Yan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Yan, L., Tao, H. (2016). An Empirical Study and Comparison for Tweet Sentiment Analysis. In: Sun, X., Liu, A., Chao, HC., Bertino, E. (eds) Cloud Computing and Security. ICCCS 2016. Lecture Notes in Computer Science(), vol 10040. Springer, Cham. https://doi.org/10.1007/978-3-319-48674-1_55

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-48674-1_55

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-48673-4

  • Online ISBN: 978-3-319-48674-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics