Skip to main content

Opinion Classification via Word and Emoji Embedding Models with LSTM

  • Conference paper
  • First Online:
Speech and Computer (SPECOM 2021)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12997))

Included in the following conference series:

Abstract

As social networks are rapidly growing, the content created in them is also growing. Mining the emotional tendency of comments on this content through opinion classification technologies is very useful for the timely understanding of public opinion on social media, monitoring of brands, and customer support. Deep learning methods have shown good results in opinion classification. In this paper, we analyze the opinion classification in Uzbek movie reviews taken from YouTube using various pre-trained word embedding models and a classification model based on long short-term memory. Users often use emojis along with text to express their opinions and feelings. Therefore, we also investigated the importance of emojis in opinion classification of Uzbek texts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Xu, G., Meng, Y., Qiu, X., Yu, Z., Wu, X.: Sentiment analysis of comment texts based on BiLSTM. IEEE Access 7, 51522–51532 (2019)

    Article  Google Scholar 

  2. Koumpouri, A., Mporas, I., Megalooikonomou, V.: Evaluation of four approaches for “Sentiment analysis on movie reviews”: the Kaggle competition. In: Proceedings of the 16th International Conference on Engineering Applications of Neural Networks (INNS), pp. 1–5. ACM, New York (2015)

    Google Scholar 

  3. Yang, L., Li, Y., Wang, J., Sherratt, R.: Sentiment analysis for E-commerce product reviews in Chinese based on sentiment lexicon and deep learning. IEEE Access 8, 23522–23530 (2020)

    Article  Google Scholar 

  4. Simaki, V., Paradis, C., Skeppstedt, M., Sahlgren, M., Kucher, K., Kerren, A.: Annotating speaker stance in discourse: the Brexit blog corpus. Corpus Linguist. Linguist. Theory 1 (2017)

    Google Scholar 

  5. Ranathunga, S., Liyanage, I.: Sentiment analysis of Sinhala news comments. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 20, 1–23 (2021)

    Article  Google Scholar 

  6. Pecore S., Villaneau J.: Complex and precise movie and book annotations in french language for aspect based sentiment analysis. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Japan, pp. 2647–2652. ELRA (2018)

    Google Scholar 

  7. Novak, P.K., Smailović, J., Sluban, B., Mozetič, I.: Sentiment of emojis. PLOS ONE 10(12), e0144296 (2015)

    Article  Google Scholar 

  8. Rabbimov, I., Mporas, I., Simaki, V., Kobilov, S.: Investigating the effect of emoji in opinion classification of Uzbek movie review comments. In: Karpov, A., Potapova, R. (eds.) SPECOM 2020. LNCS (LNAI), vol. 12335, pp. 435–445. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60276-5_42

    Chapter  Google Scholar 

  9. Felbo, B., Mislove, A., Søgaard, A., Rahwan, I., Lehmann, S.: Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Denmark, pp. 1615–1625. Association for Computational Linguistics (2017)

    Google Scholar 

  10. Al-Azani, S., El-Alfy, E.: Combining emojis with Arabic textual features for sentiment classification. In 2018 9th International Conference on Information and Communication Systems (ICICS), Jordan, pp. 139–144. IEEE (2018)

    Google Scholar 

  11. Tang, D., Qin, B., Liu, T.: Deep learning for sentiment analysis: successful approaches and future challenges. WIREs Data Min. Knowl. Discov. 5, 292–303 (2015)

    Article  Google Scholar 

  12. Mikolov T., Sutskever I., Chen K., Corrado G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 3111–3119 (2013)

    Google Scholar 

  13. Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Qatar, pp. 1532–1543. Association for Computational Linguistics (2014)

    Google Scholar 

  14. Bojanowski, P., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. Trans. Assoc. Computat. Linguist. 5, 135–146 (2017)

    Article  Google Scholar 

  15. Senarath, Y., Thayasivam, U.: DataSEARCH at IEST 2018: multiple word embedding based models for implicit emotion classification of tweets with deep learning. In: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Belgium, pp. 211–216. Association for Computational Linguistics (2018)

    Google Scholar 

  16. Huang, S., Zhao, Q., Xu, X., Zhang, B., Wang, D.: Emojis-based recurrent neural network for Chinese microblogs sentiment analysis. In: 2019 IEEE International Conference on Service Operations and Logistics, and Informatics (SOLI), China, pp. 59–64. IEEE (2019)

    Google Scholar 

  17. Subramanian, J., Sridharan, V., Shu, K., Liu, H.: Exploiting Emojis for sarcasm detection. In: Thomson, R., Bisgin, H., Dancy, C., Hyder, A. (eds.) SBP-BRiMS 2019. LNCS, vol. 11549, pp. 70–80. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-21741-9_8

    Chapter  Google Scholar 

  18. Barbieri, F., Ronzano, F., Saggion, H.: What does this emoji mean? a vector space skip-gram model for twitter emojis. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Slovenia, pp. 3967–3972. ELRA (2016)

    Google Scholar 

  19. Eisner, B., Rocktäschel, T., Augenstein, I., Bosnjak, M., Riedel, S.: emoji2vec: learning emoji representations from their description. In: Proceedings of The Fourth International Workshop on Natural Language Processing for Social Media, USA, pp. 48–54. Association for Computational Linguistics (2016)

    Google Scholar 

  20. Velioğlu, R., Yıldız, T., Yıldırım, S.: Sentiment analysis using learning approaches over emojis for Turkish tweets. In: 2018 3rd International Conference on Computer Science and Engineering (UBMK), Bosnia and Herzegovina, pp. 303–307. IEEE (2018)

    Google Scholar 

  21. Sakenovich, N.S., Zharmagambetov, A.S.: On one approach of solving sentiment analysis task for Kazakh and Russian languages using deep learning. In: Nguyen, N.-T., Manolopoulos, Y., Iliadis, L., Trawiński, B. (eds.) ICCCI 2016. LNCS (LNAI), vol. 9876, pp. 537–545. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45246-3_51

    Chapter  Google Scholar 

  22. Singh, A., Blanco, E., Jin, W.: Incorporating emoji descriptions improves tweet classification. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minnesota, pp. 2096–2101. Association for Computational Linguistics (2019)

    Google Scholar 

  23. Shamal, A.J., Pemathilake, R.G.H., Karunathilake, S.P., Ganegoda, G.U.: Sentiment analysis using Token2Vec and LSTMs: user review analyzing module. In: 2018 18th International Conference on Advances in ICT for Emerging Regions (ICTer), Sri Lanka, pp. 48–53. IEEE (2019).

    Google Scholar 

  24. Zhu, P., Yang, Y., Liu, Y.: Sentiment analysis based on hybrid bi-attention mechanism in mobile application. In: Aiello, M., Yang, Y., Zou, Y., Zhang, L.-J. (eds.) AIMS 2018. LNCS, vol. 10970, pp. 157–171. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-94361-9_12

    Chapter  Google Scholar 

  25. Rabbimov, I., Kobilov, S.: Multi-class text classification of Uzbek News articles using machine learning. J. Phys. Conf. Ser. 1546, 012097 (2020)

    Article  Google Scholar 

  26. Rabbimov, I., Kobilov, S., Mporas, I.: Uzbek news categorization using word embeddings and convolutional neural networks. In: 2020 IEEE 14th International Conference on Application of Information and Communication Technologies (AICT), Uzbekistan, pp. 1–5. IEEE (2020)

    Google Scholar 

  27. Mukhamedieva, D.T., Zhuraev, Z., Bakaev, I.I.: Approaches to the thematic classification of literary works. Probl. Comput. Appl. Math. 4(22), 111–117 (2019)

    Google Scholar 

  28. Mukhamedieva, D.K., Jurayev, Z.: Classification of content of works. Probl. Comput. Appl. Math. 2(26), 108–117 (2020)

    Google Scholar 

  29. Babomuradov, O.J., Mamatov, N.S., Boboyev, L.B., Otaxonova, B.I.: Classification of texts using decision trees algorithms. Descendants of Muhammad al-Khwarizmi 4(10) (2019)

    Google Scholar 

  30. Babomuradov, O.J., Boboev, L.B., Otaxonova, B.I.: A comparison of naïve bayes models for text classification. Probl. Comput. Appl. Math. 1(19), 39–43 (2019)

    Google Scholar 

  31. Tuliev, U.: Cluster analyse of text documents according to their relation of connectedness. Probl. Comput. Appl. Math. 6(24), 102–109 (2019)

    Google Scholar 

  32. Kuriyozov, E., Matlatipov, S.: Building a new sentiment analysis dataset for Uzbek language and creating baseline models. Multidiscip. Dig. Publ. Inst. Proc. 21(1), 37 (2019)

    Google Scholar 

  33. Kuriyozov, E., Matlatipov, S., Alonso, M., Gómez-Rodríguez, C.: Deep learning vs. classic models on a new Uzbek sentiment analysis dataset. In: Proceedings of the Human Language Technologies as a Challenge for Computer Science and Linguistics, pp. 258–262 (2019)

    Google Scholar 

  34. Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014)

    MathSciNet  MATH  Google Scholar 

  35. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9, 1735–1780 (1997)

    Article  Google Scholar 

  36. Grave, E., Bojanowski, P., Gupta, P., Joulin, A., Mikolov T.: Learning word vectors for 157 languages. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Japan. ELRA (2018)

    Google Scholar 

  37. Kuriyozov, E., Doval, Y., Gómez-Rodríguez, C.: Cross-lingual word embeddings for turkic languages. In: Proceedings of the 12th Language Resources and Evaluation Conference, France, pp. 4054–4062. ELRA (2020)

    Google Scholar 

  38. Abadi, M., et al.: TensorFlow: large-scale machine learning on heterogeneous systems (2015). https://www.tensorflow.org. Accessed 21 May 2021

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ilyos Rabbimov .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rabbimov, I., Kobilov, S., Mporas, I. (2021). Opinion Classification via Word and Emoji Embedding Models with LSTM. In: Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2021. Lecture Notes in Computer Science(), vol 12997. Springer, Cham. https://doi.org/10.1007/978-3-030-87802-3_53

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-87802-3_53

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-87801-6

  • Online ISBN: 978-3-030-87802-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics