Abstract
As social networks are rapidly growing, the content created in them is also growing. Mining the emotional tendency of comments on this content through opinion classification technologies is very useful for the timely understanding of public opinion on social media, monitoring of brands, and customer support. Deep learning methods have shown good results in opinion classification. In this paper, we analyze the opinion classification in Uzbek movie reviews taken from YouTube using various pre-trained word embedding models and a classification model based on long short-term memory. Users often use emojis along with text to express their opinions and feelings. Therefore, we also investigated the importance of emojis in opinion classification of Uzbek texts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Xu, G., Meng, Y., Qiu, X., Yu, Z., Wu, X.: Sentiment analysis of comment texts based on BiLSTM. IEEE Access 7, 51522–51532 (2019)
Koumpouri, A., Mporas, I., Megalooikonomou, V.: Evaluation of four approaches for “Sentiment analysis on movie reviews”: the Kaggle competition. In: Proceedings of the 16th International Conference on Engineering Applications of Neural Networks (INNS), pp. 1–5. ACM, New York (2015)
Yang, L., Li, Y., Wang, J., Sherratt, R.: Sentiment analysis for E-commerce product reviews in Chinese based on sentiment lexicon and deep learning. IEEE Access 8, 23522–23530 (2020)
Simaki, V., Paradis, C., Skeppstedt, M., Sahlgren, M., Kucher, K., Kerren, A.: Annotating speaker stance in discourse: the Brexit blog corpus. Corpus Linguist. Linguist. Theory 1 (2017)
Ranathunga, S., Liyanage, I.: Sentiment analysis of Sinhala news comments. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 20, 1–23 (2021)
Pecore S., Villaneau J.: Complex and precise movie and book annotations in french language for aspect based sentiment analysis. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Japan, pp. 2647–2652. ELRA (2018)
Novak, P.K., Smailović, J., Sluban, B., Mozetič, I.: Sentiment of emojis. PLOS ONE 10(12), e0144296 (2015)
Rabbimov, I., Mporas, I., Simaki, V., Kobilov, S.: Investigating the effect of emoji in opinion classification of Uzbek movie review comments. In: Karpov, A., Potapova, R. (eds.) SPECOM 2020. LNCS (LNAI), vol. 12335, pp. 435–445. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60276-5_42
Felbo, B., Mislove, A., Søgaard, A., Rahwan, I., Lehmann, S.: Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Denmark, pp. 1615–1625. Association for Computational Linguistics (2017)
Al-Azani, S., El-Alfy, E.: Combining emojis with Arabic textual features for sentiment classification. In 2018 9th International Conference on Information and Communication Systems (ICICS), Jordan, pp. 139–144. IEEE (2018)
Tang, D., Qin, B., Liu, T.: Deep learning for sentiment analysis: successful approaches and future challenges. WIREs Data Min. Knowl. Discov. 5, 292–303 (2015)
Mikolov T., Sutskever I., Chen K., Corrado G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 3111–3119 (2013)
Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Qatar, pp. 1532–1543. Association for Computational Linguistics (2014)
Bojanowski, P., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. Trans. Assoc. Computat. Linguist. 5, 135–146 (2017)
Senarath, Y., Thayasivam, U.: DataSEARCH at IEST 2018: multiple word embedding based models for implicit emotion classification of tweets with deep learning. In: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Belgium, pp. 211–216. Association for Computational Linguistics (2018)
Huang, S., Zhao, Q., Xu, X., Zhang, B., Wang, D.: Emojis-based recurrent neural network for Chinese microblogs sentiment analysis. In: 2019 IEEE International Conference on Service Operations and Logistics, and Informatics (SOLI), China, pp. 59–64. IEEE (2019)
Subramanian, J., Sridharan, V., Shu, K., Liu, H.: Exploiting Emojis for sarcasm detection. In: Thomson, R., Bisgin, H., Dancy, C., Hyder, A. (eds.) SBP-BRiMS 2019. LNCS, vol. 11549, pp. 70–80. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-21741-9_8
Barbieri, F., Ronzano, F., Saggion, H.: What does this emoji mean? a vector space skip-gram model for twitter emojis. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Slovenia, pp. 3967–3972. ELRA (2016)
Eisner, B., Rocktäschel, T., Augenstein, I., Bosnjak, M., Riedel, S.: emoji2vec: learning emoji representations from their description. In: Proceedings of The Fourth International Workshop on Natural Language Processing for Social Media, USA, pp. 48–54. Association for Computational Linguistics (2016)
Velioğlu, R., Yıldız, T., Yıldırım, S.: Sentiment analysis using learning approaches over emojis for Turkish tweets. In: 2018 3rd International Conference on Computer Science and Engineering (UBMK), Bosnia and Herzegovina, pp. 303–307. IEEE (2018)
Sakenovich, N.S., Zharmagambetov, A.S.: On one approach of solving sentiment analysis task for Kazakh and Russian languages using deep learning. In: Nguyen, N.-T., Manolopoulos, Y., Iliadis, L., Trawiński, B. (eds.) ICCCI 2016. LNCS (LNAI), vol. 9876, pp. 537–545. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45246-3_51
Singh, A., Blanco, E., Jin, W.: Incorporating emoji descriptions improves tweet classification. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minnesota, pp. 2096–2101. Association for Computational Linguistics (2019)
Shamal, A.J., Pemathilake, R.G.H., Karunathilake, S.P., Ganegoda, G.U.: Sentiment analysis using Token2Vec and LSTMs: user review analyzing module. In: 2018 18th International Conference on Advances in ICT for Emerging Regions (ICTer), Sri Lanka, pp. 48–53. IEEE (2019).
Zhu, P., Yang, Y., Liu, Y.: Sentiment analysis based on hybrid bi-attention mechanism in mobile application. In: Aiello, M., Yang, Y., Zou, Y., Zhang, L.-J. (eds.) AIMS 2018. LNCS, vol. 10970, pp. 157–171. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-94361-9_12
Rabbimov, I., Kobilov, S.: Multi-class text classification of Uzbek News articles using machine learning. J. Phys. Conf. Ser. 1546, 012097 (2020)
Rabbimov, I., Kobilov, S., Mporas, I.: Uzbek news categorization using word embeddings and convolutional neural networks. In: 2020 IEEE 14th International Conference on Application of Information and Communication Technologies (AICT), Uzbekistan, pp. 1–5. IEEE (2020)
Mukhamedieva, D.T., Zhuraev, Z., Bakaev, I.I.: Approaches to the thematic classification of literary works. Probl. Comput. Appl. Math. 4(22), 111–117 (2019)
Mukhamedieva, D.K., Jurayev, Z.: Classification of content of works. Probl. Comput. Appl. Math. 2(26), 108–117 (2020)
Babomuradov, O.J., Mamatov, N.S., Boboyev, L.B., Otaxonova, B.I.: Classification of texts using decision trees algorithms. Descendants of Muhammad al-Khwarizmi 4(10) (2019)
Babomuradov, O.J., Boboev, L.B., Otaxonova, B.I.: A comparison of naïve bayes models for text classification. Probl. Comput. Appl. Math. 1(19), 39–43 (2019)
Tuliev, U.: Cluster analyse of text documents according to their relation of connectedness. Probl. Comput. Appl. Math. 6(24), 102–109 (2019)
Kuriyozov, E., Matlatipov, S.: Building a new sentiment analysis dataset for Uzbek language and creating baseline models. Multidiscip. Dig. Publ. Inst. Proc. 21(1), 37 (2019)
Kuriyozov, E., Matlatipov, S., Alonso, M., Gómez-Rodríguez, C.: Deep learning vs. classic models on a new Uzbek sentiment analysis dataset. In: Proceedings of the Human Language Technologies as a Challenge for Computer Science and Linguistics, pp. 258–262 (2019)
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9, 1735–1780 (1997)
Grave, E., Bojanowski, P., Gupta, P., Joulin, A., Mikolov T.: Learning word vectors for 157 languages. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Japan. ELRA (2018)
Kuriyozov, E., Doval, Y., Gómez-Rodríguez, C.: Cross-lingual word embeddings for turkic languages. In: Proceedings of the 12th Language Resources and Evaluation Conference, France, pp. 4054–4062. ELRA (2020)
Abadi, M., et al.: TensorFlow: large-scale machine learning on heterogeneous systems (2015). https://www.tensorflow.org. Accessed 21 May 2021
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Rabbimov, I., Kobilov, S., Mporas, I. (2021). Opinion Classification via Word and Emoji Embedding Models with LSTM. In: Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2021. Lecture Notes in Computer Science(), vol 12997. Springer, Cham. https://doi.org/10.1007/978-3-030-87802-3_53
Download citation
DOI: https://doi.org/10.1007/978-3-030-87802-3_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87801-6
Online ISBN: 978-3-030-87802-3
eBook Packages: Computer ScienceComputer Science (R0)