A Deep Learning Emotion Classification Framework for Low Resource Languages

Manisha; Clifford, William; McLaughlin, Eugene; Stynes, Paul

doi:10.1007/978-3-031-49601-1_8

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14418))

Included in the following conference series:

International Conference on Big Data Analytics

622 Accesses

Abstract

Emotion classification from text is the process of identifying and classifying emotions expressed in textual data. Emotions can be feelings such as anger, joy, suspense, sadness and neutral. Developing a machine learning model to identify emotions in a low-resourced language with a limited set of linguistic resources and annotated corpora is a challenge. This research proposes a Deep Learning Emotion Classification Framework to identify and classify emotions in low-resourced languages such as Hindi. The proposed framework combines a classification model and a low resource optimization technique in a novel way. An annotated corpus of Hindi short stories consisting of 20,304 sentences is used to train the models for predicting five categories of emotions: anger, joy, suspense, sadness, and neutral talk. To resolve the class imbalance in the dataset SMOTE technique is applied. The optimal classification model is selected through experimentation that compares machine learning models and pre-trained models. Machine learning and deep learning models are SVM, Logistic Regression, Random Forest, CNN, BiLSTM, and CNN+BiLSTM. The pre-trained models, mBERT, IndicBERT, and a hybrid model, mBERT+BiLSTM. The models are evaluated based on macro average recall, macro average precision, and macro average F1 score. Results demonstrate that the hybrid model mBERT+BiLSTM out perform other models with a test accuracy of 57%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://zenodo.org/record/3457467.

References

Acheampong, F.A., Wenyu, C., Nunoo-Mensah, H.: Text-based emotion detection: advances, challenges, and opportunities. Eng. Rep. 2(7), e12189 (2020)
Google Scholar
Das, A., Sharif, O., Hoque, M.M., Sarker, I.H.: Emotion classification in a resource constrained language using transformer-based approach (2021). arXiv preprint arXiv:2104.08613
Alam, T., Khan, A., Alam, F.: Bangla text classification using transformers (2020). arXiv preprint arXiv:2011.04446
Bharti, S.K., et al.: Text-based emotion recognition using deep learning approach. Comput. Intell. Neurosci. (2022)
Google Scholar
Midhan, T.M., Selvaraj, P., Raju, M.H.K., Reddy, M.B.P., Bhaskar, T.: Classification of mental health and emotion of human from text using machine learning approaches. In: 2023 6th International Conference on Information Systems and Computer Networks (ISCON), pp. 1–7. IEEE, March 2023
Google Scholar
Xu, D., Tian, Z., Lai, R., Kong, X., Tan, Z., Shi, W.: Deep learning based emotion analysis of microblog texts. Inf. Fusion 64, 1–11 (2020)
Article Google Scholar
Kannan, E., Kothamasu, L.A.: Fine-tuning BERT based approach for multi-class sentiment analysis on twitter emotion data. Ingénierie des Systémes d’Information 27(1) (2022)
Google Scholar
Sonu, S., Haque, R., Hasanuzzaman, M., Stynes, P., Pathak, P.: Identifying emotions in code mixed Hindi-English tweets. In: Proceedings of the WILDRE-6 Workshop within the 13th Language Resources and Evaluation Conference, pp. 35–41. European Language Resources Association, Marseille, France (2022)
Google Scholar
Li, A., Yi, S.: Emotion analysis model of microblog comment text based on CNN-BiLSTM. Comput. Intell. Neurosci. (2022)
Google Scholar
Gou, Z., Li, Y.: Integrating BERT embeddings and BiLSTM for emotion analysis of dialogue. Comput. Intell. Neurosci. (2023)
Google Scholar
Li, X., Lei, Y., Ji, S.: BERT-and BiLSTM-based sentiment analysis of online Chinese buzzwords. Future Internet 14(11), 332 (2022)
Article Google Scholar
Ozturk, O., Ozcan, A.: Sentiment analysis in turkish using transformer-based deep learning models. In: Hemanth, D.J., Yigit, T., Kose, U., Guvenc, U. (eds.) The International Conference on Artificial Intelligence and Applied Mathematics in Engineering, vol. 7, pp. 1–15. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-31956-3_1
Ranathunga, S., Liyanage, I.U.: Sentiment analysis of Sinhala news comments. Trans. Asian Low-Resource Lang. Inf. Process. 20(4), 1–23 (2021)
Article Google Scholar
Ucan, A., Dörterler, M., Akçapinar Sezer, E.: A study of Turkish emotion classification with pretrained language models. J. Inf. Sci. 48(6), 857–865 (2022)
Google Scholar
Kumar, Y., Mahata, D., Aggarwal, S., Chugh, A., Maheshwari, R., Shah, R.R.: BHAAV-A text corpus for emotion analysis from Hindi stories (2019). arXiv preprint arXiv:1910.04073
Kannan, R.R., Rajalakshmi, R., Kumar, L.: IndicBERT based approach for sentiment analysis on code-mixed Tamil tweets (2021)
Google Scholar
Fischer, M., Haque, R., Stynes, P., Pathak, P.: Identifying fake news in Brazilian Portuguese. In: Rosso, P., Basile, V., Martínez, R., Métais, E., Meziane, F. (eds.) International Conference on Applications of Natural Language to Information Systems, vol. 13286, pp. 111–118. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-08473-7_10

Download references

Author information

Authors and Affiliations

National College of Ireland, Dublin, Ireland
Manisha, William Clifford, Eugene McLaughlin & Paul Stynes

Authors

Manisha
View author publications
You can also search for this author in PubMed Google Scholar
William Clifford
View author publications
You can also search for this author in PubMed Google Scholar
Eugene McLaughlin
View author publications
You can also search for this author in PubMed Google Scholar
Paul Stynes
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Manisha , William Clifford , Eugene McLaughlin or Paul Stynes .

Editor information

Editors and Affiliations

Indraprastha Institute of Information Technology, Delhi, India
Vikram Goyal
University of Delhi, Delhi, India
Naveen Kumar
Nanyang Technological University, Singapore, Singapore
Sourav S. Bhowmick
Indian Institute of Technology, Kharagpur, India
Pawan Goyal
Birla Institute of Technology and Science, Pilani, India
Navneet Goyal
Indraprastha Institute of Information Technology, Delhi, India
Dhruv Kumar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Manisha, Clifford, W., McLaughlin, E., Stynes, P. (2023). A Deep Learning Emotion Classification Framework for Low Resource Languages. In: Goyal, V., Kumar, N., Bhowmick, S.S., Goyal, P., Goyal, N., Kumar, D. (eds) Big Data and Artificial Intelligence. BDA 2023. Lecture Notes in Computer Science, vol 14418. Springer, Cham. https://doi.org/10.1007/978-3-031-49601-1_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-49601-1_8
Published: 04 December 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-49600-4
Online ISBN: 978-3-031-49601-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Deep Learning Emotion Classification Framework for Low Resource Languages