Skip to main content

A Deep Learning Emotion Classification Framework for Low Resource Languages

  • Conference paper
  • First Online:
Big Data and Artificial Intelligence (BDA 2023)

Abstract

Emotion classification from text is the process of identifying and classifying emotions expressed in textual data. Emotions can be feelings such as anger, joy, suspense, sadness and neutral. Developing a machine learning model to identify emotions in a low-resourced language with a limited set of linguistic resources and annotated corpora is a challenge. This research proposes a Deep Learning Emotion Classification Framework to identify and classify emotions in low-resourced languages such as Hindi. The proposed framework combines a classification model and a low resource optimization technique in a novel way. An annotated corpus of Hindi short stories consisting of 20,304 sentences is used to train the models for predicting five categories of emotions: anger, joy, suspense, sadness, and neutral talk. To resolve the class imbalance in the dataset SMOTE technique is applied. The optimal classification model is selected through experimentation that compares machine learning models and pre-trained models. Machine learning and deep learning models are SVM, Logistic Regression, Random Forest, CNN, BiLSTM, and CNN+BiLSTM. The pre-trained models, mBERT, IndicBERT, and a hybrid model, mBERT+BiLSTM. The models are evaluated based on macro average recall, macro average precision, and macro average F1 score. Results demonstrate that the hybrid model mBERT+BiLSTM out perform other models with a test accuracy of 57%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 49.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://zenodo.org/record/3457467.

References

  1. Acheampong, F.A., Wenyu, C., Nunoo-Mensah, H.: Text-based emotion detection: advances, challenges, and opportunities. Eng. Rep. 2(7), e12189 (2020)

    Google Scholar 

  2. Das, A., Sharif, O., Hoque, M.M., Sarker, I.H.: Emotion classification in a resource constrained language using transformer-based approach (2021). arXiv preprint arXiv:2104.08613

  3. Alam, T., Khan, A., Alam, F.: Bangla text classification using transformers (2020). arXiv preprint arXiv:2011.04446

  4. Bharti, S.K., et al.: Text-based emotion recognition using deep learning approach. Comput. Intell. Neurosci. (2022)

    Google Scholar 

  5. Midhan, T.M., Selvaraj, P., Raju, M.H.K., Reddy, M.B.P., Bhaskar, T.: Classification of mental health and emotion of human from text using machine learning approaches. In: 2023 6th International Conference on Information Systems and Computer Networks (ISCON), pp. 1–7. IEEE, March 2023

    Google Scholar 

  6. Xu, D., Tian, Z., Lai, R., Kong, X., Tan, Z., Shi, W.: Deep learning based emotion analysis of microblog texts. Inf. Fusion 64, 1–11 (2020)

    Article  Google Scholar 

  7. Kannan, E., Kothamasu, L.A.: Fine-tuning BERT based approach for multi-class sentiment analysis on twitter emotion data. Ingénierie des Systémes d’Information 27(1) (2022)

    Google Scholar 

  8. Sonu, S., Haque, R., Hasanuzzaman, M., Stynes, P., Pathak, P.: Identifying emotions in code mixed Hindi-English tweets. In: Proceedings of the WILDRE-6 Workshop within the 13th Language Resources and Evaluation Conference, pp. 35–41. European Language Resources Association, Marseille, France (2022)

    Google Scholar 

  9. Li, A., Yi, S.: Emotion analysis model of microblog comment text based on CNN-BiLSTM. Comput. Intell. Neurosci. (2022)

    Google Scholar 

  10. Gou, Z., Li, Y.: Integrating BERT embeddings and BiLSTM for emotion analysis of dialogue. Comput. Intell. Neurosci. (2023)

    Google Scholar 

  11. Li, X., Lei, Y., Ji, S.: BERT-and BiLSTM-based sentiment analysis of online Chinese buzzwords. Future Internet 14(11), 332 (2022)

    Article  Google Scholar 

  12. Ozturk, O., Ozcan, A.: Sentiment analysis in turkish using transformer-based deep learning models. In: Hemanth, D.J., Yigit, T., Kose, U., Guvenc, U. (eds.) The International Conference on Artificial Intelligence and Applied Mathematics in Engineering, vol. 7, pp. 1–15. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-31956-3_1

  13. Ranathunga, S., Liyanage, I.U.: Sentiment analysis of Sinhala news comments. Trans. Asian Low-Resource Lang. Inf. Process. 20(4), 1–23 (2021)

    Article  Google Scholar 

  14. Ucan, A., Dörterler, M., Akçapinar Sezer, E.: A study of Turkish emotion classification with pretrained language models. J. Inf. Sci. 48(6), 857–865 (2022)

    Google Scholar 

  15. Kumar, Y., Mahata, D., Aggarwal, S., Chugh, A., Maheshwari, R., Shah, R.R.: BHAAV-A text corpus for emotion analysis from Hindi stories (2019). arXiv preprint arXiv:1910.04073

  16. Kannan, R.R., Rajalakshmi, R., Kumar, L.: IndicBERT based approach for sentiment analysis on code-mixed Tamil tweets (2021)

    Google Scholar 

  17. Fischer, M., Haque, R., Stynes, P., Pathak, P.: Identifying fake news in Brazilian Portuguese. In: Rosso, P., Basile, V., Martínez, R., Métais, E., Meziane, F. (eds.) International Conference on Applications of Natural Language to Information Systems, vol. 13286, pp. 111–118. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-08473-7_10

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Manisha , William Clifford , Eugene McLaughlin or Paul Stynes .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Manisha, Clifford, W., McLaughlin, E., Stynes, P. (2023). A Deep Learning Emotion Classification Framework for Low Resource Languages. In: Goyal, V., Kumar, N., Bhowmick, S.S., Goyal, P., Goyal, N., Kumar, D. (eds) Big Data and Artificial Intelligence. BDA 2023. Lecture Notes in Computer Science, vol 14418. Springer, Cham. https://doi.org/10.1007/978-3-031-49601-1_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-49601-1_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-49600-4

  • Online ISBN: 978-3-031-49601-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics