skip to main content
10.1145/3589883.3589907acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmltConference Proceedingsconference-collections
research-article

Multi-Supervised LSTM for Bengali Text Sentiment Analysis

Authors Info & Claims
Published:27 June 2023Publication History

ABSTRACT

Bangla text sentiment analysis is one of the challenges of recent years. Various machine learning and deep learning approaches have been made to further improve the classification models’ performance to achieve state-of-the art results. Since text sentiment analysis is a sequential task, Long Short-term memory have provided great results in these cases. In this paper, we propose a methodology to tweak an existing LSTM model to gain further improvement in results. As Bengali Text Sentiment analysis is a sequence-to-one task, we discuss how using the features of the entire sequence generated from LSTM along with the last hidden state’s output can improve the results. We compare with a single LSTM layer where the last hidden state is used for producing the final result with our methodology where also a single LSTM layer is used but with additional feature engineering and multi-supervision is applied on the generated sequence and final hidden state to produce the results. Our final results show that adding these tweaks to an existing LSTM layer can increase the overall performance of the model.

References

  1. Abien Fred Agarap. 2018. Deep learning using rectified linear units (relu). arXiv preprint arXiv:1803.08375 (2018).Google ScholarGoogle Scholar
  2. Corinna Cortes, Mehryar Mohri, and Afshin Rostamizadeh. 2012. L2 regularization for learning kernels. arXiv preprint arXiv:1205.2653 (2012).Google ScholarGoogle Scholar
  3. Zhiyong Cui, Ruimin Ke, Ziyuan Pu, and Yinhai Wang. 2018. Deep bidirectional and unidirectional LSTM recurrent neural network for network-wide traffic speed prediction. arXiv preprint arXiv:1801.02143 (2018).Google ScholarGoogle Scholar
  4. Hrayr Harutyunyan, Hrant Khachatrian, David C Kale, Greg Ver Steeg, and Aram Galstyan. 2019. Multitask learning and benchmarking with clinical time series data. Scientific data 6, 1 (2019), 96.Google ScholarGoogle Scholar
  5. Khan Md Hasib, Nurul Akter Towhid, and Md Golam Rabiul Alam. 2021. Online review based sentiment classification on bangladesh airline service using supervised learning. In 2021 5th International Conference on Electrical Engineering and Information Communication Technology (ICEEICT). IEEE, 1–6.Google ScholarGoogle ScholarCross RefCross Ref
  6. Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735–1780.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Naimul Hossain, Md Rafiuzzaman Bhuiyan, Zerin Nasrin Tumpa, and Syed Akhter Hossain. 2020. Sentiment analysis of restaurant reviews using combined CNN-LSTM. In 2020 11th International conference on computing, communication and networking technologies (ICCCNT). IEEE, 1–5.Google ScholarGoogle ScholarCross RefCross Ref
  8. Jie Hu, Li Shen, and Gang Sun. 2018. Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7132–7141.Google ScholarGoogle ScholarCross RefCross Ref
  9. A. Khatuun. [n. d.]. Bangla wikipedia dataset.Google ScholarGoogle Scholar
  10. Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google ScholarGoogle Scholar
  11. Chen-Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, and Zhuowen Tu. 2015. Deeply-supervised nets. In Artificial intelligence and statistics. PMLR, 562–570.Google ScholarGoogle Scholar
  12. Chenbin Li, Guohua Zhan, and Zhihua Li. 2018. News text classification based on improved Bi-LSTM-CNN. In 2018 9th International conference on information technology in medicine and education (ITME). IEEE, 890–893.Google ScholarGoogle ScholarCross RefCross Ref
  13. Yuandong Luan and Shaofu Lin. 2019. Research on text classification based on CNN and LSTM. In 2019 IEEE international conference on artificial intelligence and computer applications (ICAICA). IEEE, 352–355.Google ScholarGoogle ScholarCross RefCross Ref
  14. Md Humaion Kabir Mehedi, Kazi Omar Faruk, Anika Rahman, Iffatun Nessa, Benozir Zabin, Khairun Nahar, Shadab Iqbal, Md Sabbir Hossain, and Annajiat Alim Rasel. 2022. Automatic Bangla Article Content Categorization using a Hybrid Deep Learning Model. In 2022 IEEE 10th Region 10 Humanitarian Technology Conference (R10-HTC). IEEE, 19–25.Google ScholarGoogle ScholarCross RefCross Ref
  15. Rakin Mostafa, Md Humaion Kabir Mehedi, MD Mustakin Alam, and Annajiat Alim Rasel. 2023. Bidirectional LSTM and NLP Based Sentiment Analysis of Tweets. In Proceedings of the 14th International Conference on Soft Computing and Pattern Recognition (SoCPaR 2022). Springer, 647–655.Google ScholarGoogle ScholarCross RefCross Ref
  16. S. Sazzad. 2021. Bangla (bengali) sentiment analysis classification benchmark dataset corpus. Mendeley.Google ScholarGoogle Scholar
  17. Qi Wang, Xiangyue Meng, Ting Sun, and Xiangde Zhang. 2021. A light iris segmentation network. The Visual Computer (2021), 1–11.Google ScholarGoogle Scholar

Index Terms

  1. Multi-Supervised LSTM for Bengali Text Sentiment Analysis

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICMLT '23: Proceedings of the 2023 8th International Conference on Machine Learning Technologies
      March 2023
      293 pages
      ISBN:9781450398329
      DOI:10.1145/3589883

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 June 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)31
      • Downloads (Last 6 weeks)4

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format