ABSTRACT
Bangla text sentiment analysis is one of the challenges of recent years. Various machine learning and deep learning approaches have been made to further improve the classification models’ performance to achieve state-of-the art results. Since text sentiment analysis is a sequential task, Long Short-term memory have provided great results in these cases. In this paper, we propose a methodology to tweak an existing LSTM model to gain further improvement in results. As Bengali Text Sentiment analysis is a sequence-to-one task, we discuss how using the features of the entire sequence generated from LSTM along with the last hidden state’s output can improve the results. We compare with a single LSTM layer where the last hidden state is used for producing the final result with our methodology where also a single LSTM layer is used but with additional feature engineering and multi-supervision is applied on the generated sequence and final hidden state to produce the results. Our final results show that adding these tweaks to an existing LSTM layer can increase the overall performance of the model.
- Abien Fred Agarap. 2018. Deep learning using rectified linear units (relu). arXiv preprint arXiv:1803.08375 (2018).Google Scholar
- Corinna Cortes, Mehryar Mohri, and Afshin Rostamizadeh. 2012. L2 regularization for learning kernels. arXiv preprint arXiv:1205.2653 (2012).Google Scholar
- Zhiyong Cui, Ruimin Ke, Ziyuan Pu, and Yinhai Wang. 2018. Deep bidirectional and unidirectional LSTM recurrent neural network for network-wide traffic speed prediction. arXiv preprint arXiv:1801.02143 (2018).Google Scholar
- Hrayr Harutyunyan, Hrant Khachatrian, David C Kale, Greg Ver Steeg, and Aram Galstyan. 2019. Multitask learning and benchmarking with clinical time series data. Scientific data 6, 1 (2019), 96.Google Scholar
- Khan Md Hasib, Nurul Akter Towhid, and Md Golam Rabiul Alam. 2021. Online review based sentiment classification on bangladesh airline service using supervised learning. In 2021 5th International Conference on Electrical Engineering and Information Communication Technology (ICEEICT). IEEE, 1–6.Google ScholarCross Ref
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735–1780.Google ScholarDigital Library
- Naimul Hossain, Md Rafiuzzaman Bhuiyan, Zerin Nasrin Tumpa, and Syed Akhter Hossain. 2020. Sentiment analysis of restaurant reviews using combined CNN-LSTM. In 2020 11th International conference on computing, communication and networking technologies (ICCCNT). IEEE, 1–5.Google ScholarCross Ref
- Jie Hu, Li Shen, and Gang Sun. 2018. Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7132–7141.Google ScholarCross Ref
- A. Khatuun. [n. d.]. Bangla wikipedia dataset.Google Scholar
- Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google Scholar
- Chen-Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, and Zhuowen Tu. 2015. Deeply-supervised nets. In Artificial intelligence and statistics. PMLR, 562–570.Google Scholar
- Chenbin Li, Guohua Zhan, and Zhihua Li. 2018. News text classification based on improved Bi-LSTM-CNN. In 2018 9th International conference on information technology in medicine and education (ITME). IEEE, 890–893.Google ScholarCross Ref
- Yuandong Luan and Shaofu Lin. 2019. Research on text classification based on CNN and LSTM. In 2019 IEEE international conference on artificial intelligence and computer applications (ICAICA). IEEE, 352–355.Google ScholarCross Ref
- Md Humaion Kabir Mehedi, Kazi Omar Faruk, Anika Rahman, Iffatun Nessa, Benozir Zabin, Khairun Nahar, Shadab Iqbal, Md Sabbir Hossain, and Annajiat Alim Rasel. 2022. Automatic Bangla Article Content Categorization using a Hybrid Deep Learning Model. In 2022 IEEE 10th Region 10 Humanitarian Technology Conference (R10-HTC). IEEE, 19–25.Google ScholarCross Ref
- Rakin Mostafa, Md Humaion Kabir Mehedi, MD Mustakin Alam, and Annajiat Alim Rasel. 2023. Bidirectional LSTM and NLP Based Sentiment Analysis of Tweets. In Proceedings of the 14th International Conference on Soft Computing and Pattern Recognition (SoCPaR 2022). Springer, 647–655.Google ScholarCross Ref
- S. Sazzad. 2021. Bangla (bengali) sentiment analysis classification benchmark dataset corpus. Mendeley.Google Scholar
- Qi Wang, Xiangyue Meng, Ting Sun, and Xiangde Zhang. 2021. A light iris segmentation network. The Visual Computer (2021), 1–11.Google Scholar
Index Terms
- Multi-Supervised LSTM for Bengali Text Sentiment Analysis
Recommendations
Sentiment Analysis in the Light of LSTM Recurrent Neural Networks
Long short-term memory LSTM is a special type of recurrent neural network RNN architecture that was designed over simple RNNs for modeling temporal sequences and their long-range dependencies more accurately. In this article, the authors work with ...
Sentiment analysis of commodity reviews based on multilayer LSTM network
AIIPCC '19: Proceedings of the International Conference on Artificial Intelligence, Information Processing and Cloud ComputingWith the rapid development of network technology and the popularization and application of e-commerce, comments with subjective sentiments on various mainstream e-commerce platforms play a crucial role in improving product quality. This paper takes the ...
Joint sentiment/topic model for sentiment analysis
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge managementSentiment analysis or opinion mining aims to use automated tools to detect subjective information such as opinions, attitudes, and feelings expressed in text. This paper proposes a novel probabilistic modeling framework based on Latent Dirichlet ...
Comments