Abstract
In the past few years, sentiment analysis (SA) of online content has gained more attention in the research area due to the enormous increase of online content from various sources like websites, social blogs, etc. Many organizations use SA techniques to determine the opinion of users and to ensure their satisfaction. Numerous techniques are suggested by many researchers to identify the sentiments of online content. Among them, hybrid of deep learning and lexicon-based SA techniques are gaining more attention due to their outstanding performance than other approaches. Though the lexicon-based SA approaches integrated with deep learning SA approaches possess more advantages they suffer from lack of accuracy and scalability issues due to the high-dimensional features. To eliminate this issue, a hybrid SA approach is proposed in this paper with a bio-inspired feature selection technique. The Valence Aware Dictionary for Sentiment Reasoning (VADER) approach is integrated with the hybrid deep learning approach of attention-based bidirectional long short-term memory and variable pooling convolutional neural network (VPCNN-ABiLSTM) for SA. The optimal features are selected to minimize the scalability issue by integrating the chimp optimization algorithm with the opposition-based learning technique. The performance of the proposed approach is evaluated for four types of benchmark datasets in terms of precision, accuracy, recall, and F1 score. The proposed approach with OBL-CHOA based feature selection technique achieved higher accuracy of 97.1% with the reduction of 13.6% features. The accuracy of the proposed approach with the feature selection technique is 6.9% higher than the existing BiLSTM-CNN based SA approach.
Similar content being viewed by others
Data Availability
Data will be shared on the reasonable request.
References
Alarifi A, Tolba A, Al-Makhadmeh Z, Said W (2018) A big data approach to sentiment analysis using greedy feature selection with cat swarm optimization-based long short-term memory neural networks. J Supercomput 76:4414–4429. https://doi.org/10.1007/s11227-018-2398-2
Boiy E, Moens M -F (2008) A machine learning approach to sentiment analysis in multilingual Web texts. Inf Retr 12(5):526–558. https://doi.org/10.1007/s10791-008-9070-z
Bojanowski P, Grave E, Joulin A, Mikolov T (2017) En-riching word vectors with subword information. Transactions of the Association for Computational Linguistics 5:135–146. https://doi.org/10.1162/tacl-a-00051
Borg A, Boldt M (2020) Using VADER sentiment and SVM for predicting customer response sentiment. Expert Syst Appl 162:113746. https://doi.org/10.1016/j.eswa.2020.113746
Cahyadi A, Khodra ML (2018) Aspect-based sentiment analysis using convolutional neural network and bidirectional long short-term memory. In: 2018 5th international conference on advanced informatics: concept theoryand applications (ICAICTA), pp 124–129. https://doi.org/10.1109/ICAICTA.2018.8541300
Cai Y, Huang Q, Lin Z, Xu J, Chen Z, Li Q (2020) Recurrent neural network with pooling operation and attention mechanism for sentiment analysis: a multi-task learning approach. Knowl-Based Syst 203:105856. https://doi.org/10.1016/j.knosys.2020.105856
Cho H, Kim S, Lee J, Lee J-S (2014) Data-driven integration of multiple sentiment dictionaries for lexicon-based sentiment classification of product reviews. Knowl-Based Syst 71:61–71. https://doi.org/10.1016/j.knosys.2014.06.001
Da Silva NFF, Coletta LFS, Hruschka ER, Hruschka ERJ (2016) Using unsupervised information to improve semi-supervised tweet sentiment classification. Inform Sci 355–356:348–365. https://doi.org/10.1016/j.ins.2016.02.002
Darwich M, Noah SAM, Omar N, Osman N (2019) Corpus-Based Techniques for sentiment lexicon generation: a review. J Digit Inf Manag 17(5):296–305. https://doi.org/10.6025/jdim/2019/17/5/296-305
Dong M, Li Y, Tang X, Xu J, Bi S, Cai Y (2020) Variable convolution and pooling convolutional neural network for text sentiment classification. IEEE Access 8:16174–16186. https://doi.org/10.1109/ACCESS.2020.2966726
Ducange P, Fazzolari M, Petrocchi M, Vecchio M (2019) An effective Decision Support System for social media listening based on cross-source sentiment analysis models. Eng Appl Artif Intell 78:71–85. https://doi.org/10.1016/j.engappai.2018.10.014
Eberhart R, Kennedy J (n.d.) A new optimizer using particle swarm theory. MHS’95. In: Proceedings of the Sixth international symposium on micro machine and human science, pp 39-43. https://doi.org/10.1109/mhs.1995.494215
Fathi M, Haghi Kashani M, Jameii SM, Mahdipour E (2021) Big data analytics in weather forecasting: a systematic review. Archives of Computational Methods in Engineering 27:1247–1275. https://doi.org/10.1007/s11831-021-09616-4
Fu X, Yang J, Li J, Fang M, Wang H (2018) Lexicon-enhanced LSTM with attention for general sentiment analysis. IEEE Access 6:71884–71891. https://doi.org/10.1109/access.2018.2878425
Gang L, Jiabao G (2019) Bidirectional LSTM with attention mechanism and convolutional layer for text classification. Neurocomputing 337:325–338. https://doi.org/10.1016/j.neucom.2019.01.078
Gautam G, Yadav D (2014) Sentiment analysis of twitter data using machine learning approaches and semantic analysis. In: 2014 Seventh international conference on contemporary computing (IC3), pp 437–442. https://doi.org/10.1109/ic3.2014.6897213
Geetha MP, Renuka D (2021) Improving the performance of aspect based sentiment analysis using fine-tuned Bert Base Uncased model. Int J Intell Netw 2:64–69. https://doi.org/10.1016/j.ijin.2021.06.005
Govindarajan M (2013) Sentiment analysis of movie reviews using hybrid method of naive Bayes and genetic algorithm. Int J Adv Comput Res 3(4):139–145
Gupta I, Joshi N (2019) Enhanced twitter sentiment analysis using hybrid approach and by accounting local contextual semantic. J Intell Syst 29 (1):1611–1625. https://doi.org/10.1515/jisys-2019-0106
Haque TU, Saber NN, Shah FM (2018) Sentiment analysis on large scale Amazon product reviews. In: 2018 IEEE international conference on innovative research and development (ICIRD), pp 1–6. https://doi.org/10.1109/icird.2018.8376299
Hassan A, Mahmood A (2018) Convolutional recurrent deep learning model for sentence classification. IEEE Access 6:13949–13957. https://doi.org/10.1109/ACCESS.2018.2814818
Hemmatian F, Sohrabi MK (2017) A survey on classification techniques for opinion mining and sentiment analysis. Artif Intell Rev 52(3):1495–1545. https://doi.org/10.1007/s10462-017-9599-6.
Holland JH (1975) Adaptation in natural and artificial systems: an introductory analysis with applications to biology. Control, and Artificial Intelligence, The MIT Press
Huang W, Rao G, Feng Z, Cong Q (2018) LSTM with sentence representations for document-level sentiment classification. Neurocomputing 308:49–57. https://doi.org/10.1016/j.neucom.2018.04.045
Hussain A, Cambria E (2018) Semi-supervised learning for big social data analysis. Neurocomputing 275:1662–1673. https://doi.org/10.1016/j.neucom.2017.10.010
Hutto CJ, Gilbert E (2015) VADER: A parsimonious rule-based model for sentiment analysis of social media text. Proceedings of the 8th International Conference on Weblogs and Social Media 8(1):216–225. https://doi.org/10.1609/icwsm.v8i1.14550
Iqbal F, Maqbool J, Fung BCM, Batool R, Khattak AM, Aleem S, Hung PCK (2019) A hybrid framework for sentiment analysis using genetic algorithm based feature reduction. IEEE Access 7:14637–14652. https://doi.org/10.1109/access.2019.2892852
Kalarani P, Selva Brunda S (2018) Sentiment analysis by POS and joint sentiment topic features using SVM and ANN. Soft Comput 23:7067–7079. https://doi.org/10.1007/s00500-018-3349-9
Kang H, Yoo SJ, Han D (2012) Senti-lexicon and improved naïve Bayes algorithms for sentiment analysis of restaurant reviews. Expert Syst Appl 39(5):6000–6010. https://doi.org/10.1016/j.eswa.2011.11.107
Karimi Y, Haghi Kashani M, Akbari M, Mahdipour E (2021) Leveraging big data in smart cities: a systematic review. Concurrency and Computation: Practice and Experience 33(21):e6379. https://doi.org/10.1002/cpe.6379
Katz G, Ofek N, Shapira B (2015) Consent: context-based sentiment analysis. Knowl-Based Syst 84:162–178. https://doi.org/10.1016/j.knosys.2015.04.009
Kausar S, Huahu X, Shabir MY, Ahmad W (2020) A sentiment polarity categorization technique for online product reviews. IEEE Access 8:3594–3605. https://doi.org/10.1109/access.2019.2963020
Khishe M, Mosavi MR (2020) Chimp optimization algorithm. Expert Syst Appl 149:113338. https://doi.org/10.1016/j.eswa.2020.113338
Kristiyanti DA, Wahyudi M (2017) Feature selection based on Genetic algorithm, particle swarm optimization and principal component analysis for opinion mining cosmetic product review. In: 2017 5th international conference on cyber and it service management (CITSM), pp 1–6. https://doi.org/10.1109/citsm.2017.8089278
Kumar SS, Kumar MA, Soman KP (2017) Sentiment analysis of tweets in malayalam using long Short-TermMemory units and convolutional neural nets. In: Ghosh A, Pal R, Prasath R (eds) Mining intelligence and knowledgeexploration. Springer, Cham
Lee G, Jeong J, Seo S, Kim CY, Kang P (2018) Sentiment classification with word localization based on weakly supervised learning with a convolutional neural network. Knowl-Based Syst 152:70–82. https://doi.org/10.1016/j.knosys.2018.04.006
Li G, Liu F (2014) Sentiment analysis based on clustering: a framework in improving accuracy and recognizing neutral opinions. Appl Intell 40 (3):441–452. https://doi.org/10.1007/s10489-013-0463-3
Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. ICLR. arXiv:301.3781
Minaee S, Azimi E, Abdolrashidi A (2019) Deep-sentiment: sentiment analysis using ensemble of cnn and bi-lstm models. https://doi.org/10.48550/arXiv.1904.04206
Mishra RM, Urolagin S, Angel AJJ, Neogi A, Nawaz N (2021) Deep learning-based sentiment analysis and topic modeling on tourism during Covid-19 pandemic. Frontiers in Computer Science 3. https://doi.org/10.3389/fcomp.2021.775368
Mujahid M, Lee E, Rustam F, Washington PB, Ullah S, Reshi AA, Ashraf I (2021) Sentiment analysis and topic modeling on tweets about online education during COVID-19. Appl Sci 11(18):8438. https://doi.org/10.3390/app11188438
Mumtaz D, Ahuja B (2017) A lexical and machine Learning-Based hybrid system for sentiment analysis. Studies in Computational Intelligence, pp 165–175. https://doi.org/10.1007/978-981-10-4555-4-11
Nabaei A, Hamian M, Parsaei MR, Safdari R, Samad-Soltani T, Zarrabi H, Ghassemi A (2016) Topologies and performance of intelligent algorithms: a comprehensive review. Artif Intell Rev 49 (1):79–103. https://doi.org/10.1007/s10462-016-9517-3
Nandi V, Agrawal S (2016) Political sentiment analysis using hybrid approach. Int Res J Eng Technol (IRJET) 3(5):1621–1627
Nogueira CSD, Gatti M (2014) Deep convolutional neural networks for sentiment analysis of short texts. In: Proceedings of 25th international conference on computational linguistics, pp 69–78
Ortigosa A, Martín JM, Carro RM (2014) Sentiment analysis in Facebook and its application to e-learning. Comput Hum Behav 31:527–541. https://doi.org/10.1016/j.chb.2013.05.024
Ortigosa A, Martín JM, Carro RM (2014) Sentiment analysis in Facebook and its application to e-learning. Comput Hum Behav 31:527–541. https://doi.org/10.1016/j.chb.2013.05.024
Peng H, Ma Y, Li Y, Cambria E (2018) Learning multi-grained aspect target sequence for Chinese sentiment analysis. Knowl-Based Syst 148:167–176. https://doi.org/10.1016/j.knosys.2018.02.034
Pennington J, Socher R, Manning C (2014) Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp 1532–1543. https://doi.org/10.3115/v1/D14-1162
Rajganesh N, Asha C, Keerthana AT, Suriya K (2018) A hybrid feedback-based book recommendation system using sentiment analysis. IJSRCSEIT 3 (3):2456–3307
Rana R (2016) Gated recurrent unit (GRU) for emotion classification from noisy speech. arXiv:1612.07778
Rehman AU, Malik AK, Raza B, Ali W (2019) A hybrid CNN-LSTM model for improving accuracy of movie reviews sentiment analysis. Multimed Tools Appl 78:26597–26613. https://doi.org/10.1007/s11042-019-07788-7
Shang L, Zhou Z, Liu X (2016) Particle swarm optimization-based feature selection in sentiment classification. Soft Comput 20(10):3821–3834. https://doi.org/10.1007/s00500-016-2093-2
Singhal P, Bhattacharyya P (2016) Sentiment analysis and deep learning: a survey. In: Center for Indian Language Technology, Indian Institute of Technology, Bombay. https://code.google.com/archive/p/word2vec/
Taboada M, Brooke J, Tofiloski M, Voll K, Stede M (2011) Lexicon-Based Methods for sentiment analysis. Comput Linguist 37 (2):267–307. https://doi.org/10.1162/coli-a-0049
Umer M, Ashraf I, Mehmood A, Kumari S, Ullah S, Choi GS (2021) Sentiment analysis of tweets using a unified convolutional neural network-long short-term memory network model. Comput Intell 37(1):409–434. https://doi.org/10.1111/coin.12415
Verma S, Saini M, Sharan A (2018) Deep sequential model for review rating prediction. In: 10th international conference on contemporary computing (IC3), pp 1–6. https://doi.org/10.1109/IC3.2017.8284318.
Wu C, Wu F, Wu S, Yuan Z, Liu J, Huang Y (2019) Semi-supervised dimensional sentiment analysis with variational autoencoder. Knowl-Based Syst 165:30–39. https://doi.org/10.1016/j.knosys.2018.11.018
Xia R, Zong C, Li S (2011) Ensemble of feature sets and classification algorithms for sentiment classification. Inf Sci 181(6):1138–1152. https://doi.org/10.1016/j.ins.2010.11.023
Yadav A, Vishwakarma DK (2020) A comparative study on bio-inspired algorithms for sentiment analysis. Clust Comput 23:2969–2989. https://doi.org/10.1007/s10586-020-03062-w
Yuan Z, Wu S, Wu F, Liu J, Huang Y (2018) Domain attention model for multi-domain sentiment classification. Knowl-Based Syst 155:1–10. https://doi.org/10.1016/j.knosys.2018.05.004
Zou Q, Xie S, Lin Z, Wu M, Ju Y (2016) Finding the best classification threshold in imbalanced classification. Big Data Res 5:2–8. https://doi.org/10.1016/j.bdr.2015.12.001
Acknowledgments
I confirm that all authors listed on the title page have contributed significantly to the work, have read the manuscript, attest to the validity and legitimacy of the data and its interpretation, and agree to its submission.
Funding
Funding information is not applicable because no funding was received
Author information
Authors and Affiliations
Contributions
I confirm that all authors listed on the title page have contributed significantly to the work, have read the manuscript, attest to the validity and legitimacy of the data and its interpretation, and agree to its submission.
Corresponding author
Ethics declarations
Ethics approval
Funding information is not applicable because no funding was received, so he name of the approving body and the approval numberID not applicable to this manuscript.
Consent for Publication
I, give my consent for the publication of above Article.I declare that I shall not submit the paper for publication in any other Journal or Magazine till the decision is made by journal editors.
Conflict of Interests
The authors declare that they have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Daniel, D.A.J., Meena, M.J. Deep learning-based hybrid sentiment analysis with feature selection using optimization algorithm. Multimed Tools Appl 82, 43273–43296 (2023). https://doi.org/10.1007/s11042-023-14767-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-14767-6