Skip to main content
Log in

Deep learning-based hybrid sentiment analysis with feature selection using optimization algorithm

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In the past few years, sentiment analysis (SA) of online content has gained more attention in the research area due to the enormous increase of online content from various sources like websites, social blogs, etc. Many organizations use SA techniques to determine the opinion of users and to ensure their satisfaction. Numerous techniques are suggested by many researchers to identify the sentiments of online content. Among them, hybrid of deep learning and lexicon-based SA techniques are gaining more attention due to their outstanding performance than other approaches. Though the lexicon-based SA approaches integrated with deep learning SA approaches possess more advantages they suffer from lack of accuracy and scalability issues due to the high-dimensional features. To eliminate this issue, a hybrid SA approach is proposed in this paper with a bio-inspired feature selection technique. The Valence Aware Dictionary for Sentiment Reasoning (VADER) approach is integrated with the hybrid deep learning approach of attention-based bidirectional long short-term memory and variable pooling convolutional neural network (VPCNN-ABiLSTM) for SA. The optimal features are selected to minimize the scalability issue by integrating the chimp optimization algorithm with the opposition-based learning technique. The performance of the proposed approach is evaluated for four types of benchmark datasets in terms of precision, accuracy, recall, and F1 score. The proposed approach with OBL-CHOA based feature selection technique achieved higher accuracy of 97.1% with the reduction of 13.6% features. The accuracy of the proposed approach with the feature selection technique is 6.9% higher than the existing BiLSTM-CNN based SA approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Algorithm 1
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data Availability

Data will be shared on the reasonable request.

References

  1. Alarifi A, Tolba A, Al-Makhadmeh Z, Said W (2018) A big data approach to sentiment analysis using greedy feature selection with cat swarm optimization-based long short-term memory neural networks. J Supercomput 76:4414–4429. https://doi.org/10.1007/s11227-018-2398-2

    Article  Google Scholar 

  2. Boiy E, Moens M -F (2008) A machine learning approach to sentiment analysis in multilingual Web texts. Inf Retr 12(5):526–558. https://doi.org/10.1007/s10791-008-9070-z

    Article  Google Scholar 

  3. Bojanowski P, Grave E, Joulin A, Mikolov T (2017) En-riching word vectors with subword information. Transactions of the Association for Computational Linguistics 5:135–146. https://doi.org/10.1162/tacl-a-00051

    Article  Google Scholar 

  4. Borg A, Boldt M (2020) Using VADER sentiment and SVM for predicting customer response sentiment. Expert Syst Appl 162:113746. https://doi.org/10.1016/j.eswa.2020.113746

    Article  Google Scholar 

  5. Cahyadi A, Khodra ML (2018) Aspect-based sentiment analysis using convolutional neural network and bidirectional long short-term memory. In: 2018 5th international conference on advanced informatics: concept theoryand applications (ICAICTA), pp 124–129. https://doi.org/10.1109/ICAICTA.2018.8541300

  6. Cai Y, Huang Q, Lin Z, Xu J, Chen Z, Li Q (2020) Recurrent neural network with pooling operation and attention mechanism for sentiment analysis: a multi-task learning approach. Knowl-Based Syst 203:105856. https://doi.org/10.1016/j.knosys.2020.105856

    Article  Google Scholar 

  7. Cho H, Kim S, Lee J, Lee J-S (2014) Data-driven integration of multiple sentiment dictionaries for lexicon-based sentiment classification of product reviews. Knowl-Based Syst 71:61–71. https://doi.org/10.1016/j.knosys.2014.06.001

    Article  Google Scholar 

  8. Da Silva NFF, Coletta LFS, Hruschka ER, Hruschka ERJ (2016) Using unsupervised information to improve semi-supervised tweet sentiment classification. Inform Sci 355–356:348–365. https://doi.org/10.1016/j.ins.2016.02.002

    Article  Google Scholar 

  9. Darwich M, Noah SAM, Omar N, Osman N (2019) Corpus-Based Techniques for sentiment lexicon generation: a review. J Digit Inf Manag 17(5):296–305. https://doi.org/10.6025/jdim/2019/17/5/296-305

    Article  Google Scholar 

  10. Dong M, Li Y, Tang X, Xu J, Bi S, Cai Y (2020) Variable convolution and pooling convolutional neural network for text sentiment classification. IEEE Access 8:16174–16186. https://doi.org/10.1109/ACCESS.2020.2966726

    Article  Google Scholar 

  11. Ducange P, Fazzolari M, Petrocchi M, Vecchio M (2019) An effective Decision Support System for social media listening based on cross-source sentiment analysis models. Eng Appl Artif Intell 78:71–85. https://doi.org/10.1016/j.engappai.2018.10.014

    Article  Google Scholar 

  12. Eberhart R, Kennedy J (n.d.) A new optimizer using particle swarm theory. MHS’95. In: Proceedings of the Sixth international symposium on micro machine and human science, pp 39-43. https://doi.org/10.1109/mhs.1995.494215

  13. Fathi M, Haghi Kashani M, Jameii SM, Mahdipour E (2021) Big data analytics in weather forecasting: a systematic review. Archives of Computational Methods in Engineering 27:1247–1275. https://doi.org/10.1007/s11831-021-09616-4

    Article  Google Scholar 

  14. Fu X, Yang J, Li J, Fang M, Wang H (2018) Lexicon-enhanced LSTM with attention for general sentiment analysis. IEEE Access 6:71884–71891. https://doi.org/10.1109/access.2018.2878425

    Article  Google Scholar 

  15. Gang L, Jiabao G (2019) Bidirectional LSTM with attention mechanism and convolutional layer for text classification. Neurocomputing 337:325–338. https://doi.org/10.1016/j.neucom.2019.01.078

    Article  Google Scholar 

  16. Gautam G, Yadav D (2014) Sentiment analysis of twitter data using machine learning approaches and semantic analysis. In: 2014 Seventh international conference on contemporary computing (IC3), pp 437–442. https://doi.org/10.1109/ic3.2014.6897213

  17. Geetha MP, Renuka D (2021) Improving the performance of aspect based sentiment analysis using fine-tuned Bert Base Uncased model. Int J Intell Netw 2:64–69. https://doi.org/10.1016/j.ijin.2021.06.005

    Article  Google Scholar 

  18. Govindarajan M (2013) Sentiment analysis of movie reviews using hybrid method of naive Bayes and genetic algorithm. Int J Adv Comput Res 3(4):139–145

    Google Scholar 

  19. Gupta I, Joshi N (2019) Enhanced twitter sentiment analysis using hybrid approach and by accounting local contextual semantic. J Intell Syst 29 (1):1611–1625. https://doi.org/10.1515/jisys-2019-0106

    Article  Google Scholar 

  20. Haque TU, Saber NN, Shah FM (2018) Sentiment analysis on large scale Amazon product reviews. In: 2018 IEEE international conference on innovative research and development (ICIRD), pp 1–6. https://doi.org/10.1109/icird.2018.8376299

  21. Hassan A, Mahmood A (2018) Convolutional recurrent deep learning model for sentence classification. IEEE Access 6:13949–13957. https://doi.org/10.1109/ACCESS.2018.2814818

    Article  Google Scholar 

  22. Hemmatian F, Sohrabi MK (2017) A survey on classification techniques for opinion mining and sentiment analysis. Artif Intell Rev 52(3):1495–1545. https://doi.org/10.1007/s10462-017-9599-6.

    Article  Google Scholar 

  23. Holland JH (1975) Adaptation in natural and artificial systems: an introductory analysis with applications to biology. Control, and Artificial Intelligence, The MIT Press

  24. Huang W, Rao G, Feng Z, Cong Q (2018) LSTM with sentence representations for document-level sentiment classification. Neurocomputing 308:49–57. https://doi.org/10.1016/j.neucom.2018.04.045

    Article  Google Scholar 

  25. Hussain A, Cambria E (2018) Semi-supervised learning for big social data analysis. Neurocomputing 275:1662–1673. https://doi.org/10.1016/j.neucom.2017.10.010

    Article  Google Scholar 

  26. Hutto CJ, Gilbert E (2015) VADER: A parsimonious rule-based model for sentiment analysis of social media text. Proceedings of the 8th International Conference on Weblogs and Social Media 8(1):216–225. https://doi.org/10.1609/icwsm.v8i1.14550

    Article  Google Scholar 

  27. Iqbal F, Maqbool J, Fung BCM, Batool R, Khattak AM, Aleem S, Hung PCK (2019) A hybrid framework for sentiment analysis using genetic algorithm based feature reduction. IEEE Access 7:14637–14652. https://doi.org/10.1109/access.2019.2892852

    Article  Google Scholar 

  28. Kalarani P, Selva Brunda S (2018) Sentiment analysis by POS and joint sentiment topic features using SVM and ANN. Soft Comput 23:7067–7079. https://doi.org/10.1007/s00500-018-3349-9

    Article  Google Scholar 

  29. Kang H, Yoo SJ, Han D (2012) Senti-lexicon and improved naïve Bayes algorithms for sentiment analysis of restaurant reviews. Expert Syst Appl 39(5):6000–6010. https://doi.org/10.1016/j.eswa.2011.11.107

    Article  Google Scholar 

  30. Karimi Y, Haghi Kashani M, Akbari M, Mahdipour E (2021) Leveraging big data in smart cities: a systematic review. Concurrency and Computation: Practice and Experience 33(21):e6379. https://doi.org/10.1002/cpe.6379

    Article  Google Scholar 

  31. Katz G, Ofek N, Shapira B (2015) Consent: context-based sentiment analysis. Knowl-Based Syst 84:162–178. https://doi.org/10.1016/j.knosys.2015.04.009

    Article  Google Scholar 

  32. Kausar S, Huahu X, Shabir MY, Ahmad W (2020) A sentiment polarity categorization technique for online product reviews. IEEE Access 8:3594–3605. https://doi.org/10.1109/access.2019.2963020

    Article  Google Scholar 

  33. Khishe M, Mosavi MR (2020) Chimp optimization algorithm. Expert Syst Appl 149:113338. https://doi.org/10.1016/j.eswa.2020.113338

    Article  Google Scholar 

  34. Kristiyanti DA, Wahyudi M (2017) Feature selection based on Genetic algorithm, particle swarm optimization and principal component analysis for opinion mining cosmetic product review. In: 2017 5th international conference on cyber and it service management (CITSM), pp 1–6. https://doi.org/10.1109/citsm.2017.8089278

  35. Kumar SS, Kumar MA, Soman KP (2017) Sentiment analysis of tweets in malayalam using long Short-TermMemory units and convolutional neural nets. In: Ghosh A, Pal R, Prasath R (eds) Mining intelligence and knowledgeexploration. Springer, Cham

  36. Lee G, Jeong J, Seo S, Kim CY, Kang P (2018) Sentiment classification with word localization based on weakly supervised learning with a convolutional neural network. Knowl-Based Syst 152:70–82. https://doi.org/10.1016/j.knosys.2018.04.006

    Article  Google Scholar 

  37. Li G, Liu F (2014) Sentiment analysis based on clustering: a framework in improving accuracy and recognizing neutral opinions. Appl Intell 40 (3):441–452. https://doi.org/10.1007/s10489-013-0463-3

    Article  MathSciNet  Google Scholar 

  38. Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. ICLR. arXiv:301.3781

  39. Minaee S, Azimi E, Abdolrashidi A (2019) Deep-sentiment: sentiment analysis using ensemble of cnn and bi-lstm models. https://doi.org/10.48550/arXiv.1904.04206

  40. Mishra RM, Urolagin S, Angel AJJ, Neogi A, Nawaz N (2021) Deep learning-based sentiment analysis and topic modeling on tourism during Covid-19 pandemic. Frontiers in Computer Science 3. https://doi.org/10.3389/fcomp.2021.775368

  41. Mujahid M, Lee E, Rustam F, Washington PB, Ullah S, Reshi AA, Ashraf I (2021) Sentiment analysis and topic modeling on tweets about online education during COVID-19. Appl Sci 11(18):8438. https://doi.org/10.3390/app11188438

    Article  Google Scholar 

  42. Mumtaz D, Ahuja B (2017) A lexical and machine Learning-Based hybrid system for sentiment analysis. Studies in Computational Intelligence, pp 165–175. https://doi.org/10.1007/978-981-10-4555-4-11

  43. Nabaei A, Hamian M, Parsaei MR, Safdari R, Samad-Soltani T, Zarrabi H, Ghassemi A (2016) Topologies and performance of intelligent algorithms: a comprehensive review. Artif Intell Rev 49 (1):79–103. https://doi.org/10.1007/s10462-016-9517-3

    Article  Google Scholar 

  44. Nandi V, Agrawal S (2016) Political sentiment analysis using hybrid approach. Int Res J Eng Technol (IRJET) 3(5):1621–1627

    Google Scholar 

  45. Nogueira CSD, Gatti M (2014) Deep convolutional neural networks for sentiment analysis of short texts. In: Proceedings of 25th international conference on computational linguistics, pp 69–78

  46. Ortigosa A, Martín JM, Carro RM (2014) Sentiment analysis in Facebook and its application to e-learning. Comput Hum Behav 31:527–541. https://doi.org/10.1016/j.chb.2013.05.024

    Article  Google Scholar 

  47. Ortigosa A, Martín JM, Carro RM (2014) Sentiment analysis in Facebook and its application to e-learning. Comput Hum Behav 31:527–541. https://doi.org/10.1016/j.chb.2013.05.024

    Article  Google Scholar 

  48. Peng H, Ma Y, Li Y, Cambria E (2018) Learning multi-grained aspect target sequence for Chinese sentiment analysis. Knowl-Based Syst 148:167–176. https://doi.org/10.1016/j.knosys.2018.02.034

    Article  Google Scholar 

  49. Pennington J, Socher R, Manning C (2014) Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp 1532–1543. https://doi.org/10.3115/v1/D14-1162

  50. Rajganesh N, Asha C, Keerthana AT, Suriya K (2018) A hybrid feedback-based book recommendation system using sentiment analysis. IJSRCSEIT 3 (3):2456–3307

    Google Scholar 

  51. Rana R (2016) Gated recurrent unit (GRU) for emotion classification from noisy speech. arXiv:1612.07778

  52. Rehman AU, Malik AK, Raza B, Ali W (2019) A hybrid CNN-LSTM model for improving accuracy of movie reviews sentiment analysis. Multimed Tools Appl 78:26597–26613. https://doi.org/10.1007/s11042-019-07788-7

    Article  Google Scholar 

  53. Shang L, Zhou Z, Liu X (2016) Particle swarm optimization-based feature selection in sentiment classification. Soft Comput 20(10):3821–3834. https://doi.org/10.1007/s00500-016-2093-2

    Article  Google Scholar 

  54. Singhal P, Bhattacharyya P (2016) Sentiment analysis and deep learning: a survey. In: Center for Indian Language Technology, Indian Institute of Technology, Bombay. https://code.google.com/archive/p/word2vec/

  55. Taboada M, Brooke J, Tofiloski M, Voll K, Stede M (2011) Lexicon-Based Methods for sentiment analysis. Comput Linguist 37 (2):267–307. https://doi.org/10.1162/coli-a-0049

    Article  Google Scholar 

  56. Umer M, Ashraf I, Mehmood A, Kumari S, Ullah S, Choi GS (2021) Sentiment analysis of tweets using a unified convolutional neural network-long short-term memory network model. Comput Intell 37(1):409–434. https://doi.org/10.1111/coin.12415

    Article  MathSciNet  Google Scholar 

  57. Verma S, Saini M, Sharan A (2018) Deep sequential model for review rating prediction. In: 10th international conference on contemporary computing (IC3), pp 1–6. https://doi.org/10.1109/IC3.2017.8284318.

  58. Wu C, Wu F, Wu S, Yuan Z, Liu J, Huang Y (2019) Semi-supervised dimensional sentiment analysis with variational autoencoder. Knowl-Based Syst 165:30–39. https://doi.org/10.1016/j.knosys.2018.11.018

    Article  Google Scholar 

  59. Xia R, Zong C, Li S (2011) Ensemble of feature sets and classification algorithms for sentiment classification. Inf Sci 181(6):1138–1152. https://doi.org/10.1016/j.ins.2010.11.023

    Article  Google Scholar 

  60. Yadav A, Vishwakarma DK (2020) A comparative study on bio-inspired algorithms for sentiment analysis. Clust Comput 23:2969–2989. https://doi.org/10.1007/s10586-020-03062-w

    Article  Google Scholar 

  61. Yuan Z, Wu S, Wu F, Liu J, Huang Y (2018) Domain attention model for multi-domain sentiment classification. Knowl-Based Syst 155:1–10. https://doi.org/10.1016/j.knosys.2018.05.004

    Article  Google Scholar 

  62. Zou Q, Xie S, Lin Z, Wu M, Ju Y (2016) Finding the best classification threshold in imbalanced classification. Big Data Res 5:2–8. https://doi.org/10.1016/j.bdr.2015.12.001

    Article  Google Scholar 

Download references

Acknowledgments

I confirm that all authors listed on the title page have contributed significantly to the work, have read the manuscript, attest to the validity and legitimacy of the data and its interpretation, and agree to its submission.

Funding

Funding information is not applicable because no funding was received

Author information

Authors and Affiliations

Authors

Contributions

I confirm that all authors listed on the title page have contributed significantly to the work, have read the manuscript, attest to the validity and legitimacy of the data and its interpretation, and agree to its submission.

Corresponding author

Correspondence to D. Anand Joseph Daniel.

Ethics declarations

Ethics approval

Funding information is not applicable because no funding was received, so he name of the approving body and the approval numberID not applicable to this manuscript.

Consent for Publication

I, give my consent for the publication of above Article.I declare that I shall not submit the paper for publication in any other Journal or Magazine till the decision is made by journal editors.

Conflict of Interests

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Daniel, D.A.J., Meena, M.J. Deep learning-based hybrid sentiment analysis with feature selection using optimization algorithm. Multimed Tools Appl 82, 43273–43296 (2023). https://doi.org/10.1007/s11042-023-14767-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-14767-6

Keywords

Navigation