Skip to main content
Log in

MuCon: Multi-channel convolution for targeted sentiment classification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Targeted Sentiment Analysis goes beyond general sentiment classification tasks by aiming to identify the sentiment of a specific target aspect within a given text. Previous studies have predominantly utilized recurrent neural networks (RNN) or their variants to predict target-specific sentiment polarity. However, the sequential processing nature of RNN restricts parallelization and fails to leverage the potential of modern multicore architectures. Additionally, these models often overlook the inherent linguistic perspective embedded in the text. This paper proposes a novel approach called MuCon (Multi-channel Convolution), which employs a simple yet effective convolutional neural network (CNN) model. MuCon incorporates multiple channels dedicated to linguistic and statistical features to determine aspect-specific sentiment polarity accurately. By incorporating linguistic knowledge into a statistical model, MuCon performs better and achieves comparable results to sophisticated state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. https://www.yelp.com/dataset/challenge

  2. https://cseweb.ucsd.edu/~jmcauley/datasets/amazon_v2/

  3. We have used Spacy for dependency relation extraction https://spacy.io/

References

  1. Akhtar MS, Garg T, Ekbal A (2020) Multi-task learning for aspect term extraction and aspect sentiment classification. Neurocomputing. https://doi.org/10.1016/j.neucom.2020.02.093

  2. Basiri ME, Nemati S, Abdar M, Cambria E, Acharrya UR (2020) Abcdm: An attention-based bidirectional cnn-rnn deep model for sentiment analysis. Future Gener Comput Syst. https://doi.org/10.1016/j.future.2020.08.005

  3. Benamara F, Cesarano C, Picariello A, Recupero DR, Subrahmanian VS (2007) Sentiment analysis: Adjectives and adverbs are better than adjectives alone. ICWSM 7:203–206

    Google Scholar 

  4. Chaturvedi A, Pandit O, Garain U (2018) CNN for text-based multiple choice question answering. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 272–277. Association for Computational Linguistics, Melbourne, Australia. https://doi.org/10.18653/v1/P18-2044. https://www.aclweb.org/anthology/P18-2044

  5. Chen P, Sun Z, Bing L, Yang W (2017) Recurrent attention network on memory for aspect sentiment analysis. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 452–461. Association for Computational Linguistics, Copenhagen, Denmark. https://doi.org/10.18653/v1/D17-1047. https://www.aclweb.org/anthology/D17-1047

  6. Devlin J, Chang MW, Lee K, Toutanova K (2019) BERT: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 4171–4186. Association for Computational Linguistics, Minneapolis, Minnesota. https://doi.org/10.18653/v1/N19-1423. https://www.aclweb.org/anthology/N19-1423

  7. Ding X, Liu B, Yu PS (2008) A holistic lexicon-based approach to opinion mining. In: Proceedings of the 2008 international conference on web search and data mining, pp. 231–240. https://doi.org/10.1145/1341531.1341561

  8. Dragut E, Fellbaum C (2014) The role of adverbs in sentiment analysis. In: Proceedings of Frame Semantics in NLP: A Workshop in Honor of Chuck Fillmore (1929-2014), pp. 38–41

  9. Elman JL (1990) Finding structure in time. Cognitive science 14(2):179–211

    Article  Google Scholar 

  10. Gamon M (2004) Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis. In: COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics, pp. 841–847. https://aclanthology.org/C04-1121

  11. Gers FA, Schmidhuber JA, Cummins FA (2000) Learning to forget: Continual prediction with lstm. Neural Comput 12(10):2451–2471. https://doi.org/10.1162/089976600300015015

    Article  CAS  PubMed  Google Scholar 

  12. Goodfellow I, Bengio Y, Courville A, Bengio Y (2016) Deep learning, vol. 1. MIT press Cambridge

  13. Gu S, Zhang L, Hou Y, Song Y (2018) A position-aware bidirectional attention network for aspect-level sentiment analysis. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 774–784. Association for Computational Linguistics, Santa Fe, New Mexico, USA. https://www.aclweb.org/anthology/C18-1066

  14. Gu X, Gu Y, Wu H (2017) Cascaded convolutional neural networks for aspect-based opinion summary. Neural Process Lett 46(2):581–594. https://doi.org/10.1007/s11063-017-9605-7

  15. Hai Z, Cong G, Chang K, Liu W, Cheng P (2014) Coarse-to-fine review selection via supervised joint aspect and sentiment model. In: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval, pp. 617–626. https://doi.org/10.1145/2600428.2609570

  16. Hatzivassiloglou V, McKeown K (1997) Predicting the semantic orientation of adjectives. In: 35th annual meeting of the association for computational linguistics and 8th conference of the european chapter of the association for computational linguistics, pp. 174–181. https://doi.org/10.3115/976909.979640. https://aclanthology.org/P97-1023

  17. Hochreiter S, Bengio Y, Frasconi P, Schmidhuber J (2001) Gradient flow in recurrent nets: the difficulty of learning long-term dependencies

  18. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780

    Article  CAS  PubMed  Google Scholar 

  19. Joshi M, Rosé C (2009) Generalizing dependency features for opinion mining. In: Proceedings of the ACL-IJCNLP 2009 conference short papers, pp. 313–316. https://aclanthology.org/P09-2079

  20. Kalchbrenner N, Grefenstette E, Blunsom P (2014) A convolutional neural network for modelling sentences. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 655–665. Association for Computational Linguistics, Baltimore, Maryland. https://doi.org/10.3115/v1/P14-1062. https://aclanthology.org/P14-1062

  21. Kim Y (2014) Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1746–1751. Association for Computational Linguistics, Doha, Qatar. https://doi.org/10.3115/v1/D14-1181. https://aclanthology.org/D14-1181

  22. Kingma DP, Ba J (2014) Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980

  23. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp. 1097–1105. https://doi.org/10.1145/3065386

  24. LeCun Y et al (1989) Generalization and network design strategies. Connectionism in perspective 19:143–155

    Google Scholar 

  25. Li F, Huang M, Zhu X (2010) Sentiment analysis with global topics and local dependency. Proceedings of the AAAI conference on artificial intelligence 24:1371–1376. https://doi.org/10.1609/aaai.v24i1.7523

  26. Li Z, Sun Y, Zhang L, Tang J (2021) Ctnet: Context-based tandem network for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 44(12):9904–9917. https://doi.org/10.1109/TPAMI.2021.3132068

  27. Liu B (2012) Sentiment analysis and opinion mining. Synthesis lectures on human language technologies 5(1):1–167. https://doi.org/10.1007/978-3-031-02145-9

  28. Liu N, Shen B (2020) Aspect-based sentiment analysis with gated alternate neural network. Knowledge-Based Systems 188:105010. https://doi.org/10.1016/j.knosys.2019.105010. https://www.sciencedirect.com/science/article/pii/S0950705119304228

  29. Liu N, Shen B (2020) Rememnn: A novel memory neural network for powerful interaction in aspect-based sentiment analysis. Neurocomputing 395:66–77. https://doi.org/10.1016/j.neucom.2020.02.018. http://www.sciencedirect.com/science/article/pii/S0925231220301934

  30. Liu X, Jing X, He Y, Mu J (2020) Multi-level attentional network for aspect-based sentiment analysis. In: 2020 IEEE 5th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA), pp. 17–22. IEEE. https://doi.org/10.1109/ICCCBDA49378.2020.9095571

  31. Ma, D., Li, S., Zhang, X., Wang, H.: Interactive attention networks for aspect-level sentiment classification. In: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI-17, pp. 4068–4074 (2017). https://doi.org/10.24963/ijcai.2017/568. https://doi.org/10.24963/ijcai.2017/568

  32. Matsumoto S, Takamura H, Okumura M (2005) Sentiment classification using word sub-sequences and dependency sub-trees. In: Pacific-Asia conference on knowledge discovery and data mining, pp. 301–311. Springer. https://doi.org/10.1007/11430919_37

  33. Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. In: Y. Bengio, Y. LeCun (eds.) 1st International Conference on Learning Representations, ICLR 2013, Scottsdale, Arizona, USA, May 2-4, 2013, Workshop Track Proceedings. http://arxiv.org/abs/1301.3781

  34. Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp. 3111–3119

  35. Mitchell TM, et al. (2007) Machine learning, vol. 1. McGraw-hill New York

  36. Mullen T, Collier N (2004) Sentiment analysis using support vector machines with diverse information sources. In: Proceedings of the 2004 conference on empirical methods in natural language processing, pp. 412–418. https://aclanthology.org/W04-3253

  37. Pang B, Lee L (2008) Opinion mining and sentiment analysis foundations and trends in information retrieval vol. 2

  38. Peng Z, Li Z, Zhang J, Li Y, Qi GJ, Tang J (2019) Few-shot image recognition with knowledge transfer. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 441–449. https://doi.org/10.1109/ICCV.2019.00053

  39. Pennington J, Socher R, Manning CD (2014) Glove: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp. 1532–1543. https://doi.org/10.3115/v1/D14-1162. https://aclanthology.org/D14-1162

  40. Peters M, Neumann M, Iyyer M, Gardner M, Clark C, Lee K, Zettlemoyer L (2018) Deep contextualized word representations. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pp. 2227–2237. Association for Computational Linguistics, New Orleans, Louisiana. https://doi.org/10.18653/v1/N18-1202. https://www.aclweb.org/anthology/N18-1202

  41. Pontiki M, Galanis D, Pavlopoulos J, Papageorgiou H, Androutsopoulos I, Manandhar S (2014) Semeval-2014 task 4: Aspect based sentiment analysis. In: Proceedings of the 8th international workshop on semantic evaluation (SemEval 2014), pp. 27–35. ACL

  42. Poria S, Cambria E, Gelbukh A (2016) Aspect extraction for opinion mining with a deep convolutional neural network. Knowledge-Based Syst 108:42 – 49 https://doi.org/10.1016/j.knosys.2016.06.009. http://www.sciencedirect.com/science/article/pii/S0950705116301721. New Avenues in Knowledge Bases for Natural Language Processing

  43. Poria S, Cambria E, Winterstein G, Huang GB (2014) Sentic patterns: Dependency-based rules for concept-level sentiment analysis. Knowledge-Based Syst 69:45–63. https://doi.org/10.1016/j.knosys.2014.05.005

  44. Qiang Y, Li X, Zhu D (2020) Toward tag-free aspect based sentiment analysis: A multiple attention network approach. arXiv preprint arXiv:2003.09986

  45. Rehurek R, Sojka P (2010) Software framework for topic modelling with large corpora. In: In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. Citeseer

  46. Shuang K, Yang Q, Loo J, Li R, Gu M (2020) Feature distillation network for aspect-based sentiment analysis. Information Fusion. https://doi.org/10.1016/j.inffus.2020.03.003

  47. Subrahmanian VS, Reforgiato D (2008) Ava: Adjective-verb-adverb combinations for sentiment analysis. IEEE Intelligent Systems 23(4):43–50. https://doi.org/10.1109/MIS.2008.57

  48. Tang D, Qin B, Feng X, Liu T (2015) Target-dependent sentiment classification with long short term memory. CoRR abs/1512.01100

  49. Tang, D., Qin, B., Feng, X., Liu, T (2016) Effective LSTMs for target-dependent sentiment classification. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 3298–3307. The COLING 2016 Organizing Committee, Osaka, Japan. https://www.aclweb.org/anthology/C16-1311

  50. Tang D, Qin B, Liu T (2016) Aspect level sentiment classification with deep memory network. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 214–224. Association for Computational Linguistics, Austin, Texas. https://doi.org/10.18653/v1/D16-1021. https://aclanthology.org/D16-1021

  51. Turney, P (2002) Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 417–424. Association for Computational Linguistics, Philadelphia, Pennsylvania, USA. https://doi.org/10.3115/1073083.1073153. https://aclanthology.org/P02-1053

  52. Wang Y, Huang M, Zhu X, Zhao L (2016) Attention-based LSTM for aspect-level sentiment classification. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 606–615. Association for Computational Linguistics, Austin, Texas. https://doi.org/10.18653/v1/D16-1058. https://www.aclweb.org/anthology/D16-1058

  53. Wilson T, Wiebe J, Hwa R (2004) Just how mad are you? finding strong and weak opinion clauses. In: aaai, vol. 4, pp. 761–769

  54. Xu H, Liu B, Shu L, Yu PS (2018) Double embeddings and CNN-based sequence labeling for aspect extraction. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 592–598. Association for Computational Linguistics, Melbourne, Australia. https://doi.org/10.18653/v1/P18-2094. https://aclanthology.org/P18-2094

  55. Xue W, Li T (2018) Aspect based sentiment analysis with gated convolutional networks. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 2514–2523. Association for Computational Linguistics, Melbourne, Australia. https://doi.org/10.18653/v1/P18-1234. https://aclanthology.org/P18-1234

  56. Yang C, Zhang H, Jiang B, Li K (2019) Aspect-based sentiment analysis with alternating coattention networks. Inf Process Manage 56(3):463–478. https://doi.org/10.1016/j.ipm.2018.12.004

  57. Yin W, Schütze H, Xiang B, Zhou B (2016) Abcnn: Attention-based convolutional neural network for modeling sentence pairs. Transactions of the Association for Computational Linguistics 4:259–272. https://doi.org/10.1162/tacl_a_00097. https://aclanthology.org/Q16-1019

  58. Yin W, Yu M, Xiang B, Zhou B, Schütze H (2016) Simple question answering by attentive convolutional neural network. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 1746–1756. The COLING 2016 Organizing Committee, Osaka, Japan. https://aclanthology.org/C16-1164

Download references

Funding

This work was partially financially supported by TEQIP-III, REC Ambedkar Nagar, UP, India.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ashish Kumar.

Ethics declarations

Ethics approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Financial interests

The authors have no relevant financial or non-financial interests to disclose.

Conflicts of interests/Competing interests

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Verma, S., Kumar, A. & Sharan, A. MuCon: Multi-channel convolution for targeted sentiment classification. Multimed Tools Appl 83, 28615–28633 (2024). https://doi.org/10.1007/s11042-023-16586-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-16586-1

Keywords

Navigation