Abstract
Sentiment analysis is a crucial step in the social media data analysis. The majority of research works on sentiment analysis focus on sentiment polarity detection which identifies whether an input text is positive, negative or neutral. In this paper, we have implemented a stacked ensemble approach to sentiment polarity detection in Bengali tweets. The basic concept of stacked generalization is to fuse the outputs of the first level base classifiers using a second-level Meta classifier in an ensemble. In our ensemble method, we have used two types of base classifiers- multinomial Naïve Bayes classifiers and SVM that make use of a diverse set of features. Our proposed approach shows an improvement over some existing Bengali sentiment analysis approaches reported in the literature.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bowker, J.: The Oxford Dictionary of World Religions. Oxford University Press, Oxford (1997)
Zhao, J., Liu, K., Wang, G.: Adding redundant features for CRFs-based sentence sentiment classification. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 117–126. Association for Computational Linguistics (2008)
Joachims, T.: Making large scale SVM learning practical. In: Schölkopf, B., Burges, C.J.C., der Smola, A. (eds.) Advances in Kernel Methods-Support Vector Learning. MITPress, Cambridge (1999)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)
Dave, K., Lawrence, S., Pennock, D.M.: Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In: Proceedings of the 12th International Conference on World Wide Web, pp. 519–528. ACM (2003)
Mullen, T., Collier, N.: Sentiment analysis using support vector machines with diverse information sources. In: EMNLP, vol. 4, pp. 412–418 (2004)
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2(1–2), 1–135 (2008)
Goldberg, A.B., Zhu, X.: Seeing stars when there aren’t many stars: graph-based semi-supervised learning for sentiment categorization. In: Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing, pp. 45–52. Association for Computational Linguistics (2006)
Miao, Q., Li, Q., Zeng, D.: Fine grained opinion mining by integrating multiple review sources. J. Am. Soc. Inform. Sci. Technol. 61(11), 2288–2299 (2010)
Riloff, E., Wiebe, J.: Learning extraction patterns for subjective expressions. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, pp. 105–112. Association for Computational Linguistics (2003)
Prabowo, R., Thelwall, M.: Sentiment analysis: a combined approach. J. Inf. 3(2), 143–157 (2009)
Narayanan, R., Liu, B., Choudhary, A.: Sentiment analysis of conditional sentences. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 1, pp. 180–189. Association for Computational Linguistics (2009)
Wiegand, M., Balahur, A., Roth, B., Klakow, D., Montoyo, A.: A survey on the role of negation in sentiment analysis. In: Proceedings of the Workshop on Negation and Speculation in Natural Language Processing, pp. 60–68. Association for Computational Linguistics (2010)
Ku, L.-W., Liang, Y.T., Chen, H-H.: Opinion extraction, summarization and tracking in news and blog corpora. In: AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs (2006)
Kim, J., Chern, G., Feng, D., Shaw, E., Hovy, E.: Mining and assessing discussions on the web through speech act analysis. In: Proceedings of the Workshop on Web Content Mining with Human Language Technologies at the 5th International Semantic Web Conference (2006)
Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p. 271 (2004)
Zhu, F., Zhang, X.: Impact of online consumer reviews on sales: the moderating role of product and consumer characteristics. J. Mark. 74(2), 133–148 (2010)
Melville, P., Gryc, W., Lawrence, R.D.: Sentiment analysis of blogs by combining lexical knowledge with text classification. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1275–1284. ACM (2009)
Ramakrishnan, G., Jadhav, A., Joshi, A., Chakrabarti, S., Bhattacharyya, P.: Question answering via Bayesian inference on lexical relations. In: Proceedings of the ACL 2003 Workshop on Multilingual Summarization and Question Answering, vol. 12, pp. 1–10. Association for Computational Linguistics (2003)
Jiao, J., Zhou, Y.: Sentiment Polarity Analysis based multi-dictionary. Phys. Procedia 22, 590–596 (2011)
Macdonald, C., Ounis, I.: The TREC Blogs06 collection: creating and analysing a blog test collection. Department of Computer Science, University of Glasgow Technical report TR-2006-224, 1, 3-1, (2006)
Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, pp. 174–181. Association for Computational Linguistics, July 1997
Wiebe, J.: Learning subjective adjectives from corpora. In: AAAI/IAAI, pp. 735–740, July 2000
Yu, H., Hatzivassiloglou, V.: Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, pp. 129–136. Association for Computational Linguistics, July 2003
Riloff, E., Wiebe, J.: Learning extraction patterns for subjective expressions. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, pp. 105–112. Association for Computational Linguistics, July 2003
Esuli, A., Sebastiani, F.: SENTIWORDNET: a publicly available lexical resource for opinion mining. In: Proceedings of LREC, vol. 6, pp. 417–422, May 2006
Fellbaum, C.: WordNet. Blackwell Publishing Ltd., Hoboken (1999)
Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: ACL (2004)
Chen, C.C., Tseng, Y.D.: Quality evaluation of product reviews using an information quality framework. Decis. Support Syst. 50(4), 755–768 (2011)
Kang, H., Yoo, S.J., Han, D.: Senti-lexicon and improved Naïve Bayes algorithms for sentiment analysis of restaurant reviews. Expert Syst. Appl. 39(5), 6000–6010 (2012)
Clarke, D., Lane, P., Hender, P.: Developing robust models for favourability analysis. In: Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, pp. 44–52. Association for Computational Linguistics (2011)
Reyes, A., Rosso, P.: Making objective decisions from subjective data: detecting irony in customer reviews. Decis. Support Syst. 53(4), 754–760 (2012)
Moraes, R., Valiati, J.F., Neto, W.P.G.: Document-level sentiment classification: an empirical comparison between SVM and ANN. Expert Syst. Appl. 40(2), 621–633 (2013)
Martín-Valdivia, M.T., Martínez-Cámara, E., Perea-Ortega, J.M., Ureña-López, L.A.: Sentiment polarity detection in Spanish reviews combining supervised and unsupervised approaches. Expert Syst. Appl. 40(10), 3934–3942 (2013)
Sarkar, K., Chakraborty, S.: A sentiment analysis system for Indian Language Tweets. In: Prasath, R., Vuppala, A.K., Kathirvalavakumar, T. (eds.) MIKE 2015. LNCS (LNAI), vol. 9468, pp. 694–702. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-26832-3_66
Li, Y.M., Li, T.Y.: Deriving market intelligence from microblogs. Decis. Support Syst. 55(1), 206–217 (2013)
Patra, B.G., Das, D., Das, A., Prasath, R.: Shared task on Sentiment Analysis in Indian Languages (SAIL) tweets - an overview. In: Prasath, R., Vuppala, A.K., Kathirvalavakumar, T. (eds.) MIKE 2015. LNCS (LNAI), vol. 9468, pp. 650–655. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-26832-3_61
Sarkar, K., Bhowmik, M.: Sentiment polarity detection in bengali tweets using multinomial Naïve Bayes and support vector machines. In: CALCON 2017, Kolkata. IEEE (2017)
Sarkar, K.: Sentiment polarity detection in Bengali tweets using deep convolutional neural networks. J. Intell. Syst. 28(3), 377–386 (2018). https://doi.org/10.1515/jisys-2017-0418. Accessed 7 July 2019
Sarkar, K.: Using character N gram features and multinomial Naïve Bayes for sentiment polarity detection in Bengali tweets. In: Proceedings of Fifth International Conference on Emerging Applications of Information Technology (EAIT), Kolkata. IEEE (2018)
Sarkar, K.: Sentiment polarity detection in Bengali tweets using LSTM recurrent neural networks. In: Proceedings of Second International Conference on Advanced Computational and Communication Paradigms (ICACCP), Sikkim, India, 25–28 February 2019. IEEE (2019)
Das, A., Bandyopadhyay, S.: SentiWordNet for Indian languages. In: Proceedings of 8th Workshop on Asian Language Resources (COLING 2010), Beijing, China, pp. 56–63 (2010)
Vapnik, V.: Estimation of Dependences Based on Empirical Data, vol. 40. Springer-Verlag, New York (1982). https://doi.org/10.1007/0-387-34239-7
Acknowledgments
This research work has received support from the project entitled ‘‘Indian Social Media Sensor: an Indian Social Media Text Mining System for Topic Detection, Topic Sentiment Analysis and Opinion Summarization’’ funded by the Department of Science and Technology, Government of India under the SERB scheme.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Sarkar, K. (2020). A Stacked Ensemble Approach to Bengali Sentiment Analysis. In: Tiwary, U., Chaudhury, S. (eds) Intelligent Human Computer Interaction. IHCI 2019. Lecture Notes in Computer Science(), vol 11886. Springer, Cham. https://doi.org/10.1007/978-3-030-44689-5_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-44689-5_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-44688-8
Online ISBN: 978-3-030-44689-5
eBook Packages: Computer ScienceComputer Science (R0)