Ensemble Learning for Sentiment Classification

Su, Ying; Zhang, Yong; Ji, Donghong; Wang, Yibing; Wu, Hongmiao

doi:10.1007/978-3-642-36337-5_10

Ying Su²¹,
Yong Zhang^22,23,
Donghong Ji²²,
Yibing Wang²⁴ &
…
Hongmiao Wu²⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7717))

Included in the following conference series:

Workshop on Chinese Lexical Semantics

3229 Accesses
16 Citations

Abstract

This paper presents an ensemble learning method for sentiment classification of reviews. The diversity among the machine learning algorithms for sentiment classification with different settings, which includes different features, different weight measures and the modeling of negation, is investigated in three domains, which gives a space for improving the performance. Then the ensemble learning framework, stacking generalization is introduced based on different algorithms with different settings, and compared with the majority voting. According to the characteristic of reviews, the opinion summary of review is proposed in this paper, which is composed of the first two and last two sentences of review. Results show that stacking has been proven to be consistently effective over all domains, working better than majority voting, and that using the opinion summary can improve the performance further.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wilson, T., Wiebe, J., Hoffmann, P.: Recognizing Contextual Polarity an exploration of features for phrase-level sentiment analysis. Computational Linguistics 35, 399–433 (2009)
Article Google Scholar
Dasgupta, S., Ng, V.: Mine the Easy, Classify the Hard: A Semi-Supervised Approach to Automatic Sentiment Classification. In: Preceeding of ACL 2009, pp. 701–709 (2009)
Google Scholar
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment Classification Using Machine Learning Techniques. In: Proceeding of EMNLP (2002)
Google Scholar
Tang, H.F., Tan, S.B., Cheng, X.Q.: A survey on sentiment detection of reviews. Expert Syst. Appl. 36(7), 10760–10773 (2009)
Article Google Scholar
Xu, J., Ding, Y.X., Wang, X.L.: Sentiment Classification for Chinese News Using Machine Learning Methods. Journal of Chinese Information Processing 21(6) (2007)
Google Scholar
Zhang, Y., Ji, D.-H., Su, Y., Sun, C.: Sentiment Analysis for Online Reviews Using an Author-Review-Object Model. In: Salem, M.V.M., Shaalan, K., Oroumchian, F., Shakery, A., Khelalfa, H. (eds.) AIRS 2011. LNCS, vol. 7097, pp. 362–371. Springer, Heidelberg (2011)
Chapter Google Scholar
Täckström, O., McDonald, R.: Semi-supervised Latent Variable Models for Sentence-level Sentiment Analysis. In: Proceeding of Association for Computational Linguistics, ACL (2011)
Google Scholar
Mukherjee, A., Liu, B.: Modeling Review Comments. In: Proceedings of ACL 2012, Jeju, Republic of Korea, July 8-14 (2012)
Google Scholar
Du, W.F., Tan, S.B., Cheng, X.Q., Yun, X.C.: Adapting information bottleneck method for automatic construction of domain-oriented sentiment lexicon. In: Proceeding of WSDM 2010, pp. 111–120 (2010)
Google Scholar
Wolpert, David, H.: Stacked Generalization. Neural Networks 5(2), 241–260 (1992)
Article Google Scholar
Lewis, David, D.: Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, Springer, Heidelberg (1998)
Google Scholar
Domingos, P., Pazzani, M.J.: On the optimality of the simple Bayesian classifier under zero-one loss. Machine Learning 29(2-3), 103–130 (1997)
Article MATH Google Scholar
Han, E.H., Karypis, G.: Principles of Data Mining and Knowledge Discovery. Springer (2000)
Google Scholar
Pan, J.S., Qiao, Y.L., Sun, S.H.: A fast K nearest neighbors classification algorithm. J. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. E87-A(4), 961–963 (2004)
Google Scholar
Das, S., Chen, M.: Yahoo! for Amazon: Extracting market sentiment from stock message boards. In: Proceeding of the 8th Asia Pacific Finance Association Annual Conference (2001)
Google Scholar
Sigletos, G., Paliouras, G., Spyropoulos, C.D., Hatzopoulos, M.: Combining Information Extraction Systems Using Voting and Stacked Generalization. Journal of Machine Learning Research 6, 1751–1782 (2005)
MathSciNet MATH Google Scholar
Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann (2000)
Google Scholar
Ting, K., Witten, M.: Issues in stacked generalization. Journal of Artificial Intelligence Research (JAIR) 10, 271–289 (1999)
MATH Google Scholar
Jia, L.F., Yu, C., Meng, W.Y.: The Effect of Negation on Sentiment Analysis and Retrieval Effectiveness. In: Proceeding of the 18th ACM Conference on Information and Knowledge Management, pp. 1827–1830 (2009)
Google Scholar
Kuncheva, L.I., Whitaker, C.J.: Measures of Diversity in Classifier Ensembles and their Relationship with the Ensemble Accuracy. Machine Learning 51, 181–207 (2003)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer and Electronic, Huazhong University of Science and Technology Wuchang Branch, Wuhan, P.R. China
Ying Su
Computer School, Wuhan University, Wuhan, 430072, P.R. China
Yong Zhang & Donghong Ji
Department of Computer Science, Huazhong Normal University, Wuhan, P.R. China
Yong Zhang
Third Faculty, Second Artillery Command College, P.R. China
Yibing Wang
School of Foreign Languages and Literature, Wuhan University, Wuhan, 430072, P.R. China
Hongmiao Wu

Authors

Ying Su
View author publications
You can also search for this author in PubMed Google Scholar
Yong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Donghong Ji
View author publications
You can also search for this author in PubMed Google Scholar
Yibing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hongmiao Wu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer School, Wuhan University, 430072, Wuhan, China
Donghong Ji
College of Chinese Language and Literature, Wuhan University, 430072, Wuhan, China
Guozheng Xiao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Su, Y., Zhang, Y., Ji, D., Wang, Y., Wu, H. (2013). Ensemble Learning for Sentiment Classification. In: Ji, D., Xiao, G. (eds) Chinese Lexical Semantics. CLSW 2012. Lecture Notes in Computer Science(), vol 7717. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36337-5_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-36337-5_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36336-8
Online ISBN: 978-3-642-36337-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics