Skip to main content

Assembling the Optimal Sentiment Classifiers

  • Conference paper
Web Information Systems Engineering - WISE 2012 (WISE 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7651))

Included in the following conference series:

Abstract

Sentiment classification aims to classify documents according to their overall sentiment orientation, which plays an important role in many web applications, such as electronic commerce. Machine learning is an effective method for such tasks. In general, a classifier is determined by a feature type, a weighting function and a classification algorithm for a given training set. Thus, users are required to predetermine which ones should be applied, that is a troublesome problem for them, because each classifier always achieves different performance for different domains. To deal with this problem, we develop a three phase framework based on assembling multiple classifiers. In order to choose the optimal combination of classifiers, we propose a criterion for estimating the quality of the combination based on sentiment classification accuracy and diversity of the results generated by these classifiers. Moreover, we study the effect of the number of classifiers selected experimentally. With our solution, users can achieve a good performance without making a choice among plentiful combinations of different classifiers. We perform extensive experiments to demonstrate the effectiveness of our solution for different domains.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Martineau, J., Finin, T., Joshi, A., Patel, S.: Improve binary classification on text problems using differential word features. In: CIKM 2009, pp. 2019–2024 (2009)

    Google Scholar 

  2. Dasgupta, S., Ng, V.: Mine the easy, classify the hard: a semi-supervised approach to automatic sentiment classification. In: Proc. of the 47th ACL and the 4th IJCNLP of the AFNLP, pp. 701–709 (2009)

    Google Scholar 

  3. Wei, W., Gulla, J.A.: Sentiment Learning on product reviews via sentiment ontology tree. In: Proc. of the 48th ACL, pp. 404–413 (2010)

    Google Scholar 

  4. Tan, C., Lee, L., Tang, J., Jiang, L., Zhou, M., Li, P.: User-level sentiment analysis incorporating social networks. In: KDD 2011, pp. 1397–1405 (2011)

    Google Scholar 

  5. Lin, Y., Zhang, J., Wang, X., Zhou, A.: Sentiment classification via integrating multiple feature presentation. In: WWW 2012, pp. 569–570 (2012)

    Google Scholar 

  6. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment Classification Using Maching Learning Technique. In: Proc. of the 7th EMNLP, pp. 79–86 (2002)

    Google Scholar 

  7. Lin, Y., Zhang, J., Wang, X., Zhou, A.: An information theoretic approach to sentiment polarity classification. In: WebQuarity 2012, pp. 35–40 (2012)

    Google Scholar 

  8. Matsumoto, S., Takamura, H., Okumura, M.: Sentiment Classification Using Word Sub-sequences and Dependency Sub-trees. In: Ho, T.-B., Cheung, D., Liu, H. (eds.) PAKDD 2005. LNCS (LNAI), vol. 3518, pp. 301–311. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  9. Alm, C.O., Roth, D., Sproat, R.: Emotions from text: machine learning for text-based emotion prediction. In: HLT/EMNLP, pp. 579–586 (2005)

    Google Scholar 

  10. Alec Go, Richa Bhayani, Lei Huang: Twitter sentiment classification using distant supervison. Technical report, Stanford (2009)

    Google Scholar 

  11. Agarwal, A., Xie, B., Vovsha, I., Rambow, O., Passonneau, R.: Sentiment analysis of Twotter data. In: LSM, pp. 30–38 (2011)

    Google Scholar 

  12. Paltoglou, G., Thelwall, M.: A study of informationretrieval weigheing schemes for sentiment analysis. In: Proc. of the 48th ACL, pp. 1386–1395 (2010)

    Google Scholar 

  13. Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: Introduction to wordnet: An on-line lexical database. International Journal of Lexicography 3(4), 235–312 (1990)

    Article  Google Scholar 

  14. Yang, Y., Pedersen, J.O.: A comparative study on feature selection in text categorization. In: Proc. 14th ICML, pp. 412–420 (1997)

    Google Scholar 

  15. Hassan, A., Radev, D.: Identify text polarity using random walks. In: Proc. of the 48th ACL, pp. 395–403 (2010)

    Google Scholar 

  16. Kamps, J., Marx, M., Mokken, R.J., Rijke, M.D.: Using wordnet to measure semantic orientations of adjectives. In: Proc. of the 4th LREC, pp. 1115–1118 (2004)

    Google Scholar 

  17. Tan, C., Lee, L., Tang, J., Jiang, L., Zhou, M., Li, P.: User-level sentiment analysis incorporating social networks. In: KDD 2011, pp. 1397–1405 (2011)

    Google Scholar 

  18. Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boo-boxes and blenders: Domain adaptation for sentiment classification. In: Proc. of the 45th ACL, pp. 440–447 (2007)

    Google Scholar 

  19. Džeroski, S., Ženko, B.: Is Combining Classifiers with Stacking Better than Selecting the Best One? Machine Learning 54(3), 255–273 (2004)

    Article  MATH  Google Scholar 

  20. Fleiss, J.L., Levin, B.: Statistical Methods for Rates and Proportions, 3rd edn. Wiley, New York (2003)

    Book  MATH  Google Scholar 

  21. Ženko, B., Todorovski, L., Džeroski, S.: A comparison of stacking with MDTs to bagging, boosting, and other stacking methods. In: ICDM 2001, pp. 669–670 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lin, Y., Wang, X., Zhang, J., Zhou, A. (2012). Assembling the Optimal Sentiment Classifiers. In: Wang, X.S., Cruz, I., Delis, A., Huang, G. (eds) Web Information Systems Engineering - WISE 2012. WISE 2012. Lecture Notes in Computer Science, vol 7651. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35063-4_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35063-4_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35062-7

  • Online ISBN: 978-3-642-35063-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics