Skip to main content

A Two-Level Classifier Model for Sentiment Analysis

  • Conference paper
  • First Online:
Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom 2017)

Abstract

This paper proposes a fast and high performance classifier model for sentiment analysis of textual reviews. The key contribution is three fold. First, a two-level classifier model consists of three base classifiers is proposed, and theory proves that the model could be better than the strongest classifier among the base classifiers in both classification performance and time cost of predict. Second, this paper proposes a lexicon-based classifier as a base classifier using a new part of speech (POS) which is called “weaken words”. Finally, we implemented several two-level classifiers by combining the lexicon-based classifier with several machine learning classifiers. Experiments on Chinese reviews dataset show that the two-level classifier model is effective and efficient.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Breiman, L.: Bagging predictors. Mach. Learn. 24(2), 123–140 (1996)

    MathSciNet  MATH  Google Scholar 

  2. Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55(1), 119–139 (1997)

    Article  MathSciNet  Google Scholar 

  3. Pang, B., Lee, L.: Thumbs up? Sentiment classification using machine learning. In: Proceedings of EMNLP, pp. 79–86 (2002)

    Google Scholar 

  4. Turney, P.D.: Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In: Meeting on Association for Computational Linguistics Association for Computational Linguistics, pp. 417–424 (2002)

    Google Scholar 

  5. Zou, H., et al.: Sentiment classification using machine learning techniques with syntax features. In: International Conference on Computational Science and Computational Intelligence, pp. 175–179. IEEE (2016)

    Google Scholar 

  6. Kim, S.M., Hovy, E.: Determining the sentiment of opinions (2004)

    Google Scholar 

  7. Ding, X., Liu, B.: The utility of linguistic rules in opinion mining. In: International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 811–812. ACM (2007)

    Google Scholar 

  8. Ding, X., Liu, B., Yu, P.S.: A holistic lexicon-based approach to opinion mining. In: International Conference on Web Search and Data Mining, pp. 231–240. ACM (2008)

    Google Scholar 

  9. Taboada, M., et al.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011)

    Article  Google Scholar 

  10. Wang, H., Zaniolo, C.: CMP: a fast decision tree classifier using multivariate predictions. In: International Conference on Data Engineering, p. 449. IEEE Computer Society (2000)

    Google Scholar 

  11. Alsabti, K., Ranka, S., Singh, V.: CLOUDS: a decision tree classifier for large datasets. In: Knowledge Discovery and Data Mining (1998)

    Google Scholar 

  12. Shi-Ming, X.U., et al.: Efficient SVM Chinese Web page classifier based on pre-classification. In: Computer Engineering and Applications (2006)

    Google Scholar 

  13. Wu, F., Zheng, Y., Cheng, W.: Adaptive normalized weighted KNN text classification based on PSO. Scientific Bulletin of National Mining University (2016)

    Google Scholar 

  14. Dong, Z., Dong, Q.: HowNet—a hybrid language and knowledge resource. In: International Conference on Natural Language Processing and Knowledge Engineering Proceedings, pp. 820–824 (2003)

    Google Scholar 

  15. http://www.datatang.com/data/44317

  16. Liu, Q., Li, S.: Word similarity computing based on how-net. In: Computational Linguistics and Chinese Language Processing (2002)

    Google Scholar 

  17. Yang, K., Duan, Q.: Analysis of emotional tendency based on emotional dictionary. Comput. Era 3, 10–13 (2017)

    Google Scholar 

Download references

Acknowledgements

This work was supported by the National Science Foundation of China (Grant Nos. 61370059, 61772053 and 61232009 (key project)), Beijing Natural Science Foundation (Grant No. 4152030), the fund of the State Key Laboratory of Computer Architecture (CARCH201507), the fund of the State Key Laboratory of Software Development Environment (SKLSDE-2016ZX-15), the State Administration of Science Technology and Industry for National Defense, the major projects of high resolution earth observation system under Grant No. Y20A-E03, and the National Key R&D Program of China under Grant No. 2017YFB0202004.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Li Ruan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hao, H. et al. (2018). A Two-Level Classifier Model for Sentiment Analysis. In: Romdhani, I., Shu, L., Takahiro, H., Zhou, Z., Gordon, T., Zeng, D. (eds) Collaborative Computing: Networking, Applications and Worksharing. CollaborateCom 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 252. Springer, Cham. https://doi.org/10.1007/978-3-030-00916-8_64

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00916-8_64

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00915-1

  • Online ISBN: 978-3-030-00916-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics