Skip to main content

An Efficient Hybrid Model for Vietnamese Sentiment Analysis

  • Conference paper
  • First Online:
Intelligent Information and Database Systems (ACIIDS 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10191))

Included in the following conference series:

Abstract

Sentiment analysis from the text is an exciting and challenging task which can be useful in many applications of exploiting people interests for improving the quality of services. Especially, text collected from social networks, websites or forums is usually represented by spoken language that is unstructured and difficult to handle. In this paper, we present a novel hybrid model that is based on Hierarchical Dirichlet Process (HDP) and adopts a combination of lexicon-based and Support Vector Machine (SVM) methods in the task of topic-based sentiment classification for Vietnamese text. The proposed model has been evaluated on five different topic-datasets, and the experimental results show the efficiency of our proposed model when the average accuracy is nearly 87%. Although this proposed model is initially designed for Vietnamese language, it is applicable and adaptable to other languages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Khan, F.H., Qamar, U., Bashir, S.: SentiMI: introducing point-wise mutual information with SentiWordNet to improve sentiment polarity detection. Appl. Soft Comput. J. 39, 140–153 (2016)

    Article  Google Scholar 

  2. Al-Rowaily, K., Abulaish, M., Haldar, N.A.-H., Al-Rubaian, M.: BiSAL - a bilingual sentiment analysis lexicon to analyze Dark Web forums for cyber security. Digital Invest. 14, 53–62 (2015)

    Article  Google Scholar 

  3. Ko, Y.: A study of term weighting schemes using class information for text classification. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 2012 (2012)

    Google Scholar 

  4. Khan, F.H., Bashir, S., Qamar, U.: TOM: twitter opinion mining framework using hybrid classification scheme. Decis. Support Syst. 57, 245–257 (2014)

    Article  Google Scholar 

  5. Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)

    MATH  Google Scholar 

  6. Scott, D., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41, 391–407 (1990)

    Article  Google Scholar 

  7. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  8. Liang, J., Liu, P., Tan, J., Ba, S.: Sentiment classification based on AS-LDA model. In: Proceeding of 2nd International Conference on Information Technology and Quantitative Management, vol. 31, pp. 511–516 (2014)

    Google Scholar 

  9. Li, F., Huang, M., Zhu, X.: Sentiment analysis with global topics and local dependency. In: Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence (2010)

    Google Scholar 

  10. Raja Mohana, S.P., Umamaheswari, K., Karthiga, R.: Sentiment classification based on latent Dirichlet allocation. Int. J. Comput. Appl. 14–16 (2015). International Conference on Innovations in Computing Techniques

    Google Scholar 

  11. Bin, L., Ott, M., Cardie, C., Tsou, B.: Multi-aspect sentiment analysis with topic models. In: Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshop, pp. 81–88 (2011)

    Google Scholar 

  12. Bagheri, A., Saraee, M.: Latent Dirichlet Markov allocation for sentiment analysis. In: Fifth European Conference on Intelligent Management Systems in Operations, Salford, UK, pp. 90–96 (2013)

    Google Scholar 

  13. Whye, T.Y., Jordan Michael, I., Beal Matthew, J., Blei, D.M.: Hierarchical Dirichlet processes. J. Am. Stat. Assoc. 101, 1566–1581 (2006)

    Article  MathSciNet  MATH  Google Scholar 

  14. Hông Phuong, L., Thi Minh Huyên, N., Roussanaly, A., Vinh, H.T.: A hybrid approach to word segmentation of Vietnamese texts. In: Martín-Vide, C., Otto, F., Fernau, H. (eds.) LATA 2008. LNCS, vol. 5196, pp. 240–249. Springer, Heidelberg (2008). doi:10.1007/978-3-540-88282-4_23

    Chapter  Google Scholar 

Download references

Acknowledgments

We would like to thank TIS Inc. (www.tis.com) for supporting and funding this research.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Hoang Anh Pham or Thanh Van Le .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Vo, T.H., Nguyen, T.T., Pham, H.A., Le, T.V. (2017). An Efficient Hybrid Model for Vietnamese Sentiment Analysis. In: Nguyen, N., Tojo, S., Nguyen, L., Trawiński, B. (eds) Intelligent Information and Database Systems. ACIIDS 2017. Lecture Notes in Computer Science(), vol 10191. Springer, Cham. https://doi.org/10.1007/978-3-319-54472-4_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-54472-4_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-54471-7

  • Online ISBN: 978-3-319-54472-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics