Abstract
Sentiment analysis from the text is an exciting and challenging task which can be useful in many applications of exploiting people interests for improving the quality of services. Especially, text collected from social networks, websites or forums is usually represented by spoken language that is unstructured and difficult to handle. In this paper, we present a novel hybrid model that is based on Hierarchical Dirichlet Process (HDP) and adopts a combination of lexicon-based and Support Vector Machine (SVM) methods in the task of topic-based sentiment classification for Vietnamese text. The proposed model has been evaluated on five different topic-datasets, and the experimental results show the efficiency of our proposed model when the average accuracy is nearly 87%. Although this proposed model is initially designed for Vietnamese language, it is applicable and adaptable to other languages.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Khan, F.H., Qamar, U., Bashir, S.: SentiMI: introducing point-wise mutual information with SentiWordNet to improve sentiment polarity detection. Appl. Soft Comput. J. 39, 140–153 (2016)
Al-Rowaily, K., Abulaish, M., Haldar, N.A.-H., Al-Rubaian, M.: BiSAL - a bilingual sentiment analysis lexicon to analyze Dark Web forums for cyber security. Digital Invest. 14, 53–62 (2015)
Ko, Y.: A study of term weighting schemes using class information for text classification. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 2012 (2012)
Khan, F.H., Bashir, S., Qamar, U.: TOM: twitter opinion mining framework using hybrid classification scheme. Decis. Support Syst. 57, 245–257 (2014)
Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)
Scott, D., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41, 391–407 (1990)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Liang, J., Liu, P., Tan, J., Ba, S.: Sentiment classification based on AS-LDA model. In: Proceeding of 2nd International Conference on Information Technology and Quantitative Management, vol. 31, pp. 511–516 (2014)
Li, F., Huang, M., Zhu, X.: Sentiment analysis with global topics and local dependency. In: Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence (2010)
Raja Mohana, S.P., Umamaheswari, K., Karthiga, R.: Sentiment classification based on latent Dirichlet allocation. Int. J. Comput. Appl. 14–16 (2015). International Conference on Innovations in Computing Techniques
Bin, L., Ott, M., Cardie, C., Tsou, B.: Multi-aspect sentiment analysis with topic models. In: Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshop, pp. 81–88 (2011)
Bagheri, A., Saraee, M.: Latent Dirichlet Markov allocation for sentiment analysis. In: Fifth European Conference on Intelligent Management Systems in Operations, Salford, UK, pp. 90–96 (2013)
Whye, T.Y., Jordan Michael, I., Beal Matthew, J., Blei, D.M.: Hierarchical Dirichlet processes. J. Am. Stat. Assoc. 101, 1566–1581 (2006)
Hông Phuong, L., Thi Minh Huyên, N., Roussanaly, A., Vinh, H.T.: A hybrid approach to word segmentation of Vietnamese texts. In: MartÃn-Vide, C., Otto, F., Fernau, H. (eds.) LATA 2008. LNCS, vol. 5196, pp. 240–249. Springer, Heidelberg (2008). doi:10.1007/978-3-540-88282-4_23
Acknowledgments
We would like to thank TIS Inc. (www.tis.com) for supporting and funding this research.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Vo, T.H., Nguyen, T.T., Pham, H.A., Le, T.V. (2017). An Efficient Hybrid Model for Vietnamese Sentiment Analysis. In: Nguyen, N., Tojo, S., Nguyen, L., Trawiński, B. (eds) Intelligent Information and Database Systems. ACIIDS 2017. Lecture Notes in Computer Science(), vol 10191. Springer, Cham. https://doi.org/10.1007/978-3-319-54472-4_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-54472-4_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-54471-7
Online ISBN: 978-3-319-54472-4
eBook Packages: Computer ScienceComputer Science (R0)