Abstract
In this work, we conduct a comparison study of the online review sentiment clustering problem from a combined perspective of data preprocessing, VSM modeling and clustering algorithm. To that end, we first introduce some methods for data preprocessing. Then, we explore the impacts of the term weighting models for review representation. Finally, we present detailed experiment results of some review clustering techniques. The conclusions would be valuable for both the study and usage of clustering methods in online review sentiment analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhu, F., (Michael) Zhang, X.: Impact of online consumer reviews on sales: The moderating role of product and consumer characteristics. Journal of Marketing 74(2), 133–148 (2010)
Yi, J., Nasukawa, T., Niblack, W., Bunescu, R.: Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques. In: Proceedings of the ICDM 2003, Florida, USA, pp. 427–434 (2003)
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval 2(1-2), 1–135 (2008)
Prabowo, R., Thelwall, M.: Sentiment analysis: A combined approach. Journal of Informetrics 3, 143–157 (2009)
Anick, P., Vaithyanathan, S.: Exploiting Clustering and Phrases for Context-Based Information Retrieval. In: Proceedings of the 20th ACM SIGIR, pp. 314–323 (1997)
Salton, G., Wong, A., et al.: A vector space model for automatic indexing. Communication of the ACM 18(11), 613–620 (1975)
Liu, B.: Sentiment Analysis and Opinion Mining. Morgan and Claypool Publishers (May 2012)
Li, G., Liu, F.: Application of a clustering method on sentiment analysis. Journal of Information Science 38(2), 127–139 (2012)
Zhai, Z., Liu, B., Xu, H., Jia, P.: Clustering product features for opinion mining. In: Proceedings of the WSDM 2011, New York, USA, pp. 347–354 (2011)
Wang, D., Feng, S., Yan, C., Yu, G.: An approach of semi-automatic public sentiment analysis for opinion and district. In: Wang, L., Jiang, J., Lu, J., Hong, L., Liu, B. (eds.) WAIM 2011. LNCS, vol. 7142, pp. 210–222. Springer, Heidelberg (2012)
Whissell, J.S., Clarke, C.L.: Improving document clustering using Okapi BM25 feature weighting. Information Retrieval 14(5), 466–487 (2011)
Steinbach, M., Karypis, G., Kumar, V.: A comparison of document clustering techniques. In: KDD Workshop on Text Mining (2000)
Toutanova, K., Manning, C.D.: Enriching the knowledge sources used in a maximum entropy part-of-speech tagger. In: Proceedings of the EMNLP/VLC 2000, Hong Kong, pp. 63–70 (2000)
Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ma, B., Yuan, H., Wei, Q. (2013). A Comparison Study of Clustering Models for Online Review Sentiment Analysis. In: Wang, J., Xiong, H., Ishikawa, Y., Xu, J., Zhou, J. (eds) Web-Age Information Management. WAIM 2013. Lecture Notes in Computer Science, vol 7923. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38562-9_34
Download citation
DOI: https://doi.org/10.1007/978-3-642-38562-9_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38561-2
Online ISBN: 978-3-642-38562-9
eBook Packages: Computer ScienceComputer Science (R0)