Abstract
Microblog has become a popular platform for people to share their ideas, information and opinions. In addition to textual content data, social relations and user behaviors in microblog provide us additional link information, which can be used to improve the performance of sentiment analysis. However, traditional sentiment analysis approaches either focus on the plain text, or make simple use of links without distinguishing different effects of different types of links. As a result, the performance of sentiment analysis on microblog can not achieve obvious improvement. In this paper, we are the first to divide the links between microblogs into three classes. We further propose an unsupervised model called Content and Link Unsupervised Sentiment Model (CLUSM). CLUSM focuses on microblog sentiment analysis by incorporating the above three types of links. Comprehensive experiments were conducted to investigate the performance of our method. Experimental results showed that our proposed model outperformed the state of the art.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 417–424. Association for Computational Linguistics (2002)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? sentiment classification using machine learning techniques. In: EMNLP, pp. 79–86 (2002)
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168–177. ACM (2004)
Godbole, N., Srinivasaiah, M., Skiena, S.: Large-scale sentiment analysis for news and blogs. In: ICWSM, vol. 7 (2007)
Devitt, A., Ahmad, K.: Sentiment polarity identification in financial news: A cohesion-based approach. In: ACL (2007)
Wilson, T., Wiebe, J., Hoffmann, P.: Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 347–354. Association for Computational Linguistics (2005)
Ou, G., Chen, W., Liu, P., Wang, T., Yang, D., Lei, K., Liu, Y.: Aspect-specific polarity-aware summarization of online reviews. In: Wang, J., Xiong, H., Ishikawa, Y., Xu, J., Zhou, J. (eds.) WAIM 2013. LNCS, vol. 7923, pp. 289–300. Springer, Heidelberg (2013)
Liu, B.: Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies 5(1), 1–167 (2012)
Hu, X., Tang, L., Tang, J., Liu, H.: Exploiting social relations for sentiment analysis in microblogging. In: WSDM, pp. 537–546 (2013)
Speriosu, M., Sudan, N., Upadhyay, S., Baldridge, J.: Twitter polarity classification with label propagation over lexical links and the follower graph. In: EMNLP, pp. 53–63 (2011)
Tan, C., Lee, L., Tang, J., Jiang, L., Zhou, M., Li, P.: User-level sentiment analysis incorporating social networks. In: KDD, pp. 1397–1405 (2011)
Jiang, L., Yu, M., Zhou, M., Liu, X., Zhao, T.: Target-dependent twitter sentiment classification. In: ACL, pp. 151–160 (2011)
Davidov, D., Tsur, O., Rappoport, A.: Enhanced sentiment learning using twitter hashtags and smileys. In: COLING (Posters), pp. 241–249 (2010)
Liu, K.L., Li, W.J., Guo, M.: Emoticon smoothed language models for twitter sentiment analysis. In: AAAI (2012)
Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision, Stanford. CS224N Project Report, pp. 1–12 (2009)
Agarwal, A., Xie, B., Vovsha, I., Rambow, O., Passonneau, R.: Sentiment analysis of twitter data. In: Proceedings of the Workshop on Languages in Social Media, pp. 30–38. Association for Computational Linguistics (2011)
Bermingham, A., Smeaton, A.F.: Classifying sentiment in microblogs: is brevity an advantage? In: CIKM, pp. 1833–1836 (2010)
Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: LREC (2010)
Mukherjee, S., Bhattacharyya, P.: Sentiment analysis in twitter with lightweight discourse analysis. In: COLING, pp. 1847–1864 (2012)
Kim, J., Yoo, J.B., Lim, H., Qiu, H., Kozareva, Z., Galstyan, A.: Sentiment prediction using collaborative filtering. In: ICWSM (2013)
Lu, Y., Wang, H., Zhai, C., Roth, D.: Unsupervised discovery of opposing opinion networks from forum discussions. In: CIKM, pp. 1642–1646 (2012)
Murakami, A., Raymond, R.: Support or oppose? classifying positions in online debates from reply activities and opinion expressions. In: COLING (Posters), pp. 869–875 (2010)
Gelfand, A.E., Smith, A.F.: Sampling-based approaches to calculating marginal densities. Journal of the American Statistical Association 85(410), 398–409 (1990)
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 1–38 (1977)
Qi, G.J., Aggarwal, C.C., Huang, T.S.: On clustering heterogeneous social media objects with outlier links. In: WSDM, pp. 553–562 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Ou, G., Chen, W., Li, B., Wang, T., Yang, D., Wong, KF. (2014). CLUSM: An Unsupervised Model for Microblog Sentiment Analysis Incorporating Link Information. In: Bhowmick, S.S., Dyreson, C.E., Jensen, C.S., Lee, M.L., Muliantara, A., Thalheim, B. (eds) Database Systems for Advanced Applications. DASFAA 2014. Lecture Notes in Computer Science, vol 8421. Springer, Cham. https://doi.org/10.1007/978-3-319-05810-8_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-05810-8_32
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05809-2
Online ISBN: 978-3-319-05810-8
eBook Packages: Computer ScienceComputer Science (R0)