ABSTRACT
This paper aims to find blog feeds having a principal inclination towards making opinionated comments on the given topic, so that we can subscribe to them to track influential and interesting opinions in the blogosphere. One major challenge is assigning topic-related opinion scores to blog feeds, which is embodied in two aspects. Firstly, we should identify whether the blog feed has a principal on-topic opinionated inclination. This inclination should be collectively revealed by all posts of the feed. We should fully consider evidences from all the posts of the feed to identify salient information among many posts of the feed. Secondly, we should capture topic-related opinions in the blog feed while ignoring irrelevant opinions.
In this paper, we propose a unified framework for opinionated blog feed retrieval, which combines topic relevance and opinion scores with a generative model. Furthermore, we propose a language modeling approach to estimating opinion scores that is seamlessly integrated into the framework, where two language models, Topic-specific Opinion Model (TOM) and Topic-biased Feed Model (TFM), work collectively to reflect whether the blog feed shows a principal on-topic opinionated inclination. To estimate TFM, we propose a topic-biased random walk to exploit both content and structural information to capture topic-biased salient information in the feed. As for TOM estimation, we propose to use a generative mixture model with prior guidance to effectively capture topic-specific opinion expressing language usage. The conducted experiments in the context of the TREC 2009-2010 Blog Track show the effectiveness of our proposed approaches.
- Balog, K., Rijke, M., and Weerkamp, W. 2008. Bloggers as Experts Feed Distillation using Expert Retrieval Models. In Proceedings of SIGIR 2008. Google ScholarDigital Library
- Elsas, J. L. , Arguello, J., Callan, J., and Carbonell, G. J. 2008. Retrieval and feedback models for blog feed search. In Proceedings of SIGIR '08. Google ScholarDigital Library
- Esuli, A., and Sebastiani, F.2005. Determining the semantic orientation of terms through gloss classification. In Proceedings of CIKM 2005. Google ScholarDigital Library
- Gerani, S., Carman, M. J., and Crestani, F. 2010. Proximity-based opinion retrieval. In Proceeding of SIGIR '10. Google ScholarDigital Library
- He, B., Macdonald, C., He, J., and Ounis, I. 2008. An effective statistical approach to blog post opinion retrieval. In Proceeding of CIKM '08. Google ScholarDigital Library
- Huang, X., and Croft, W. B. 2009. A unified relevance model for opinion retrieval. In Proceedings of CIKM 2009, pages 947--956. Google ScholarDigital Library
- Jia, L., Yu, C. 2010. UIC at TREC 2010 Faceted Blog Distillation. In Proceedings of TREC 2010.Google Scholar
- Jiang, P., Zhang, C., Yang, Q., and Niu, Z. 2010. Blog Opinion Retrieval Based on Topic-Opinion Mixture Model. In Proceedings of PAKDD 2010. Google ScholarDigital Library
- Keikha, M., Mahdabi, P., Gerani, S., Inches, G., Carman, M., Crestani, F., and Parapar, J. 2010. University of Lugano at TREC 2010. In Proceedings of TREC 2010.Google Scholar
- Li, S., Gao, H., Sun, H., Chen, F., Feng, O., Gao, S., Zhang, H., Li, X., Tan, C., Xu, W., Chen, G., and Guo, J. 2009. A Study of Faceted Blog Distillation - PRIS at TREC 2009 Blog Track. In Proceedings of TREC 2009.Google Scholar
- Macdonald, C., Ounis, I., and Soboroff, I. 2007. Overview of the TREC 2007 Blog track. In Proceedings of TREC 2007.Google Scholar
- Macdonald, C., and Ounis, I. 2008. Key blog distillation: ranking aggregates. In Proceedings CIKM 2008. Google ScholarDigital Library
- Macdonald, C., Ounis, I., and Soboroff, I. 2009. Overview of the TREC-2009 Blog Track. In Proceedings of TREC 2009.Google Scholar
- Macdonald, C., Santos, R. L. T., Ounis, I., and Soboroff, I. 2010. Blog track research at TREC. SIGIR Forum 44, 58--75. Google ScholarDigital Library
- McCreadie, R., Macdonald, C., Ounis, I., Peng, J., and Santos, R. 2009. University of Glasgow at TREC 2009: Experiments with Terrier. In Proceedings of TREC 2009.Google Scholar
- Mei, Q., Ling, X., Wondra, M., Su, H., and Zhai, C. 2007. Topic sentiment mixture: modeling facets and opinions in weblogs. In Proceedings of WWW '07, 171--180. Google ScholarDigital Library
- Na, S.-H., Lee, Y. , Nam, S.-H., and Lee, J.-H. 2009. Improving opinion retrieval based on query-specific sentiment lexicon. In Proceedings of ECIR 2009, pages 734--738. Google ScholarDigital Library
- Ounis, I., Macdonald, C., Rijke, M. de., Mishne, G., and Soboroff, I. 2006. Overview of the TREC 2006 Blog track. In Proceedings of the 15th Text REtrieval Conference.Google Scholar
- Ounis, I., Macdonald, C., and Soboroff, I. 2008. Overview of the TREC-2008 Blog Track. In Proceedings of TREC'08.Google Scholar
- Ounis, I., Macdonald, C., and Soboroff, I. 2010. Overview of the TREC-2010 Blog Track (Preliminary). In Proceedings of TREC 2010.Google Scholar
- Sanderson, J. 2008. The Blog is Serving Its Purpose: Self-Presentation Strategies on 38pitches.com. Journal of Computer-Mediated Communication. Jul 2008, Vol. 13, No. 4: 912--936.Google ScholarCross Ref
- Santos, R. L. T. , He, B., Macdonald, C., and Ounis, I. 2009. Integrating proximity to subjective sentences for blog opinion retrieval. In Proceedings of ECIR 2009, pages 325--336. Google ScholarDigital Library
- Seki, K., and Uehara, K. 2009. Adaptive subjective triggers for opinionated document retrieval. In Proceedings WSDM '09, 25--33. Google ScholarDigital Library
- Song, L., Cheng, X., Guo, Y., Liu, L., and Ding, G. 2009. ContentEx: A Framework for Automatic Content Extraction Programs. In Proceedings of ISI'2009. Google ScholarDigital Library
- Vechtomova, O. 2010. Facet-based opinion retrieval from blogs. Information Processing and Management, 46(1):71--88. Google ScholarDigital Library
- Wei, F., Li, W., Lu, Q., and He, Y. 2008. Query-sensitive mutual reinforcement chain and its application in query-oriented multi-document summarization. In Proceedings of SIGIR '08. Google ScholarDigital Library
- Xu, X., Meng, T., Cheng, X., and Liu, Y. 2011. A probabilistic model for opinionated blog feed retrieval. In Proceedings of the 20th international conference companion on World wide web (WWW '11). Google ScholarDigital Library
- Zhai C., and Lafferty, J. 2001. Model-based feedback in the language modeling approach to information retrieval. In Proceedings of CIKM 2001. Google ScholarDigital Library
- Zhai, C., and Lafferty, J. 2004. A study of smoothing methods for language models applied to information retrieval. ACM Transactions on Information Systems, Vol. 22, No. 2, 179--214. Google ScholarDigital Library
- Zhai, C., Velivelli, A., and Yu, B. 2004. A Cross-Collection Mixture Model for Comparative Text Mining. In Proceedings of KDD 2004. Google ScholarDigital Library
- Zhang, W., Yu, C., and Meng, W. 2007. Opinion retrieval from blogs. In Proceedings of CIKM 2007, pages 831--840. Google ScholarDigital Library
- Zhang, M., and Ye, X. 2008. A generation model to unify topic relevance and lexicon-based sentiment for opinion retrieval. In Proceedings of SIGIR 2008, pages 411--418. Google ScholarDigital Library
- Zhou, Z., Zhang, X.,Vines, P. 2010. RMIT at TREC 2010 Blog Track: Faceted Blog Distillation Task. Online Proceedings of TREC 2010.Google Scholar
Index Terms
- Find me opinion sources in blogosphere: a unified framework for opinionated blog feed retrieval
Recommendations
Identifying opinion leaders in the blogosphere
CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge managementOpinion leaders are those who bring in new information, ideas, and opinions, then disseminate them down to the masses, and thus influence the opinions and decisions of others by a fashion of word of mouth. Opinion leaders capture the most representative ...
A probabilistic model for opinionated blog feed retrieval
WWW '11: Proceedings of the 20th international conference companion on World wide webIn this poster, we study the problem of Opinionated Blog Feed Retrieval which can be considered as a particular type of the faceted blog distillation introduced by TREC 2009. It is a task of finding blogs not only having a principle and recurring ...
Topic sentiment mixture: modeling facets and opinions in weblogs
WWW '07: Proceedings of the 16th international conference on World Wide WebIn this paper, we define the problem of topic-sentiment analysis on Weblogs and propose a novel probabilistic model to capture the mixture of topics and sentiments simultaneously. The proposed Topic-Sentiment Mixture (TSM) model can reveal the latent ...
Comments