Abstract
Microblog, as a way of online communication, can generate large amounts of information in a very short period. Therefore, how to retrieve the latest relevant information becomes a hot research area. Different from traditional information retrieval (IR), the microblog retrieval emphasizes fresh contents of the information. In order to solve this problem, we extend the traditional IR methods by taking into account the posting time. We propose a time-sensitive retrieval model, which takes the time factor as a prior probability. In the retrieval model, we introduce the pseudo relevance feedback technology as a query expansion approach to improve retrieval performance. Furthermore, we introduce a strategy to filter the initial retrieval results, which takes post quality factors into account including entropy and link features. Experiments on Twitter corpus show that our algorithm is effective to improve the retrieval performance, and the retrieval results can meet the real time retrieval need well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Efron, M.: Information search and retrieval in microblogs. Journal of the American Society for Information Science and Technology 62(6), 996–1008 (2011)
Cheong, M., Lee, V.: Integrating web-based intelligence retrieval and decision-making from the twitter trends knowledge base. In: Proceeding of the 2nd ACM Workshop on Social web Search and Mining, pp. 1–8 (2009)
Dong, A., Zhang, R., Kolari, P., et al.: Time is of the essence: improving recency ranking using Twitter data. In: Proceedings of the 19th International Conference on World Wide Web, pp. 331–340 (2010)
Efron, M.: Hashtag Retrieval in a microblogging environment. In: Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 787–788 (2010)
Evans, M., Chi, E.H.: Towards a model of understanding social search. In: Proceedings of the 2008 ACM Conference on Computer Supported Cooperative Work, pp. 485–494 (2008)
Geer, D.: Is It Really Time for Real-Time Search? Computer 43(3), 16–19 (2010)
Horowitz, D., Kamvar, S.D.: The anatomy of a large-scale social search engine. In: Proceedings of the 19th International Conference on World Wide Web, pp. 431–440 (2010)
Teevan, J., Ramage, D., Morris, M.R.: TwitterSearch: A Comparison of Microblog Search and Web Search. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 35–44 (2011)
Weng, J., Lim, E., Jiang, J., et al.: TwitterRank: finding topic-sensitive influential twitterers. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, pp. 261–270 (2010)
Chen, C., Li, F., et al.: TI: An efficient indexing mechanism for real-time search on tweets. In: Proceedings of the 2011 International Conference on Management of Data, pp. 648–660 (2011)
Younus, A., Qureshi, M.A., Ghazi, A.N., et al.: Ins and Outs of News: Twitter as a Real-Time News Analysis Service. In: Proceedings of the Workshop on Visual Interfaces to the Social and Semantic Web (2011)
Massoudi, K., Tsagkias, M., Rijke, M.D., et al.: Incorporating Query Expansion and Quality Indicators in Searching Microblog Posts. In: The 33rd European Conference on Information Retrieval, pp. 362–367 (2011)
Nagmoti, R., Teredesai, A., Cock, M.D.: Ranking Approaches for Microblog Search. In: International Conference on Web Intelligence and Intelligent Agent Technology, pp. 153–157 (2010)
Meredith Ringel, M., Scott, C., Asta, R., Aaron, H., Julia, S.: Tweeting is Believing? Understanding Microblog Credibility Perceptions. In: The 12th Computer Supported Cooperative Work, pp. 441–450 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shi, C., Xu, B., Lin, H., Guo, Q. (2013). A Time-Sensitive Model for Microblog Retrieval. In: Zhou, G., Li, J., Zhao, D., Feng, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2013. Communications in Computer and Information Science, vol 400. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41644-6_39
Download citation
DOI: https://doi.org/10.1007/978-3-642-41644-6_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41643-9
Online ISBN: 978-3-642-41644-6
eBook Packages: Computer ScienceComputer Science (R0)