Abstract
An effective information retrieval system must satisfy different users search intentions expecting a variety of queries categories, comprising recency sensitive queries where fresh content is the major user’s requirement. However, using temporal features of documents to measure their freshness remains a hard task since these features may not be accurately represented in recent documents. In this paper, we propose a language model which estimates the topical relevance and freshness of documents with respect to real-time sensitive queries. In order to improve recency ranking, our approach models freshness by exploiting terms extracted from recently posted tweets topically relevant to each real-time sensitive query. In our experiments, we use these fresh terms to re-rank initial search results. Then, we compare our model with two baseline approaches which integrate temporal relevance in their language models. Our results show that there is a clear advantage of using microblogs platforms, such as Twitter, to extract fresh keywords.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bai, J., Nie, J.Y., Cao, G.: Context-dependent term relations for information retrieval. In: Proc. Empirical Methods in Natural Language Processing, EMNLP 2006, pp. 551–559 (2006)
Ben Jabeur, L., Tamine, L., Boughanem, M.: Featured tweet search: Modeling time and social influence for microblog retrieval. In: Proceedings of International Conference on Web Intelligence, China, pp. 166–173 (2012)
Dai, N., Shokouhi, M., Davison, B.: Learning to rank for freshness and relevance. In: Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 95–104 (2011)
Dakka, W., Gravano, L., Ipeirotis, P.G.: Answering general time-sensitive queries. Proceedings of the IEEE Transactions on Knowledge and Data Engeneering 24, 220–235 (2012)
Dong, A., Chang, Y., Zheng, Z., Mishne, G., Bai, J., Zhang, R., Buchner, K., Liao, C., Diaz, F.: Towards recency ranking in web search. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, WSDM 2010, pp. 11–20. ACM, New York (2010)
Efron, M., Golovchinsky, G.: Estimation methods for ranking recent information. In: Proceedings of the 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 495–504 (2011)
Huo, W., Tsotras, V.J.: Temporal top-k search in social tagging sites using multiple social networks. In: Kitagawa, H., Ishikawa, Y., Li, Q., Watanabe, C. (eds.) DASFAA 2010. LNCS, vol. 5981, pp. 498–504. Springer, Heidelberg (2010)
Viera, A.J., Garrett, J.M.: Understanding interobserver agreement: The kappa statistic. Family Medecine Research Series 37(5), 360–363 (2005)
Karkali, M., Plachouras, V., Vazirgiannis, M., Stefanatos, C.: Keeping keywords fresh: A bm25 variation for personalized keyword extraction. In: Proceedings of the 2nd Temporal Web Analytics Workshop, pp. 17–24 (2012)
Li, X., Croft, W.B.: Time-based language models. In: ACM (ed.) Proceedings of the Twelfth International Conference on Information and Knowledge Management, New York, NY, USA (2003)
Massoudi, K., Tsagkias, M., Rijke, M., Weerkamp, M.: Incorporating query expansion and quality indicators in searching microblog posts. In: Proceedings of the 33rd European Conference on IR Research, Dublin, Ireland, pp. 362–367 (2011)
Moon, T., Chu, W., Lihong, L., Zheng, Z., Chang, Y.: Online learning for recency search ranking using real-time user feedback. ACM Transactions on Information Systems 30(4), 20 (2010)
Ponte, J., Croft, W.: A language modeling approach to information retrieval. In: Proceedings of the 21st annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 275–281 (1998)
Wang, H., Dong, A., Li, L., Chang, Y.: Joint relevance and freshness learning from clickthroughs for news search. In: Proceedings of the 21st International Conference on World Wide Web, pp. 579–588 (2012)
Wang, H., Dong, A., Li, L., Chang, Y.: Joint relevance and freshness learning from clickthroughs for news search. In: Proceedings of the 21st International World Wide Web Conference Committee, pp. 579–588 (2012)
Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to information retrieval. ACM Transactions on Information Systems 2(2), 179–214 (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Bambia, M., Faiz, R. (2015). FRel: A Freshness Language Model for Optimizing Real-Time Web Search. In: Silhavy, R., Senkerik, R., Oplatkova, Z., Prokopova, Z., Silhavy, P. (eds) Intelligent Systems in Cybernetics and Automation Theory. CSOC 2015. Advances in Intelligent Systems and Computing, vol 348. Springer, Cham. https://doi.org/10.1007/978-3-319-18503-3_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-18503-3_21
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18502-6
Online ISBN: 978-3-319-18503-3
eBook Packages: EngineeringEngineering (R0)