FRel: A Freshness Language Model for Optimizing Real-Time Web Search

Bambia, Mariem; Faiz, Rim

doi:10.1007/978-3-319-18503-3_21

Mariem Bambia⁷ &
Rim Faiz⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 348))

Included in the following conference series:

Computer Science On-line Conference

886 Accesses
1 Citations

Abstract

An effective information retrieval system must satisfy different users search intentions expecting a variety of queries categories, comprising recency sensitive queries where fresh content is the major user’s requirement. However, using temporal features of documents to measure their freshness remains a hard task since these features may not be accurately represented in recent documents. In this paper, we propose a language model which estimates the topical relevance and freshness of documents with respect to real-time sensitive queries. In order to improve recency ranking, our approach models freshness by exploiting terms extracted from recently posted tweets topically relevant to each real-time sensitive query. In our experiments, we use these fresh terms to re-rank initial search results. Then, we compare our model with two baseline approaches which integrate temporal relevance in their language models. Our results show that there is a clear advantage of using microblogs platforms, such as Twitter, to extract fresh keywords.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bai, J., Nie, J.Y., Cao, G.: Context-dependent term relations for information retrieval. In: Proc. Empirical Methods in Natural Language Processing, EMNLP 2006, pp. 551–559 (2006)
Google Scholar
Ben Jabeur, L., Tamine, L., Boughanem, M.: Featured tweet search: Modeling time and social influence for microblog retrieval. In: Proceedings of International Conference on Web Intelligence, China, pp. 166–173 (2012)
Google Scholar
Dai, N., Shokouhi, M., Davison, B.: Learning to rank for freshness and relevance. In: Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 95–104 (2011)
Google Scholar
Dakka, W., Gravano, L., Ipeirotis, P.G.: Answering general time-sensitive queries. Proceedings of the IEEE Transactions on Knowledge and Data Engeneering 24, 220–235 (2012)
Article Google Scholar
Dong, A., Chang, Y., Zheng, Z., Mishne, G., Bai, J., Zhang, R., Buchner, K., Liao, C., Diaz, F.: Towards recency ranking in web search. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, WSDM 2010, pp. 11–20. ACM, New York (2010)
Google Scholar
Efron, M., Golovchinsky, G.: Estimation methods for ranking recent information. In: Proceedings of the 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 495–504 (2011)
Google Scholar
Huo, W., Tsotras, V.J.: Temporal top-k search in social tagging sites using multiple social networks. In: Kitagawa, H., Ishikawa, Y., Li, Q., Watanabe, C. (eds.) DASFAA 2010. LNCS, vol. 5981, pp. 498–504. Springer, Heidelberg (2010)
Chapter Google Scholar
Viera, A.J., Garrett, J.M.: Understanding interobserver agreement: The kappa statistic. Family Medecine Research Series 37(5), 360–363 (2005)
Google Scholar
Karkali, M., Plachouras, V., Vazirgiannis, M., Stefanatos, C.: Keeping keywords fresh: A bm25 variation for personalized keyword extraction. In: Proceedings of the 2nd Temporal Web Analytics Workshop, pp. 17–24 (2012)
Google Scholar
Li, X., Croft, W.B.: Time-based language models. In: ACM (ed.) Proceedings of the Twelfth International Conference on Information and Knowledge Management, New York, NY, USA (2003)
Google Scholar
Massoudi, K., Tsagkias, M., Rijke, M., Weerkamp, M.: Incorporating query expansion and quality indicators in searching microblog posts. In: Proceedings of the 33rd European Conference on IR Research, Dublin, Ireland, pp. 362–367 (2011)
Google Scholar
Moon, T., Chu, W., Lihong, L., Zheng, Z., Chang, Y.: Online learning for recency search ranking using real-time user feedback. ACM Transactions on Information Systems 30(4), 20 (2010)
Google Scholar
Ponte, J., Croft, W.: A language modeling approach to information retrieval. In: Proceedings of the 21st annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 275–281 (1998)
Google Scholar
Wang, H., Dong, A., Li, L., Chang, Y.: Joint relevance and freshness learning from clickthroughs for news search. In: Proceedings of the 21st International Conference on World Wide Web, pp. 579–588 (2012)
Google Scholar
Wang, H., Dong, A., Li, L., Chang, Y.: Joint relevance and freshness learning from clickthroughs for news search. In: Proceedings of the 21st International World Wide Web Conference Committee, pp. 579–588 (2012)
Google Scholar
Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to information retrieval. ACM Transactions on Information Systems 2(2), 179–214 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

LARODEC, ISG University of Tunis, Le Bardo, Tunisia
Mariem Bambia
LARODEC, IHEC University of Carthage, Carthage Presidency, Tunisia
Rim Faiz

Authors

Mariem Bambia
View author publications
You can also search for this author in PubMed Google Scholar
Rim Faiz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mariem Bambia .

Editor information

Editors and Affiliations

Faculty of Applied Informatics, Tomas Bata University in Zlín, Zlín, Czech Republic
Radek Silhavy
Faculty of Applied Informatics, Tomas Bata University in Zlín, Zlín, Czech Republic
Roman Senkerik
Faculty of Applied Informatics, Tomas Bata University in Zlín, Zlín, Czech Republic
Zuzana Kominkova Oplatkova
Faculty of Applied Informatics, Tomas Bata University in Zlín, Zlín, Czech Republic
Zdenka Prokopova
Faculty of Applied Informatics, Tomas Bata University in Zlín, Zlín, Czech Republic
Petr Silhavy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bambia, M., Faiz, R. (2015). FRel: A Freshness Language Model for Optimizing Real-Time Web Search. In: Silhavy, R., Senkerik, R., Oplatkova, Z., Prokopova, Z., Silhavy, P. (eds) Intelligent Systems in Cybernetics and Automation Theory. CSOC 2015. Advances in Intelligent Systems and Computing, vol 348. Springer, Cham. https://doi.org/10.1007/978-3-319-18503-3_21

Download citation

DOI: https://doi.org/10.1007/978-3-319-18503-3_21
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18502-6
Online ISBN: 978-3-319-18503-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics