Abstract
Getting an overview of a historic entity or event can be difficult in search results, especially if important dates concerning the entity or event are not known beforehand. For such information needs, users benefit if returned results covered diverse dates, thus giving an overview of what has happened throughout history. Such a method can be a building block for applications, for instance, in digital humanities. We describe an approach to diversify search results using temporal expressions (e.g., 1990s) from their contents. Our approach first identifies time intervals of interest to the given keyword query based on pseudo-relevant documents. It then re-ranks query results so as to maximize the coverage of identified time intervals. We present a novel and objective evaluation for our proposed approach. We test the effectiveness of our methods on The New York Times Annotated corpus and the Living Knowledge corpus, collectively consisting of around 6 million documents. Using history-oriented queries and encyclopedic resources we show that our method is able to present search results diversified along time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal, R., et al.: Diversifying search results. In: WSDM (2009)
Berberich, K., Bedathur, S.: Temporal diversification of search results. In: TAIA (2013)
Berberich, K., Bedathur, S., Alonso, O., Weikum, G.: A language modeling approach for temporal information needs. In: Gurrin, C., He, Y., Kazai, G., Kruschwitz, U., Little, S., Roelleke, T., Rüger, S., van Rijsbergen, K. (eds.) ECIR 2010. LNCS, vol. 5993, pp. 13–25. Springer, Heidelberg (2010)
Campos, R.: Survey of temporal information retrieval, related applications. ACM Comput. Surv. 47(2), 15:1–15:41 (2014)
Carbonell, J.G., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: SIGIR (1998)
Chang, A.X., Manning, C.D: A library for recognizing and normalizing time expressions. In: LREC, SUTIME (2012)
Gupta, D., Berberich, K.: Identifying time intervals of interest to queries. In: CIKM (2014)
Joho, H., et al.: NTCIR temporalia: A test collection for temporal information access research. In: WWW (2014)
Mazur, P.P., Dale, R.: A new corpus for research on temporal expressions. In: EMNLP, Wikiwars (2010)
Nguyen, T.N., Kanhabua, N.: Leveraging dynamic query subtopics for time-aware search result diversification. In: de Rijke, M., Kenter, T., de Vries, A.P., Zhai, C.X., de Jong, F., Radinsky, K., Hofmann, K. (eds.) ECIR 2014. LNCS, vol. 8416, pp. 222–234. Springer, Heidelberg (2014)
Gupta, D., Berberich, K.: Diversifying search results using time. Research Report MPI-I–5-001 (2016)
Lin, C.Y.: Rouge: A package for automatic evaluation of summaries. In: ACL (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Gupta, D., Berberich, K. (2016). Diversifying Search Results Using Time. In: Ferro, N., et al. Advances in Information Retrieval. ECIR 2016. Lecture Notes in Computer Science(), vol 9626. Springer, Cham. https://doi.org/10.1007/978-3-319-30671-1_69
Download citation
DOI: https://doi.org/10.1007/978-3-319-30671-1_69
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-30670-4
Online ISBN: 978-3-319-30671-1
eBook Packages: Computer ScienceComputer Science (R0)