Skip to main content

Diversifying Search Results Using Time

An Information Retrieval Method for Historians

  • Conference paper
Advances in Information Retrieval (ECIR 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9626))

Included in the following conference series:

Abstract

Getting an overview of a historic entity or event can be difficult in search results, especially if important dates concerning the entity or event are not known beforehand. For such information needs, users benefit if returned results covered diverse dates, thus giving an overview of what has happened throughout history. Such a method can be a building block for applications, for instance, in digital humanities. We describe an approach to diversify search results using temporal expressions (e.g., 1990s) from their contents. Our approach first identifies time intervals of interest to the given keyword query based on pseudo-relevant documents. It then re-ranks query results so as to maximize the coverage of identified time intervals. We present a novel and objective evaluation for our proposed approach. We test the effectiveness of our methods on The New York Times Annotated corpus and the Living Knowledge corpus, collectively consisting of around 6 million documents. Using history-oriented queries and encyclopedic resources we show that our method is able to present search results diversified along time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://en.wikipedia.org/.

  2. 2.

    http://livingknowledge.europarchive.org/.

  3. 3.

    https://catalog.ldc.upenn.edu/LDC2008T19.

  4. 4.

    https://www.elastic.co/.

  5. 5.

    http://usatoday30.usatoday.com/news/top25-influential.htm.

  6. 6.

    http://www.berouge.com/Pages/default.aspx.

References

  1. Agrawal, R., et al.: Diversifying search results. In: WSDM (2009)

    Google Scholar 

  2. Berberich, K., Bedathur, S.: Temporal diversification of search results. In: TAIA (2013)

    Google Scholar 

  3. Berberich, K., Bedathur, S., Alonso, O., Weikum, G.: A language modeling approach for temporal information needs. In: Gurrin, C., He, Y., Kazai, G., Kruschwitz, U., Little, S., Roelleke, T., Rüger, S., van Rijsbergen, K. (eds.) ECIR 2010. LNCS, vol. 5993, pp. 13–25. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  4. Campos, R.: Survey of temporal information retrieval, related applications. ACM Comput. Surv. 47(2), 15:1–15:41 (2014)

    Article  Google Scholar 

  5. Carbonell, J.G., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: SIGIR (1998)

    Google Scholar 

  6. Chang, A.X., Manning, C.D: A library for recognizing and normalizing time expressions. In: LREC, SUTIME (2012)

    Google Scholar 

  7. Gupta, D., Berberich, K.: Identifying time intervals of interest to queries. In: CIKM (2014)

    Google Scholar 

  8. Joho, H., et al.: NTCIR temporalia: A test collection for temporal information access research. In: WWW (2014)

    Google Scholar 

  9. Mazur, P.P., Dale, R.: A new corpus for research on temporal expressions. In: EMNLP, Wikiwars (2010)

    Google Scholar 

  10. Nguyen, T.N., Kanhabua, N.: Leveraging dynamic query subtopics for time-aware search result diversification. In: de Rijke, M., Kenter, T., de Vries, A.P., Zhai, C.X., de Jong, F., Radinsky, K., Hofmann, K. (eds.) ECIR 2014. LNCS, vol. 8416, pp. 222–234. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  11. Gupta, D., Berberich, K.: Diversifying search results using time. Research Report MPI-I–5-001 (2016)

    Google Scholar 

  12. Lin, C.Y.: Rouge: A package for automatic evaluation of summaries. In: ACL (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Klaus Berberich .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Gupta, D., Berberich, K. (2016). Diversifying Search Results Using Time. In: Ferro, N., et al. Advances in Information Retrieval. ECIR 2016. Lecture Notes in Computer Science(), vol 9626. Springer, Cham. https://doi.org/10.1007/978-3-319-30671-1_69

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-30671-1_69

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-30670-4

  • Online ISBN: 978-3-319-30671-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics