skip to main content
10.1145/3487553.3524668acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Semantic Modelling of Document Focus-Time for Temporal Information Retrieval

Published: 16 August 2022 Publication History

Abstract

An accurate understanding of the temporal dynamics of Web content and user behaviors plays a crucial role during the interactive process between search engine and users. In this work, we focus on how to improve the retrieval performance via a better understanding of the time factor. On the one hand, we proposed a novel method to estimate the focus-time of documents leveraging their semantic information. On the other hand, we introduced query trend time for understanding the temporal intent underlying a search query based on Google Trend. Furthermore, we applied the proposed methods to two search scenarios: temporal information retrieval and temporal diversity retrieval. Our experimental results based on NTCIR Temporalia test collections show that: (1) Semantic information can be used to predict the temporal tendency of documents. (2) The semantic-based model works effectively even when few temporal expressions and entity names are available in documents. (3) The effectiveness of the estimated focus-time was comparable to that of the article’s publication time in relevance modelling, and thus, our method can be used as an alternative or supplementary tool when reliable publication dates are not available. (4) The trend time can improve the representation of temporal intents behind queries over query issue time.

References

[1]
Ashutosh Adhikari, Achyudh Ram, Raphael Tang, and Jimmy Lin. 2019. Docbert: Bert for document classification. arXiv preprint arXiv:1904.08398(2019).
[2]
Axel Almquist and Adam Jatowt. 2019. Towards Content Expiry Date Determination: Predicting Validity Periods of Sentences. In Advances in Information Retrieval, Leif Azzopardi, Benno Stein, Norbert Fuhr, Philipp Mayr, Claudia Hauff, and Djoerd Hiemstra (Eds.). Springer International Publishing, Cham, 86–101.
[3]
Omar Alonso, Jannik Strötgen, Ricardo Baeza-Yates, and Michael Gertz. 2011. Temporal Information Retrieval: Challenges and Opportunities.Twaw 11(2011), 1–8.
[4]
Ricardo Campos, Gaël Dias, Alípio Jorge, and Célia Nunes. 2016. GTE-Rank: A Time-aware Search Engine to Answer Time-sensitive Queries. Information Processing & Management 52, 2 (2016), 273–298. https://doi.org/10.1016/j.ipm.2015.07.006
[5]
Ricardo Campos, Gaël Dias, Alípio M. Jorge, and Adam Jatowt. 2014. Survey of Temporal Information Retrieval and Related Applications. ACM Comput. Surv. 47, 2, Article 15 (aug 2014), 41 pages. https://doi.org/10.1145/2619088
[6]
Ricardo Campos, Gaël Dias, Alípio Mário Jorge, and Célia Nunes. 2017. Identifying Top Relevant Dates for Implicit Time Sensitive Queries. Inf. Retr. 20, 4 (aug 2017), 363–398. https://doi.org/10.1007/s10791-017-9302-1
[7]
Na Dai, Milad Shokouhi, and Brian Davison. 2011. Learning to rank for freshness and relevance. 95–104. https://doi.org/10.1145/2009916.2009933
[8]
Gaël Harry Dias, Mohammed Hasanuzzaman, Stéphane Ferrari, and Yann Mathet. 2014. TempoWordNet for Sentence Time Tagging. In Proceedings of the 23rd International Conference on World Wide Web (Seoul, Korea) (WWW ’14 Companion). Association for Computing Machinery, New York, NY, USA, 833–838. https://doi.org/10.1145/2567948.2579042
[9]
Mathías Etcheverry and Dina Wonsever. 2017. Time Expressions Recognition with Word Vectors and Neural Networks. In TIME.
[10]
Adam Jatowt, Ching-Man Au Yeung, and Katsumi Tanaka. 2013. Estimating Document Focus Time. In Proceedings of the 22nd ACM International Conference on Information & Knowledge Management (San Francisco, California, USA) (CIKM ’13). Association for Computing Machinery, 2273–2278.
[11]
Hideo Joho, Adam Jatowt, Roi Blanco, Hajime Naka, and Shuhei Yamamoto. 2014. Overview of NTCIR-11 Temporal Information Access (Temporalia) Task. In NTCIR.
[12]
Hideo Joho, Adam Jatowt, Roi Blanco, Haitao Yu, and Shuhei Yamamoto. 2016. Overview of NTCIR-12 Temporal Information Access (Temporalia-2) Task. In NTCIR.
[13]
Sanjay Kairam, Meredith Morris, Jaime Teevan, Dan Liebling, and Susan Dumais. 2013. Towards Supporting Search over Trending Events with Social Media. In Proceedings of the International AAAI Conference on Web and Social Media, Vol. 7.
[14]
Nattiya Kanhabua and Avishek Anand. 2016. Temporal Information Retrieval. In Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval (Pisa, Italy) (SIGIR ’16). Association for Computing Machinery, 1235–1238.
[15]
Seyed Mehran Kazemi, Rishab Goel, Sepehr Eghbali, Janahan Ramanan, Jaspreet Sahota, Sanjay Thakur, Stella Wu, Cathal Smyth, Pascal Poupart, and Marcus Brubaker. 2019. Time2Vec: Learning a Vector Representation of Time. arxiv:1907.05321 [cs.LG]
[16]
Donald Metzler, Rosie Jones, Fuchun Peng, and Ruiqiang Zhang. 2009. Improving Search Relevance for Implicitly Temporal Queries. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval(Boston, MA, USA) (SIGIR ’09). Association for Computing Machinery, 700–701.
[17]
Christian Morbidoni, Alessandro Cucchiarelli, and Domenico Ursino. 2018. Leveraging Linked Entities to Estimate Focus Time of Short Texts. In Proceedings of the 22nd International Database Engineering & Applications Symposium (Villa San Giovanni, Italy) (IDEAS 2018). Association for Computing Machinery, 282–286.
[18]
Sérgio Nunes, Cristina Ribeiro, and Gabriel David. 2008. Use of Temporal Expressions in Web Search. 580–584. https://doi.org/10.1007/978-3-540-78646-7_59
[19]
Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. arxiv:1908.10084 [cs.CL]
[20]
Jannik Strötgen, Omar Alonso, and Michael Gertz. 2012. Identification of top relevant temporal expressions in documents. In Proceedings of the 2nd Temporal Web Analytics Workshop. 33–40.
[21]
Xujian Zhao, Peiquan Jin, and Lihua Yue. 2015. Discovering Topic Time from Web News. 51, 6 (Nov. 2015), 869–890.

Cited By

View all
  • (2023)Report on the 12th Temporal Web Analytics Workshop (TempWeb 2022) at WWW 2022ACM SIGIR Forum10.1145/3582900.358290956:2(1-6)Online publication date: 31-Jan-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '22: Companion Proceedings of the Web Conference 2022
April 2022
1338 pages
ISBN:9781450391306
DOI:10.1145/3487553
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 August 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Document Focus-Time
  2. Temporal Information Retrieval
  3. Time-Aware Ranking

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

WWW '22
Sponsor:
WWW '22: The ACM Web Conference 2022
April 25 - 29, 2022
Virtual Event, Lyon, France

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)13
  • Downloads (Last 6 weeks)1
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Report on the 12th Temporal Web Analytics Workshop (TempWeb 2022) at WWW 2022ACM SIGIR Forum10.1145/3582900.358290956:2(1-6)Online publication date: 31-Jan-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media