short-paper

Query-Task Mapping

Authors:

Michael Völske,

Ehsan Fatehifar,

Matthias HagenAuthors Info & Claims

SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 969 - 972

https://doi.org/10.1145/3331184.3331286

Published: 18 July 2019 Publication History

Abstract

Several recent task-based search studies aim at splitting query logs into sets of queries for the same task or information need. We address the natural next step: mapping a currently submitted query to an appropriate task in an already task-split log. This query-task mapping can, for instance, enhance query suggestions---rendering efficiency of the mapping, besides accuracy, a key objective. Our main contributions are three large benchmark datasets and preliminary experiments with four query-task mapping approaches: (1) a Trie-based approach, (2) MinHash~LSH, (3) word movers distance in a Word2Vec setup, and (4) an inverted index-based approach. The experiments show that the fast and accurate inverted index-based method forms a strong baseline.

References

[1]

Ahmed Hassan Awadallah, Ryen W. White, Patrick Pantel, Susan T. Dumais, and Yi-Min Wang. 2014. Supporting complex search tasks. In Proceedings of CIKM 2014, 829--838.

Digital Library

[2]

Mayank Bawa, Tyson Condie, and Prasanna Ganesan. 2005. LSH Forest: Self-tuning indexes for similarity search. In Proceedings of WWW 2005, 651--660.

Digital Library

[3]

Paolo Boldi, Francesco Bonchi, Carlos Castillo, Debora Donato, Aristides Gionis, and Sebastiano Vigna. 2008. The query-flow graph: Model and applications. In Proceedings of CIKM 2008, 609--618.

Digital Library

[4]

Ben Carterette, Evangelos Kanoulas, Mark M. Hall, and Paul D. Clough. 2014. Overview of the TREC 2014 Session track. In Proceedings of TREC 2014.

[5]

Rene De La Briandais. 1959. File searching using variable length keys. In Proceedings of IRE-AIEE-ACM 1959, 295--298.

Digital Library

[6]

Debora Donato, Francesco Bonchi, Tom Chi, and Yoëlle S. Maarek. 2010. Do you want to take notes?: Identifying research missions in Yahoo! search pad. In Proceedings of WWW 2010, 321--330.

Digital Library

[7]

Daniel Gayo-Avello. 2009. A survey on session detection methods in query logs and a proposal for future evaluation. Information Sciences, Vol. 179, 12 (2009), 1822--1843.

Digital Library

[8]

Matthias Hagen, Jakob Gomoll, Anna Beyer, and Benno Stein. 2013. From search session detection to search mission detection. In Proceedings of OAIR 2013, 85--92.

Digital Library

[9]

Matthias Hagen, Martin Potthast, Michael Völske, Jakob Gomoll, and Benno Stein. 2016. How writers search: Analyzing the search and writing logs of non-fictional essays. In Proceedings of CHIIR 2016, 193--202.

Digital Library

[10]

Daqing He, Ayse Göker, and David J. Harper. 2002. Combining evidence for automatic web session identification. Information Processing & Management, Vol. 38, 5 (2002), 727--742.

Digital Library

[11]

Wen Hua, Yangqiu Song, Haixun Wang, and Xiaofang Zhou. 2013. Identifying users' topical tasks in web search. In Proceedings of WSDM 2013, 93--102.

Digital Library

[12]

Bernard J. Jansen, Amanda Spink, Chris Blakely, and Sherry Koshman. 2007. Defining a session on web search engines. JASIST, Vol. 58, 6 (2007), 862--871.

Digital Library

[13]

Rosie Jones and Kristina Lisa Klinkner. 2008. Beyond the session timeout: Automatic hierarchical segmentation of search topics in query logs. In Proceedings of CIKM 2008, 699--708.

Digital Library

[14]

Evangelos Kanoulas, Emine Yilmaz, Rishabh Mehrotra, Ben Carterette, Nick Craswell, and Peter Bailey. 2017. TREC 2017 Tasks track overview. In Proceedings of TREC 2017 .

[15]

Alexander Kotov, Paul N. Bennett, Ryen W. White, Susan T. Dumais, and Jaime Teevan. 2011. Modeling and analysis of cross-session search tasks. In Proceedings of SIGIR 2011, 5--14.

Digital Library

[16]

Matt J. Kusner, Yu Sun, Nicholas I. Kolkin, and Kilian Q. Weinberger. 2015. From word embeddings to document distances. In Proceedings of ICML 2015, 957--966.

Digital Library

[17]

Liangda Li, Hongbo Deng, Anlei Dong, Yi Chang, and Hongyuan Zha. 2014. Identifying and labeling search tasks via query-based Hawkes processes. In Proceedingsof KDD 2014, 731--740.

Digital Library

[18]

Zhen Liao, Yang Song, Yalou Huang, Li-wei He, and Qi He. 2014. Task trail: An effective segmentation of user search behavior. IEEE Trans. Knowl. Data Eng., Vol. 26, 12 (2014), 3090--3102.

[19]

Zheng Lu, Hongyuan Zha, Xiaokang Yang, Weiyao Lin, and Zhaohui Zheng. 2013. A new algorithm for inferring user search goals with feedback sessions. IEEE Trans. Knowl. Data Eng., Vol. 25, 3 (2013), 502--513.

Digital Library

[20]

Claudio Lucchese, Salvatore Orlando, Raffaele Perego, Fabrizio Silvestri, and Gabriele Tolomei. 2011. Identifying task-based sessions in search engine query logs. In Proceedings of WSDM 2011, 277--286.

Digital Library

[21]

Claudio Lucchese, Salvatore Orlando, Raffaele Perego, Fabrizio Silvestri, and Gabriele Tolomei. 2013. Discovering tasks from search engine query logs. ACM Trans. Inf. Syst., Vol. 31, 3 (2013), 14.

Digital Library

[22]

Rishabh Mehrotra, Prasanta Bhattacharya, and Emine Yilmaz. 2016. Deconstructing complex search tasks: A Bayesian nonparametric approach for extracting sub-tasks. In Proceedings of NAACL 2016, 599--605.

[23]

Rishabh Mehrotra and Emine Yilmaz. 2015. Terms, topics & tasks: Enhanced user modelling for better personalization. In Proceedings of ICTIR 2015, 131--140.

Digital Library

[24]

Rishabh Mehrotra and Emine Yilmaz. 2017a. Extracting hierarchies of search tasks & subtasks via a Bayesian nonparametric approach. In Proceedings of SIGIR 2017, 285--294.

Digital Library

[25]

Rishabh Mehrotra and Emine Yilmaz. 2017b. Task embeddings: Learning query embeddings using task context. In Proceedings of CIKM 2017, 2199--2202.

Digital Library

[26]

Donald Metzler, Susan T. Dumais, and Christopher Meek. 2007. Similarity measures for short segments of text. In Proceedings of ECIR 2007. 16--27.

Digital Library

[27]

Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv, Vol. abs/1301.3781 (2013).

[28]

Greg Pass, Abdur Chowdhury, and Cayley Torgeson. 2006. A picture of search. In Proceedings of Infoscale 2006, 1.

Digital Library

[29]

Procheta Sen, Debasis Ganguly, and Gareth J. F. Jones. 2018. Tempo-lexical context driven word embedding for cross-session search task extraction. In Proceedings of NAACL 2018. 283--292.

[30]

Amanda Spink, Minsoo Park, Bernard J. Jansen, and Jan O. Pedersen. 2006. Multitasking during web search sessions. Inf. Process. Manage., Vol. 42, 1 (2006), 264--275.

Digital Library

[31]

Manisha Verma and Emine Yilmaz. 2014. Entity oriented task extraction from query logs. Proceedings of CIKM 2014, 1975--1978.

Digital Library

[32]

Manisha Verma and Emine Yilmaz. 2016. Category oriented task extraction. In Proceedings of CHIIR 2016, 333--336.

Digital Library

[33]

Zi Yang and Eric Nyberg. 2015. Leveraging procedural knowledge for task-oriented search. In Proceedings of SIGIR 2015, 513--522.

Digital Library

Cited By

Ates NYaslan Y(2025)Search task extraction using k-contour based recurrent deep graph clusteringEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109501139(109501)Online publication date: Jan-2025
https://doi.org/10.1016/j.engappai.2024.109501
Otto CRokicki MPardi GGritz WHienert DYu Rvon Hoyer JHoppe ADietze SHoltz PKammerer YEwerth RElsweiler DKruschwitz ULudwig B(2022)SaL-Lightning Dataset: Search and Eye Gaze Behavior, Resource Interactions and Knowledge Gain during Web SearchACM SIGIR Conference on Human Information Interaction and Retrieval10.1145/3498366.3505835(347-352)Online publication date: 14-Mar-2022
https://doi.org/10.1145/3498366.3505835
Dosso CMoreno JChevalier ATamine LDemartini GZuccon GCulpepper JHuang ZTong H(2021)CoST: An annotated Data Collection for Complex SearchProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3481998(4455-4464)Online publication date: 26-Oct-2021
https://dl.acm.org/doi/10.1145/3459637.3481998
Show More Cited By

Index Terms

Query-Task Mapping
1. Information systems
  1. Information retrieval

Recommendations

User Behaviour and Task Characteristics: A Field Study of Daily Information Behaviour
CHIIR '17: Proceedings of the 2017 Conference on Conference Human Information Interaction and Retrieval

Previous studies investigating task based search often take the form of lab studies or large scale log analysis. In lab studies, users typically perform a designed task under a controlled environment, which may not reflect their natural behaviour. While ...
Query Reformulation for Task-Oriented Web Searches
WI-IAT '11: Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03

Web searches are driven by information needs and intend the accomplishment of specific tasks. Information needs are determined by the topical subject of queries, i.e. what we search, while tasks are determined by the user motives that induce the ...
Generating Query Suggestions to Support Task-Based Search
SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval

We address the problem of generating query suggestions to support users in completing their underlying tasks (which motivated them to search in the first place). Given an initial query, these query suggestions should provide a coverage of possible ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

July 2019

1512 pages

ISBN:9781450361729

DOI:10.1145/3331184

General Chairs:
Benjamin Piwowarski
CNRS - Sorbonne Universite, France
,
Max Chevalier
Universite de Toulouse, CNRS, France
,
Eric Gaussier
Universite Grenoble Alpes, CNRS, France
,
Program Chairs:
Yoelle Maarek
Amazon Research, Israel
,
Jian-Yun Nie
University of Montreal, Canada
,
Falk Scholer
RMIT University, Australia

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 July 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

Bundesministerium für Bildung und Forschung

Conference

SIGIR '19

Sponsor:

SIGIR

SIGIR '19: The 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

July 21 - 25, 2019

Paris, France

Acceptance Rates

SIGIR'19 Paper Acceptance Rate 84 of 426 submissions, 20%;

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
248
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 16 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Ates NYaslan Y(2025)Search task extraction using k-contour based recurrent deep graph clusteringEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109501139(109501)Online publication date: Jan-2025
https://doi.org/10.1016/j.engappai.2024.109501
Otto CRokicki MPardi GGritz WHienert DYu Rvon Hoyer JHoppe ADietze SHoltz PKammerer YEwerth RElsweiler DKruschwitz ULudwig B(2022)SaL-Lightning Dataset: Search and Eye Gaze Behavior, Resource Interactions and Knowledge Gain during Web SearchACM SIGIR Conference on Human Information Interaction and Retrieval10.1145/3498366.3505835(347-352)Online publication date: 14-Mar-2022
https://doi.org/10.1145/3498366.3505835
Dosso CMoreno JChevalier ATamine LDemartini GZuccon GCulpepper JHuang ZTong H(2021)CoST: An annotated Data Collection for Complex SearchProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3481998(4455-4464)Online publication date: 26-Oct-2021
https://dl.acm.org/doi/10.1145/3459637.3481998
Lugo LMoreno JHubert G(2021)Modeling User Search Tasks with a Language-Agnostic Unsupervised ApproachAdvances in Information Retrieval10.1007/978-3-030-72113-8_27(405-418)Online publication date: 27-Mar-2021
https://doi.org/10.1007/978-3-030-72113-8_27
Lugo LMoreno JHubert G(2021)Extracting Search Tasks from Query Logs Using a Recurrent Deep Clustering ArchitectureAdvances in Information Retrieval10.1007/978-3-030-72113-8_26(391-404)Online publication date: 27-Mar-2021
https://doi.org/10.1007/978-3-030-72113-8_26

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten