research-article

ReQue: A Configurable Workflow and Dataset Collection for Query Refinement

Authors:

Ebrahim BagheriAuthors Info & Claims

CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management

Pages 3165 - 3172

https://doi.org/10.1145/3340531.3412775

Published: 19 October 2020 Publication History

Get Access

Abstract

In this paper, we implement and publicly share a configurable software workflow and a collection of gold standard datasets for training and evaluating supervised query refinement methods. Existing datasets such as AOL and MS MARCO, which have been extensively used in the literature for this purpose, are based on the weak assumption that users' input queries improve gradually within a search session, i.e., the last query where the user ends her information seeking session is the best reconstructed version of her initial query. In practice, such an assumption is not necessarily accurate for a variety of reasons, e.g., topic drift. The objective of our work is to enable researchers to build gold standard query refinement datasets without having to rely on such weak assumptions. Our software workflow, which generates such gold standard query datasets, takes three inputs: (1) a dataset of queries along with their associated relevance judgements (e.g. TREC topics), (2) an information retrieval method (e.g., BM25), and (3) an evaluation metric (e.g., MAP), and outputs a gold standard dataset. The produced gold standard dataset includes a list of revised queries for each query in the input dataset, each of which effectively improves the performance of the specified retrieval method in terms of the desirable evaluation metric. Since our workflow can be used to generate gold standard datasets for any input query set, in this paper, we have generated and publicly shared gold standard datasets for TREC queries associated with Robust04, Gov2, ClueWeb09, and ClueWeb12. The source code of our software workflow, the generated gold datasets, and benchmark results for three state-of-the-art supervised query refinement methods over these datasets are made publicly available for reproducibility purposes.

Supplementary Material

MP4 File (3340531.3412775.mp4)

In this presentation, we illustrate the implementation of our configurable software workflow and a collection of gold standard datasets for training and evaluating supervised query refinement methods. The main objective is to enable researchers to build gold standard query refinement datasets which guarantees the trustworthiness of the dataset in search improvement of the query reformulations. Our workflow can be used to generate gold standard datasets for any input query set by taking three inputs: a dataset of queries along with their associated relevance judgments (e.g.TREC topics), an information retrieval method (e.g., BM25), and an evaluation metric (e.g., MAP). We present the generated gold standard datasets for TREC queries (Robust04, Gov2, ClueWeb09, and ClueWeb12). The dataset includes a list of revised queries for each query in the input dataset, each of which effectively improves the performance of the specified retrieval method in terms of the desirable evaluation metric.

Download
69.79 MB

References

[1]

Wasi Uddin Ahmad, Kai-Wei Chang, and Hongning Wang. 2019. Context Attentive Document Ranking and Query Suggestion. In 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR. 385--394.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

RePair: An Extensible Toolkit to Generate Large-Scale Datasets for Query Refinement via Transformers

Matches Made in Heaven: Toolkit and Large-Scale Datasets for Supervised Query Reformulation

Query Reformulation for Content Based Multimedia Retrieval in MARS

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations