poster

Evaluation of methods for relative comparison of retrieval systems based on clickthroughs

Authors:

Jing He,

Chengxiang Zhai,

Xiaoming LiAuthors Info & Claims

CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management

Pages 2029 - 2032

https://doi.org/10.1145/1645953.1646293

Published: 02 November 2009 Publication History

Get Access

Abstract

The Cranfield evaluation method has some disadvantages, including its high cost in labor and inadequacy for evaluating interactive retrieval techniques. As a very promising alternative, automatic comparison of retrieval systems based on observed clicking behavior of users has recently been studied. Several methods have been proposed, but there has so far been no systematic way to assess which strategy is better, making it difficult to choose a good method for real applications. In this paper, we propose a general way to evaluate these relative comparison methods with two measures: utility to users(UtU) and effectiveness of differentiation(EoD). We evaluate two state of the art methods by systematically simulating different retrieval scenarios. Inspired by the weakness of these methods revealed through our evaluation, we further propose a novel method by considering the positions of clicked documents. Experiment results show that our new method performs better than the existing methods.

References

[1]

Ben Carterette, Paul N. Bennett, David Maxwell Chickering, and Susan T. Dumais. Here or there. In ECIR, volume 4956 of Lecture Notes in Computer Science, pages 16--27. Springer, 2008.

Digital Library

Google Scholar

[2]

T. Joachims. Unbiased evaluation of retrieval quality using clickthrough data. In SIGIR Workshop on Mathematical/Formal Methods in Information Retrieval, 2002.

Google Scholar

[3]

T. Joachims, L. Granka, Bing Pan, H. Hembrooke, F. Radlinski, and G. Gay. Evaluating the accuracy of implicit feedback from clicks and query reformulations in web search. ACM Transactions on Information Systems (TOIS), 25(2), April 2007.

Digital Library

Google Scholar

[4]

F. Radlinski, M. Kurup, and T. Joachims. How does clickthrough data reflect retrieval quality? In Conference on Information and Knowledge Management (CIKM), 2008.

Digital Library

Google Scholar

Cited By

View all

Breuer TFuhr NSchaer P(2023)Validating Synthetic Usage Data in Living Lab EnvironmentsJournal of Data and Information Quality10.1145/3623640Online publication date: 24-Sep-2023
https://dl.acm.org/doi/10.1145/3623640
Benedetti ARuggero A(2023)Stat-Weight: Improving the Estimator of Interleaved Methods Outcomes with Statistical Hypothesis TestingAdvances in Information Retrieval10.1007/978-3-031-28241-6_2(20-34)Online publication date: 16-Mar-2023
https://doi.org/10.1007/978-3-031-28241-6_2
Bi NCastells PGilbert DGalperin STardif PAhuja SAl Hasan MXiong L(2022)Debiased Balanced Interleaving at Amazon SearchProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557123(2913-2922)Online publication date: 17-Oct-2022
https://dl.acm.org/doi/10.1145/3511808.3557123
Show More Cited By

Index Terms

Evaluation of methods for relative comparison of retrieval systems based on clickthroughs
1. Information systems
  1. Information retrieval
    1. Evaluation of retrieval results

Recommendations

Current Status of the Evaluation of Information Retrieval

This is the second in the series of the articles on an application of the systems analytic approach to evaluation of information retrieval (IR). In the previous article a historical overview of IR was presented and existing terminological problems ...
A Comparison between Term-Independence Retrieval Models for Ad Hoc Retrieval
In Information Retrieval, numerous retrieval models or document ranking functions have been developed in the quest for better retrieval effectiveness. Apart from some formal retrieval models formulated on a theoretical basis, various recent works have ...
The FIRE 2008 Evaluation Exercise

The aim of the Forum for Information Retrieval Evaluation (FIRE) is to create an evaluation framework in the spirit of TREC (Text REtrieval Conference), CLEF (Cross-Language Evaluation Forum), and NTCIR (NII Test Collection for IR Systems), for Indian ...

Comments

Information & Contributors

Information

Published In

CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management

November 2009

2162 pages

ISBN:9781605585123

DOI:10.1145/1645953

General Chairs:
David Cheung
University of Hong Kong, Hong Kong
,
Il-Yeol Song
Drexel University, USA
,
Program Chairs:
Wesley Chu
UCLA, USA
,
Xiaohua Hu
Drexel University, USA
,
Jimmy Lin
University of Maryland, USA

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 November 2009

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Poster

Conference

CIKM '09

Sponsor:

CIKM '09: Conference on Information and Knowledge Management

November 2 - 6, 2009

Hong Kong, China

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

33
Total Citations
View Citations
278
Total Downloads

Downloads (Last 12 months)6
Downloads (Last 6 weeks)0

Reflects downloads up to 13 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Breuer TFuhr NSchaer P(2023)Validating Synthetic Usage Data in Living Lab EnvironmentsJournal of Data and Information Quality10.1145/3623640Online publication date: 24-Sep-2023
https://dl.acm.org/doi/10.1145/3623640
Benedetti ARuggero A(2023)Stat-Weight: Improving the Estimator of Interleaved Methods Outcomes with Statistical Hypothesis TestingAdvances in Information Retrieval10.1007/978-3-031-28241-6_2(20-34)Online publication date: 16-Mar-2023
https://doi.org/10.1007/978-3-031-28241-6_2
Bi NCastells PGilbert DGalperin STardif PAhuja SAl Hasan MXiong L(2022)Debiased Balanced Interleaving at Amazon SearchProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557123(2913-2922)Online publication date: 17-Oct-2022
https://dl.acm.org/doi/10.1145/3511808.3557123
Zhuang SZuccon G(2020)Counterfactual Online Learning to RankAdvances in Information Retrieval10.1007/978-3-030-45439-5_28(415-430)Online publication date: 8-Apr-2020
https://doi.org/10.1007/978-3-030-45439-5_28
Aslanyan GPorwal U(2019)Position Bias Estimation for Unbiased Learning-to-Rank in eCommerce SearchString Processing and Information Retrieval10.1007/978-3-030-32686-9_4(47-64)Online publication date: 3-Oct-2019
https://doi.org/10.1007/978-3-030-32686-9_4
Oosterhuis Hde Rijke M(2019)Optimizing Ranking Models in an Online SettingAdvances in Information Retrieval10.1007/978-3-030-15712-8_25(382-396)Online publication date: 7-Apr-2019
https://doi.org/10.1007/978-3-030-15712-8_25
Oosterhuis Hde Rijke MCuzzocrea AAllan JPaton NSrivastava DAgrawal RBroder AZaki MCandan SLabrinidis ASchuster AWang H(2018)Differentiable Unbiased Online Learning to RankProceedings of the 27th ACM International Conference on Information and Knowledge Management10.1145/3269206.3271686(1293-1302)Online publication date: 17-Oct-2018
https://dl.acm.org/doi/10.1145/3269206.3271686
Oosterhuis Hde Rijke MLim EWinslett MSanderson MFu ASun JCulpepper SLo EHo JDonato DAgrawal RZheng YCastillo CSun ATseng VLi C(2017)Sensitive and Scalable Online Evaluation with Theoretical GuaranteesProceedings of the 2017 ACM on Conference on Information and Knowledge Management10.1145/3132847.3132895(77-86)Online publication date: 6-Nov-2017
https://dl.acm.org/doi/10.1145/3132847.3132895
Zhao TKing IMukhopadhyay SZhai CBertino ECrestani FMostafa JTang JSi LZhou XChang YLi YSondhi P(2016)Constructing Reliable Gradient Exploration for Online Learning to RankProceedings of the 25th ACM International on Conference on Information and Knowledge Management10.1145/2983323.2983774(1643-1652)Online publication date: 24-Oct-2016
https://dl.acm.org/doi/10.1145/2983323.2983774
Qian XLin JRoegiest APerego RSebastiani FAslam JRuthven IZobel J(2016)Interleaved Evaluation for Retrospective Summarization and Prospective Notification on Document StreamsProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval10.1145/2911451.2911494(175-184)Online publication date: 7-Jul-2016
https://dl.acm.org/doi/10.1145/2911451.2911494
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Current Status of the Evaluation of Information Retrieval

A Comparison between Term-Independence Retrieval Models for Ad Hoc Retrieval

The FIRE 2008 Evaluation Exercise

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations