short-paper

On modifying evaluation measures to deal with ties in ranked lists

Authors:
Sourav Saha

Indian Statistical Institute, Kolkata, India

Indian Statistical Institute, Kolkata, India
View Profile

,
Dwaipayan Roy

Indian Institute of Science Education and Research, Kolkata, India

Indian Institute of Science Education and Research, Kolkata, India
View Profile

,
Mandar Mitra

Indian Statistical Institute, Kolkata, India

Indian Statistical Institute, Kolkata, India
View Profile

JCDL '22: Proceedings of the 22nd ACM/IEEE Joint Conference on Digital LibrariesJune 2022Article No.: 12Pages 1–4https://doi.org/10.1145/3529372.3533291

Published:20 June 2022Publication History

JCDL '22: Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries

Pages 1–4

ABSTRACT

Evaluation metrics for search and ranking systems are generally designed for a linear list of ranked items that does not have ties. However, ties in ranked lists arise naturally for certain systems or techniques. Evaluation protocols generally arbitrarily break ties in such lists, and compute the standard metrics. If the number of ties is non-trivial, it would be more principled to use modified, tie-aware formulations of these metrics. For most commonly used metrics, McSherry and Najork [5] present modified definitions that are tie-aware, and therefore, more appropriate for assessing the quality of systems that retrieve multiple distinct results at the same rank. This paper proposes a tie-aware version of Hit@k that we call ta-Hit@k. Hit@k is also a common evaluation measure that is widely used for some tasks, but is not covered in [5]. We also empirically compare the values of ta-Hit@k and Hit@k for a single example system on a standard benchmark task.

References

Philipp Christmann, Rishiraj Saha Roy, Abdalghani Abujabal, Jyotsna Singh, and Gerhard Weikum. 2019. Look before You Hop: Conversational Question Answering over Knowledge Graphs Using Judicious Context Expansion. In Proc of 28th ACM CIKM (CIKM '19). Association for Computing Machinery, New York, NY, USA, 729--738.Google ScholarDigital Library
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proc. of NAACL. 4171--4186.Google Scholar
Denys Katerenchuk and Andrew Rosenberg. 2016. RankDCG: Rank-Ordering Evaluation Measure. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16). European Language Resources Association (ELRA), Portorož, Slovenia, 3675--3680. https://www.aclweb.org/anthology/L16-1583Google Scholar
Xiaolu Lu, Soumajit Pramanik, Rishiraj Saha Roy, Abdalghani Abujabal, Yafang Wang, and Gerhard Weikum. 2019. Answering Complex Questions by Joining Multi-Document Evidence with Quasi Knowledge Graphs. In Proc. of 42nd SIGIR (SIGIR'19). 105--114.Google ScholarDigital Library
Frank McSherry and Marc Najork. 2008. Computing Information Retrieval Performance Measures Efficiently in the Presence of Tied Scores. In Advances in Information Retrieval, 30th European Conference on IR Research, ECIR 2008, Glasgow, UK, March 30-April 3, 2008. Proceedings (Lecture Notes in Computer Science), Craig Macdonald, Iadh Ounis, Vassilis Plachouras, Ian Ruthven, and Ryen W. White (Eds.), Vol. 4956. Springer, 414--421. Google ScholarCross Ref
Zhiqing Sun, Shikhar Vashishth, Soumya Sanyal, Partha Talukdar, and Yiming Yang. 2020. A Re-evaluation of Knowledge Graph Completion Methods. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online, 5516--5522. Google ScholarCross Ref

Index Terms

On modifying evaluation measures to deal with ties in ranked lists
1. Information systems
  1. Information retrieval
    1. Evaluation of retrieval results
      1. Presentation of retrieval results

Recommendations

Learning to rank with ties
SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval

Designing effective ranking functions is a core problem for information retrieval and Web search since the ranking functions directly impact the relevance of the search results. The problem has been the focus of much of the research at the intersection ...
Read More
Directly optimizing evaluation measures in learning to rank
SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval

One of the central issues in learning to rank for information retrieval is to develop algorithms that construct ranking models by directly optimizing evaluation measures used in information retrieval such as Mean Average Precision (MAP) and Normalized ...
Read More
Re-ranking search results using query logs
CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management

This work addresses two common problems in search, frequently occurring with underspecified user queries: the top-ranked results for such queries may not contain documents relevant to the user's search intent, and fresh and relevant pages may not get ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
JCDL '22: Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries
June 2022
392 pages
ISBN:9781450393454
DOI:10.1145/3529372
General Chairs:
Akiko Aizawa
National Institute of Informatics, Japan
,
Thomas Mandl
University of Hildesheim, Germany
,
Zeljko Carevic
GESIS - Leibniz Institute for the Social Sciences, Germany
,
Program Chairs:
Annika Hinze
University of Waikato, New Zealand
,
Philipp Mayr
GESIS - Leibniz Institute for the Social Sciences, Germany
,
Philipp Schaer
TH Köln (University of Applied Sciences), Germany
Copyright © 2022 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 20 June 2022
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
evaluation measure
ranking
ties
Qualifiers
- short-paper
Conference

Acceptance Rates
JCDL '22 Paper Acceptance Rate35of132submissions,27%Overall Acceptance Rate415of1,482submissions,28%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 101
  Total Downloads
- Downloads (Last 12 months)31
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

On modifying evaluation measures to deal with ties in ranked lists

JCDL '22: Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries

ABSTRACT

References

Cited By

Index Terms

Recommendations

Learning to rank with ties

Directly optimizing evaluation measures in learning to rank

Re-ranking search results using query logs