Article

Quantify query ambiguity using ODP metadata

Authors:
Guang Qiu

Zhejiang University, Hangzhou, China

Zhejiang University, Hangzhou, China
View Profile

,
Kangmiao Liu

Zhejiang University, Hangzhou, China

Zhejiang University, Hangzhou, China
View Profile

,
Jiajun Bu

Zhejiang University, Hangzhou, China

Zhejiang University, Hangzhou, China
View Profile

,
Chun Chen

Zhejiang University, Hangzhou, China

Zhejiang University, Hangzhou, China
View Profile

,
Zhiming Kang

Zhejiang University, Hangzhou, China

Zhejiang University, Hangzhou, China
View Profile

SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrievalJuly 2007Pages 697–698https://doi.org/10.1145/1277741.1277864

Published:23 July 2007Publication History

SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

Pages 697–698

ABSTRACT

Query ambiguity prevents existing retrieval systems from returning reasonable results for every query. As there is already lots of work done on resolving ambiguity, vague queries could be handled using corresponding approaches separately if they can be identified in advance. Quantification of the degree of (lack of) ambiguity laysthe groundwork for the identification. In this poster, we propose such a measure using query topics based on the topic structure selected from the Open Directory Project (ODP) taxonomy. We introduce clarity score to quantify the lack of ambiguity with respect to data sets constructed from the TREC collections and the rank correlation test results demonstrate a strong positive association between the clarity scores and retrieval precisions for queries.

References

S. Cronen-Townsend and W. B. Croft. Quantifying query ambiguity. In Proceedings of Human Language Technology, pages 94--98, 2002. Google ScholarDigital Library
M. Sanderson and K. van Rijsbergen. The impact on retrieval effectiveness of skewed frequency distributions. ACM Transactions on Information Systems, 17(4):440--465, 1999. Google ScholarDigital Library
H. Schutze and J. Pederson. Information retrieval based on word senses. In Proceedings of the 4th Annual Symposium on Document Analysis and Information Retrieval, pages 161--175, 1995.Google Scholar
I. Soboroff. Overview of the trec 2004 novelty track. In Proceedings of the Thirteenth Text REtrieval Conference (TREC 2004), NIST Special Publication 500--261, 2004.Google Scholar
E. Voorhees. Overview of the trec 2003 robust retrieval track. In Proceedings of the Twelfth Text REtrieval Conference Proceedings (TREC 2003), NIST Special Publication 500--255, 2003.Google Scholar

Index Terms

Quantify query ambiguity using ODP metadata
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing

Recommendations

Predicting query performance
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval

We develop a method for predicting query performance by computing the relative entropy between a query language model and the corresponding collection language model. The resulting clarity score measures the coherence of the language usage in documents ...
Read More
TAPHSIR: towards AnaPHoric ambiguity detection and ReSolution in requirements
ESEC/FSE 2022: Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering

We introduce TAPHSIR – a tool for anaphoric ambiguity detection and anaphora resolution in requirements. TAPHSIR facilities reviewing the use of pronouns in a requirements specification and revising those pronouns that can lead to misunderstandings ...
Read More
Quantifying query ambiguity
HLT '02: Proceedings of the second international conference on Human Language Technology Research

We develop a measure of a query with respect to a collection of documents with the aim of quantifying the query's ambiguity with respect to those documents. This measure, the clarity score, is the relative entropy between a query language model and the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
July 2007
946 pages
ISBN:9781595935977
DOI:10.1145/1277741
General Chairs:
Wessel Kraaij
TNO, The Netherlands
,
Arjen P. de Vries
CWI, The Netherlands
,
Program Chairs:
Charles L. A. Clarke
University of Waterloo, Canada
,
Norbert Fuhr
University of Duisburg-Essen, Germany
,
Noriko Kando
National Institute of Informatics, Japan
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 July 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
ODP
ambiguity
quantification
rank correlation test
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate792of3,983submissions,20%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 15
  Total Citations
  View Citations
- 465
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Quantify query ambiguity using ODP metadata

SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Predicting query performance

TAPHSIR: towards AnaPHoric ambiguity detection and ReSolution in requirements

Quantifying query ambiguity