poster

Language models, probability of relevance and relevance likelihood

Authors:
Richard Bache

University of Strathclyde, Glasgow, Scotland, United Kingdom

University of Strathclyde, Glasgow, Scotland, United Kingdom
View Profile

,
Mark Baillie

University of Strathclyde, Glasgow, Scotland, United Kingdom

University of Strathclyde, Glasgow, Scotland, United Kingdom
View Profile

,
Fabio Crestani

University of Strathclyde, Glasgow, Scotland, United Kingdom

University of Strathclyde, Glasgow, Scotland, United Kingdom
View Profile

CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge managementNovember 2007Pages 853–856https://doi.org/10.1145/1321440.1321559

Published:06 November 2007Publication History

CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management

Pages 853–856

ABSTRACT

This paper proposes a measure of relevance likelihood derived specifically for language models. Such a measure may be used to guide a user on how far to browse through the list of retrieved items or for pseudo-relevance feedback. To derive this measure, it is necessary to make the assumption that a user is seeking an ideal (usually non-existent) document and the actual relevant documents in the collection will contain fragments of this ideal document. Thus, in deriving this measure we propose a novel way of capturing relevance in Language Modelling.

References

Bache, R., Crestani, F., Canter, D., Youngs, D., Application of Language Models to Suspect Prioritisation and Suspect Likelihood in Serial Crimes, to appear at International Workshop on Computer Forensics, 2007.Google Scholar
R. Baeza-Yates, B Ribeiro-Neto, Modern Information Retrieval, Addison-Wesley, Harrow, England 1999. Google ScholarDigital Library
F. Crestani, M. Lalmas, C. J. van Rijsbergen, I. Campbell, "Is this document relevant?..probably": a survey of probabilistic models in information retrieval", ACM Computing Surveys, Vol. 30, No. 4, pp. 528--552, 1998. Google ScholarDigital Library
N. Fuhr, Models for Retrieval With Probabilistic Indexing, Information Processing & Management, Vol. 25, No. 1 pp. 55--72, 1989. Google ScholarDigital Library
J. Lafferty, C Zhai, Probabalistic Relevance Models Based on Document and Query Generation, in (ed. Croft, W. B. and Lafferty, J.), Language Modeling for Information Retrieval, Kluwer Academic Publishers, Dordrecht 2003.Google Scholar
S. Mizzaro, Relevance: The whole history. In T. Bellardo Hahn and M. Buckland, editors, Historical Studies in Information Science, pages 221--244. 1998.Google Scholar
J. M. Ponte, W. B. Croft, "A Language Modeling Approach to Information Retrieval", in Proceedings of the Twenty First ACM-SIGIR, pp 275--281, Melbourne, Australia, 1998. Google ScholarDigital Library
C. J. van Rijsbergen, Information Retrieval, Buttereworths, London, England, 1979. Google ScholarDigital Library
S. E. Robertson, The Probability Ranking Principle in IR, in K. Spark-Jones, P. Willett (Eds), Readings In Information Retrieval, Morgan Kaufmann Publishers, San Francisco, California, 1997. Google ScholarDigital Library
K. Spark-Jones, S. Robertson, D. Hiemstra, H. Zaragoza, Language Modelling and Relevance, in (ed. Croft W. B. and Lafferty J.), Language Modeling for Information Retrieval, Kluwer Academic Publishers, Dordrecht 2003.Google Scholar
K. Sparck Jones, S. Walker and S. E. Robertson, A probabilistic model of information retrieval: development and comparative experiments. Information Processing and Management 36, Part 1 779--808, 2000. Google ScholarDigital Library
H. Turtle, W. B. Croft, Inference Networks for Document Retrieval, in K. Spark-Jones, P. Willett (Eds), Readings In Information Retrieval, Morgan Kaufmann Publishers, San Francisco, California, 1997. Google ScholarDigital Library
E. Vorrhees, Overview of TREC 2003, http://trec.nist.gov/pubs/trec12/papers/OVERVIEW.12.pdf (last accessed 27/04/07).Google Scholar

Index Terms

Language models, probability of relevance and relevance likelihood
1. Information systems
  1. Information retrieval

Recommendations

Pseudo relevance feedback using semantic clustering in relevance language model
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management

Pseudo relevance feedback has demonstrated to be in general an effective technique for improving retrieval effectiveness, but the noise in the top retrieved documents still can cause topic drift problem that affects the performance of certain topics. By ...
Read More
Enhancing relevance models with adaptive passage retrieval
ECIR'08: Proceedings of the IR research, 30th European conference on Advances in information retrieval

Passage retrieval and pseudo relevance feedback/query expansion have been reported as two effective means for improving document retrieval in literature. Relevance models, while improving retrieval in most cases, hurts performance on some heterogeneous ...
Read More
Time-based relevance models
SIGIR '11: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval

This paper addresses blog feed retrieval where the goal is to retrieve the most relevant blog feeds for a given user query. Since the retrieval unit is a blog, as a collection of posts, performing relevance feedback techniques and selecting the most ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
November 2007
1048 pages
ISBN:9781595938039
DOI:10.1145/1321440
Co-chair:
Alberto H. F. Laender,
Conference Chairs:
André O. Falcão
Universidade de Lisboa, Portugal
,
Øystein Haug Olsen,
General Chair:
Mário J. Silva
(Universidade de Lisboa, Portugal)
,
Program Chairs:
Ricardo Baeza-Yates,
Deborah L. McGuinness,
Bjorn Olstad
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 6 November 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
language modelling
ranking function
relevance likelihood
Qualifiers
- poster
Conference

Acceptance Rates
Overall Acceptance Rate1,861of8,427submissions,22%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 276
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Language models, probability of relevance and relevance likelihood

CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Pseudo relevance feedback using semantic clustering in relevance language model

Enhancing relevance models with adaptive passage retrieval

Time-based relevance models