skip to main content
10.1145/1076034.1076116acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

An exploration of axiomatic approaches to information retrieval

Published: 15 August 2005 Publication History

Abstract

Existing retrieval models generally do not offer any guarantee for optimal retrieval performance. Indeed, it is even difficult, if not impossible, to predict a model's empirical performance analytically. This limitation is at least partly caused by the way existing retrieval models are developed where relevance is only coarsely modeled at the level of documents and queries as opposed to a finer granularity level of terms. In this paper, we present a new axiomatic approach to developing retrieval models based on direct modeling of relevance with formalized retrieval constraints defined at the level of terms. The basic idea of this axiomatic approach is to search in a space of candidate retrieval functions for one that can satisfy a set of reasonable retrieval constraints. To constrain the search space, we propose to define a retrieval function inductively and decompose a retrieval function into three component functions. Inspired by the analysis of the existing retrieval functions with the inductive definition, we derive several new retrieval functions using the axiomatic retrieval framework. Experiment results show that the derived new retrieval functions are more robust and less sensitive to parameter settings than the existing retrieval functions with comparable optimal performance.

References

[1]
P. D. Bruza and T. Huibers. investigating aboutness axioms using information fields. In Proceedings of the 1994 ACM SIGIR Conference on Research and Development in Information Retrieval, 1994.
[2]
H. Fang, T. Tao, and C. Zhai. A formal study of information retrieval heuristics. In Proceedings of the 2004 ACM SIGIR Conference on Research and Development in Information Retrieval, 2004.
[3]
N. Fuhr. Probabilistic models in information retrieval. The Computer Journal, 35(3):243--255, 1992.
[4]
W. R. Grieff. A theory of term weighting based on exploratory data analysis. In Proceedings of the 1998 ACM SIGIR Conference on Research and Development in Information Retrieval, 1998.
[5]
F. Hartiwig and B. E. Dearing. Exploratory Data Analysis. Sage Publications, 1979.
[6]
T. Huibers. Towards an axiomatic aboutness theory for information retrieval. Information Retrieval, Uncertainty and Logics-Advanced Models for the representation and retrieval for information, 1998.
[7]
J. Kleinberg. An impossibility theorem for clustering. In Advances in NIPS 15, 2002.
[8]
J. Lafferty and C. Zhai. Probabilistic relevance models based on document and query generation. In W. B. Croft and J. Lafferty, editors, Language Modeling and Information Retrieval. Kluwer Academic Publishers, 2003.
[9]
J. Ponte and W. B. Croft. A language modeling approach to information retrieval. In Proceedings of the ACM SIGIR'98, pages 275--281, 1998.
[10]
S. Robertson and K. Sparck Jones. Relevance weighting of search terms. Journal of the American Society for Information Science, 27:129--146, 1976.
[11]
S. Robertson and S. Walker. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In Proceedings of SIGIR'94, pages 232--241, 1994.
[12]
S. E. Robertson, S. Walker, S. Jones, M. M.Hancock-Beaulieu, and M. Gatford. Okapi at TREC-3. In D. K. Harman, editor, The Third Text REtrieval Conference (TREC-3), pages 109--126, 1995.
[13]
G. Salton. Automatic Text Processing: The Transformation, Analysis and Retrieval of Information by Computer. Addison-Wesley, 1989.
[14]
G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Information Processing and Management, 24:513--523, 1988.
[15]
G. Salton and M. McGill. Introduction to Modern Information Retrieval. McGraw-Hill, 1983.
[16]
G. Salton, C. S. Yang, and C. T. Yu. A theory of term importance in automatic text analysis. Journal of the American Society for Information Science, 26(1):33--44, Jan-Feb 1975.
[17]
A. Singhal. Modern information retrieval: A brief overview. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 24(4):35--43, 2001.
[18]
A. Singhal, C. Buckley, and M. Mitra. Pivoted document length normalization. In Proceedings of the 1996 ACM SIGIR Conference on Research and Development in Information Retrieval, pages 21--29, 1996.
[19]
K. Sparck Jones. A statistical interpretation of term specifity and its application in retrieval. Journal of Documentation, 28(1):11--22, 1972.
[20]
H. Turtle and W. B. Croft. Evaluation of an inference network-based retrieval model. ACM Transactions on Information Systems, 9(3):187--222, 1991.
[21]
C. J. van Rijbergen. A theoretical basis for theuse of co-occurrence data in information retrieval. Journal of Documentation, pages 106--119, 1977.
[22]
K.-F. Wong, D. Song, P. Bruza, and C.-H. Cheng. Application of aboutness to functional benchmarking in information retrieval. ACM Transactions on Information Systems, 19(4):337--370, 2001.
[23]
C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proceedings of SIGIR'01, pages 334--342, Sept 2001.
[24]
J. Zobel and A. Moffat. Exploring the similarity space. SIGIR Forum, 31(1):18--34, 1998.

Cited By

View all
  • (2024)Secure semantic search using deep learning in a blockchain-assisted multi-user settingJournal of Cloud Computing10.1186/s13677-023-00578-513:1Online publication date: 30-Jan-2024
  • (2024)Query Variability and Experimental Consistency: A Concerning Case StudyProceedings of the 2024 ACM SIGIR International Conference on Theory of Information Retrieval10.1145/3664190.3672519(35-41)Online publication date: 2-Aug-2024
  • (2024)Group Validation in Recommender Systems: Framework for Multi-layer Performance EvaluationACM Transactions on Recommender Systems10.1145/36408202:1(1-25)Online publication date: 19-Jan-2024
  • Show More Cited By

Index Terms

  1. An exploration of axiomatic approaches to information retrieval

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
    August 2005
    708 pages
    ISBN:1595930345
    DOI:10.1145/1076034
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 August 2005

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. TF-IDF weighting
    2. asxiomatic model
    3. constraints
    4. formal models
    5. retrieval heuristics

    Qualifiers

    • Article

    Conference

    SIGIR05
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)47
    • Downloads (Last 6 weeks)17
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Secure semantic search using deep learning in a blockchain-assisted multi-user settingJournal of Cloud Computing10.1186/s13677-023-00578-513:1Online publication date: 30-Jan-2024
    • (2024)Query Variability and Experimental Consistency: A Concerning Case StudyProceedings of the 2024 ACM SIGIR International Conference on Theory of Information Retrieval10.1145/3664190.3672519(35-41)Online publication date: 2-Aug-2024
    • (2024)Group Validation in Recommender Systems: Framework for Multi-layer Performance EvaluationACM Transactions on Recommender Systems10.1145/36408202:1(1-25)Online publication date: 19-Jan-2024
    • (2024)Course Recommender Systems Need to Consider the Job MarketProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657847(522-532)Online publication date: 10-Jul-2024
    • (2024)An Intrinsic Framework of Information Retrieval Evaluation MeasuresIntelligent Systems and Applications10.1007/978-3-031-47721-8_47(692-713)Online publication date: 10-Jan-2024
    • (2024)How much freedom does an effectiveness metric really have?Journal of the Association for Information Science and Technology10.1002/asi.24874Online publication date: 15-Feb-2024
    • (2023)Dense Text Retrieval based on Pretrained Language Models: A SurveyACM Transactions on Information Systems10.1145/3637870Online publication date: 18-Dec-2023
    • (2023)Information Retrieval Evaluation Measures Defined on Some Axiomatic Models of PreferencesACM Transactions on Information Systems10.1145/363217142:3(1-35)Online publication date: 8-Nov-2023
    • (2023)Towards Query Performance Prediction for Neural Information Retrieval: Challenges and OpportunitiesProceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval10.1145/3578337.3605142(51-63)Online publication date: 9-Aug-2023
    • (2023)Automatic and Analytical Field Weighting for Structured Document RetrievalAdvances in Information Retrieval10.1007/978-3-031-28244-7_31(489-503)Online publication date: 17-Mar-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media