Retrieval performance and information theory

https://doi.org/10.1016/0306-4573(77)90034-6Get rights and content

Abstract

This paper challenges the meaningfulness of precision and recall values as a measure of performance of a retrieval system. Instead, it advocates the use of a normalised form of Shannon's functions (entropy and mutual information). Shannon's four axioms are replaced by an equivalent set of five axioms which are more readily shown to be pertinent to document retrieval.

The applicability of these axioms and the conceptual and operational advantages of Shannon's functions are the central points of the work.

The applicability of the results to any automatic classification is also outlined.

References (12)

  • C.E. Shannon

    The mathematical theory of communication

    Bell Syst. Tech. J.

    (1948)
  • C.E. Shannon et al.

    The Mathematical Theory of Communication

    (1949)
  • A.I. Khinchin

    Mathematical Foundations of Information Theory

    (1957)
  • G. Salton

    Automatic Information Organisation and Retrieval

    (1968)
  • J.A. Swets

    Effectiveness of information retrieval methods

    Am. Docum.

    (1976)
There are more references available in the full text version of this article.

Cited by (11)

  • Human subjectivity and performance limits in document retrieval

    1996, Information Processing and Management
  • Ten years progress in quantitative research on libraries

    1978, Socio-Economic Planning Sciences
View all citing articles on Scopus
View full text