Abstract
Most previous investigations comparing the performance of different representations have used recall and precision as performance measures. However, there is evidence to show that these measures are insensitive to an important difference between representations. To explain, two representations may perform similarly on these measures, while retrieving very different sets of documents. Equivalence of representations should be decided on the basis of similarity in performance and similarity in the documents retrieved. This study compared the performance of four representations in the PsycAbs database. In addition, overlap between retrieved sets was also computed where overlap is the proportion of retrieved documents that are the same for pairs of document representations. Results indicate that for any two representations considered, performance values differed slightly while overlap scores were also low, thus supporting the evidence that recall and precision as performance measures mask differences between the sets of retrieved documents. Results are interpreted to propose an optimal ordering of the representations and to examine the contribution of each representation given this combination.
- Katzer, Jeffrey, et al. A Study of the Overlap Among Document Representations. Information Technology, 1982. i, pp. 201--273.Google Scholar
- Katzer, Jeffrey. A Study of the Impact of Representations in Information Retrieval Systems. Final report for Grant NSF-IST- 79-21468 to the National Science Foundation, July 1982.Google Scholar
- McGill, Michael J. et al. An Evaluation of Factors Affecting Document Ranking by Information Retrieval Systems. Final Report For Grant NSF-IST-78-10454 to the National Science Foundation, October 1979.Google Scholar
Index Terms
- A Study of the Overlap Among Document Representations
Recommendations
A study of the overlap among document representations
Most previous investigations comparing the performance of different representations have used recall and precision as performance measures. However, there is evidence to show that these measures are insensitive to an important difference between ...
A study of the overlap among document representations
SIGIR '83: Proceedings of the 6th annual international ACM SIGIR conference on Research and development in information retrievalMost previous investigations comparing the performance of different representations have used recall and precision as performance measures. However, there is evidence to show that these measures are insensitive to an important difference between ...
Improving document representations using relevance feedback: the RFA algorithm
CIKM '04: Proceedings of the thirteenth ACM international conference on Information and knowledge managementIn this paper we present a document representation improvement technique, named the Relevance Feedback Accumulation (RFA) algorithm. Using prior relevance feedback assessments and a data mining measure called "support", the algorithm's learning function ...
Comments