ABSTRACT
This paper presents our novel relevance feedback (RF) algorithm that uses the probabilistic document-context based retrieval model with limited relevance judgments for document re-ranking. Probabilities of the document-context based retrieval model are estimated from the top N (=20) documents in the initial retrieval. We use document-context based cosine similarity measure to find similar data for better probability estimation in order to reduce the data scarcity problem and the negative weighting problem. Our RF algorithm is promising because its mean average precision is statistically significantly better than the baseline using TREC-6 and TREC-7 data collections.
- J. J. Rocchio, Relevance feedback in information retrieval, The Smart retrieval system: Experiments in automatic document processing, Prentice Hall, pp. 313--323, 1971Google Scholar
- M. Iwayama, Relevance Feedback with a small Number of Relevance Judgements: Incremental Relevance Feedback vs. Document Clustering, Proc. ACM SIGIR, pp. 10--16, 2000 Google ScholarDigital Library
- H. C. Wu, R. W. P. Luk, K. F. Wong, K. L. Kwok and W. J. Li, A retrospective study of probabilistic context-based retrieval, Proc. ACM SIGIR, pp. 663--664, 2005 Google ScholarDigital Library
- R. H. Warren and T. Liu, A Review of Relevance Feedback Experiments at the 2003 Reliable Information Access (RIA) Workshop, Proc. ACM SIGIR, pp. 570--571, 2004 Google ScholarDigital Library
- C. Buckley and G. Salton, Optimization of Relevance Feedback Weights, Proc. ACM SIGIR, pp. 351--357, 1995 Google ScholarDigital Library
- C. Zhai and J. Lafferty, Model-based Feedback in the Language Modeling Approach to Information Retrieval, Proc. ACM CIKM, pp. 403--410, 2001 Google ScholarDigital Library
- O. Vechtomova, S. E. Robertson and S. Jones, Query Expansion with Long-Span Collocates, Journal of Information Retrieval, 6, 251--273, 2003 Google ScholarDigital Library
- J. P. Callan, Passage-based evidence in document retrieval, Proc. ACM SIGIR, pp. 302--310, 1994 Google ScholarDigital Library
- Y. K. Kong, R. W. P. Luk, W. Lam, K. S. Ho and F. L. Chung, Passage-based retrieval based on parameterized fuzzy operators, ACM SIGIR Workshop on Mathematical/Formal Methods for Information Retrieval, 2004Google Scholar
- V. Lavrenko and W. B. Croft, Relevance based language models, Proc. ACM SIGIR, pp. 120--127, 2001 Google ScholarDigital Library
- S. E. Robertson and K. Sparck Jones, Relevance Weighting of Search Terms, Journal of the American Society for Information Science, 27, 129--146, 1976Google ScholarCross Ref
- W. S. Cooper, Some inconsistencies and misidentified modeling assumptions in probabilistic information retrieval, ACM TOIS, 13, 1, 100--111, 1995 Google ScholarDigital Library
- D. Harman, Relevance Feedback Revisited, Proc. ACM SIGIR, pp. 1--10, 1992 Google ScholarDigital Library
- S. E. Robertson and S. Walker, Some Simple Effective Approximations to the 2-Poisson Model for Probabilistic Weighted Retrieval, Proc. ACM SIGIR, pp. 232--241, 1992 Google ScholarDigital Library
- K. Sparck Jones, Summary Performance Comparisons TREC-2 Through TREC-7, Proc. TREC-7, pp. B-1, 1998Google Scholar
- G. V. Cormack, C. L. A. Clarke, C. R. Palmer and S. S. L. To, Passage-Based Refinement, Proc. TREC-6, pp. 303--320, 1997Google Scholar
Index Terms
- Probabilistic document-context based relevance feedback with limited relevance judgments
Recommendations
Document-based and term-based linear methods for pseudo-relevance feedback
Query expansion is a successful approach for improving Information Retrieval effectiveness. This work focuses on pseudo-relevance feedback (PRF) which provides an automatic method for expanding queries without explicit user feedback. These techniques ...
Query dependent pseudo-relevance feedback based on wikipedia
SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrievalPseudo-relevance feedback (PRF) via query-expansion has been proven to be e®ective in many information retrieval (IR) tasks. In most existing work, the top-ranked documents from an initial search are assumed to be relevant and used for PRF. One problem ...
Image retrieval based on indexing and relevance feedback
In content based image retrieval (CBIR) system, search engine retrieves the images similar to the query image according to a similarity measure. It should be fast enough and must have a high precision of retrieval. Indexing scheme is used to achieve a ...
Comments