Abstract
Traditional relevance feedback methods, which usually use the most frequent terms in the relevant documents as expansion terms to enrich the user’s initial query, could help improve retrieval performance. However, in reality, many expansion terms identified in traditional approaches are indeed unrelated to the query and even harmful to the retrieval. This paper introduces a new method based on the relative word-frequency to select good expansion terms. The relative word-frequency defined in this paper is a new feature and can help discriminate relevant documents from irrelevant ones. The new approach selects good expansion terms according to the relative word-frequency and uses them to reformulate the initial query. We compare a set of existing relevance feedback methods with our proposed approach, including the representative vector space models and language models. Our experiments on several TREC collections demonstrate that retrieval effectiveness can be much improved when the proposed approach is used. Experimental results show that the improvement of our proposed approach is more than 30% over traditional relevance feedback techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Smeaton, A.F., Van Rijsbergen, C.J.: The retrieval effects of query expansion on a feedback document retrieval system. Computer Journal 26(3), 239–246 (1983)
Peat, H.J., Willett, P.: The limitations of term co-occurrence data for query expansion in document retrieval systems. Journal of the American Society for Information Science 42(5), 378–383 (1991)
Cao, G.H., Nie, J.Y., Gao, J.F., Robertson, S.: Selecting Good Expansion Terms for Pseudo-Relevance Feedback. In: The Proceedings of SIGIR 2008, pp. 243–250 (2008)
Bin, T., Velivellia, Hui, F., et al.: Term feedback for information retrieval with language models. In: The Proceedings of SIGIR 2007, pp. 263–270 (2007)
Rocchio, J.J.: Relevance feedback in information retrieval. In: The SMART Retrieval System. Experiments in Automatic Document Processing, pp. 313–323 (1971)
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)
Amo, P., Ferreras, F.L., Cruz, F., et al.: Smoothing functions for automatic relevance feedback in information retrieval. In: Proc of the 11th International Workshop on Database and Expert Systems Applications, pp. 115–119 (2000)
Lavrenko, V., Croft, B.: Relevance-based language models. In: The Proceedings of SIGIR 2001, pp. 120–128 (2001)
Robertson, S.E.: On term selection for query expansion. Journal of Documentation 46(4), 359–364 (1990)
Wang, X., Fang, H., Zhai, C.: Improve retrieval accuracy for difficult queries using negative feedback. In: CIKM, pp. 991–994 (2007)
Zhai, C., Lafferty, J.: Model-based feedback in the language modeling approach to information retrieval. In: Proceedings of ACM CIKM 2001, pp. 403–410 (2001)
Miller, D., Leek, T., Schwartz, R.: A hidden markov model information retrieval system. In: The Proceedings of SIGIR 1999, pp. 214–221 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, Z., Lu, Y. (2010). A Relative Word-Frequency Based Method for Relevance Feedback. In: Dicheva, D., Dochev, D. (eds) Artificial Intelligence: Methodology, Systems, and Applications. AIMSA 2010. Lecture Notes in Computer Science(), vol 6304. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15431-7_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-15431-7_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15430-0
Online ISBN: 978-3-642-15431-7
eBook Packages: Computer ScienceComputer Science (R0)