Abstract
The application of relevance feedback techniques has been shown to improve retrieval performance for a number of information retrieval tasks. This paper explores incremental relevance feedback for ad hoc Japanese text retrieval; examining, separately and in combination, the utility of term reweighting and query expansion using a probabilistic retrieval model. Retrieval performance is evaluated in terms of standard precision-recall measures, and also using “number-to-view” graphs. Experimental results, on the standard BMIR-J2 Japanese language retrieval collection, show that both term reweighting and query expansion improve retrieval performance. This is reflected in improvements in both precision and recall, but also a reduction in the average number of documents which must be viewed to find a selected number of relevant items. In particular, using a simple simulation of user searching, incremental application of relevance information is shown to lead to progressively improved retrieval performance and an overall reduction in the number of documents that a user must view to find relevant ones.
Article PDF
Similar content being viewed by others
References
Allan J (1996) Incremental relevance feedback for information filtering. In: Proceedings of the 19th Annual International ACMSIGIR Conference on Research and Development in Information Retrieval, Zurich, pp. 270–278.
Beaulieu MM, Gatford M, Huang X, Robertson SE, Walker S and Williams P (1997) Okapi at TREC-5. In: Proceedings of the Fifth Text REtrieval Conference (TREC-5), pp. 143–165.
Belkin NJ, Kantor P, Fox EA and Shaw JA (1995) Combining the evidence of multiple query representations for information retrieval. Information Processing and Management, 31:431–448.
Chien L (1995) Fast and quasi-natural language search for gigabytes of chinese texts. In: Proceedings of the 18th Annual International ACMSIGIR Conference on Research and Development in Information Retrieval, Seattle, pp. 112–120.
Cooper WS (1968) Expected search length: A single measure of retrieval effectiveness based on the weak ordering action of retrieval systems. Journal of the American Society for Information Science, 19:30–41.
Dunlop MD (1997) Time, relevance and interaction modelling for information retrieval. In: Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, pp. 206–213.
Eguich K, Ito H and Kumamoto A (1997) Information retrieval considering adaptation to user's behaviors on the WWW. In: Proceedings of the 2nd International Workshop on Information Retrieval with Asian Languages, Tsukuba, pp. 108–113.
Fujii H and Croft WB (1993) A comparison of indexing techniques for Japanese text retrieval. In: Proceedings of the 16th Annual International ACMSIGIR Conference on Research and Development in Information Retrieval, Pittsburgh, pp. 237–246.
Jones GJF, Sakai T, Kajiura M and Sumita K (1998a) Experiments in Japanese text retrieval and routing using the NEAT system. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, pp. 197–205.
Jones GJF, Sakai T, Kajiura M and Sumita K (1998b) First experiments on the BMIR-J2 collection using the NEAT system. In: Information Processing Society of Japan Joint SIG DBS and SIG FI Workshop, Yokohama, pp. 57–64.
Kajiura M, Miike S, Sakai T, Sato M and Sumita K (1997) Development of the NEAT information filtering system. In: Proceedings of the 54th Information Processing Society of Japan National Conference, Tokyo, pp. 3299–3300. In Japanese.
Kitani T, et al. (1998) Lessons from BMIR-J2: A test collection for Japanese IR systems. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, pp. 345–346.
Lee JH and Ahn JS (1996) Using n-grams forKorean text retrieval. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, pp. 216–224.
Matsumoto Y, Kitauchi A, Yamashita T, Hirano Y, Imaichi O and Imamura T (1997) Japanese morphological analysis system ChaSen manual. Nara Institute of Science and Technology Technical Report, NAIST-IS-TR97007. In Japanese.
Nie J, Brisebois M and Ren X (1996) On chinese text retrieval. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, pp. 225–233.
Ogawa Y and Iwasaki M (1995) A new character-based indexing method using frequency data for japanese documents. In: Proceedings of the 18th Annual International ACMSIGIR Conference on Research and Development in Information Retrieval, Seattle, pp. 121–129.
Ogawa Y and Matsuda T (1997) Overlapping statistical word indexing: A new indexing method for Japanese text. In: Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, pp. 226–234.
Robertson SE (1990) On term selection for query expansion. Journal of Documentation, 46:359–364.
Robertson SE and Sparck Jones K (1976) Relevance Weighting of Search Terms. Journal of the American Society for Information Science, 27(3):129–146.
Robertson SE and Sparck Jones K (1997) Simple, proven approaches to text retrieval. Technical Report 356, Computer Laboratory, University of Cambridge.
Robertson SE and Walker S (1994) Some simple effective approximations to the 2–Poisson model for probabilistic weighted retrieval. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, pp. 232–241.
Robertson SE and Walker S (1997) On relevance weights with little relevance information. In: Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, pp. 16–24.
Robertson SE, Walker S, Beaulieu MM, Gatford M and Payne A (1996) Okapi at TREC-4. In: Harman DK, Ed. Overview of the Fourth Text REtrieval Conference (TREC-4), NIST, pp. 73–96.
Sakai T, Jones GJF, Kajiura M and Sumita K (1998a) Application of query expansion techniques in probabilistic Japanese news filtering. In: Proceedings of the 3rd InternationalWorkshop on Information Retrieval with Asian Languages, Singapore, pp. 149–152.
Sakai T, Jones GJF, Kajiura M and Sumita K (1998b) Profile generation at the user's request for the NEAT information filtering system. In: Proceedings of Interaction '98, Tokyo, pp. 149–152. In Japanese.
Sakai T, Kajiura M, Miike S, Sato M and Sumita K (1997) Evaluation of the NEAT information filtering system using the BMIR-J1 benchmark. In: Proceedings of the 54th Information Processing Society of Japan National Conference, Tokyo, pp. 3301–3302. In Japanese.
Salton G and Buckley C (1990) Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science 41:288–297.
Sparck Jones K (1979) Search term relevance weighting given little relevance information. Journal of Documentation, 35:30–48.
Sugai T and Morita Y (1998) The hierarchical information filtering method and its evaluation. In: Proceedings of the Japanese Society for Artificial Intelligence 12th National Conference, Tokyo, pp. 390–393. In Japanese.
van Rijsbergen CJ (1979) Information Retrieval, 2nd ed. Butterworths, London.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Jones, G., Sakai, T., Kajiura, M. et al. Incremental Relevance Feedback in Japanese Text Retrieval. Information Retrieval 2, 361–384 (2000). https://doi.org/10.1023/A:1009932512781
Issue Date:
DOI: https://doi.org/10.1023/A:1009932512781