Abstract
Query Expansion has been widely used to improve the effectiveness of conceptual search. In this paper pseudo relevance feedback is used along with equi-width and equi-frequency partition technique. The proposed method effectively uses the position and frequency of the query terms for identifying a region within the retrieved documents, which is expected to contain expansion terms. This region is an intersecting region obtained by partitioning the retrieved documents using equi-width and equi-frequency partition techniques. Initial results indicate that words falling in the intersecting region contain good candidate terms for query expansion. The experiments are performed on FIRE 2011’s Ad-hoc Hindi and English Data using Terrier as the retrieval engine. The initial experiments show an improvement in average precision of 12-14% in case of English data and 12.75% in case of Hindi data set.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aly, A.A.: Using a Query Expansion Technique to improve Document Retrieval. International Journal “Information Technologies and Knowledge” 2, 343 (2008)
Matsuo, Y., Ishizuka, M.: Keyword Extraction from a Single Document using Word Co-occurrence Statistical Information. International Journal on Artificial Intelligence Tools (2004)
Galeas, P., Kretschmer, R., Freisleben, B.: Document Relevance Assessment via Term Distribution Analysis Using Fourier Series Expansion. In: Proceedings of the 2009 Joint International Conference on Digital Libraries, JCDL 2009, pp. 277–284. ACM, New York (2009)
Jones, R., Rey, B., Madani, O.: Generating Query Substitutions. In: Proceedings of the 15th International Conference on World Wide Web. ACM (2006)
Lv, Y., Zhai, C.: Positional Relevance Model for Pseudo-Relevance Feedback. In: SIGIR 2010, Geneva, Switzerland, July 19-23 (2010)
Xu, J., BruceCroft, W.: Improving the Effectiveness of Information Retrieval with Local Context Analysis. Journal ACM Transactions on Information Systems (TOIS) 18(1) (January 2000)
Tao, T., Zhai, C.: Regularized Estimation of Mixture Models for Robust Pseudo Relevance Feedback. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2006)
Mitra, M., Singhal, A., Buckley, C.: Improving Automatic Query Expansion. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1998)
Galeas, P., Freisleben, B.: Word Distribution Analysis for Relevance Ranking and Query Expansion. In: Gelbukh, A. (ed.) CICLing 2008. LNCS, vol. 4919, pp. 500–511. Springer, Heidelberg (2008)
Hawking, D., Thistlewaite, P.: Relevance Weighting Using Distance Between Term Occurrences, Cooperative Research Centre For Advanced Computational Systems (January 25, 1996)
Höppner, F., Klawonn, F.: Systems of Information Granules. Wiley Online Library (published online July 16, 2008)
Ounis, I., Amati, G., Plachouras, V., He, B., Macdonald, C., Lioma, C.: Terrier: A High Performance and Scalable Information Retrieval Platform. In: 2006 ACM SIGIR Conference on OSIR (2006)
Ballesteros, L., Croft, W.B.: Phrasal Translation and Query Expansion Techniques for Cross-Language Information Retrieval. ACM SIGIR Forum (1997)
Strzalkowski, T., Wang, J., Wise, B.: Summarization-based Query Expansion in Information Retrieval. In: COLING 1998 Proceedings of the 17th International Conference on Computational Linguistics, vol. 2 (1998)
Yu, S., Cai, D., Wen, J.-R., Ma, W.-Y.: Improving Pseudo-Relevance Feedback in Web Information Retrieval Using Web Page Segmentation. In: The Twelfth International World Wide Web Conference (WWW 2003) (May 2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vaidyanathan, R., Das, S., Srivastava, N. (2013). Query Expansion Based on Equi-Width and Equi-Frequency Partition. In: Majumder, P., Mitra, M., Bhattacharyya, P., Subramaniam, L.V., Contractor, D., Rosso, P. (eds) Multilingual Information Access in South Asian Languages. Lecture Notes in Computer Science, vol 7536. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40087-2_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-40087-2_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40086-5
Online ISBN: 978-3-642-40087-2
eBook Packages: Computer ScienceComputer Science (R0)