Abstract
Searchers’ difficulty in formulating effective queries for their information needs is well known. Analysis of search session logs shows that users often pose short, vague queries and then struggle with revising them. Interactive query expansion (users selecting terms to add to their queries) dramatically improves effectiveness and satisfaction. Suggesting relevant candidate expansion terms based on the initial query enables users to satisfy their information needs faster. We find that suggesting query phrases other users have found it necessary to add for a given query (mined from session logs) dramatically improves the quality of suggestions over simply using cooccurrence. However, this exacerbates the sparseness problem faced when mining short queries that lack features. To mitigate this, we tag query phrases with higher level topical categories to mine more general rules, finding that this enables us to make suggestions for approximately 10% more queries while maintaining an acceptable false positive rate.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Spink, A., Jansen, B.J., Ozmutlu, H.C.: Use of query reformulation and relevance feedback by excite users. Internet Research: Electronic Networking Applications and Policy 10(4), 317–328 (2000)
Kelly, D., Dollu, V.D., Fu, X.: The loquacious user: A document-independent source of terms for query expansion. In: ACM Conference on Research and Development in Information Retrieval (2005)
Belkin, N.J.: The human element: Helping people find what they don’t know. Communications of the ACM 43(8), 58–61 (2000)
Hersh, W.: Trec 2002 interactive track report. In: Voorhees, E.M., Buckland, L.P. (eds.) Proceedings of the Eleventh Text Retrieval Conference (TREC 2002), vol. SP 500-251, NIST (2002)
Beitzel, S.M., Jensen, E.C., Chowdhury, A., Grossman, D., Frieder, O.: Hourly analysis of a very large topically categorized web query log. In: ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 321–328 (2004)
Shen, X., Tan, B., Zhai, C.: Context sensitive information retrieval using implicit feedback. In: ACM Conference on Research and Development in Information Retrieval (2005)
Murray, G.C., Lin, J., Chowdhury, A.: Characterizing web search user sessions with hierarchical agglomerative clustering (forthcoming, 2006)
Sihvonen, A., Vakkari, P.: Subject knowledge, thesaurus-assisted query expansion and search success. In: RIAO (2004)
Wen, J.R., Zhang, H.J.: Information Retrieval and Clustering. In: Query Clustering in the Web Context, pp. 195–226. Kluwer Academic Publishers, Dordrecht (2003)
Baeza-Yates, R., Hurtado, C., Mendoza, M.: Query recommendation using query logs in search engines. In: International Workshop on Clustering Information over the Web (2004)
Fonseca, B.M., Golgher, P., Pÿssas, B., Ribeiro-Neto, B., Ziviani, N.: Concept based interactive query expansion. In: ACM Conference on Information and Knowledge Management (2005)
Kawamae, N., Takeya, M., Hanaki, M.: Semantic log analysis based on a user query behavior model. In: IEEE International Conference on Data Mining (2003)
Jones, R., Fain, D.C.: Query word deletion prediction. In: ACM Conference on Research and Development in Information Retrieval (2003)
Huang, C.K., Chien, L.F., Oyang, Y.J.: Relevant term suggestion in interactive web search based on contextual information in query session logs. Journal of the American Society of Information Science and Technology 54(7), 638–649 (2003)
Gleich, D., Zhukov, L.: Svd based term suggestion and ranking system. In: IEEE International Conference on Data Mining (2004)
Herlocker, J.L., Kostan, J.A., Terveen, L.G., Riedl, J.T.: Evaluating collaborative filtering recommender systems. ACM Transactions on Information Systems 22(1), 5–53 (2004)
Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
Martin, A., Doddington, G., Kamm, T., Ordowski, M., Przybocki, M.: The det curve in assessment of detection task performance. In: Proceedings of the 5th ESCA Conference on Speech Communication and Technology (Eurospeech 1997), pp. 1895–1898 (1997)
Manmatha, R., Feng, A., Allan, J.: A critical examination of tdt’s cost function. In: Proceedings of the 25th annual international ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 403–404 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jensen, E.C., Beitzel, S.M., Chowdhury, A., Frieder, O. (2006). Query Phrase Suggestion from Topically Tagged Session Logs. In: Larsen, H.L., Pasi, G., Ortiz-Arroyo, D., Andreasen, T., Christiansen, H. (eds) Flexible Query Answering Systems. FQAS 2006. Lecture Notes in Computer Science(), vol 4027. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11766254_16
Download citation
DOI: https://doi.org/10.1007/11766254_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34638-8
Online ISBN: 978-3-540-34639-5
eBook Packages: Computer ScienceComputer Science (R0)