skip to main content
10.1145/1390334.1390421acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Automatically identifying localizable queries

Published: 20 July 2008 Publication History

Abstract

Personalization of web search results as a technique for improving user satisfaction has received notable attention in the research community over the past decade. Much of this work focuses on modeling and establishing a profile for each user to aid in personalization. Our work takes a more query-centric approach. In this paper, we present a method for efficient, automatic identification of a class of queries we define as localizable from a web search engine query log. We determine a set of relevant features and use conventional machine learning techniques to classify queries. Our experiments find that our technique is able to identify localizable queries with 94% accuracy.

References

[1]
C. M. Bishop. Neural Networks for Pattern Recognition. Oxford University Press, November 1995.
[2]
C. Buckley, G. Salton, J. Allan, and A. Singhal. Automatic query expansion using SMART: TREC 3. In Text REtrieval Conference, pages 0--, 1994.
[3]
C. J. C. Burges. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2(2):121--167, 1998.
[4]
Y. Freund and R. E. Schapire. Experiments with a new boosting algorithm. In International Conference on Machine Learning, pages 148--156, 1996.
[5]
C.-K. Huang, L.-F. Chien, and Y.-J. Oyang. Relevant term suggestion in interactive web search based on contextual information in query session logs. J. Am. Soc. Inf. Sci. Technol., 54(7):638--649, 2003.
[6]
B. J. Jansen, A. Spink, and T. Saracevic. Real life, real users, and real needs: a study and analysis of user queries on the web. Information Processing and Management, 36(2):207--227, 2000.
[7]
G. Jeh and J. Widom. Scaling personalized web search. In WWW '03: Proceedings of the 12th international conference on World Wide Web, pages 271--279, New York, NY, USA, 2003. ACM Press.
[8]
T. Joachims. Optimizing search engines using clickthrough data. In KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 133--142, New York, NY, USA, 2002. ACM Press.
[9]
T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. Accurately interpreting clickthrough data as implicit feedback. In SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pages 154--161, New York, NY, USA, 2005. ACM Press.
[10]
G. H. John and P. Langley. Estimating continuous distributions in Bayesian classifiers. In Proceedings of the Eleventh Annual Conference on Uncertainty in Artificial Intelligence (UAI '95), pages 338--345, 1995.
[11]
Y.-H. Kim, S.-Y. Hahn, and B.-T. Zhang. Text filtering by boosting naive bayes classifiers. In SIGIR '00: Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, pages 168--175, New York, NY, USA, 2000. ACM.
[12]
R. Kraft, C. C. Chang, F. Maghoul, and R. Kumar. Searching with context. In WWW '06: Proceedings of the 15th international conference on World Wide Web, pages 477--486, New York, NY, USA, 2006. ACM Press.
[13]
R. Kraft and J. Zien. Mining anchor text for query refinement, 2004.
[14]
T. Lau and E. Horvitz. Patterns of search: Analyzing and modeling web query refinement. In A. Press, editor, Proceedings of the Seventh International Conference on User Modeling, 1999.
[15]
U. Lee, Z. Liu, and J. Cho. Automatic identification of user goals in web search. In WWW2005, 2005.
[16]
Y.-S. Lee. Morphological analysis for statistical machine translation. In D. M. Susan Dumais and S. Roukos, editors, HLT-NAACL 2004: Short Papers, pages 57--60, Boston, Massachusetts, USA, May 2 - May 7 2004. Association for Computational Linguistics.
[17]
F. Liu, C. Yu, and W. Meng. Personalized web search by mapping user queries to categories. In CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management, pages 558--565, New York, NY, USA, 2002. ACM Press.
[18]
I. Mierswa, M. Wurst, R. Klinkenberg, M. Scholz, and T. Euler. Yale: rapid prototyping for complex data mining tasks. In KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 935--940, New York, NY, USA, 2006. ACM.
[19]
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998.
[20]
G. Pass, A. Chowdhury, and C. Torgeson. A picture of search. The First International Conference on Scalable Information Systems, June 2006.
[21]
M. F. Porter. An algorithm for suffix stripping. Readings in information retrieval, pages 313--316, 1997.
[22]
A. Pretschner and S. Gauch. Ontology based personalized search. In ICTAI '99: Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence, page 391, Washington, DC, USA, 1999. IEEE Computer Society.
[23]
I. Rish. An empirical study of the naive bayes classifier. In IJCAI-01 workshop on "Empirical Methods in AI", 2001.
[24]
M. Speretta and S. Gauch. Personalized search based on user search histories. In WI '05: Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence, pages 622--628, Washington, DC, USA, 2005. IEEE Computer Society.
[25]
J. Xu and W. B. Croft. Query expansion using local and global document analysis. In SIGIR '96: Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval, pages 4--11, New York, NY, USA, 1996. ACM Press.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
July 2008
934 pages
ISBN:9781605581644
DOI:10.1145/1390334
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 July 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. localizable query
  2. machine learning
  3. web search

Qualifiers

  • Research-article

Conference

SIGIR '08
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)3
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Deep Learning Methods for Query Auto CompletionAdvances in Information Retrieval10.1007/978-3-031-28241-6_35(341-348)Online publication date: 16-Mar-2023
  • (2020)Query ClassificationQuery Understanding for Search Engines10.1007/978-3-030-58334-7_2(15-41)Online publication date: 2-Dec-2020
  • (2020)Personalization in text information retrievalJournal of the Association for Information Science and Technology10.1002/asi.2423471:3(349-369)Online publication date: 28-Jan-2020
  • (2019)PAVAL: A location-aware virtual personal assistant for retrieving geolocated points of interest and location-based servicesEngineering Applications of Artificial Intelligence10.1016/j.engappai.2018.09.01377(70-85)Online publication date: Jan-2019
  • (2016)Automatic Identification and Contextual Reformulation of Implicit System-Related QueriesProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval10.1145/2911451.2914701(761-764)Online publication date: 7-Jul-2016
  • (2015)A Preferences Based Approach for Better Comprehension of User Information NeedsTransactions on Computational Collective Intelligence XVIII10.1007/978-3-662-48145-5_4(67-85)Online publication date: 31-Jul-2015
  • (2014)Collaborative Language Models for Localized Query PredictionACM Transactions on Interactive Intelligent Systems10.1145/26226174:2(1-21)Online publication date: 1-Jun-2014
  • (2014)A Preferences Based Approach for Better Comprehension of User Information NeedsComputational Collective Intelligence. Technologies and Applications10.1007/978-3-319-11289-3_10(94-103)Online publication date: 2014
  • (2014)A New Preference Based Model for Relevant Dimension Identification in Contextual Mobile SearchNetworked Systems10.1007/978-3-319-09581-3_15(215-229)Online publication date: 3-Aug-2014
  • (2014)Adapting User’s Context to Understand Mobile Information NeedsModern Trends and Techniques in Computer Science10.1007/978-3-319-06740-7_29(343-354)Online publication date: 6-May-2014
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media