Skip to main content
Log in

Query ranking model for search engine query recommendation

  • Original Article
  • Published:
International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Abstract

In this paper, we propose a query ranking model to select and order queries for search engine query recommendations. In contrast to existing similarity-based query recommendation methods (Agglomerative clustering of a search engine query log, 2000; The query-flow graph: model and applications, 2008], this model is based on utility, and ranks a query based on the joint probability of events whereby a query is selected by the user, the search results of the query are selected by the user, and the chosen search results satisfy the user’s information needs. We thus define three utilities in our model: a query-level utility representing the attractiveness of a query to the user, a perceived utility measuring the user’s actions given the search results, and a posterior utility measuring the user’s satisfaction with the chosen search results. We propose methods to compute these three utilities from query log data. In experiments involving real query log data, our proposed query ranking model outperformed seven other baseline methods in generating useful recommendations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. In our experiments, we set μ β  = 4.

  2. In our experiments, we set μ γ  = 6.

  3. http://research.microsoft.com/users/nickcr/wscd09.

  4. The Internal Revenue Service (IRS) is an organization of the United States federal government. The IRS is responsible for collecting taxes, and the interpretation and enforcement of the Internal Revenue Code.

References

  1. Abedini M, Kirley M (2013) An enhanced XCS rule discovery module using feature ranking. Int J Mach Learn Cybernet 4(3):173–187

    Article  Google Scholar 

  2. Ageev M, Guo Q, Lagun D, Agichtein E (2011) Find it if you can: a game for modeling different types of web search success using interaction data. In: Proceedings of the 34th international ACM sigir conference on research and development in information, SIGIR’11, pp 345–354

  3. Ahsaee M, Naghibzadeh M, Naeini Y (2014) Semantic similarity assessment of words using weighted WordNet. Int J Mach Learn Cybernet 5(3):479–490

    Article  Google Scholar 

  4. Anagnostopoulos A, Becchetti L, Castillo C, Gionis A (2010) An optimization framework for query recommendation. In: Proceedings of the 3rd ACM international conference on web search and data mining, WSDM’10, pp 161–170

  5. Baraglia R, Castillo C, Donato D, Nardini F, Perego R, Silvestri F (2009) Aging effects on query flow graphs for query suggestion. In: Proceedings of the 18th ACM conference on information and knowledge management, CIKM’09, pp 1947–1950

  6. Baraglia R, Nardini FM, Castillo C, Perego R, Donato D, Silvestri F (2010) The effects of time on query flow graph-based models for query suggestion. In: Adaptivity, personalization and fusion of heterogeneous information, RIAO’10, pp 182–189

  7. Beeferman D, Berger A (2000) Agglomerative clustering of a search engine query log. In: Proceedings of the 6th ACM international conference on knowledge discovery and data mining, SIGKDD’00, pp 407–416

  8. Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022

    MATH  Google Scholar 

  9. Boldi P, Bonchi F, Castillo C, Donato D, Gionis A, Vigna S (2008) The query-flow graph: model and applications. In: Proceedings of the 17th ACM conference on information and knowledge management, CIKM’08, pp 609–618

  10. Boldi P, Bonchi F, Castillo C, Donato D, Vigna S (2009) Query suggestions using query-flow graphs. In: Proceedings of the 2009 workshop on web search click data, WSCD’09, pp 56–63

  11. Bordino I, Castillo C, Donato D, Gionis A (2010) Query similarity by projecting the query-flow graph. In: Proceedings of the 33rd international ACM SIGIR conference on research and development in information retrieval, SIGIR’10, pp 515–522

  12. Chen L (2014) EM-type method for measuring graph dissimilarity. Int J Mach Learn Cybernet 5(4):625–633

    Article  Google Scholar 

  13. Chai J, Liu J (2013) Dominance-based decision rule induction for multi-criteria ranking. Int J Mach Learn Cybernet 4(5):427–444

    Article  Google Scholar 

  14. Craswell N, Szummer M (2007) Random walks on the click graph. In: Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR’07, pp 239–246

  15. Fonseca BM, Golgher PB, de Moura ES, Ziviani N (2003) Using association rules to discover search engine-related queries. In: Proceedings of the 1st conference on latin American web congress, LA-WEB’03, pp 66–71

  16. Fox S, Karnawat K, Mydland M, Dumais S, White T (2005) Evaluating implicit measures to improve web search. ACM Trans Inform Syst 23(2):147–168

    Article  Google Scholar 

  17. Hassan A, Jones R, Klinkner KL (2010) Beyond DCG: user behavior as a predictor of a successful search. In: Proceedings of the 3rd ACM international conference on web search and data mining, WSDM’10, pp 221–230

  18. He Q, Jiang D, Liao Z, Hoi SCH, Chang K, Lim EP, Li H (2009) Web query recommendation via sequential query prediction. In: Proceedings of the 2009 IEEE international conference on data engineering, ICDE’09, pp 1443–1454

  19. Huang CK, Chien LF, Oyang YJ (2003) Relevant term suggestion in interactive web search based on contextual information in query session logs. J Am Soc Inform Sci Technol 54(7):638–649

    Article  Google Scholar 

  20. Jain A, Ozertem U, Velipasaoglu E (2011) Synthesizing high utility suggestions for rare web search queries. In: Proceedings of the 34th international ACM SIGIR conference on research and development in information, SIGIR’11, pp 805–814

  21. Jarvelin K, Kekalainen J (2002) Cumulated gain-based evaluation of IR techniques. ACM Trans Inform Syst 20(4):422–446

    Article  Google Scholar 

  22. Jones R, Rey B, Madani O, Greiner W (2006) Generating query substitutions. In: Proceedings of the 15th international conference on world wide web, WWW’06, pp 387–396

  23. Kuhn H, Tucker A (1951) Nonlinear programming. In: Proceedings of the 2nd Berkeley symposium on mathematical statistics and probability, Statistical Laboratory of the University of California, Berkeley, pp 481–492

  24. Leung KWT, Ng W, Lee DL (2008) Personalized concept-based clustering of search engine queries. IEEE Trans Knowl Data Eng 20(11):1505–1518

    Article  Google Scholar 

  25. Li L, Yang Z, Liu L, Kitsuregawa M (2008) Query-URL bipartite based approach to personalized query recommendation. In: Proceedings of the 23rd national conference on artificial intelligence, vol 2, AAAI’08, pp 1189–1194

  26. Li L, Xu G, Yang Z, Dolog P, Zhang Y, Kitsuregawa M (2013) An efficient approach to suggesting topically related web queries using hidden topic model. World Wide Web 16(3):273–297

    Article  Google Scholar 

  27. Liu Y, Song R, Chen Y, Jian-Yun N, Wen JR (2012) Adaptive query suggestion for difficult queries. In: Proceedings of the 35th international ACM SIGIR conference on research and development in information retrieval, SIGIR’12, pp 15–24

  28. Liu Z, Sun M (2008) Asymmetrical query recommendation method based on bipartite network resource allocation. In: Proceedings of the 17th international conference on world wide web, WWW’08, pp 1049–1050

  29. http://en.wikipedia.org/wiki/Luce%27s_choice_axiom

  30. Ma H, Yang H, King I, Lyu MR (2008) Learning latent semantic relations from click through data for query suggestion. In: Proceeding of the 17th ACM conference on information and knowledge management, CIKM’08, pp 709–718

  31. Ma H, King I, Lyu M (2012) Mining web graphs for recommendations. IEEE Trans Knowl Data Eng 24(6):1051–1064

    Article  Google Scholar 

  32. Mei Q, Zhou D, Church K (2008) Query suggestion using hitting time. In: Proceeding of the 17th ACM conference on information and knowledge management, CIKM’08, pp 469–477

  33. Ozertem U, Chapelle O, Donmez P, Velipasaoglu E (2012) Learning to suggest: a machine learning framework for ranking query suggestions. In: Proceedings of the 35th international ACM SIGIR conference on research and development in information retrieval, SIGIR’12, pp 25–34

  34. Strohmaier M, Kröll M, Körner C (2009) Intentional query suggestion: making user goals more explicit during search. In: Proceedings of the 2009 workshop on web search click data, WSCD’09, pp 68–74

  35. Tognola G, Rainer B (1999) Unlimited point algorithm for OPF problems. IEEE Trans Power Syst 14(3):1049–1052

    Article  Google Scholar 

  36. Wang B, Liang J, Qian Y (2014) Determining decision makers’ weights in group ranking: a granular computing method. Int J Mach Learn Cybernet. doi:10.1007/s13042-014-0278-5

    Google Scholar 

  37. Wang J, Huang J (2014) QRM: a probabilistic model for search engine query recommendation. In: Proceedings of PAKDD 2014 workshops, LNCS 8643, Springer, pp 665–676

  38. Zhu X, Guo J, Cheng X, Lan Y (2012) More than relevance: high utility query recommendation by mining users’ search behaviors. In: Proceedings of the 21st ACM international conference on information and knowledge management, CIKM’12, pp 37–46

  39. Zhu X, Guo J, Cheng X, Lan Y, Nejdl W (2013) Recommending high utility query via session flow graph. In: Proceedings of the 35th European conference on IR research, ECIR’13, pp 642–655

Download references

Acknowledgments

Joshua Zhexue Huang was supported by The National Natural Science Foundation of China under Grant No. 61473194.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Joshua Zhexue Huang.

Additional information

This paper has been extended from our workshop paper [37].

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, J., Huang, J.Z., Guo, J. et al. Query ranking model for search engine query recommendation. Int. J. Mach. Learn. & Cyber. 8, 1019–1038 (2017). https://doi.org/10.1007/s13042-015-0362-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13042-015-0362-5

Keywords

Navigation