skip to main content
10.1145/2396761.2396827acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

The wisdom of advertisers: mining subgoals via query clustering

Published: 29 October 2012 Publication History

Abstract

This paper tackles the problem of mining subgoals of a given search goal from data. For example, when a searcher wants to travel to London, she may need to accomplish several subtasks such as "book flights," "book a hotel," "find good restaurants" and "decide which sightseeing spots to visit." As another example, if a searcher wants to lose weight, there may exist several alternative solutions such as "do physical exercise," "take diet pills," and "control calorie intake." In this paper, we refer to such subtasks or solutions as subgoals, and propose to utilize sponsored search data for finding subgoals of a given query by means of query clustering. Advertisements (ads) reflect advertisers' tremendous efforts in trying to match a given query with implicit user needs. Moreover, ads are usually associated with a particular action or transaction. We therefore hypothesized that they are useful for subgoal mining. To our knowledge, our work is the first to use sponsored search data for this purpose. Our experimental results show that sponsored search data is a good resource for obtaining related queries and for identifying subgoals via query clustering. In particular, our method that combines ad impressions from sponsored search data and query co-occurrences from session data outperforms a state-of-the-art query clustering method that relies on document clicks rather than ad impressions in terms of purity, NMI, Rand Index, F1-measure and subgoal recall.

References

[1]
L. M. Aiello, D. Donato, U. Ozertem, and F. Menczer. Behavior-driven clustering of queries into topics. In Proc. of CIKM, pages 1373--1382, 2011.
[2]
R. Baeza-Yates, C. Hurtado, and M. Mendoza. Query recommendation using query logs in search engines. In Current Trends in Database Technology-EDBT 2004 Workshops, pages 588--596, 2004.
[3]
D. Beeferman and A. Berger. Agglomerative clustering of a search engine query log. In Proc. of KDD, pages 407--416, 2000.
[4]
P. Boldi, F. Bonchi, C. Castillo, D. Donato, A. Gionis, and S. Vigna. The query-flow graph: model and applications. In Proc. of CIKM, pages 609--618, 2008.
[5]
A. Broder. A taxonomy of web search. ACM SIGIR Forum, 36(2):3--10, 2002.
[6]
A. Broder, P. Ciccolo, M. Fontoura, E. Gabrilovich, V. Josifovski, and L. Riedel. Search advertising using web relevance feedback. In Proc. of CIKM, pages 1013--1022, 2008.
[7]
G. Buscher, S. T. Dumais, and E. Cutrell. The good, the bad, and the random: an eye-tracking study of ad quality in web search. In Proc. of SIGIR, pages 42--49, 2010.
[8]
H. Cao, D. Jiang, J. Pei, Q. He, Z. Liao, E. Chen, and H. Li. Context-aware query suggestion by mining click-through and session data. In Proc. of KDD, pages 875--883, 2008.
[9]
H. K. Dai, L. Zhao, Z. Nie, J.-R. Wen, L.Wang, and Y. Li. Detecting online commercial intention (OCI). In Proc. of WWW, pages 829--837, 2006.
[10]
C. Danescu-Niculescu-Mizil, A. Z. Broder, E. Gabrilovich, V. Josifovski, and B. Pang. Competing for users' attention: on the interplay between organic and sponsored search results. In Proc. of WWW, pages 291--300, 2010.
[11]
D. Donato, F. Bonchi, T. Chi, and Y. Maarek. Do you want to take notes?: identifying research missions in Yahoo! search pad. In Proc. of WWW, pages 321--330, 2010.
[12]
J. Fleiss and J. Cohen. The equivalence of weighted kappa and the intraclass correlation coefficient as measures of reliability. Educational and Psychological Measurement, 1973.
[13]
B. M. Fonseca, P. Golgher, B. Pôssas, B. Ribeiro-Neto, and N. Ziviani. Concept-based interactive query expansion. In Proc. of CIKM, pages 696--703, 2005.
[14]
T. Graepel, J. Candela, T. Borchert, and R. Herbrich. Web-scale bayesian click-through rate prediction for sponsored search advertising in Microsoft's Bing search engine. In Proc. of ICML, pages 13--20, 2010.
[15]
Q. Guo and E. Agichtein. Ready to buy or just browsing?: detecting web searcher goals from interaction data. In Proc. of SIGIR, pages 130--137, 2010.
[16]
D. Hillard, S. Schroedl, E. Manavoglu, H. Raghavan, and C. Leggetter. Improving ad relevance in sponsored search. In Proc. of WSDM, pages 361--370, 2010.
[17]
B. J. Jansen. The comparative effectiveness of sponsored and nonsponsored links for web e-commerce queries. ACM Transactions on the Web, 1(3), 2007.
[18]
B. J. Jansen, D. Booth, and A. Spink. Determining the informational, navigational, and transactional intent of web queries. Information Processing & Management, 44(3):1251--1266, 2008.
[19]
B. J. Jansen and M. Resnick. Examining searcher perceptions of and interactions with sponsored results. In Workshop on Sponsored Search Auctions, 2005.
[20]
K. Järvelin and J. Kekäläinen. Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems, 20(4):422--446, 2002.
[21]
R. Jones and K. L. Klinkner. Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs. In Proc. of CIKM, pages 699--708, 2008.
[22]
I. Kang and G. Kim. Query type classification for web document retrieval. In Proc. of SIGIR, pages 64--71, 2006.
[23]
E. Kanoulas, B. Carterette, P. D. Clough, and M. Sanderson. Evaluating multi-query sessions. In Proc. of SIGIR, pages 1053--1062, 2011.
[24]
Y. Liu, X. Ni, J.-T. Sun, and Z. Chen. Unsupervised transactional query classification based on webpage form understanding. In Proc. of CIKM, pages 57--66, 2011.
[25]
C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge University Press, 2008.
[26]
M. P. Kato, T. Sakai, and K. Tanaka. Structured query suggestion for specialization and parallel movement: effect on search behaviors. In Proc. of WWW, pages 389--398, 2012.
[27]
F. Radlinski, A. Broder, P. Ciccolo, E. Gabrilovich, V. Josifovski, and L. Riedel. Optimizing relevance and revenue in ad search: a query substitution approach. In Proc. of SIGIR, pages 403--410, 2008.
[28]
E. Sadikov, J. Madhavan, L. Wang, and A. Halevy. Clustering query refinements by user intent. In Proc. of WWW, pages 841--850, 2010.
[29]
X. Wang, D. Chakrabarti, and K. Punera. Mining broad latent query aspects from search sessions. In Proc. of KDD, pages 867--876, 2009.
[30]
J.-R.Wen, J.-Y. Nie, and H.-J. Zhang. Clustering user queries of a search engine. In Proc. of WWW, pages 162--168, 2001.
[31]
R. White and R. Roth. Exploratory search: Beyond the query-response paradigm. Synthesis Lectures on Information Concepts, Retrieval, and Services, 1(1):1--98, 2009.
[32]
W. Xu, E. Manavoglu, and E. Cantu-Paz. Temporal click model for sponsored search. In Proc. of SIGIR, pages 106--113, 2010.

Cited By

View all
  • (2023)Buy Eye-Mask Instead of Alarm Clock!: Graph-Based Approach to Identify Functionally Equal Alternative ProductsInformation Integration and Web Intelligence10.1007/978-3-031-48316-5_27(265-279)Online publication date: 4-Dec-2023
  • (2021)A Framework for Comparative Analysis of Intention Mining ApproachesResearch Challenges in Information Science10.1007/978-3-030-75018-3_2(20-37)Online publication date: 8-May-2021
  • (2018)Mining Alternative Actions from Community Q&A CorpusJournal of Information Processing10.2197/ipsjjip.26.42726(427-438)Online publication date: 2018
  • Show More Cited By

Index Terms

  1. The wisdom of advertisers: mining subgoals via query clustering

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management
    October 2012
    2840 pages
    ISBN:9781450311564
    DOI:10.1145/2396761
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 October 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. query clustering
    2. sponsored search
    3. user intent

    Qualifiers

    • Research-article

    Conference

    CIKM'12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 07 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Buy Eye-Mask Instead of Alarm Clock!: Graph-Based Approach to Identify Functionally Equal Alternative ProductsInformation Integration and Web Intelligence10.1007/978-3-031-48316-5_27(265-279)Online publication date: 4-Dec-2023
    • (2021)A Framework for Comparative Analysis of Intention Mining ApproachesResearch Challenges in Information Science10.1007/978-3-030-75018-3_2(20-37)Online publication date: 8-May-2021
    • (2018)Mining Alternative Actions from Community Q&A CorpusJournal of Information Processing10.2197/ipsjjip.26.42726(427-438)Online publication date: 2018
    • (2017)Mining alternative actions from community Q&A corpus for task-oriented web searchProceedings of the International Conference on Web Intelligence10.1145/3106426.3106461(607-614)Online publication date: 23-Aug-2017
    • (2015)Constructing Complex Search Tasks with Coherent Subtask Search GoalsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/274254715:2(1-29)Online publication date: 11-Dec-2015
    • (2014)Heterogeneous graph-based intent learning with queries, web pages and Wikipedia conceptsProceedings of the 7th ACM international conference on Web search and data mining10.1145/2556195.2556222(23-32)Online publication date: 24-Feb-2014

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media