skip to main content
10.1145/1963405.1963452acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Consideration set generation in commerce search

Published: 28 March 2011 Publication History

Abstract

In commerce search, the set of products returned by a search engine often forms the basis for all user interactions leading up to a potential transaction on the web. Such a set of products is known as the consideration set. In this study, we consider the problem of generating consideration set of products in commerce search so as to maximize user satisfaction. One of the key features of commerce search that we exploit in our study is the association of a set of important attributes with the products and a set of specified attributes with the user queries. Those important attributes not used in the query are treated as unspecified. The attribute space admits a natural definition of user satisfaction via user preferences on the attributes and their values, viz. require that the surfaced products be close to the specified attribute values in the query, and diverse with respect to the unspecified attributes. We model this as a general Max-Sum Dispersion problem wherein we are given a set of n nodes in a metric space and the objective is to select a subset of nodes with total cost at most a given budget, and maximize the sum of the pairwise distances between the selected nodes. In our setting, each node denotes a product, the cost of a node being inversely proportional to its relevance with respect to specified attributes. The distance between two nodes quantifies the diversity with respect to the unspecified attributes. The problem is NP-hard and a 2-approximation was previously known only when all the nodes have unit cost.
In our setting, we do not make any assumptions on the cost. We label this problem as the General Max-Sum Dispersion problem. We give the first constant factor approximation algorithm for this problem, achieving an approximation ratio of 2. Further, we perform extensive empirical analysis on real-world data to show the effectiveness of our algorithm.

References

[1]
R. Agrawal, S. Gollapudi, A. Halverson, and S. Ieong. Diversifying search results. In WSDM, pages 5--14, 2009.
[2]
A. Bhaskara, M. Charikar, E. Chlamtac, U. Feige, and A. Vijayaraghavan. Detecting high log-densities: an (1/4) approximation for densest -subgraph. In STOC, pages 201--210, 2010.
[3]
B. E. Birnbaum and K. J. Goldman. An improved analysis for a greedy remote-clique algorithm using factor-revealing lps. Algorithmica, 55(1):42--59, 2009.
[4]
A. Bookstein. Information retrieval: A sequential learning process. Journal of the American Society for Information Sciences (ASIS), 34(5):331--342, 1983.
[5]
J. G. Carbonell and J. Goldstein. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In SIGIR, pages 335--336, 1998.
[6]
C. Chekuri and S. Khanna. A ptas for the multiple knapsack problem. In SODA, pages 213--222, 2000.
[7]
H. Chen and D. R. Karger. Less is more: probabilistic models for retrieving fewer relevant documents. In SIGIR, pages 429--436, 2006.
[8]
E. Erkut. The discrete p-dispersion problem. Eur. J. Opnl. Res., 46:48--60, 1990.
[9]
R. Hassin, S. Rubinstein, and A. Tamir. Approximation algorithms for maximum dispersion. Oper. Res. Lett., 21(3):133--137, 1997.
[10]
D. Panigrahi and S. Gollapudi. Result enrichment in commerce search using browse trails. In WSDM, 2011.
[11]
F. Radlinski, R. Kleinberg, and T. Joachims. Learning diverse rankings with multi-armed bandits. In International Conference on Machine Learning (ICML), 2008. First presented at NIPS07 Workshop on Machine Learning for Web Search.
[12]
S. S. Ravi, D. J. Rosenkrantz, and G. K. Tayi. Heuristic and special case algorithms for dispersion problems. Operations Research, 42(2):299--310, 1994.
[13]
D. J. Rosenkrantz, G. K. Tayi, and S. S. Ravi. Facility dispersion problems under capacity and cost constraints. J. Comb. Optim., 4(1):7--33, 2000.
[14]
N. Sarkas, S. Paparizos, and P. Tsaparas. Structured annotations of web queries. In SIGMOD, pages 771--782, 2010.
[15]
E. Vee, U. Srivastava, J. Shanmugasundaram, P. Bhat, and S. Amer-Yahia. Efficient computation of diverse query results. In ICDE, pages 228--236, 2008.
[16]
J. Wallenius, J. S. Dyer, P. C. Fishburn, R. E. Steuer, S. Zionts, and K. Deb. Multiple criteria decision making, multiattribute utility theory: Recent accomplishments and what lies ahead. Management Science, 54(7):1336--1349, 2008.
[17]
D. W. Wang and Y.-S. Kuo. A study on two geometric location problems. Inf. Process. Lett., 28(6):281--286, 1988.
[18]
C. Zhai and J. D. Lafferty. A risk minimzation framework for information retrieval. Info. Processing and Management, 42(1):31--55, 2006.

Cited By

View all
  • (2024)Max-Min Diversification with Asymmetric DistancesProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671757(1440-1450)Online publication date: 25-Aug-2024
  • (2021)Maximization problems of balancing submodular relevance and supermodular diversityJournal of Global Optimization10.1007/s10898-021-01063-6Online publication date: 2-Aug-2021
  • (2019)An Improved Analysis of Local Search for Max-Sum DiversificationMathematics of Operations Research10.1287/moor.2018.0982Online publication date: 17-Sep-2019
  • Show More Cited By

Index Terms

  1. Consideration set generation in commerce search

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '11: Proceedings of the 20th international conference on World wide web
    March 2011
    840 pages
    ISBN:9781450306324
    DOI:10.1145/1963405
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 March 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. approximation algorithms
    2. facility dispersion
    3. novelty
    4. relevance

    Qualifiers

    • Research-article

    Conference

    WWW '11
    WWW '11: 20th International World Wide Web Conference
    March 28 - April 1, 2011
    Hyderabad, India

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 28 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Max-Min Diversification with Asymmetric DistancesProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671757(1440-1450)Online publication date: 25-Aug-2024
    • (2021)Maximization problems of balancing submodular relevance and supermodular diversityJournal of Global Optimization10.1007/s10898-021-01063-6Online publication date: 2-Aug-2021
    • (2019)An Improved Analysis of Local Search for Max-Sum DiversificationMathematics of Operations Research10.1287/moor.2018.0982Online publication date: 17-Sep-2019
    • (2018)A machine learning approach for product matching and categorizationSemantic Web10.3233/SW-1803009:5(707-728)Online publication date: 27-Aug-2018
    • (2017)Local search for max-sum diversificationProceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3039686.3039695(130-142)Online publication date: 16-Jan-2017
    • (2017)MapReduce and streaming algorithms for diversity maximization in metric spaces of bounded doubling dimensionProceedings of the VLDB Endowment10.14778/3055540.305554110:5(469-480)Online publication date: 1-Jan-2017
    • (2016)Linear relaxations for finding diverse elements in metric spacesProceedings of the 30th International Conference on Neural Information Processing Systems10.5555/3157382.3157556(4105-4113)Online publication date: 5-Dec-2016
    • (2016)Enriching Product Ads with Metadata from HTML AnnotationsProceedings of the 13th International Conference on The Semantic Web. Latest Advances and New Domains - Volume 967810.1007/978-3-319-34129-3_10(151-167)Online publication date: 29-May-2016
    • (2014)Composable core-sets for diversity and coverage maximizationProceedings of the 33rd ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems10.1145/2594538.2594560(100-108)Online publication date: 18-Jun-2014
    • (2013)From clicking to considerationDecision Support Systems10.5555/2747904.274824256:C(397-405)Online publication date: 1-Dec-2013
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media