skip to main content
research-article

Product Selection Problem: Improve Market Share by Learning Consumer Behavior

Authors Info & Claims
Published:29 June 2016Publication History
Skip Abstract Section

Abstract

It is often crucial for manufacturers to decide what products to produce so that they can increase their market share in an increasingly fierce market. To decide which products to produce, manufacturers need to analyze the consumers’ requirements and how consumers make their purchase decisions so that the new products will be competitive in the market. In this paper, we first present a general distance-based product adoption model to capture consumers’ purchase behavior. Using this model, various distance metrics can be used to describe different real life purchase behavior. We then provide a learning algorithm to decide which set of distance metrics one should use when we are given some accessible historical purchase data. Based on the product adoption model, we formalize the k most marketable products (or k-MMP) selection problem and formally prove that the problem is NP-hard. To tackle this problem, we propose an efficient greedy-based approximation algorithm with a provable solution guarantee. Using submodularity analysis, we prove that our approximation algorithm can achieve at least 63% of the optimal solution. We apply our algorithm on both synthetic datasets and real-world datasets (TripAdvisor.com), and show that our algorithm can easily achieve five or more orders of speedup over the exhaustive search and achieve about 96% of the optimal solution on average. Our experiments also demonstrate the robustness of our distance metric learning method, and illustrate how one can adopt it to improve the accuracy of product selection.

References

  1. S. Borzsony, Donald Kossmann, and Konrad Stocker. 2001. The skyline operator. In Proceedings of the 17th International Conference on Data Engineering. IEEE, 421--430. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Timothy M. Chan, Kasper Green Larsen, and Mihai Pătraşcu. 2011. Orthogonal range searching on the RAM, revisited. In Proceedings of the 27th Annual Symposium on Computational Geometry. ACM, 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Gerard Cornuejols, Marshall L. Fisher, and George L. Nemhauser. 1977. Location of bank accounts to optimize float. Manag. Sci. 23, 8 (1977), 789--810.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Philip E. Gill, Walter Murray, and Margaret H. Wright. 1981. Practical optimization. Academic Press, New York.Google ScholarGoogle Scholar
  5. Venky Harinarayan, Anand Rajaraman, and Jeffrey D. Ullman. 1996. Implementing data cubes efficiently. In ACM SIGMOD Record, Vol. 25. ACM, 205--216. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. David Kempe, Jon Kleinberg, and Éva Tardos. 2003. Maximizing the spread of influence through a social network. In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 137--146. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Jon Kleinberg, Christos Papadimitriou, and Prabhakar Raghavan. 1998. A microeconomic view of data mining. Data Min. Knowl. Discovery 2, 4 (1998), 311--324. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Cuiping Li, Beng Chin Ooi, Anthony K. H. Tung, and Shan Wang. 2006. Dada: A data cube for dominant relationship analysis. In Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data. ACM, 659--670. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Chen-Yi Lin, Jia-Ling Koh, and Arbee L. P. Chen. 2013. Determining k-most demanding products with maximum expected number of total customers. IEEE Trans.Knowl. Data Eng. 25, 8 (2013), 1732--1747. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Xuemin Lin, Yidong Yuan, Qing Zhang, and Ying Zhang. 2007. Selecting stars: The k most representative skyline operator. In IEEE 23rd International Conference onData Engineering (ICDE 2007). IEEE, 86--95.Google ScholarGoogle ScholarCross RefCross Ref
  11. George L. Nemhauser and Laurence A. Wolsey. 1988. Integer and Combinatorial Optimization. Vol. 18. Wiley, New York, NY. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. George L. Nemhauser, Laurence A. Wolsey, and Marshall L. Fisher. 1978. An analysis of approximations for maximizing submodular set functions - I. Math. Program. 14, 1 (1978), 265--294.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Yu Peng, Raymond Chi-Wing Wong, and Qian Wan. 2012. Finding top-k preferable products. IEEE Trans. Knowl. Data Eng. 24, 10 (2012), 1774--1788. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Qian Wan, Raymond Chi-Wing Wong, Ihab F. Ilyas, M Tamer Özsu, and Yu Peng. 2009. Creating competitive products. In Proceedings of the VLDB Endowment, Vol. 2. VLDB Endowment, 898--909. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Qian Wan, R. C.-W. Wong, and Yu Peng. 2011. Finding top-k profitable products. In IEEE 27th International Conference on Data Engineering (ICDE). IEEE, 1055--1066. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Hongning Wang, Yue Lu, and Chengxiang Zhai. 2010. Latent aspect rating analysis on review text data: A rating regression approach. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 783--792. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Zhenjie Zhang, Laks V. S. Lakshmanan, and Anthony K. H. Tung. 2009. On domination game analysis for microeconomic data mining. ACM Trans. Knowl. Discovery Data (TKDD) 2, 4 (2009), 18. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Product Selection Problem: Improve Market Share by Learning Consumer Behavior

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Knowledge Discovery from Data
      ACM Transactions on Knowledge Discovery from Data  Volume 10, Issue 4
      Special Issue on SIGKDD 2014, Special Issue on BIGCHAT and Regular Papers
      July 2016
      417 pages
      ISSN:1556-4681
      EISSN:1556-472X
      DOI:10.1145/2936311
      Issue’s Table of Contents

      Copyright © 2016 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 29 June 2016
      • Revised: 1 March 2015
      • Accepted: 1 March 2015
      • Received: 1 October 2014
      Published in tkdd Volume 10, Issue 4

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader