ABSTRACT
As product prices become increasingly available on the World Wide Web, consumers attempt to understand how corporations vary these prices over time. However, corporations change prices based on proprietary algorithms and hidden variables (e.g., the number of unsold seats on a flight). Is it possible to develop data mining techniques that will enable consumers to predict price changes under these conditions?This paper reports on a pilot study in the domain of airline ticket prices where we recorded over 12,000 price observations over a 41 day period. When trained on this data, Hamlet --- our multi-strategy data mining algorithm --- generated a predictive model that saved 341 simulated passengers $198,074 by advising them when to buy and when to postpone ticket purchases. Remarkably, a clairvoyant algorithm with complete knowledge of future prices could save at most $320,572 in our simulation, thus HAMLET's savings were 61.8% of optimal. The algorithm's savings of $198,074 represents an average savings of 23.8% for the 341 passengers for whom savings are possible. Overall, HAMLET saved 4.4% of the ticket price averaged over the entire set of 4,488 simulated passengers. Our pilot study suggests that mining of price data available over the web has the potential to save consumers substantial sums of money per annum.
- R. Agrawal and R. Srikant. Mining sequential patterns. In P. S. Yu and A. S. P. Chen, editors, Eleventh International Conference on Data Engineering, pages 3--14, Taipei, Taiwan, 1995. IEEE Computer Society Press.]] Google ScholarDigital Library
- J. L. Ambite, G. Barish, C. A. Knoblock, M. Muslea, J. Oh, and S. Minton. Getting from here to there: Interactive planning and agent execution for optimizing travel. In Proceedings of the Fourteenth Conference on Innovative Applications of Artificial Intelligence (IAAI-2002), pages 862--869, AAAI Press, Menlo Park, CA, 2002.]] Google ScholarDigital Library
- G. Barish and C. A. Knoblock. An efficient and expressive language for information gathering on the web. In Proceedings of the AIPS-2002 Workshop on Is there life after operator sequencing? - Exploring real world planning, pages 5--12, Tolouse, France, 2002.]]Google Scholar
- D. Berndt and J. Clifford. Finding patterns in time series: a dynamic programming approach. In U. Fayyad, G. Shapiro, P. Smyth, and R. Uthurusamy, editors, Advances in Knowledge Discovery and Data Mining. AAAI Press, 1996.]] Google ScholarDigital Library
- L. Breiman. Bagging predictors. Machine Learning, 24:123--140, 1996.]] Google ScholarDigital Library
- H. Chalupsky, Y. Gil, C. A. Knoblock, K. Lerman, J. Oh, D. V. Pynadath, T. A. Russ, and M. Tambe. Electric elves: Applying agent technology to support human organizations. In Proceedings of the Conference on Innovative Applications of Artificial Intelligence, 2001.]] Google ScholarDigital Library
- C. Chatfield. The Analysis of Time Series: An Introduction. Chapman and Hall, London, UK, 1989.]]Google Scholar
- W. W. Cohen. Fast effective rule induction. In A. Prieditis and S. Russell, editors, Proc. of the 12th International Conference on Machine Learning, pages 115--123, Tahoe City, CA, July 9--12, 1995. Morgan Kaufmann.]]Google Scholar
- F. Diebold. Elements of Forecasting. South-Western College Publishing, 2nd edition, 2000.]] Google ScholarDigital Library
- P. Domingos. MetaCost: A general method for making classifiers cost-sensitive. In Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 155--164, San Diego, CA, 1999. ACM Press.]] Google ScholarDigital Library
- R. Doorenbos, O. Etzioni, and D. Weld. A scalable comparison-shopping agent for the World-Wide Web. In Proc. First Intl. Conf. Autonomous Agents, pages 39--48, 1997.]] Google ScholarDigital Library
- Y. Freund and R. E. Schapire. Experiments with a new boosting algorithm. In Proceedings of the Thirteenth International Conference on Machine Learning, pages 148--156, Bari, Italy, 1996. Morgan Kaufmann.]]Google ScholarDigital Library
- C. W. J. Granger. Forecasting in Business and Economics. Harcourt Brace, second edition, 1989.]]Google Scholar
- J. C. Hull. Options, Futures, and Other Derivatives. Prentice Hall College Div, 5th edition, 2002.]]Google Scholar
- C. A. Knoblock, K. Lerman, S. Minton, and I. Muslea. Accurately and reliably extracting data from the web: A machine learning approach. In P. S. Szczepaniak, J. Segovia, J. Kacprzyk, and L. A. Zadeh, editors, Intelligent Exploration of the Web, pages 275--287. Springer-Verlag, Berkeley, CA, 2003.]] Google ScholarDigital Library
- N. Littlestone and M. K. Warmuth. The weighted majority algorithm. Information and Computation, 108(2):212--261, February 1994.]] Google ScholarDigital Library
- D. Lucking-Reiley, D. Bryan, N. Prasad, and D. Reeves. Pennies from ebay: The determinants of price in online auctions. Technical report, University of Arizona, 2000.]]Google Scholar
- S. Mahadevan. Average reward reinforcement learning: Foundations, algorithms, and empirical results. Machine Learning, 22(1--3):159--195, 1996.]] Google ScholarDigital Library
- S. McCartney. Airlines Rely on Technology To Manipuate Fare Structure. Wall Street Journal, November 3 1997.]]Google Scholar
- J. Moody and M. Saffell. Reinforcement learning for trading systems and portfolios. In KDD, pages 279--283, 1998.]]Google Scholar
- J. Moody and M. Saffell. Minimizing downside risk via stochastic dynamic programming. In Y. S. Abu-Mostafa, B. LeBaron, A. W. Lo, and A. S. Weigend, editors, Computational Finance 1999, Cambridge, MA, 2000. MIT Press.]]Google Scholar
- J. Moody and M. Saffell. Learning to trade via direct reinforcement. In IEEE Transactions on Neural Networks, Vol. 12, No. 4, 2001.]]Google ScholarDigital Library
- J. F. Roddick and M. Spiliopoulou. A bibliography of temporal, spatial and spatio-temporal data mining research. SIGKDD Explorations, 1(1):34--38, 1999.]] Google ScholarDigital Library
- H. S. Shah, N. R. Joshi, A. Sureka, and P. R. Wurman. Mining for bidding strategies on ebay. In Lecture Notes in Artificial Intelligence. Springer-Verlag, 2003.]]Google Scholar
- R. S. Sutton and A. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.]] Google ScholarDigital Library
- K. M. Ting and I. H. Witten. Issues in stacked generalization. Journal of Artificial Intelligence Research, 10:271--289, 1999.]]Google ScholarCross Ref
- M. P. Wellman, D. M. Reeves, K. M. Lochner, and Y. Vorobeychik. Price prediction in a trading agent competition. Technical report, University of Michigan, 2002.]]Google Scholar
- D. Wolpert. Stacked generalization. Neural Networks, 5:241--259, 1992.]] Google ScholarDigital Library
Index Terms
- To buy or not to buy: mining airfare data to minimize ticket purchase price
Recommendations
Buy Now and Match Later: Impact of Posterior Price Matching on Profit with Strategic Consumers
With a posterior price matching (PM) policy, a seller guarantees to reimburse the price difference to a consumer who buys a product before the seller marks it down. Such a policy has been widely adopted by retailers. We examine the impact of a posterior ...
Buy It Again: Modeling Repeat Purchase Recommendations
KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data MiningRepeat purchasing, i.e., a customer purchasing the same product multiple times, is a common phenomenon in retail. As more customers start purchasing consumable products (e.g., toothpastes, diapers, etc.) online, this phenomenon has also become prevalent ...
Omnichannel Retail Operations with Buy-Online-and-Pick-up-in-Store
Many retailers have recently started to offer customers the option to buy online and pick up in store BOPS. We study the impact of the BOPS initiative on store operations. We build a stylized model where a retailer operates both online and offline ...
Comments