Policy Search through Adaptive Function Approximation for Bidding in TAC SCM

Chatzidimitriou, Kyriakos C.; Symeonidis, Andreas L.; Mitkas, Pericles A.

doi:10.1007/978-3-642-40864-9_2

Policy Search through Adaptive Function Approximation for Bidding in TAC SCM

Kyriakos C. Chatzidimitriou^10,11,
Andreas L. Symeonidis^10,11 &
Pericles A. Mitkas^10,11

Conference paper

595 Accesses
2 Citations

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 136))

Abstract

Agent autonomy is strongly related to learning and adaptation. Machine learning models generated through the use of historical data or current environmental signals, provide agents with the necessary decision-making and generalization capabilities in competitive, dynamic, partially observable and stochastic environments. In this work, we discuss learning and adaptation in the context of the TAC SCM game. We apply a variety of machine learning and computational intelligence methods for generating the most efficient sales component of the agent, dealing with customer orders and production throughput. Along with utility maximization and bid acceptance probability estimation methods, we evaluate regression trees, particle swarm optimization, heuristic control and policy search via adaptive function approximation in order to build an efficient, near-real time, bidding mechanism. Results indicate that a suitable reinforcement learning setup coupled with the power of adaptive function approximation techniques is a good candidate for enabling high performance strategies.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 49.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chatzidimitriou, K.C., Symeonidis, A.L., Kontogounis, I., Mitkas, P.A.: Agent mertacor: A robust design for dealing with uncertainty and variation in scm environments. Expert Systems with Applications 35(3), 591–603 (2008) (Cited by: Collins2008)
Google Scholar
Stone, P.: Learning and multiagent reasoning for autonomous agents. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence, pp. 13–30 (January 2007)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Arunachalam, R., Sadeh, N.M.: The supply chain trading agent competition. Electronic Commerce Research and Applications 4(1), 66–84 (2005)
Article Google Scholar
Collins, J., Arunachalam, R., Sadeh, N., Eriksson, J., Finne, N., Janson, S.: The supply chain management game for the 2007 trading agent competition. Technical Report CMU-ISRI-07-100, Carnegie Mellon University (December 2006)
Google Scholar
Jaeger, H.: Tutorial on training recurrent neural networks, covering BPTT, RTRL, EKF and the “echo state network” approach. Technical Report GMD Report 159, German National Research Center for Information Technology (2002)
Google Scholar
Szita, I., Gyenes, V., Lőrincz, A.: Reinforcement learning with echo state networks. In: Kollias, S.D., Stafylopatis, A., Duch, W., Oja, E. (eds.) ICANN 2006. LNCS, vol. 4131, pp. 830–839. Springer, Heidelberg (2006)
Chapter Google Scholar
Stanley, K.O., Miikkulainen, R.: Evolving neural networks through augmenting topologies. Evolutionary Computation 10(2), 99–127 (2002)
Article Google Scholar
Chatzidimitriou, K.C., Mitkas, P.A.: A neat way for evolving echo state networks. In: European Conference on Artificial Intelligence. IOS Press (August 2010)
Google Scholar
Pardoe, D., Stone, P.: An autonomous agent for supply chain management. In: Adomavicius, G., Gupta, A. (eds.) Handbooks in Information Systems Series: Business Computing, vol. 3, pp. 141–172. Emerald Group (2009)
Google Scholar
Benisch, M., Greenwald, A., Grypari, I., Lederman, R., Naroditskiy, V., Tschantz, M.: Botticelli: A supply chain management agent designed to optimize under uncertainty. ACM Transactions on Computational Logic 4(3), 29–37 (2004)
Google Scholar
Pardoe, D., Stone, P.: Bidding for customer orders in TAC SCM. In: Faratin, P., Rodríguez-Aguilar, J.-A. (eds.) AMEC 2004. LNCS (LNAI), vol. 3435, pp. 143–157. Springer, Heidelberg (2006)
Chapter Google Scholar
Stan, M., Stan, B., Florea, A.M.: A dynamic strategy agent for supply chain management. In: Proceedings of the Eighth International Symposium on Symbolic and Numeric Algorithms for Scientific Computing, pp. 227–232 (2006)
Google Scholar
Chatzidimitriou, K.C., Symeonidis, A.L.: Data-mining-enhanced agents in dynamic supply-chain-management environments. Intelligent Systems 24(3), 54–63 (2009); Special issue on Agents and Data Mining
Google Scholar
Hogenboom, A., Ketter, W., van Dalen, J., Kaymak, U., Collins, J., Gupta, A.: Product pricing in TAC SCM using adaptive real-time probability of acceptance estimations based on economic regimes. In: Workshop: Trading Agent Design and Analysis (TADA) at Twenty-First International Joint Conference on Artificial Intelligence (IJCAI 2009), 15–24 (July 2009)
Google Scholar
Benisch, M., Greenwald, A., Naroditskiy, V., Tschantz, M.C.: A stochastic programming approach to scheduling in TAC SCM. In: Proceedings of the 5th ACM Conference on Electronic Commerce, EC 2004, pp. 152–159. ACM, New York (2004)
Google Scholar
Breiman, L., Friedman, J., Stone, C.J., Olshen, R.: Classification and Regression Trees. Chapman and Hall (1984)
Google Scholar
Wang, Y., Witten, I.H.: Induction of model trees for predicting continuous classes. Poster Papers of the 9th European Conference on Machine Learning, pp. 128–137 (1997)
Google Scholar
Kiekintveld, C., Miller, J., Jordan, P.R., Callender, L.F., Wellman, M.P.: Forecasting market prices in a supply chain game. Electronic Commerce Research and Applications 8, 63–77 (2009)
Article Google Scholar
Kennedy, J., Eberhart, R.: Particle swarm optimization. In: Proceedings of the International Conference on Neural Networks, pp. 1942–1948 (1995)
Google Scholar
Kiekintveld, C., Wellman, M.P., Singh, S., Estelle, J., Vorobeychik, Y., Soni, V., Rudary, M.: Distributed feedback control for decision making on supply chains. In: Fourteenth International Conference on Automated Planning and Scheduling (2004)
Google Scholar
He, M., Rogers, A., Luo, X., Jennings, N.R.: Designing a successful trading agent for supply chain management. In: Fifth International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 2006 (2006)
Google Scholar
R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2011) ISBN 3-900051-07-0
Google Scholar
Therneau, T.M., port by Brian Ripley, B.A.R.: rpart: Recursive Partitioning, R package version 3.1-50 (2011)
Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: An update. SIGKDD Explorations 11(1), 10–18 (2009)
Article Google Scholar
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. Journal of Machine Learning Research 7, 1–30 (2006)
MATH Google Scholar
Wilcoxon, F.: Individual comparisons by ranking methods. Biometrics Bulletin 1(6), 80–83 (1945)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, Aristotle University of Thessaloniki, Greece
Kyriakos C. Chatzidimitriou, Andreas L. Symeonidis & Pericles A. Mitkas
Informatics and Telematics Institute, Centre for Research and Technology Hellas, Greece
Kyriakos C. Chatzidimitriou, Andreas L. Symeonidis & Pericles A. Mitkas

Authors

Kyriakos C. Chatzidimitriou
View author publications
You can also search for this author in PubMed Google Scholar
Andreas L. Symeonidis
View author publications
You can also search for this author in PubMed Google Scholar
Pericles A. Mitkas
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Ashkelon Academic College, Ashkelon, Israel
Esther David
Department of Computer Science, University of Texas at El Paso, El Paso, TX, USA
Christopher Kiekintveld
School of Electronics and Computer Science, University of Southampton, Southampton, UK
Valentin Robu & Sebastian Stein &
IBM Haifa Research Lab, Haifa University, Haifa, Israel
Onn Shehory

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chatzidimitriou, K.C., Symeonidis, A.L., Mitkas, P.A. (2013). Policy Search through Adaptive Function Approximation for Bidding in TAC SCM. In: David, E., Kiekintveld, C., Robu, V., Shehory, O., Stein, S. (eds) Agent-Mediated Electronic Commerce. Designing Trading Strategies and Mechanisms for Electronic Markets. AMEC TADA 2012 2012. Lecture Notes in Business Information Processing, vol 136. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40864-9_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-40864-9_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40863-2
Online ISBN: 978-3-642-40864-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics