Solving Multi-stage Games with Hierarchical Learning Automata That Bootstrap

Peeters, Maarten; Verbeeck, Katja; Nowé, Ann

doi:10.1007/978-3-540-77949-0_13

Maarten Peeters¹,
Katja Verbeeck² &
Ann Nowé¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4865))

Included in the following conference series:

1555 Accesses
1 Citations

Abstract

Hierarchical learning automata are shown to be an excellent tool for solving multi-stage games. However, most updating schemes used by hierarchical automata expect the multi-stage game to reach an absorbing state at which point the automata are updated in a Monte Carlo way. As such, the approach is infeasible for large multi-stage games (and even for problems with an infinite horizon) and the convergence process is slow. In this paper we propose an algorithm where the rewards don’t have to travel all the way up to the top of the hierarchy and in which there is no need for explicit end-stages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Tuyls, K.: Learning in Multi-Agent Systems: An Evolutionary Game Theoretic Approach. PhD thesis, Vrije Universiteit Brussel (2004)
Google Scholar
Claus, C., Boutilier, C.: The dynamics of reinforcement learning in cooperative multiagent systems. In: AAAI 1998. Proceedings of the Fifteenth National Conference of Artificial Intelligence, Madison, WI, pp. 746–752 (1998)
Google Scholar
Verbeeck, K., Nowé, A., Peeters, M., Tuyls, K.: Multi-agent reinforcement learning in stochastic single and multi-stage games. In: Kudenko, D., Kazakov, D., Alonso, E. (eds.) Adaptive Agents and Multi-Agent Systems II, pp. 275–294. Springer, Heidelberg (2005)
Google Scholar
Verbeeck, K.: Coordinated Exploration in Multi-Agent Reinforcement Learning. PhD thesis, Vrije Universiteit Brussel (2004)
Google Scholar
Kapetanakis, S., Kudenko, D., Strens, M.J.A.: Learning to coordinate using commitment sequences in cooperative multi-agent systems. In: Kudenko, D., Kazakov, D., Alonso, E. (eds.) Adaptive Agents and Multi-Agent Systems II, pp. 275–294. Springer, Heidelberg (2005)
Google Scholar
Tsetlin, M.L.: On the behavior of finite automata in random media. Avtomatika i Telemekhanika 22(10), 1345–1354 (1961)
Google Scholar
Narendra, K.S., Thathachar, M.A.L.: Learning automata - a survey. IEEE_J_SMC SMC-4(4), 323–334 (1974)
MathSciNet Google Scholar
Narendra, K.S., Thathachar, M.A.L.: Learning Automata: An Introduction. Prentice-Hall, Englewood Cliffs (1989)
Google Scholar
Thathachar, M.A.L., Sastry, P.S.: Networks of Learning Automata: Techniques for Online Stochastic Optimization. Kluwer Academic Publishers, Dordrecht (2004)
Google Scholar
Nowé, A., Verbeeck, K., Peeters, M.: Learning automata as a basis for multi-agent reinforcement learning. In: Tuyls, K., t Hoen, P.J., Verbeeck, K., Sen, S. (eds.) LAMAS 2005. LNCS (LNAI), vol. 3898, pp. 71–85. Springer, Heidelberg (2006)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA (1998)
Google Scholar
Verbeeck, K., Nowé, A., Parent, J., Tuyls, K.: Exploring selfish reinforcement learning in repeated games with stochastic rewards. Journal of Autonomous Agents and Multi-agent Systems (to appear)
Google Scholar
Panait, L., Luke, S.: Cooperative multi-agent learning: The state of the art. Autonomous Agents and Multi-Agent Systems 3(11), 383–434 (2005)
Google Scholar
Boutilier, C.: Sequential optimality and coordination in multiagent systems. In: Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, pp. 478–485 (1996)
Google Scholar
Thathachar, M.A.L., Ramakrishnan, K.R.: A hierarchical system of learning automata. IEEE Transactions on Systems, Man, and Cybernetics SMC-11(3), 236–241 (1981)
MathSciNet Google Scholar
Ramakrishnan, K.R.: Hierarchical systems and cooperative games of learning automata. PhD thesis, Indian Institute of Science, Bangalore, India (1982)
Google Scholar
Verbeeck, K., Nowé, A., Tuyls, K., Peeters, M.: Multi-agent reinforcement learning in stochastic single and multi-stage games. In: Kudenko, D., Kazakov, D., Alonso, E. (eds.) Adaptive Agents and Multi-Agent Systems II. LNCS (LNAI), vol. 3394, pp. 275–294. Springer, Heidelberg (2005)
Google Scholar
Watkins, C., Dayan, P.: Q-learning. Machine Learning 8(3), 279–292 (1992)
MATH Google Scholar
Shoham, Y., Powers, R., Grenager, T.: Multi-agent reinforcement learning: a critical survey. Technical report, Stanford University (2003)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Peeters, M., Verbeeck, K., Nowé, A.: The effect of bootstrapping in multi-automata reinforcement learning. In: IEEE Symposium Series on Computational Intelligence, International Symposium on Approximate Dynamic Programming and Reinforcement Learning (2007)
Google Scholar
Kaelbling, L.P., Littman, M.L., Moore, A.P.: Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4, 237–285 (1996)
Google Scholar
Tsitsiklis, J.: Asynchronous stochastic approximation and q-learning. Machine Learning 16, 185–202 (1994)
MATH Google Scholar
Narendra, K.S., Parthasarathy, K.: Learning automata approach to hierarchical multiobjective analysis. IEEE Transactions on Systems, Man, and Cybernetics 21(2), 263–273 (1991)
Article Google Scholar
Peeters, M., Nowé, A., Verbeeck, K.: Bootstrapping versus monte carlo in a learning automata hierarchy. Adaptive Learning Agents and Multi-Agent Systems, 61–71 (2006)
Google Scholar
Peeters, M., Nowé, A., Verbeeck, K.: Toward bootstrapping in a hierarchy of learning automata. In: Proceedings of the Seventh European Workshop on Reinforcement Learning, pp. 31–32 (2005)
Google Scholar
Van de Wege, L.: Learning automata as a framework for multi-agent reinforcement learning: Convergence issues in tree-structured multi-stage games. Master’s thesis, Vrije Universiteit Brussel (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Computational Modeling Lab, Vrije Universiteit Brussel, Pleinlaan 2, 1050, Brussel, Belgium
Maarten Peeters & Ann Nowé
MICC-IKAT, Maastricht University, P.O. Box 616, 6200 MD, Maastricht, The Netherlands
Katja Verbeeck

Authors

Maarten Peeters
View author publications
You can also search for this author in PubMed Google Scholar
Katja Verbeeck
View author publications
You can also search for this author in PubMed Google Scholar
Ann Nowé
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Karl Tuyls Ann Nowe Zahia Guessoum Daniel Kudenko

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Peeters, M., Verbeeck, K., Nowé, A. (2008). Solving Multi-stage Games with Hierarchical Learning Automata That Bootstrap. In: Tuyls, K., Nowe, A., Guessoum, Z., Kudenko, D. (eds) Adaptive Agents and Multi-Agent Systems III. Adaptation and Multi-Agent Learning. AAMAS ALAMAS ALAMAS 2005 2007 2006. Lecture Notes in Computer Science(), vol 4865. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77949-0_13

Download citation

DOI: https://doi.org/10.1007/978-3-540-77949-0_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77947-6
Online ISBN: 978-3-540-77949-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics