Abstract
Multi-agent reinforcement learning is a challenging issue in artificial intelligence researches. In this paper, the reinforcement learning model and algorithm in multi-agent system simulation context are brought forward. We suggest and validate an opponent modeling learning to the problem of finding good policies for agents accommodated in an adversarial artificial world. The feature of the algorithm exhibits in that when in a multi-player adversarial environment the immediate reward depends on not only agent’s action choose but also its opponent’s trends. Experiment results show that the learning agent finds optimal policies in accordance with the reward functions provided.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sutton, R.S., Barto, A.G.: Reinforcement learning: An Introduction. MIT Press, Cambridge (1998)
Christopher, J.C.H., Watkins, Dayan, P.: Q-learnin. Machine learning 3, 279–292 (1992)
Bowling, M.: Multi-agent learning in the presence of agents with limitations, Ph.D. dissertation, School of Computer Science, Carnegie Mellon University, Pittsburgh (2003)
Littman, M.L.: Markov games as a framework for multi-agent reinforcement learning. In: Proceedings of ICML 1994, pp. 157–163. Morgan Kaufmann, San Francisco (1994)
Suematsu, N., Hayashi, A.: A multi-agent reinforcement learning algorithm using extended optimal response. In: Proceedings of the 1st International Joint Conference on Autonomous Agents & Multi-agent Systems, Bologna, Italy, pp. 370–377 (2002)
Uther, W., Veloso, M.: Adversarial reinforcement learning, Technical Report, Carnegie Mellon University, Pittsburgh (1997)
Kaelbling, L.P., Littman, M.L., Moore, A.W.: Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4, 237–285 (1996)
Mahadevan, S.: Average reward reinforcement learning: foundations, algorithms, and empirical results. Machine Learning 22, 159–195 (1996)
Spronck, P., Kuyper, I.S., Postma, E.: Improving opponent intelligence through machine learning. In: Proceedings of the 14th Belgium-Netherlands Conference on AI, pp. 299–306 (2002)
Wendler, J.: Recognizing and predicting agent behavior with case based reasoning. In: Nardi, D., Riedmiller, M., Sammut, C., Santos-Victor, J. (eds.) RoboCup 2004. LNCS (LNAI), vol. 3276, Springer, Heidelberg (2005)
Visser, U., Weland, H.G.: Using online learning to analyze the opponents behavior. In: Kaminka, G.A., Lima, P.U., Rojas, R. (eds.) RoboCup 2002. LNCS (LNAI), vol. 2752, pp. 78–93. Springer, Heidelberg (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, J., Ye, C., Jin, S. (2006). Opponent Learning for Multi-agent System Simulation. In: Wang, GY., Peters, J.F., Skowron, A., Yao, Y. (eds) Rough Sets and Knowledge Technology. RSKT 2006. Lecture Notes in Computer Science(), vol 4062. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11795131_94
Download citation
DOI: https://doi.org/10.1007/11795131_94
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36297-5
Online ISBN: 978-3-540-36299-9
eBook Packages: Computer ScienceComputer Science (R0)