Pareto-Q Learning Algorithm for Cooperative Agents in General-Sum Games

Song, Meiping; Gu, Guochang; Zhang, Guoyin

doi:10.1007/11559221_64

Meiping Song²¹,
Guochang Gu²¹ &
Guoyin Zhang²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3690))

Included in the following conference series:

International Central and Eastern European Conference on Multi-Agent Systems

1167 Accesses
1 Citations

Abstract

Rationality and convergence are two important criterions for multi-agent learning. A novel method called Pareto-Q learning is prompted for cooperative general-sum games, with the Pareto Optimum allowing rationality and social conventions benefiting the convergence. Experiments with the grid game suggest the efficiency of Pareto-Q. Compared with the single-agent Q-learning and Nash agent Q-learning, Pareto-Q learning performs best.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bowling, M., Veloso, M.: Existence of Multiagent Equilibria with Limited Agents. Journal of Artificial Intelligence Research 22, 353–384 (2004)
MATH MathSciNet Google Scholar
Hu, J., Wellman, M.P.: Nash Q-Learning for General-sum Stochastic Games. Journal of Machine Learning Research 4, 1039–1069 (2003)
Article MathSciNet Google Scholar
Littman, M.L., Szepesvari, C.: A Generalized Reinforcement Learning Model: Convergence and Applications. In: Proceedings of the 13th International Conference on Machine Learning, Bari, Italy, pp. 310–318 (1996)
Google Scholar
Deb, K.: Multi-Objective Evolutionary Algorithms: Introducing Bias among Pareto-Optimal Solutions. KanGAL report 99002, Indian Institute of Technology, Kanpur, India (1999)
Google Scholar
Kok, J.R., Spaan, M.T.J., Vlassis, N.: An Approach to Noncommunicative Multiagent Coordination in Continuous Domains. In: Proceedings of the Twelfth Belgian-Dutch Conference on Machine Learning, Utrecht, Netherlands, pp. 46–52 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer Science and Technology, Harbin Engineering University, 15001, China
Meiping Song, Guochang Gu & Guoyin Zhang

Authors

Meiping Song
View author publications
You can also search for this author in PubMed Google Scholar
Guochang Gu
View author publications
You can also search for this author in PubMed Google Scholar
Guoyin Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Cybernetics, Czech Technical University in Prague, Czech Republic
Michael Pěchouček
Austrian Research Institute for Artificial Intelligence, Freyung 6/6, A-1010, Vienna, Austria
Paolo Petta
Computer and Automation Research Institute, Hungarian Academy of Sciences, Kende u. 13-17., 1111, Budapest, Hungary
László Zsolt Varga

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Song, M., Gu, G., Zhang, G. (2005). Pareto-Q Learning Algorithm for Cooperative Agents in General-Sum Games. In: Pěchouček, M., Petta, P., Varga, L.Z. (eds) Multi-Agent Systems and Applications IV. CEEMAS 2005. Lecture Notes in Computer Science(), vol 3690. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11559221_64

Download citation

DOI: https://doi.org/10.1007/11559221_64
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29046-9
Online ISBN: 978-3-540-31731-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics