Skip to main content

Pareto-Q Learning Algorithm for Cooperative Agents in General-Sum Games

  • Conference paper
Multi-Agent Systems and Applications IV (CEEMAS 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3690))

Abstract

Rationality and convergence are two important criterions for multi-agent learning. A novel method called Pareto-Q learning is prompted for cooperative general-sum games, with the Pareto Optimum allowing rationality and social conventions benefiting the convergence. Experiments with the grid game suggest the efficiency of Pareto-Q. Compared with the single-agent Q-learning and Nash agent Q-learning, Pareto-Q learning performs best.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bowling, M., Veloso, M.: Existence of Multiagent Equilibria with Limited Agents. Journal of Artificial Intelligence Research 22, 353–384 (2004)

    MATH  MathSciNet  Google Scholar 

  2. Hu, J., Wellman, M.P.: Nash Q-Learning for General-sum Stochastic Games. Journal of Machine Learning Research 4, 1039–1069 (2003)

    Article  MathSciNet  Google Scholar 

  3. Littman, M.L., Szepesvari, C.: A Generalized Reinforcement Learning Model: Convergence and Applications. In: Proceedings of the 13th International Conference on Machine Learning, Bari, Italy, pp. 310–318 (1996)

    Google Scholar 

  4. Deb, K.: Multi-Objective Evolutionary Algorithms: Introducing Bias among Pareto-Optimal Solutions. KanGAL report 99002, Indian Institute of Technology, Kanpur, India (1999)

    Google Scholar 

  5. Kok, J.R., Spaan, M.T.J., Vlassis, N.: An Approach to Noncommunicative Multiagent Coordination in Continuous Domains. In: Proceedings of the Twelfth Belgian-Dutch Conference on Machine Learning, Utrecht, Netherlands, pp. 46–52 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Song, M., Gu, G., Zhang, G. (2005). Pareto-Q Learning Algorithm for Cooperative Agents in General-Sum Games. In: Pěchouček, M., Petta, P., Varga, L.Z. (eds) Multi-Agent Systems and Applications IV. CEEMAS 2005. Lecture Notes in Computer Science(), vol 3690. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11559221_64

Download citation

  • DOI: https://doi.org/10.1007/11559221_64

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29046-9

  • Online ISBN: 978-3-540-31731-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics