Cooperative Markov decision processes: time consistency, greedy players satisfaction, and cooperation maintenance

Avrachenkov, Konstantin; Cottatellucci, Laura; Maggi, Lorenzo

doi:10.1007/s00182-012-0343-9

Cooperative Markov decision processes: time consistency, greedy players satisfaction, and cooperation maintenance

Published: 05 August 2012

Volume 42, pages 239–262, (2013)
Cite this article

International Journal of Game Theory Aims and scope Submit manuscript

Konstantin Avrachenkov¹,
Laura Cottatellucci² &
Lorenzo Maggi²

538 Accesses
Explore all metrics

Abstract

We deal with multi-agent Markov decision processes (MDPs) in which cooperation among players is allowed. We find a cooperative payoff distribution procedure (MDP-CPDP) that distributes in the course of the game the payoff that players would earn in the long run game. We show under which conditions such a MDP-CPDP fulfills a time consistency property, contents greedy players, and strengthen the coalition cohesiveness throughout the game. Finally we refine the concept of Core for Cooperative MDPs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Optimization implementation of solution concepts for cooperative games with stochastic payoffs

Article 31 January 2022

Cooperation in two-stage games on undirected networks

Article 29 December 2016

An extended version of opportunity cost algorithm for communication decisions

Article 23 September 2015

References

Bondareva ON (1963) Some applications of linear programming methods to the theory of cooperative games. Problemy Kybernetiki 10: 119–139
Google Scholar
Filar J, Petrosjan LA (2000) Dynamic cooperative games. Int Game Theory Rev 2(1): 47–65
Article Google Scholar
Filar J, Vrieze K (1997) Competitive Markov decision processes. Springer, New York
Google Scholar
Kranich L, Perea A, Peters H (2000) Dynamic cooperative games, Discussion papers. Department of Economics, University at Albany, SUNY
Mazalov VV, Rettieva AN (2010) Fish wars and cooperation maintenance. Ecol Model 221(12): 1545–1553
Article Google Scholar
Oviedo J (2000) The core of a repeated n-person cooperative game. Eur J Oper Res 127(3): 519–524
Article Google Scholar
Peleg B, Sudhölter P (2007) Introduction to the theory of cooperative games, 2nd edn. Springer, Berlin
Google Scholar
Petrosjan LA (2002) Cooperative stochastic games. In: Proceedings of the 10th international symposium on dynamic games and applications, vol. 2
Predtetchinski A (2007) The strong sequential core for stationary cooperative games. Games Econ Behav 61: 50–66
Article Google Scholar
Puterman ML (1994) Markov decision processes. Wiley, New York
Book Google Scholar
Shapley LS (1967) On balanced sets and cores. Naval Res Logist Q 14: 453–460
Article Google Scholar
Shapley LS (1971) Cores of convex games. Int J Game Theory 1(1): 11–26
Article Google Scholar
von Neumann J, Morgenstern O (1944) Theory of games and economic behavior. Princeton University Press, Princeton
Google Scholar
Zaccour G (2008) Time consistency in cooperative differential games: a tutorial. Inf Syst Oper Res 46(1): 81–92
Google Scholar

Download references

Author information

Authors and Affiliations

INRIA, BP95, 06902, Sophia Antipolis Cedex, France
Konstantin Avrachenkov
Eurecom, Mobile Communications, BP193, 06560, Sophia Antipolis Cedex, France
Laura Cottatellucci & Lorenzo Maggi

Authors

Konstantin Avrachenkov
View author publications
You can also search for this author inPubMed Google Scholar
Laura Cottatellucci
View author publications
You can also search for this author inPubMed Google Scholar
Lorenzo Maggi
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Lorenzo Maggi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Avrachenkov, K., Cottatellucci, L. & Maggi, L. Cooperative Markov decision processes: time consistency, greedy players satisfaction, and cooperation maintenance. Int J Game Theory 42, 239–262 (2013). https://doi.org/10.1007/s00182-012-0343-9

Download citation

Accepted: 16 July 2012
Published: 05 August 2012
Issue Date: February 2013
DOI: https://doi.org/10.1007/s00182-012-0343-9

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Cooperative Markov decision processes: time consistency, greedy players satisfaction, and cooperation maintenance

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Optimization implementation of solution concepts for cooperative games with stochastic payoffs

Cooperation in two-stage games on undirected networks

An extended version of opportunity cost algorithm for communication decisions

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now