Abstract
We deal with multi-agent Markov decision processes (MDPs) in which cooperation among players is allowed. We find a cooperative payoff distribution procedure (MDP-CPDP) that distributes in the course of the game the payoff that players would earn in the long run game. We show under which conditions such a MDP-CPDP fulfills a time consistency property, contents greedy players, and strengthen the coalition cohesiveness throughout the game. Finally we refine the concept of Core for Cooperative MDPs.
Similar content being viewed by others
References
Bondareva ON (1963) Some applications of linear programming methods to the theory of cooperative games. Problemy Kybernetiki 10: 119–139
Filar J, Petrosjan LA (2000) Dynamic cooperative games. Int Game Theory Rev 2(1): 47–65
Filar J, Vrieze K (1997) Competitive Markov decision processes. Springer, New York
Kranich L, Perea A, Peters H (2000) Dynamic cooperative games, Discussion papers. Department of Economics, University at Albany, SUNY
Mazalov VV, Rettieva AN (2010) Fish wars and cooperation maintenance. Ecol Model 221(12): 1545–1553
Oviedo J (2000) The core of a repeated n-person cooperative game. Eur J Oper Res 127(3): 519–524
Peleg B, Sudhölter P (2007) Introduction to the theory of cooperative games, 2nd edn. Springer, Berlin
Petrosjan LA (2002) Cooperative stochastic games. In: Proceedings of the 10th international symposium on dynamic games and applications, vol. 2
Predtetchinski A (2007) The strong sequential core for stationary cooperative games. Games Econ Behav 61: 50–66
Puterman ML (1994) Markov decision processes. Wiley, New York
Shapley LS (1967) On balanced sets and cores. Naval Res Logist Q 14: 453–460
Shapley LS (1971) Cores of convex games. Int J Game Theory 1(1): 11–26
von Neumann J, Morgenstern O (1944) Theory of games and economic behavior. Princeton University Press, Princeton
Zaccour G (2008) Time consistency in cooperative differential games: a tutorial. Inf Syst Oper Res 46(1): 81–92
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Avrachenkov, K., Cottatellucci, L. & Maggi, L. Cooperative Markov decision processes: time consistency, greedy players satisfaction, and cooperation maintenance. Int J Game Theory 42, 239–262 (2013). https://doi.org/10.1007/s00182-012-0343-9
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00182-012-0343-9