Efficient learning equilibrium

https://doi.org/10.1016/j.artint.2004.04.013Get rights and content
Under an Elsevier user license
open archive

Abstract

We introduce efficient learning equilibrium (ELE), a normative approach to learning in non-cooperative settings. In ELE, the learning algorithms themselves are required to be in equilibrium. In addition, the learning algorithms must arrive at a desired value after polynomial time, and a deviation from the prescribed ELE becomes irrational after polynomial time. We prove the existence of an ELE (where the desired value is the expected payoff in a Nash equilibrium) and of a Pareto-ELE (where the objective is the maximization of social surplus) in repeated games with perfect monitoring. We also show that an ELE does not always exist in the imperfect monitoring case. Finally, we discuss the extension of these results to general-sum stochastic games.

Keywords

Learning equilibrium
Ex-post equilibrium
Efficiency
Multi-agent learning
Repeated games
Stochastic games

Cited by (0)

A preliminary short version of this paper appeared at NIPS'02. This research was supported by the Israel Science Foundation under grant #91/02-1. The first author is partially supported by the Paul Ivanier Center for Robotics and Production Management.