Abstract
A calibrated strategy can be obtained by performing a strategy that has no internal regret in some auxiliary game. Such a strategy can be constructed explicitly with the use of Blackwell’s approachability theorem, in an other auxiliary game. We establish the converse: a strategy that approaches a convex B-set can be derived from the construction of a calibrated strategy.
We develop these tools in the framework of a game with partial monitoring, where players do not observe the actions of their opponents but receive random signals, to define a notion of internal regret and construct strategies that have no such regret.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aubin, J.-P., Frankowska, H.: Set-valued Analysis. Birkhäuser Boston Inc., Basel (1990)
Azuma, K.: Weighted sums of certain dependent random variables. Tôhoku Math. J. 19(2), 357–367 (1967)
Blackwell, D.: An analog of the minimax theorem for vector payoffs. Pacific J. Math. 6, 1–8 (1956)
Blackwell, D.: Controlled random walks. In: Proceedings of the International Congress of Mathematicians, 1954, Amsterdam, vol. III, pp. 336–338 (1956)
Cesa-Bianchi, N., Lugosi, G.: Prediction, Learning, and Games. Cambridge University Press, Cambridge (2006)
Foster, D.P., Vohra, R.V.: Asymptotic calibration. Biometrika 85, 379–390 (1998)
Foster, D.P., Vohra, R.V.: Regret in the on-line decision problem. Games Econom. Behav. 29, 7–35 (1999)
Fudenberg, D., Levine, D.K.: Conditional universal consistency. Games Econom. Behav. 29, 104–130 (1999)
Hannan, J.: Approximation to Bayes risk in repeated play. In: Contributions to the theory of Games. Annals of Mathematics Studies, vol. 3(39), pp. 97–139. Princeton University Press, Princeton (1957)
Hart, S., Mas-Colell, A.: A simple adaptive procedure leading to correlated equilibrium. Econometrica 68, 1127–1150 (2000)
Hoeffding, W.: Probability inequalities for sums of bounded random variables. J. Amer. Statist. Assoc. 58, 13–30 (1963)
Lehrer, E.: A wide range no-regret theorem. Games Econom. Behav. 42, 101–115 (2003)
Lehrer, E., Solan, E.: Learning to play partially-specified equilibrium (manuscript, 2007)
Lugosi, G., Mannor, S., Stoltz, G.: Strategies for prediction under imperfect monitoring. Math. Oper. Res. 33, 513–528 (2008)
Rustichini, A.: Minimizing regret: the general case. Games Econom. Behav. 29, 224–243 (1999)
Sandroni, A., Smorodinsky, R., Vohra, R.V.: Calibration with many checking rules. Math. Oper. Res. 28, 141–153 (2003)
Sorin, S.: Lectures on Dynamics in Games. Unpublished Lecture Notes (2008)
Vovk, V.: Non-asymptotic calibration and resolution. Theoret. Comput. Sci. 387, 77–89 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Perchet, V. (2009). Calibration and Internal No-Regret with Random Signals. In: Gavaldà, R., Lugosi, G., Zeugmann, T., Zilles, S. (eds) Algorithmic Learning Theory. ALT 2009. Lecture Notes in Computer Science(), vol 5809. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04414-4_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-04414-4_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04413-7
Online ISBN: 978-3-642-04414-4
eBook Packages: Computer ScienceComputer Science (R0)