research-article

Multiplicative updates outperform generic no-regret learning in congestion games: extended abstract

Authors:

Robert Kleinberg,

Georgios Piliouras,

Eva TardosAuthors Info & Claims

STOC '09: Proceedings of the forty-first annual ACM symposium on Theory of computing

Pages 533 - 542

https://doi.org/10.1145/1536414.1536487

Published: 31 May 2009 Publication History

Abstract

We study the outcome of natural learning algorithms in atomic congestion games. Atomic congestion games have a wide variety of equilibria often with vastly differing social costs. We show that in almost all such games, the well-known multiplicative-weights learning algorithm results in convergence to pure equilibria. Our results show that natural learning behavior can avoid bad outcomes predicted by the price of anarchy in atomic congestion games such as the load-balancing game introduced by Koutsoupias and Papadimitriou, which has super-constant price of anarchy and has correlated equilibria that are exponentially worse than any mixed Nash equilibrium.

Our results identify a set of mixed Nash equilibria that we call weakly stable equilibria. Our notion of weakly stable is defined game-theoretically, but we show that this property holds whenever a stability criterion from the theory of dynamical systems is satisfied. This allows us to show that in every congestion game, the distribution of play converges to the set of weakly stable equilibria. Pure Nash equilibria are weakly stable, and we show using techniques from algebraic geometry that the converse is true with probability 1 when congestion costs are selected at random independently on each edge (from any monotonically parametrized distribution). We further extend our results to show that players can use algorithms with different (sufficiently small) learning rates, i.e. they can trade off convergence speed and long term average regret differently.

References

[1]

E. Akin. The Geometry of Population Genetics. Springer-Verlag, 1979.

[2]

S. Arora, E. Hazan, and S. Kale. The multiplicative weights update method: A meta-algorithm and applications. Preprint.

[3]

A. Blum, E. Even-Dar, and K. Ligett. Routing without regret: on convergence to Nash equilibria of regret-minimizing algorithms in routing games. In Proc. of PODC, 45--52, 2006.

Digital Library

[4]

A. Blum, M. Hajiaghayi, K. Ligett, and A. Roth. Regret Minimization and the Price of Total anarchy. In Proc. of STOC, 373--382, 2008.

Digital Library

[5]

A. Blum and Y. Mansour. Learning, Regret Minimization and Equilibria, in Algorithmic Game Theory (eds. N. Nisan, T. Roughgarden, E. Tardos and V. Vazirani), Cambridge University Press, 2007.

[6]

E. Even-Dar and Y. Mansour. Fast convergence of selfish rerouting. SODA, 2005.

Digital Library

[7]

S. Fischer, H. Racke, and B. Vocking. Fast convergence to wardrop equilibria by adaptive sampling methods. STOC, 2006.

Digital Library

[8]

S. Fischer, B. Vocking. On the evolution of selfish routing. In Proc. of the 12th European Symposium on Algorithms (ESA '04), pages 323--334. Springer, 2004.

[9]

D. Foster and S. M. Kakade. Deterministic calibration and Nash equilibrium. In Proc. of COLT, pp. 33--48, 2004.

[10]

D. Foster, and R. Vohra. Calibrated learning and correlated equilibrium. Games and Economic Behavior 21:40--55, 1997.

[11]

D. Foster and H. P. Young. Learning, hypothesis testing, and Nash equilibrium. Games and Economic Behavior, 45:73, 96, 2003.

[12]

D. Foster and H. P. Young. Regret testing: Learning to play Nash equilibrium without knowing you have an opponent. Theoretical Economics, 1:341--367, 2006.

[13]

Y. Freund, and R. Schapire. Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29:79--103, 1999.

[14]

D. Fudenberg and D. Levine. The theory of learning in games. MIT Press, 1998.

[15]

F. Germano and G. Lugosi. Global Nash convergence of Foster and Young's regret testing. Games and Economic Behavior, 60:135--154, 2007.

[16]

M. Goemans, V. Mirrokni, and A. Vetta, Sink Equilibria and Convergence, IEEE Symposium on Foundations of Computing, 2005.

Digital Library

[17]

J. Hannan, Approximation to Bayes risk in repeated plays. In M. Dresher, A. Tucker, and P. Wolfe eds, Contributions to the Theory of Games, vol 3, pp 79--139, Princeton University Press.

[18]

S. Hart, and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127--1150, 2000.

[19]

S. Hart and A. Mas-Colell. Stochastic uncoupled dynamics and Nash equilibrium. Games and Economic Behavior, 57:286--303, 2006.

[20]

J. Hofbauer and K. Sigmund. Evolutionary Games and Population Dynamics. Cambridge Univ, Press, 1998.

[21]

E. Koutsoupias and C. H. Papadimitriou. Worst-case equilibria. In Proceedings of the 16th Annual Symposium on Theoretical Aspects of Computer Science, 404--413, 1999.

Digital Library

[22]

Nick Littlestone and Manfred K. Warmuth. The weighted majority algorithm. Information and Computation, 108(2):212--260, 1994.

Digital Library

[23]

J. Maynard Smith. Evolution and the Theory of Games. Cambridge University Press, 1982.

[24]

J. W. Milnor. Topology from the differentiable viewpoint. Princeton Landmarks in Mathematics. Princeton University Press, 1997.

[25]

V. Mirrokni and A. Vetta, Convergence Issues in Competitive Games, RANDOM-APPROX, 2004.

[26]

D. Monderer and L. S. Shapley. Potential games. Games and Economic Behavior 14: 124--143, 1996.

[27]

D. Monderer and L. S. Shapley. Fictitious play property for games with identical interests. Journal of Economic Theory 68:258--265, 1996.

[28]

L. Perko. Differential Equations and Dynamical Systems. Springer, 1991.

Digital Library

[29]

K. Ritzberger and J. Weibull. Evolutionary selection in normal--form games. Econometrica 63, 1371--1399, 1995.

[30]

R. W. Rosenthal. A class of games possessing pure-strategy Nash equilibria. International Journal of Game Theory 2:65--67, 1973.

Digital Library

[31]

T. Roughgarden. Routing Games, in Algorithmc Game Theory (eds. N. Nisan, T. Roughgarden, E. Tardos, V. Vazirani). Cambridge University Press. 2007.

[32]

T. Roughgarden. Intrinsic Robustness of the Price of Anarchy. In Proceedings of the 41th Annual ACM Symposium on Theory of Computing (STOC), 2009.

Digital Library

[33]

L. Samuelson and J. Zhang. Evolutionary stability in asymmetric games. Journal of Economic Theory 57:363--391, 1992.

[34]

W. H. Sandholm. Potential games with continuous player sets. J. of Economic Theory 97:81--108, 2001.

[35]

W. H. Sandholm. Population Games and Evolutionary Dynamics. Manuscript, October 2008.

[36]

I. R. Shafarevich. Basic Algebraic Geometry, volume 1. Miles Reid, translator. Springer-Verlag, 1994.

[37]

J. M. Steele. An Efron-Stein inequality for nonsymmetric statistics. Annals of Statistics 14(2):753--758, 1986.

[38]

B. Vocking. Selfish Load Balancing. In Algorithmic Game Theory (eds. N. Nisan, T. Roughgarden, E. Tardos, V. Vazirani). Cambridge Univ. Press. 2007.

[39]

J. Weibull. Evolutionary Game Theory. MIT Press, 1997.

Cited By

Branzei SMehta RNisan N(2025)Tit-for-tat strategies drive growth and inequality in production economiesProceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences10.1098/rspa.2024.0533481:2306Online publication date: 22-Jan-2025
https://doi.org/10.1098/rspa.2024.0533
Carissimo C(2024) Counter-Intuitive Effects of Q -Learning Exploration in a Congestion Dilemma IEEE Access10.1109/ACCESS.2024.335860812(15984-15996)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3358608
Sun YLiu TZhou RKumar PShahrampour SOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Provably fast convergence of independent natural policy gradient for Markov potential gamesProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668029(43951-43971)Online publication date: 10-Dec-2023
https://dl.acm.org/doi/10.5555/3666122.3668029
Show More Cited By

Index Terms

Multiplicative updates outperform generic no-regret learning in congestion games: extended abstract
1. Applied computing
  1. Law, social and behavioral sciences
2. Theory of computation
  1. Design and analysis of algorithms

Recommendations

The price of anarchy of finite congestion games
STOC '05: Proceedings of the thirty-seventh annual ACM symposium on Theory of computing

We consider the price of anarchy of pure Nash equilibria in congestion games with linear latency functions. For asymmetric games, the price of anarchy of maximum social cost is Θ(√N), where N is the number of players. For all other cases of symmetric or ...
The effect of collusion in congestion games
STOC '06: Proceedings of the thirty-eighth annual ACM symposium on Theory of Computing

In this paper we initiate the study of how collusion alters the quality of solutions obtained in competitive games. The price of anarchy aims to measure the cost of the lack of coordination by comparing the quality of a Nash equilibrium to that of a ...
Liquid Welfare Guarantees for No-Regret Learning in Sequential Budgeted Auctions
EC '23: Proceedings of the 24th ACM Conference on Economics and Computation

We study the liquid welfare in sequential first-price auctions with budget-limited buyers. We focus on first-price auctions, which are increasingly commonly used in many settings, and consider liquid welfare, a natural and well-studied generalization ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

STOC '09: Proceedings of the forty-first annual ACM symposium on Theory of computing

May 2009

750 pages

ISBN:9781605585062

DOI:10.1145/1536414

Program Chair:
Michael Mitzenmacher
Harvard University

Copyright © 2009 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 May 2009

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

STOC '09

Sponsor:

STOC '09: Symposium on Theory of Computing

May 31 - June 2, 2009

MD, Bethesda, USA

Acceptance Rates

Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

Upcoming Conference

STOC '25

Sponsor:
sigact

57th Annual ACM Symposium on Theory of Computing (STOC 2025)

June 23 - 27, 2025

Prague , Czech Republic

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

81
Total Citations
View Citations
831
Total Downloads

Downloads (Last 12 months)72
Downloads (Last 6 weeks)6

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Branzei SMehta RNisan N(2025)Tit-for-tat strategies drive growth and inequality in production economiesProceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences10.1098/rspa.2024.0533481:2306Online publication date: 22-Jan-2025
https://doi.org/10.1098/rspa.2024.0533
Carissimo C(2024) Counter-Intuitive Effects of Q -Learning Exploration in a Congestion Dilemma IEEE Access10.1109/ACCESS.2024.335860812(15984-15996)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3358608
Sun YLiu TZhou RKumar PShahrampour SOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Provably fast convergence of independent natural policy gradient for Markov potential gamesProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668029(43951-43971)Online publication date: 10-Dec-2023
https://dl.acm.org/doi/10.5555/3666122.3668029
Chhablani CSullins MKash IAgmon NAn BRicci AYeoh W(2023)Multiplicative Weight Updates for Extensive Form GamesProceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems10.5555/3545946.3598747(1071-1078)Online publication date: 30-May-2023
https://dl.acm.org/doi/10.5555/3545946.3598747
Piliouras GYu FLeyton-Brown KSamuelson LHartline J(2023)Multi-agent Performative Prediction: From Global Stability and Optimality to ChaosProceedings of the 24th ACM Conference on Economics and Computation10.1145/3580507.3597759(1047-1074)Online publication date: 9-Jul-2023
https://dl.acm.org/doi/10.1145/3580507.3597759
Cho BXiao Y(2023)A Repeated Unknown Game: Decentralized Task Offloading in Vehicular Fog ComputingIEEE Transactions on Vehicular Technology10.1109/TVT.2023.327512072:10(13430-13446)Online publication date: Oct-2023
https://doi.org/10.1109/TVT.2023.3275120
Seaton JBrown P(2023)On the Intrinsic Fragility of the Price of AnarchyIEEE Control Systems Letters10.1109/LCSYS.2023.33353157(3573-3578)Online publication date: 2023
https://doi.org/10.1109/LCSYS.2023.3335315
Chiu CMaheshwari CSu PSastry S(2023)Arc-Based Traffic Assignment: Equilibrium Characterization and Learning2023 62nd IEEE Conference on Decision and Control (CDC)10.1109/CDC49753.2023.10384151(7751-7758)Online publication date: 13-Dec-2023
https://doi.org/10.1109/CDC49753.2023.10384151
Clarke SDragotto GFisac JStellato B(2023)Learning Rationality in Potential Games2023 62nd IEEE Conference on Decision and Control (CDC)10.1109/CDC49753.2023.10383714(4261-4266)Online publication date: 13-Dec-2023
https://doi.org/10.1109/CDC49753.2023.10383714
Engel MPiliouras G(2023)A stochastic variant of replicator dynamics in zero-sum games and its invariant measuresPhysica D: Nonlinear Phenomena10.1016/j.physd.2023.133940456(133940)Online publication date: Dec-2023
https://doi.org/10.1016/j.physd.2023.133940
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten