skip to main content
10.1145/1536414.1536487acmconferencesArticle/Chapter ViewAbstractPublication PagesstocConference Proceedingsconference-collections
research-article

Multiplicative updates outperform generic no-regret learning in congestion games: extended abstract

Published: 31 May 2009 Publication History

Abstract

We study the outcome of natural learning algorithms in atomic congestion games. Atomic congestion games have a wide variety of equilibria often with vastly differing social costs. We show that in almost all such games, the well-known multiplicative-weights learning algorithm results in convergence to pure equilibria. Our results show that natural learning behavior can avoid bad outcomes predicted by the price of anarchy in atomic congestion games such as the load-balancing game introduced by Koutsoupias and Papadimitriou, which has super-constant price of anarchy and has correlated equilibria that are exponentially worse than any mixed Nash equilibrium.
Our results identify a set of mixed Nash equilibria that we call weakly stable equilibria. Our notion of weakly stable is defined game-theoretically, but we show that this property holds whenever a stability criterion from the theory of dynamical systems is satisfied. This allows us to show that in every congestion game, the distribution of play converges to the set of weakly stable equilibria. Pure Nash equilibria are weakly stable, and we show using techniques from algebraic geometry that the converse is true with probability 1 when congestion costs are selected at random independently on each edge (from any monotonically parametrized distribution). We further extend our results to show that players can use algorithms with different (sufficiently small) learning rates, i.e. they can trade off convergence speed and long term average regret differently.

References

[1]
E. Akin. The Geometry of Population Genetics. Springer-Verlag, 1979.
[2]
S. Arora, E. Hazan, and S. Kale. The multiplicative weights update method: A meta-algorithm and applications. Preprint.
[3]
A. Blum, E. Even-Dar, and K. Ligett. Routing without regret: on convergence to Nash equilibria of regret-minimizing algorithms in routing games. In Proc. of PODC, 45--52, 2006.
[4]
A. Blum, M. Hajiaghayi, K. Ligett, and A. Roth. Regret Minimization and the Price of Total anarchy. In Proc. of STOC, 373--382, 2008.
[5]
A. Blum and Y. Mansour. Learning, Regret Minimization and Equilibria, in Algorithmic Game Theory (eds. N. Nisan, T. Roughgarden, E. Tardos and V. Vazirani), Cambridge University Press, 2007.
[6]
E. Even-Dar and Y. Mansour. Fast convergence of selfish rerouting. SODA, 2005.
[7]
S. Fischer, H. Racke, and B. Vocking. Fast convergence to wardrop equilibria by adaptive sampling methods. STOC, 2006.
[8]
S. Fischer, B. Vocking. On the evolution of selfish routing. In Proc. of the 12th European Symposium on Algorithms (ESA '04), pages 323--334. Springer, 2004.
[9]
D. Foster and S. M. Kakade. Deterministic calibration and Nash equilibrium. In Proc. of COLT, pp. 33--48, 2004.
[10]
D. Foster, and R. Vohra. Calibrated learning and correlated equilibrium. Games and Economic Behavior 21:40--55, 1997.
[11]
D. Foster and H. P. Young. Learning, hypothesis testing, and Nash equilibrium. Games and Economic Behavior, 45:73, 96, 2003.
[12]
D. Foster and H. P. Young. Regret testing: Learning to play Nash equilibrium without knowing you have an opponent. Theoretical Economics, 1:341--367, 2006.
[13]
Y. Freund, and R. Schapire. Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29:79--103, 1999.
[14]
D. Fudenberg and D. Levine. The theory of learning in games. MIT Press, 1998.
[15]
F. Germano and G. Lugosi. Global Nash convergence of Foster and Young's regret testing. Games and Economic Behavior, 60:135--154, 2007.
[16]
M. Goemans, V. Mirrokni, and A. Vetta, Sink Equilibria and Convergence, IEEE Symposium on Foundations of Computing, 2005.
[17]
J. Hannan, Approximation to Bayes risk in repeated plays. In M. Dresher, A. Tucker, and P. Wolfe eds, Contributions to the Theory of Games, vol 3, pp 79--139, Princeton University Press.
[18]
S. Hart, and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127--1150, 2000.
[19]
S. Hart and A. Mas-Colell. Stochastic uncoupled dynamics and Nash equilibrium. Games and Economic Behavior, 57:286--303, 2006.
[20]
J. Hofbauer and K. Sigmund. Evolutionary Games and Population Dynamics. Cambridge Univ, Press, 1998.
[21]
E. Koutsoupias and C. H. Papadimitriou. Worst-case equilibria. In Proceedings of the 16th Annual Symposium on Theoretical Aspects of Computer Science, 404--413, 1999.
[22]
Nick Littlestone and Manfred K. Warmuth. The weighted majority algorithm. Information and Computation, 108(2):212--260, 1994.
[23]
J. Maynard Smith. Evolution and the Theory of Games. Cambridge University Press, 1982.
[24]
J. W. Milnor. Topology from the differentiable viewpoint. Princeton Landmarks in Mathematics. Princeton University Press, 1997.
[25]
V. Mirrokni and A. Vetta, Convergence Issues in Competitive Games, RANDOM-APPROX, 2004.
[26]
D. Monderer and L. S. Shapley. Potential games. Games and Economic Behavior 14: 124--143, 1996.
[27]
D. Monderer and L. S. Shapley. Fictitious play property for games with identical interests. Journal of Economic Theory 68:258--265, 1996.
[28]
L. Perko. Differential Equations and Dynamical Systems. Springer, 1991.
[29]
K. Ritzberger and J. Weibull. Evolutionary selection in normal--form games. Econometrica 63, 1371--1399, 1995.
[30]
R. W. Rosenthal. A class of games possessing pure-strategy Nash equilibria. International Journal of Game Theory 2:65--67, 1973.
[31]
T. Roughgarden. Routing Games, in Algorithmc Game Theory (eds. N. Nisan, T. Roughgarden, E. Tardos, V. Vazirani). Cambridge University Press. 2007.
[32]
T. Roughgarden. Intrinsic Robustness of the Price of Anarchy. In Proceedings of the 41th Annual ACM Symposium on Theory of Computing (STOC), 2009.
[33]
L. Samuelson and J. Zhang. Evolutionary stability in asymmetric games. Journal of Economic Theory 57:363--391, 1992.
[34]
W. H. Sandholm. Potential games with continuous player sets. J. of Economic Theory 97:81--108, 2001.
[35]
W. H. Sandholm. Population Games and Evolutionary Dynamics. Manuscript, October 2008.
[36]
I. R. Shafarevich. Basic Algebraic Geometry, volume 1. Miles Reid, translator. Springer-Verlag, 1994.
[37]
J. M. Steele. An Efron-Stein inequality for nonsymmetric statistics. Annals of Statistics 14(2):753--758, 1986.
[38]
B. Vocking. Selfish Load Balancing. In Algorithmic Game Theory (eds. N. Nisan, T. Roughgarden, E. Tardos, V. Vazirani). Cambridge Univ. Press. 2007.
[39]
J. Weibull. Evolutionary Game Theory. MIT Press, 1997.

Cited By

View all
  • (2025)Tit-for-tat strategies drive growth and inequality in production economiesProceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences10.1098/rspa.2024.0533481:2306Online publication date: 22-Jan-2025
  • (2024) Counter-Intuitive Effects of Q -Learning Exploration in a Congestion Dilemma IEEE Access10.1109/ACCESS.2024.335860812(15984-15996)Online publication date: 2024
  • (2023)Provably fast convergence of independent natural policy gradient for Markov potential gamesProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668029(43951-43971)Online publication date: 10-Dec-2023
  • Show More Cited By

Index Terms

  1. Multiplicative updates outperform generic no-regret learning in congestion games: extended abstract

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      STOC '09: Proceedings of the forty-first annual ACM symposium on Theory of computing
      May 2009
      750 pages
      ISBN:9781605585062
      DOI:10.1145/1536414
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 31 May 2009

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. game theory
      2. learning theory
      3. price of anarchy

      Qualifiers

      • Research-article

      Conference

      STOC '09
      Sponsor:
      STOC '09: Symposium on Theory of Computing
      May 31 - June 2, 2009
      MD, Bethesda, USA

      Acceptance Rates

      Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

      Upcoming Conference

      STOC '25
      57th Annual ACM Symposium on Theory of Computing (STOC 2025)
      June 23 - 27, 2025
      Prague , Czech Republic

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)72
      • Downloads (Last 6 weeks)6
      Reflects downloads up to 28 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)Tit-for-tat strategies drive growth and inequality in production economiesProceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences10.1098/rspa.2024.0533481:2306Online publication date: 22-Jan-2025
      • (2024) Counter-Intuitive Effects of Q -Learning Exploration in a Congestion Dilemma IEEE Access10.1109/ACCESS.2024.335860812(15984-15996)Online publication date: 2024
      • (2023)Provably fast convergence of independent natural policy gradient for Markov potential gamesProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668029(43951-43971)Online publication date: 10-Dec-2023
      • (2023)Multiplicative Weight Updates for Extensive Form GamesProceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems10.5555/3545946.3598747(1071-1078)Online publication date: 30-May-2023
      • (2023)Multi-agent Performative Prediction: From Global Stability and Optimality to ChaosProceedings of the 24th ACM Conference on Economics and Computation10.1145/3580507.3597759(1047-1074)Online publication date: 9-Jul-2023
      • (2023)A Repeated Unknown Game: Decentralized Task Offloading in Vehicular Fog ComputingIEEE Transactions on Vehicular Technology10.1109/TVT.2023.327512072:10(13430-13446)Online publication date: Oct-2023
      • (2023)On the Intrinsic Fragility of the Price of AnarchyIEEE Control Systems Letters10.1109/LCSYS.2023.33353157(3573-3578)Online publication date: 2023
      • (2023)Arc-Based Traffic Assignment: Equilibrium Characterization and Learning2023 62nd IEEE Conference on Decision and Control (CDC)10.1109/CDC49753.2023.10384151(7751-7758)Online publication date: 13-Dec-2023
      • (2023)Learning Rationality in Potential Games2023 62nd IEEE Conference on Decision and Control (CDC)10.1109/CDC49753.2023.10383714(4261-4266)Online publication date: 13-Dec-2023
      • (2023)A stochastic variant of replicator dynamics in zero-sum games and its invariant measuresPhysica D: Nonlinear Phenomena10.1016/j.physd.2023.133940456(133940)Online publication date: Dec-2023
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media