skip to main content
10.1145/3219166.3219235acmconferencesArticle/Chapter ViewAbstractPublication PagesecConference Proceedingsconference-collections
research-article

Multiplicative Weights Update in Zero-Sum Games

Published: 11 June 2018 Publication History

Abstract

We study the classic setting where two agents compete against each other in a zero-sum game by applying the Multiplicative Weights Update (MWU) algorithm. In a twist of the standard approach of [Freund and Schapire 1999], we focus on the K-L divergence from the equilibrium but instead of providing an upper bound about the rate of increase we provide a nonnegative lower bound for games with interior equilibria. This implies movement away from equilibria and towards the boundary. In the case of zero-sum games without interior equilibria convergence to the boundary (and in fact to the minimal product of subsimplexes that contains all Nash equilibria) follows via an orthogonal argument. In that subspace divergence from the set of NE applies for all nonequilibrium initial conditions via the first argument. We argue the robustness of this non-equilibrating behavior by considering the following generalizations: Step size: Agents may be using different and even decreasing step sizes. Dynamics: Agents may be using Follow-the-Regularized-Leader algorithms and possibly apply different regularizers (e.g. MWU versus Gradient Descent). We also consider a linearized version of MWU. More than two agents: Multiple agents can interact via arbitrary networks of zero-sum polymatrix games and their affine variants. Our results come in stark contrast with the standard interpretation of the behavior of MWU (and more generally regret minimizing dynamics) in zero-sum games, which is typically referred to as "converging to equilibrium". If equilibria are indeed predictive even for the benchmark class of zero-sum games, agents in practice must deviate robustly from the axiomatic perspective of optimization driven dynamics as captured by MWU and variants and apply carefully tailored equilibrium-seeking behavioral dynamics.

Supplementary Material

MP4 File (p321.mp4)

References

[1]
Maria-Florina Balcan, Florin Constantin, and Ruta Mehta. 2012. The Weighted Majority Algorithm does not Converge in Nearly Zero-sum Games ICML Workshop on Markets, Mechanisms and Multi-Agent Models.
[2]
G.W. Brown. 1951. Iterative Solutions of Games by Fictitious Play. In Activity Analysis of Production and Allocation, T.C. Koopmans (Ed.), New York: Wiley. (1951).
[3]
Yang Cai, Ozan Candogan, Constantinos Daskalakis, and Christos Papadimitriou. 2016. Zero-Sum Polymatrix Games: A Generalization of Minmax. Mathematics of Operations Research Vol. 41, 2 (2016), 648--655.
[4]
Yang Cai and Costantinos Daskalakis. 2011. On Minmax Theorems for Multiplayer Games. In ACM-SIAM Symposium on Discrete Algorithms (SODA). 217--234.
[5]
Erick Chastain, Adi Livnat, Christos Papadimitriou, and Umesh Vazirani. 2014. Algorithms, games, and evolution. Proceedings of the National Academy of Sciences (PNAS), Vol. 111, 29 (2014), 10620--10623.
[6]
Constantinos Daskalakis, Alan Deckelbaum, and Anthony Kim. 2011. Near-optimal No-regret Algorithms for Zero-sum Games Proceedings of the Twenty-second Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '11). Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 235--254. http://dl.acm.org/citation.cfm?id=2133036.2133057
[7]
C. Daskalakis, R. Frongillo, C. Papadimitriou, G. Pierrakos, and G. Valiant. 2010. On learning algorithms for Nash equilibria. Symposium on Algorithmic Game Theory (SAGT) (2010), 114--125.
[8]
Constantinos Daskalakis, Andrew Ilyas, Vasilis Syrgkanis, and Haoyang Zeng. 2018. Training GANs with Optimism.
[9]
Constantinos Daskalakis and Christos Papadimitriou. 2009. On a Network Generalization of the Minmax Theorem. ICALP. 423--434.
[10]
Dylan J Foster, Thodoris Lykouris, Karthik Sridharan, and Eva Tardos. 2016. Learning in games: Robustness of fast convergence. Advances in Neural Information Processing Systems. 4727--4735.
[11]
Yoav Freund and Robert E Schapire. 1999. Adaptive game playing using multiplicative weights. Games and Economic Behavior Vol. 29, 1--2 (1999), 79--103.
[12]
Drew Fudenberg and David K Levine. 1995. Consistency and cautious fictitious play. Journal of Economic Dynamics and Control Vol. 19, 5--7 (1995), 1065--1089.
[13]
Drew Fudenberg and David K. Levine. 1998. The Theory of Learning in Games. The MIT Press.
[14]
R. Kleinberg, K. Ligett, G. Piliouras, and É. Tardos. 2011. Beyond the Nash equilibrium barrier. In Symposium on Innovations in Computer Science (ICS).
[15]
Tien-Yien Li and James A. Yorke. 1975. Period Three Implies Chaos. The American Mathematical Monthly Vol. 82, 10 (1975), 985--992.
[16]
Nick Littlestone and Manfred K Warmuth. 1994. The weighted majority algorithm. Information and computation Vol. 108, 2 (1994), 212--261.
[17]
T. Mai, I. Panageas, W. Ratcliff, V. V. Vazirani, and P. Yunker. 2017. Rock-Paper-Scissors, Differential Games and Biological Diversity. ArXiv e-prints (Oct. 2017). {arxiv}math.DS/1710.11249
[18]
Ruta Mehta, Ioannis Panageas, and Georgios Piliouras. 2015. Natural Selection as an Inhibitor of Genetic Diversity: Multiplicative Weights Updates Algorithm and a Conjecture of Haploid Genetics Innovations in Theoretical Computer Science.
[19]
R. Mehta, I. Panageas, G. Piliouras, and S. Yazdanbod. 2016. The Computational Complexity of Genetic Diversity. European Symposium on Algorithms (ESA) (2016).
[20]
Panayotis Mertikopoulos, Christos Papadimitriou, and Georgios Piliouras. 2018. Cycles in adversarial regularized learning. In SODA.
[21]
Gerasimos Palaiopanos, Ioannis Panageas, and Georgios Piliouras. 2017. Multiplicative Weights Update with Constant Step-Size in Congestion Games: Convergence, Limit Cycles and Chaos. In Proceedings of the 30th International Conference on Neural Information Processing Systems (NIPS'17).
[22]
Christos Papadimitriou and Georgios Piliouras. 2016. From Nash equilibria to chain recurrent sets: Solution concepts and topology ITCS.
[23]
Georgios Piliouras, Carlos Nieto-Granda, Henrik I. Christensen, and Jeff S. Shamma. 2014. Persistent Patterns: Multi-agent Learning Beyond Equilibrium and Utility AAMAS. 181--188.
[24]
Georgios Piliouras and Jeff S Shamma. 2014. Optimization despite chaos: Convex relaxations to complex limit sets via Poincaré recurrence Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms. SIAM, 861--873.
[25]
Sasha Rakhlin and Karthik Sridharan. 2013. Optimization, learning, and games with predictable sequences Advances in Neural Information Processing Systems. 3066--3074.
[26]
J. Robinson. 1951. An Iterative Method of Solving a Game. Annals of Mathematics Vol. 54 (1951), 296--301.
[27]
William H. Sandholm. 2010. Population Games and Evolutionary Dynamics. MIT Press.
[28]
Vasilis Syrgkanis, Alekh Agarwal, Haipeng Luo, and Robert E. Schapire. 2015. Fast Convergence of Regularized Learning in Games. Proceedings of the 28th International Conference on Neural Information Processing Systems (NIPS'15). MIT Press, Cambridge, MA, USA, 2989--2997. http://dl.acm.org/citation.cfm?id=2969442.2969573
[29]
Eric Van Damme. 1991. Stability and perfection of Nash equilibria. Vol. Vol. 339. Springer.
[30]
John von Neumann. 1928. Zur Theorie der Gesellschaftsspiele. Math. Ann. Vol. 100 (1928), 295--300.
[31]
John von Neumann and Oskar Morgenstern. 1944. Theory of Games and Economic Behavior. Princeton University Press.

Cited By

View all
  • (2025)An Exponentially Converging Particle Method for the Mixed Nash Equilibrium of Continuous GamesOpen Journal of Mathematical Optimization10.5802/ojmo.376(1-66)Online publication date: 15-Jan-2025
  • (2024)Last-iterate convergence separation between extra-gradient and optimisim in constrained periodic gamesProceedings of the Fortieth Conference on Uncertainty in Artificial Intelligence10.5555/3702676.3702739(1339-1370)Online publication date: 15-Jul-2024
  • (2024)Configurable mirror descentProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3693200(28146-28203)Online publication date: 21-Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
EC '18: Proceedings of the 2018 ACM Conference on Economics and Computation
June 2018
713 pages
ISBN:9781450358293
DOI:10.1145/3219166
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. convex analysis
  2. follow-the-regularized-leader
  3. gradient descent
  4. learning in games
  5. multiplicative weights
  6. online learning

Qualifiers

  • Research-article

Funding Sources

Conference

EC '18
Sponsor:

Acceptance Rates

EC '18 Paper Acceptance Rate 70 of 269 submissions, 26%;
Overall Acceptance Rate 664 of 2,389 submissions, 28%

Upcoming Conference

EC '25
The 25th ACM Conference on Economics and Computation
July 7 - 11, 2025
Stanford , CA , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)156
  • Downloads (Last 6 weeks)27
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)An Exponentially Converging Particle Method for the Mixed Nash Equilibrium of Continuous GamesOpen Journal of Mathematical Optimization10.5802/ojmo.376(1-66)Online publication date: 15-Jan-2025
  • (2024)Last-iterate convergence separation between extra-gradient and optimisim in constrained periodic gamesProceedings of the Fortieth Conference on Uncertainty in Artificial Intelligence10.5555/3702676.3702739(1339-1370)Online publication date: 15-Jul-2024
  • (2024)Configurable mirror descentProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3693200(28146-28203)Online publication date: 21-Jul-2024
  • (2024)Prediction accuracy of learning in gamesProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692604(13278-13325)Online publication date: 21-Jul-2024
  • (2024)Complex Dynamics in Autobidding SystemsProceedings of the 25th ACM Conference on Economics and Computation10.1145/3670865.3673551(75-100)Online publication date: 8-Jul-2024
  • (2024)Periodicity in dynamical games driven by the Hedge algorithm and myopic best response2024 IEEE 63rd Conference on Decision and Control (CDC)10.1109/CDC56724.2024.10885811(3203-3208)Online publication date: 16-Dec-2024
  • (2024)Learning in games: a systematic reviewScience China Information Sciences10.1007/s11432-023-3955-x67:7Online publication date: 28-Jun-2024
  • (2024)Swim till You Sink: Computing the Limit of a GameAlgorithmic Game Theory10.1007/978-3-031-71033-9_12(205-222)Online publication date: 31-Aug-2024
  • (2023)Local convergence of gradient methods for min-max gamesProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668780(60841-60852)Online publication date: 10-Dec-2023
  • (2023)Uncoupled and convergent learning in two-player zero-sum Markov games with bandit feedbackProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667701(36364-36406)Online publication date: 10-Dec-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media