research-article

Multiplicative Weights Update in Zero-Sum Games

Authors:

James P. Bailey,

Georgios PiliourasAuthors Info & Claims

EC '18: Proceedings of the 2018 ACM Conference on Economics and Computation

Pages 321 - 338

https://doi.org/10.1145/3219166.3219235

Published: 11 June 2018 Publication History

Abstract

We study the classic setting where two agents compete against each other in a zero-sum game by applying the Multiplicative Weights Update (MWU) algorithm. In a twist of the standard approach of [Freund and Schapire 1999], we focus on the K-L divergence from the equilibrium but instead of providing an upper bound about the rate of increase we provide a nonnegative lower bound for games with interior equilibria. This implies movement away from equilibria and towards the boundary. In the case of zero-sum games without interior equilibria convergence to the boundary (and in fact to the minimal product of subsimplexes that contains all Nash equilibria) follows via an orthogonal argument. In that subspace divergence from the set of NE applies for all nonequilibrium initial conditions via the first argument. We argue the robustness of this non-equilibrating behavior by considering the following generalizations: Step size: Agents may be using different and even decreasing step sizes. Dynamics: Agents may be using Follow-the-Regularized-Leader algorithms and possibly apply different regularizers (e.g. MWU versus Gradient Descent). We also consider a linearized version of MWU. More than two agents: Multiple agents can interact via arbitrary networks of zero-sum polymatrix games and their affine variants. Our results come in stark contrast with the standard interpretation of the behavior of MWU (and more generally regret minimizing dynamics) in zero-sum games, which is typically referred to as "converging to equilibrium". If equilibria are indeed predictive even for the benchmark class of zero-sum games, agents in practice must deviate robustly from the axiomatic perspective of optimization driven dynamics as captured by MWU and variants and apply carefully tailored equilibrium-seeking behavioral dynamics.

Supplementary Material

MP4 File (p321.mp4)

Download
487.50 MB

References

[1]

Maria-Florina Balcan, Florin Constantin, and Ruta Mehta. 2012. The Weighted Majority Algorithm does not Converge in Nearly Zero-sum Games ICML Workshop on Markets, Mechanisms and Multi-Agent Models.

[2]

G.W. Brown. 1951. Iterative Solutions of Games by Fictitious Play. In Activity Analysis of Production and Allocation, T.C. Koopmans (Ed.), New York: Wiley. (1951).

[3]

Yang Cai, Ozan Candogan, Constantinos Daskalakis, and Christos Papadimitriou. 2016. Zero-Sum Polymatrix Games: A Generalization of Minmax. Mathematics of Operations Research Vol. 41, 2 (2016), 648--655.

[4]

Yang Cai and Costantinos Daskalakis. 2011. On Minmax Theorems for Multiplayer Games. In ACM-SIAM Symposium on Discrete Algorithms (SODA). 217--234.

Digital Library

[5]

Erick Chastain, Adi Livnat, Christos Papadimitriou, and Umesh Vazirani. 2014. Algorithms, games, and evolution. Proceedings of the National Academy of Sciences (PNAS), Vol. 111, 29 (2014), 10620--10623.

[6]

Constantinos Daskalakis, Alan Deckelbaum, and Anthony Kim. 2011. Near-optimal No-regret Algorithms for Zero-sum Games Proceedings of the Twenty-second Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '11). Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 235--254. http://dl.acm.org/citation.cfm?id=2133036.2133057

Digital Library

[7]

C. Daskalakis, R. Frongillo, C. Papadimitriou, G. Pierrakos, and G. Valiant. 2010. On learning algorithms for Nash equilibria. Symposium on Algorithmic Game Theory (SAGT) (2010), 114--125.

Digital Library

[8]

Constantinos Daskalakis, Andrew Ilyas, Vasilis Syrgkanis, and Haoyang Zeng. 2018. Training GANs with Optimism.

[9]

Constantinos Daskalakis and Christos Papadimitriou. 2009. On a Network Generalization of the Minmax Theorem. ICALP. 423--434.

Digital Library

[10]

Dylan J Foster, Thodoris Lykouris, Karthik Sridharan, and Eva Tardos. 2016. Learning in games: Robustness of fast convergence. Advances in Neural Information Processing Systems. 4727--4735.

Digital Library

[11]

Yoav Freund and Robert E Schapire. 1999. Adaptive game playing using multiplicative weights. Games and Economic Behavior Vol. 29, 1--2 (1999), 79--103.

[12]

Drew Fudenberg and David K Levine. 1995. Consistency and cautious fictitious play. Journal of Economic Dynamics and Control Vol. 19, 5--7 (1995), 1065--1089.

[13]

Drew Fudenberg and David K. Levine. 1998. The Theory of Learning in Games. The MIT Press.

[14]

R. Kleinberg, K. Ligett, G. Piliouras, and É. Tardos. 2011. Beyond the Nash equilibrium barrier. In Symposium on Innovations in Computer Science (ICS).

[15]

Tien-Yien Li and James A. Yorke. 1975. Period Three Implies Chaos. The American Mathematical Monthly Vol. 82, 10 (1975), 985--992.

[16]

Nick Littlestone and Manfred K Warmuth. 1994. The weighted majority algorithm. Information and computation Vol. 108, 2 (1994), 212--261.

Digital Library

[17]

T. Mai, I. Panageas, W. Ratcliff, V. V. Vazirani, and P. Yunker. 2017. Rock-Paper-Scissors, Differential Games and Biological Diversity. ArXiv e-prints (Oct. 2017). {arxiv}math.DS/1710.11249

[18]

Ruta Mehta, Ioannis Panageas, and Georgios Piliouras. 2015. Natural Selection as an Inhibitor of Genetic Diversity: Multiplicative Weights Updates Algorithm and a Conjecture of Haploid Genetics Innovations in Theoretical Computer Science.

Digital Library

[19]

R. Mehta, I. Panageas, G. Piliouras, and S. Yazdanbod. 2016. The Computational Complexity of Genetic Diversity. European Symposium on Algorithms (ESA) (2016).

[20]

Panayotis Mertikopoulos, Christos Papadimitriou, and Georgios Piliouras. 2018. Cycles in adversarial regularized learning. In SODA.

Digital Library

[21]

Gerasimos Palaiopanos, Ioannis Panageas, and Georgios Piliouras. 2017. Multiplicative Weights Update with Constant Step-Size in Congestion Games: Convergence, Limit Cycles and Chaos. In Proceedings of the 30th International Conference on Neural Information Processing Systems (NIPS'17).

[22]

Christos Papadimitriou and Georgios Piliouras. 2016. From Nash equilibria to chain recurrent sets: Solution concepts and topology ITCS.

Digital Library

[23]

Georgios Piliouras, Carlos Nieto-Granda, Henrik I. Christensen, and Jeff S. Shamma. 2014. Persistent Patterns: Multi-agent Learning Beyond Equilibrium and Utility AAMAS. 181--188.

Digital Library

[24]

Georgios Piliouras and Jeff S Shamma. 2014. Optimization despite chaos: Convex relaxations to complex limit sets via Poincaré recurrence Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms. SIAM, 861--873.

Digital Library

[25]

Sasha Rakhlin and Karthik Sridharan. 2013. Optimization, learning, and games with predictable sequences Advances in Neural Information Processing Systems. 3066--3074.

Digital Library

[26]

J. Robinson. 1951. An Iterative Method of Solving a Game. Annals of Mathematics Vol. 54 (1951), 296--301.

[27]

William H. Sandholm. 2010. Population Games and Evolutionary Dynamics. MIT Press.

[28]

Vasilis Syrgkanis, Alekh Agarwal, Haipeng Luo, and Robert E. Schapire. 2015. Fast Convergence of Regularized Learning in Games. Proceedings of the 28th International Conference on Neural Information Processing Systems (NIPS'15). MIT Press, Cambridge, MA, USA, 2989--2997. http://dl.acm.org/citation.cfm?id=2969442.2969573

Digital Library

[29]

Eric Van Damme. 1991. Stability and perfection of Nash equilibria. Vol. Vol. 339. Springer.

Digital Library

[30]

John von Neumann. 1928. Zur Theorie der Gesellschaftsspiele. Math. Ann. Vol. 100 (1928), 295--300.

[31]

John von Neumann and Oskar Morgenstern. 1944. Theory of Games and Economic Behavior. Princeton University Press.

Cited By

Wang GChizat L(2025)An Exponentially Converging Particle Method for the Mixed Nash Equilibrium of Continuous GamesOpen Journal of Mathematical Optimization10.5802/ojmo.376(1-66)Online publication date: 15-Jan-2025
https://doi.org/10.5802/ojmo.37
Feng YLi PPanageas IWang XKiyavash NMooij J(2024)Last-iterate convergence separation between extra-gradient and optimisim in constrained periodic gamesProceedings of the Fortieth Conference on Uncertainty in Artificial Intelligence10.5555/3702676.3702739(1339-1370)Online publication date: 15-Jul-2024
https://dl.acm.org/doi/10.5555/3702676.3702739
Li PLi SYang CWang XHu SHuang XChan HAn BSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Configurable mirror descentProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3693200(28146-28203)Online publication date: 21-Jul-2024
https://dl.acm.org/doi/10.5555/3692070.3693200
Show More Cited By

Index Terms

Multiplicative Weights Update in Zero-Sum Games
1. Networks
  1. Network properties
    1. Network dynamics
2. Theory of computation
  1. Design and analysis of algorithms
    1. Online algorithms
      1. Online learning algorithms
  2. Theory and algorithms for application domains
    1. Algorithmic game theory and mechanism design
    2. Machine learning theory
      1. Multi-agent learning
      2. Online learning theory

Recommendations

Computing Approximate Nash Equilibria in Polymatrix Games

In an $$\epsilon $$∈-Nash equilibrium, a player can gain at most $$\epsilon $$∈ by unilaterally changing his behavior. For two-player (bimatrix) games with payoffs in [0, 1], the best-known $$\epsilon $$∈ achievable in polynomial time is 0.3393 (...
Zero-sum stochastic stackelberg games
NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing Systems

Zero-sum stochastic games have found important applications in a variety of fields, from machine learning to economics. Work on this model has primarily focused on the computation of Nash equilibrium due to its effectiveness in solving adversarial board ...
Repeated zero-sum games with budget
AAMAS '12: Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 2

When a zero-sum game is played once, a risk-neutral player will want to maximize his expected outcome in that single play. However, if that single play instead only determines how much one player must pay to the other, and the same game must be played ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

EC '18: Proceedings of the 2018 ACM Conference on Economics and Computation

June 2018

713 pages

ISBN:9781450358293

DOI:10.1145/3219166

General Chair:
Eva Tardos
Cornell University, USA
,
Program Chairs:
Edith Elkind
University of Oxford, UK
,
Rakesh Vohra
University of Pennsylvania, USA

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGecom: Special Interest Group on Economics and Computation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Conference

EC '18

Sponsor:

SIGecom

EC '18: ACM Conference on Economics and Computation

June 18 - 22, 2018

NY, Ithaca, USA

Acceptance Rates

EC '18 Paper Acceptance Rate 70 of 269 submissions, 26%;

Overall Acceptance Rate 664 of 2,389 submissions, 28%

Upcoming Conference

EC '25

Sponsor:
sigecom

The 25th ACM Conference on Economics and Computation

July 7 - 11, 2025

Stanford , CA , USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

35
Total Citations
View Citations
876
Total Downloads

Downloads (Last 12 months)156
Downloads (Last 6 weeks)27

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Wang GChizat L(2025)An Exponentially Converging Particle Method for the Mixed Nash Equilibrium of Continuous GamesOpen Journal of Mathematical Optimization10.5802/ojmo.376(1-66)Online publication date: 15-Jan-2025
https://doi.org/10.5802/ojmo.37
Feng YLi PPanageas IWang XKiyavash NMooij J(2024)Last-iterate convergence separation between extra-gradient and optimisim in constrained periodic gamesProceedings of the Fortieth Conference on Uncertainty in Artificial Intelligence10.5555/3702676.3702739(1339-1370)Online publication date: 15-Jul-2024
https://dl.acm.org/doi/10.5555/3702676.3702739
Li PLi SYang CWang XHu SHuang XChan HAn BSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Configurable mirror descentProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3693200(28146-28203)Online publication date: 21-Jul-2024
https://dl.acm.org/doi/10.5555/3692070.3693200
Feng YPiliouras GWang XSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Prediction accuracy of learning in gamesProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692604(13278-13325)Online publication date: 21-Jul-2024
https://dl.acm.org/doi/10.5555/3692070.3692604
Paes Leme RPiliouras GSchneider JSpendlove KZuo SKleinberg RSaban DBergemann D(2024)Complex Dynamics in Autobidding SystemsProceedings of the 25th ACM Conference on Economics and Computation10.1145/3670865.3673551(75-100)Online publication date: 8-Jul-2024
https://dl.acm.org/doi/10.1145/3670865.3673551
Guo XMu YYang X(2024)Periodicity in dynamical games driven by the Hedge algorithm and myopic best response2024 IEEE 63rd Conference on Decision and Control (CDC)10.1109/CDC56724.2024.10885811(3203-3208)Online publication date: 16-Dec-2024
https://doi.org/10.1109/CDC56724.2024.10885811
Qin RYu Y(2024)Learning in games: a systematic reviewScience China Information Sciences10.1007/s11432-023-3955-x67:7Online publication date: 28-Jun-2024
https://doi.org/10.1007/s11432-023-3955-x
Hakim RMilionis JPapadimitriou CPiliouras G(2024)Swim till You Sink: Computing the Limit of a GameAlgorithmic Game Theory10.1007/978-3-031-71033-9_12(205-222)Online publication date: 31-Aug-2024
https://doi.org/10.1007/978-3-031-71033-9_12
Wang GChizat LOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Local convergence of gradient methods for min-max gamesProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668780(60841-60852)Online publication date: 10-Dec-2023
https://dl.acm.org/doi/10.5555/3666122.3668780
Cai YLuo HWei CZheng WOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Uncoupled and convergent learning in two-player zero-sum Markov games with bandit feedbackProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667701(36364-36406)Online publication date: 10-Dec-2023
https://dl.acm.org/doi/10.5555/3666122.3667701
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten