An Exponentially Converging Particle Method for the Mixed Nash Equilibrium of Continuous Games

Guillaume Wang; Lénaïc Chizat

doi:10.5802/ojmo.37

Guillaume Wang ¹; Lénaïc Chizat ¹

¹ Institute of Mathematics, École polytechnique fédérale de Lausanne (EPFL), Station Z, CH-1015 Lausanne

Open Journal of Mathematical Optimization, Volume 6 (2025), article no. 1, 66 p.

Abstract

We consider the problem of computing mixed Nash equilibria of two-player zero-sum games with continuous sets of pure strategies and with first-order access to the payoff function. This problem arises for example in game-theory-inspired machine learning applications, such as distributionally-robust learning. In those applications, the strategy sets are high-dimensional and thus methods based on discretisation cannot tractably return high-accuracy solutions. In this paper, we introduce and analyze a particle-based method that enjoys guaranteed local convergence for this problem. This method consists in parametrizing the mixed strategies as atomic measures and applying proximal point updates to both the atoms’ weights and positions. It can be interpreted as an implicit time discretization of the “interacting” Wasserstein–Fisher–Rao gradient flow.

We prove that, under non-degeneracy assumptions, this method converges at an exponential rate to the exact mixed Nash equilibrium from any initialization satisfying a natural notion of closeness to optimality. We illustrate our results with numerical experiments and discuss applications to max-margin and distributionally-robust classification using two-layer neural networks, where our method has a natural interpretation as a simultaneous training of the network’s weights and of the adversarial distribution.

Received: 2024-02-01
Revised: 2024-08-28
Accepted: 2024-09-16
Published online: 2025-01-15

DOI: 10.5802/ojmo.37

Author's affiliations:

Guillaume Wang ¹; Lénaïc Chizat ¹

¹ Institute of Mathematics, École polytechnique fédérale de Lausanne (EPFL), Station Z, CH-1015 Lausanne

License:

CC-BY 4.0

Copyrights: The authors retain unrestricted copyrights and publishing rights

@article{OJMO_2025__6__A1_0,
     author = {Guillaume Wang and L\'ena{\"\i}c Chizat},
     title = {An {Exponentially} {Converging} {Particle} {Method} for the {Mixed} {Nash} {Equilibrium} of {Continuous} {Games}},
     journal = {Open Journal of Mathematical Optimization},
     eid = {1},
     pages = {1--66},
     publisher = {Universit\'e de Montpellier},
     volume = {6},
     year = {2025},
     doi = {10.5802/ojmo.37},
     language = {en},
     url = {https://ojmo.centre-mersenne.org/articles/10.5802/ojmo.37/}
}

TY  - JOUR
AU  - Guillaume Wang
AU  - Lénaïc Chizat
TI  - An Exponentially Converging Particle Method for the Mixed Nash Equilibrium of Continuous Games
JO  - Open Journal of Mathematical Optimization
PY  - 2025
SP  - 1
EP  - 66
VL  - 6
PB  - Université de Montpellier
UR  - https://ojmo.centre-mersenne.org/articles/10.5802/ojmo.37/
DO  - 10.5802/ojmo.37
LA  - en
ID  - OJMO_2025__6__A1_0
ER  -

%0 Journal Article
%A Guillaume Wang
%A Lénaïc Chizat
%T An Exponentially Converging Particle Method for the Mixed Nash Equilibrium of Continuous Games
%J Open Journal of Mathematical Optimization
%D 2025
%P 1-66
%V 6
%I Université de Montpellier
%U https://ojmo.centre-mersenne.org/articles/10.5802/ojmo.37/
%R 10.5802/ojmo.37
%G en
%F OJMO_2025__6__A1_0

Guillaume Wang; Lénaïc Chizat. An Exponentially Converging Particle Method for the Mixed Nash Equilibrium of Continuous Games. Open Journal of Mathematical Optimization, Volume 6 (2025), article  no. 1, 66 p. doi : 10.5802/ojmo.37. https://ojmo.centre-mersenne.org/articles/10.5802/ojmo.37/

References
Cited by

[1] Ehsan Amid; Manfred K. K. Warmuth Reparameterizing mirror descent as gradient descent, Adv. Neural Inf. Process. Syst., Volume 33 (2020), pp. 8430-8439

[2] Francis Bach Breaking the curse of dimensionality with convex neural networks, J. Mach. Learn. Theory, Volume 18 (2017) no. 1, pp. 629-681

[3] James P. Bailey; Georgios Piliouras Multiplicative weights update in zero-sum games, Proceedings of the 2018 ACM Conference on Economics and Computation (2018), pp. 321-338 | DOI

[4] Sébastien Bubeck ORF523: Mirror Descent, part II/II | I’m a bandit, 2013 (Accessed 2022-06-08 https://blogs.princeton.edu/imabandit/2013/04/18/orf523-mirror-descent-part-iiii/)

[5] Sébastien Bubeck ORF523: Mirror Prox | I’m a bandit, 2013 (Accessed 2022-06-08 https://blogs.princeton.edu/imabandit/2013/04/23/orf523-mirror-prox/)

[6] Shicong Cen; Yuting Wei; Yuejie Chi Fast policy extragradient methods for competitive games with entropy regularization, Adv. Neural Inf. Process. Syst., Volume 34 (2021), pp. 27952-27964

[7] Lénaïc Chizat Convergence rates of gradient methods for convex optimization in the space of measures (2021) | arXiv

[8] Lénaïc Chizat Sparse optimization on measures with over-parameterized gradient descent, Math. Program., Volume 194 (2022) no. 1, pp. 487-532 | DOI | MR | Zbl

[9] Lénaïc Chizat; Gabriel Peyré; Bernhard Schmitzer; François-Xavier Vialard Unbalanced optimal transport: Dynamic and Kantorovich formulations, J. Funct. Anal., Volume 274 (2018) no. 11, pp. 3090-3123 | DOI | MR | Zbl

[10] Constantinos Daskalakis; Ioannis Panageas Last-Iterate Convergence: Zero-Sum Games and Constrained Min-Max Optimization, 10th innovations in theoretical computer science conference, ITCS 2019, January 10–12, 2019, San Diego, CA, USA (LIPIcs – Leibniz International Proceedings in Informatics), Volume 124, Leibniz Zentrum für Informatik (2019) article no. 27 (18 pages) | MR | Zbl

[11] Jelena Diakonikolas; Constantinos Daskalakis; Michael I. Jordan Efficient methods for structured nonconvex-nonconcave min-max optimization, International Conference on Artificial Intelligence and Statistics, PMLR (2021), pp. 2746-2754

[12] Carles Domingo-Enrich; Samy Jelassi; Arthur Mensch; Grant Rotskoff; Joan Bruna A mean-field analysis of two-player zero-sum games, Adv. Neural Inf. Process. Syst., Volume 33 (2020), pp. 20215-20226

[13] Gauthier Gidel; Hugo Berard; Gaëtan Vignoud; Pascal Vincent; Simon Lacoste-Julien A Variational Inequality Perspective on Generative Adversarial Networks, International Conference on Learning Representations (2018)

[14] Irving L. Glicksberg A further generalization of the Kakutani fixed point theorem, with application to Nash equilibrium points, Proc. Am. Math. Soc., Volume 3 (1952) no. 1, pp. 170-174 | MR | Zbl

[15] Charles Guille-Escuret; Manuela Girotti; Baptiste Goujaud; Ioannis Mitliagkas A study of condition numbers for first-order optimization, International Conference on Artificial Intelligence and Statistics, PMLR (2021), pp. 1261-1269

[16] Ya-Ping Hsieh; Chen Liu; Volkan Cevher Finding mixed Nash equilibria of generative adversarial networks, International Conference on Machine Learning, PMLR (2019), pp. 2810-2819

[17] Yu-Guan Hsieh; Franck Iutzeler; Jérôme Malick; Panayotis Mertikopoulos Explore aggressively, update conservatively: Stochastic extragradient methods with variable stepsize scaling, Adv. Neural Inf. Process. Syst., Volume 33 (2020), pp. 16223-16234

[18] Mohammad Reza Karimi Jaghargh; Ya-Ping Hsieh; Andreas Krause A Dynamical System View of Langevin-Based Non-Convex Sampling, Advances in Neural Information Processing Systems 36 (NeurIPS 2023) (2023), pp. 41051-41075

[19] Mohammad Reza Karimi Jaghargh; Ya-Ping Hsieh; Andreas Krause Stochastic Approximation Algorithms for Systems of Interacting Particles, Advances in Neural Information Processing Systems 36 (NeurIPS 2023) (2023), pp. 55826-88847

[20] Stanislav Kondratyev; Léonard Monsaingeon; Dmitry Vorotnikov A new optimal transport distance on the space of finite Radon measures, Adv. Differ. Equ., Volume 21 (2016) no. 11/12, pp. 1117-1164 | MR | Zbl

[21] Tengyuan Liang; James Stokes Interaction matters: A note on non-asymptotic local convergence of generative adversarial networks, The 22nd International Conference on Artificial Intelligence and Statistics, PMLR (2019), pp. 907-915

[22] Matthias Liero; Alexander Mielke; Giuseppe Savaré Optimal entropy-transport problems and a new Hellinger–Kantorovich distance between positive measures, Invent. Math., Volume 211 (2018) no. 3, pp. 969-1117 | DOI | MR | Zbl

[23] Yulong Lu Two-Scale Gradient Descent Ascent Dynamics Finds Mixed Nash Equilibria of Continuous Games: A Mean-Field Perspective (2022) | arXiv

[24] Chao Ma; Lexing Ying Provably convergent quasistatic dynamics for mean-field two-player zero-sum games, International Conference on Learning Representations (2021)

[25] Panayotis Mertikopoulos; Ya-Ping Hsieh; Volkan Cevher Learning in games from a stochastic approximation viewpoint (2022) | arXiv

[26] Panayotis Mertikopoulos; Bruno Lecouat; Houssam Zenati; Chuan-Sheng Foo; Vijay Chandrasekhar; Georgios Piliouras Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile, International Conference on Learning Representations (2018)

[27] Panayotis Mertikopoulos; Christos Papadimitriou; Georgios Piliouras Cycles in adversarial regularized learning, Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, Association for Computing Machinery (2018), pp. 2703-2717 | Zbl

[28] Peyman Mohajerin Esfahani; Daniel Kuhn Data-driven distributionally robust optimization using the Wasserstein metric: Performance guarantees and tractable reformulations, Math. Program., Volume 171 (2018) no. 1, pp. 115-166 | DOI | MR | Zbl

[29] Aryan Mokhtari; Asuman Ozdaglar; Sarath Pattathil A unified analysis of extra-gradient and optimistic gradient methods for saddle point problems: Proximal point approach, International Conference on Artificial Intelligence and Statistics, PMLR (2020), pp. 1497-1507

[30] Arkadi Nemirovski Prox-method with rate of convergence O (1/t) for variational inequalities with Lipschitz continuous monotone operators and smooth convex-concave saddle point problems, SIAM J. Optim., Volume 15 (2004) no. 1, pp. 229-251 | DOI | MR | Zbl

[31] Allan Pinkus Approximation theory of the MLP model in neural networks, Acta Numer., Volume 8 (1999), pp. 143-195 | DOI | Zbl

[32] Filippo Santambrogio ${$ Euclidean, metric, and Wasserstein $}$ gradient flows: an overview, Bull. Math. Sci., Volume 7 (2017) no. 1, pp. 87-154 | DOI | MR | Zbl

[33] Maurice Sion On general minimax theorems, Pac. J. Math., Volume 8 (1958) no. 1, pp. 171-176 | DOI | MR | Zbl

[34] Noah D. Stein; Asuman Ozdaglar; Pablo A. Parrilo Separable and low-rank continuous games, Int. J. Game Theory, Volume 37 (2008) no. 4, pp. 475-504 | DOI | MR | Zbl

[35] Paul Tseng On linear convergence of iterative methods for the variational inequality problem, J. Comput. Appl. Math., Volume 60 (1995) no. 1-2, pp. 237-252 | DOI | MR | Zbl

[36] Guillaume Wang; Lénaïc Chizat Local convergence of gradient methods for min-max games: partial curvature generically suffices, Advances in Neural Information Processing Systems 36 (NeurIPS 2023) (2023), pp. 60841-60852

[37] Chen-Yu Wei; Chung-Wei Lee; Mengxiao Zhang; Haipeng Luo Linear Last-iterate Convergence in Constrained Saddle-point Optimization, International Conference on Learning Representations (2020)

Cited by Sources: