Recursive Reasoning in Minimax Games: A Level $k$ Gradient Play Method

Liu, Zichu; Pavel, Lacra

Computer Science > Machine Learning

arXiv:2210.16482 (cs)

[Submitted on 29 Oct 2022]

Title:Recursive Reasoning in Minimax Games: A Level $k$ Gradient Play Method

Authors:Zichu Liu, Lacra Pavel

View PDF

Abstract:Despite the success of generative adversarial networks (GANs) in generating visually appealing images, they are notoriously challenging to train. In order to stabilize the learning dynamics in minimax games, we propose a novel recursive reasoning algorithm: Level $k$ Gradient Play (Lv.$k$ GP) algorithm. In contrast to many existing algorithms, our algorithm does not require sophisticated heuristics or curvature information. We show that as $k$ increases, Lv.$k$ GP converges asymptotically towards an accurate estimation of players' future strategy. Moreover, we justify that Lv.$\infty$ GP naturally generalizes a line of provably convergent game dynamics which rely on predictive updates. Furthermore, we provide its local convergence property in nonconvex-nonconcave zero-sum games and global convergence in bilinear and quadratic games. By combining Lv.$k$ GP with Adam optimizer, our algorithm shows a clear advantage in terms of performance and computational overhead compared to other methods. Using a single Nvidia RTX3090 GPU and 30 times fewer parameters than BigGAN on CIFAR-10, we achieve an FID of 10.17 for unconditional image generation within 30 hours, allowing GAN training on common computational resources to reach state-of-the-art performance.

Comments:	For the code associated with this paper, see this https URL
Subjects:	Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT)
Cite as:	arXiv:2210.16482 [cs.LG]
	(or arXiv:2210.16482v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.16482

Submission history

From: Zichu Liu [view email]
[v1] Sat, 29 Oct 2022 03:43:59 UTC (3,284 KB)

Computer Science > Machine Learning

Title:Recursive Reasoning in Minimax Games: A Level $k$ Gradient Play Method

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Recursive Reasoning in Minimax Games: A Level $k$ Gradient Play Method

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators