Finding and Certifying (Near-)Optimal Strategies in Black-Box Extensive-Form Games

Zhang, Brian Hu; Sandholm, Tuomas

Computer Science > Computer Science and Game Theory

arXiv:2009.07384 (cs)

[Submitted on 15 Sep 2020 (v1), last revised 17 Mar 2021 (this version, v3)]

Title:Finding and Certifying (Near-)Optimal Strategies in Black-Box Extensive-Form Games

Authors:Brian Hu Zhang, Tuomas Sandholm

View PDF

Abstract:Often -- for example in war games, strategy video games, and financial simulations -- the game is given to us only as a black-box simulator in which we can play it. In these settings, since the game may have unknown nature action distributions (from which we can only obtain samples) and/or be too large to expand fully, it can be difficult to compute strategies with guarantees on exploitability. Recent work \cite{Zhang20:Small} resulted in a notion of certificate for extensive-form games that allows exploitability guarantees while not expanding the full game tree. However, that work assumed that the black box could sample or expand arbitrary nodes of the game tree at any time, and that a series of exact game solves (via, for example, linear programming) can be conducted to compute the certificate. Each of those two assumptions severely restricts the practical applicability of that method. In this work, we relax both of the assumptions. We show that high-probability certificates can be obtained with a black box that can do nothing more than play through games, using only a regret minimizer as a subroutine. As a bonus, we obtain an equilibrium-finding algorithm with $\tilde O(1/\sqrt{T})$ convergence rate in the extensive-form game setting that does not rely on a sampling strategy with lower-bounded reach probabilities (which MCCFR assumes). We demonstrate experimentally that, in the black-box setting, our methods are able to provide nontrivial exploitability guarantees while expanding only a small fraction of the game tree.

Comments:	AAAI 2021
Subjects:	Computer Science and Game Theory (cs.GT)
Cite as:	arXiv:2009.07384 [cs.GT]
	(or arXiv:2009.07384v3 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2009.07384

Submission history

From: Brian Zhang [view email]
[v1] Tue, 15 Sep 2020 23:11:14 UTC (147 KB)
[v2] Wed, 16 Dec 2020 19:06:47 UTC (210 KB)
[v3] Wed, 17 Mar 2021 04:47:57 UTC (211 KB)

Computer Science > Computer Science and Game Theory

Title:Finding and Certifying (Near-)Optimal Strategies in Black-Box Extensive-Form Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:Finding and Certifying (Near-)Optimal Strategies in Black-Box Extensive-Form Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators