Lazy-CFR: fast and near optimal regret minimization for extensive games with imperfect information

Zhou, Yichi; Ren, Tongzheng; Li, Jialian; Yan, Dong; Zhu, Jun

Computer Science > Machine Learning

arXiv:1810.04433 (cs)

[Submitted on 10 Oct 2018 (v1), last revised 25 Dec 2018 (this version, v3)]

Title:Lazy-CFR: fast and near optimal regret minimization for extensive games with imperfect information

Authors:Yichi Zhou, Tongzheng Ren, Jialian Li, Dong Yan, Jun Zhu

View PDF

Abstract:Counterfactual regret minimization (CFR) is the most popular algorithm on solving two-player zero-sum extensive games with imperfect information and achieves state-of-the-art performance in practice. However, the performance of CFR is not fully understood, since empirical results on the regret are much better than the upper bound proved in \cite{zinkevich2008regret}. Another issue is that CFR has to traverse the whole game tree in each round, which is time-consuming in large scale games. In this paper, we present a novel technique, lazy update, which can avoid traversing the whole game tree in CFR, as well as a novel analysis on the regret of CFR with lazy update. Our analysis can also be applied to the vanilla CFR, resulting in a much tighter regret bound than that in \cite{zinkevich2008regret}. Inspired by lazy update, we further present a novel CFR variant, named Lazy-CFR. Compared to traversing $O(|\mathcal{I}|)$ information sets in vanilla CFR, Lazy-CFR needs only to traverse $O(\sqrt{|\mathcal{I}|})$ information sets per round while keeping the regret bound almost the same, where $\mathcal{I}$ is the class of all information sets. As a result, Lazy-CFR shows better convergence result compared with vanilla CFR. Experimental results consistently show that Lazy-CFR outperforms the vanilla CFR significantly.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1810.04433 [cs.LG]
	(or arXiv:1810.04433v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.04433

Submission history

From: Yichi Zhou [view email]
[v1] Wed, 10 Oct 2018 09:24:39 UTC (228 KB)
[v2] Sun, 25 Nov 2018 05:17:34 UTC (231 KB)
[v3] Tue, 25 Dec 2018 04:54:33 UTC (237 KB)

Computer Science > Machine Learning

Title:Lazy-CFR: fast and near optimal regret minimization for extensive games with imperfect information

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Lazy-CFR: fast and near optimal regret minimization for extensive games with imperfect information

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators