Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback

Jordan, Michael I.; Lin, Tianyi; Zhou, Zhengyuan

Computer Science > Computer Science and Game Theory

arXiv:2310.14085 (cs)

[Submitted on 21 Oct 2023 (v1), last revised 28 Mar 2024 (this version, v4)]

Title:Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback

Authors:Michael I. Jordan, Tianyi Lin, Zhengyuan Zhou

View PDF HTML (experimental)

Abstract:Online gradient descent (OGD) is well known to be doubly optimal under strong convexity or monotonicity assumptions: (1) in the single-agent setting, it achieves an optimal regret of $\Theta(\log T)$ for strongly convex cost functions; and (2) in the multi-agent setting of strongly monotone games, with each agent employing OGD, we obtain last-iterate convergence of the joint action to a unique Nash equilibrium at an optimal rate of $\Theta(\frac{1}{T})$. While these finite-time guarantees highlight its merits, OGD has the drawback that it requires knowing the strong convexity/monotonicity parameters. In this paper, we design a fully adaptive OGD algorithm, \textsf{AdaOGD}, that does not require a priori knowledge of these parameters. In the single-agent setting, our algorithm achieves $O(\log^2(T))$ regret under strong convexity, which is optimal up to a log factor. Further, if each agent employs \textsf{AdaOGD} in strongly monotone games, the joint action converges in a last-iterate sense to a unique Nash equilibrium at a rate of $O(\frac{\log^3 T}{T})$, again optimal up to log factors. We illustrate our algorithms in a learning version of the classical newsvendor problem, where due to lost sales, only (noisy) gradient feedback can be observed. Our results immediately yield the first feasible and near-optimal algorithm for both the single-retailer and multi-retailer settings. We also extend our results to the more general setting of exp-concave cost functions and games, using the online Newton step (ONS) algorithm.

Comments:	Accepted by Operations Research; 47 pages
Subjects:	Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2310.14085 [cs.GT]
	(or arXiv:2310.14085v4 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2310.14085

Submission history

From: Tianyi Lin [view email]
[v1] Sat, 21 Oct 2023 18:38:13 UTC (44 KB)
[v2] Tue, 24 Oct 2023 04:16:23 UTC (43 KB)
[v3] Wed, 15 Nov 2023 04:15:45 UTC (43 KB)
[v4] Thu, 28 Mar 2024 19:37:02 UTC (44 KB)

Computer Science > Computer Science and Game Theory

Title:Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators