$(\epsilon, u)$-Adaptive Regret Minimization in Heavy-Tailed Bandits

Genalti, Gianmarco; Marsigli, Lupo; Gatti, Nicola; Metelli, Alberto Maria

Computer Science > Machine Learning

arXiv:2310.02975 (cs)

[Submitted on 4 Oct 2023 (v1), last revised 12 Feb 2024 (this version, v2)]

Title:$(ε, u)$-Adaptive Regret Minimization in Heavy-Tailed Bandits

Authors:Gianmarco Genalti, Lupo Marsigli, Nicola Gatti, Alberto Maria Metelli

View PDF

Abstract:Heavy-tailed distributions naturally arise in several settings, from finance to telecommunications. While regret minimization under subgaussian or bounded rewards has been widely studied, learning with heavy-tailed distributions only gained popularity over the last decade. In this paper, we consider the setting in which the reward distributions have finite absolute raw moments of maximum order $1+\epsilon$, uniformly bounded by a constant $u<+\infty$, for some $\epsilon \in (0,1]$. In this setting, we study the regret minimization problem when $\epsilon$ and $u$ are unknown to the learner and it has to adapt. First, we show that adaptation comes at a cost and derive two negative results proving that the same regret guarantees of the non-adaptive case cannot be achieved with no further assumptions. Then, we devise and analyze a fully data-driven trimmed mean estimator and propose a novel adaptive regret minimization algorithm, AdaR-UCB, that leverages such an estimator. Finally, we show that AdaR-UCB is the first algorithm that, under a known distributional assumption, enjoys regret guarantees nearly matching those of the non-adaptive heavy-tailed case.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2310.02975 [cs.LG]
	(or arXiv:2310.02975v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.02975

Submission history

From: Gianmarco Genalti [view email]
[v1] Wed, 4 Oct 2023 17:11:15 UTC (30 KB)
[v2] Mon, 12 Feb 2024 10:39:44 UTC (59 KB)

Computer Science > Machine Learning

Title:$(ε, u)$-Adaptive Regret Minimization in Heavy-Tailed Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:$(ε, u)$-Adaptive Regret Minimization in Heavy-Tailed Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators