(Bandit) Convex Optimization with Biased Noisy Gradient Oracles

Hu, Xiaowei; A., Prashanth L.; György, András; Szepesvári, Csaba

Computer Science > Machine Learning

arXiv:1609.07087 (cs)

[Submitted on 22 Sep 2016 (v1), last revised 4 Jul 2020 (this version, v2)]

Title:(Bandit) Convex Optimization with Biased Noisy Gradient Oracles

Authors:Xiaowei Hu, Prashanth L.A., András György, Csaba Szepesvári

View PDF

Abstract:Algorithms for bandit convex optimization and online learning often rely on constructing noisy gradient estimates, which are then used in appropriately adjusted first-order algorithms, replacing actual gradients. Depending on the properties of the function to be optimized and the nature of ``noise'' in the bandit feedback, the bias and variance of gradient estimates exhibit various tradeoffs. In this paper we propose a novel framework that replaces the specific gradient estimation methods with an abstract oracle. With the help of the new framework we unify previous works, reproducing their results in a clean and concise fashion, while, perhaps more importantly, the framework also allows us to formally show that to achieve the optimal root-$n$ rate either the algorithms that use existing gradient estimators, or the proof techniques used to analyze them have to go beyond what exists today.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1609.07087 [cs.LG]
	(or arXiv:1609.07087v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1609.07087

Submission history

From: L.A. Prashanth [view email]
[v1] Thu, 22 Sep 2016 17:56:38 UTC (424 KB)
[v2] Sat, 4 Jul 2020 22:16:51 UTC (382 KB)

Computer Science > Machine Learning

Title:(Bandit) Convex Optimization with Biased Noisy Gradient Oracles

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:(Bandit) Convex Optimization with Biased Noisy Gradient Oracles

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators