Online learning with kernel losses

Pacchiano, Aldo; Chatterji, Niladri S.; Bartlett, Peter L.

Statistics > Machine Learning

arXiv:1802.09732 (stat)

[Submitted on 27 Feb 2018]

Title:Online learning with kernel losses

Authors:Aldo Pacchiano, Niladri S. Chatterji, Peter L. Bartlett

View PDF

Abstract:We present a generalization of the adversarial linear bandits framework, where the underlying losses are kernel functions (with an associated reproducing kernel Hilbert space) rather than linear functions. We study a version of the exponential weights algorithm and bound its regret in this setting. Under conditions on the eigendecay of the kernel we provide a sharp characterization of the regret for this algorithm. When we have polynomial eigendecay $\mu_j \le \mathcal{O}(j^{-\beta})$, we find that the regret is bounded by $\mathcal{R}_n \le \mathcal{O}(n^{\beta/(2(\beta-1))})$; while under the assumption of exponential eigendecay $\mu_j \le \mathcal{O}(e^{-\beta j })$, we get an even tighter bound on the regret $\mathcal{R}_n \le \mathcal{O}(n^{1/2}\log(n)^{1/2})$. We also study the full information setting when the underlying losses are kernel functions and present an adapted exponential weights algorithm and a conditional gradient descent algorithm.

Comments:	40 pages, 4 figures
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1802.09732 [stat.ML]
	(or arXiv:1802.09732v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1802.09732

Submission history

From: Niladri Chatterji [view email]
[v1] Tue, 27 Feb 2018 06:07:54 UTC (512 KB)

Statistics > Machine Learning

Title:Online learning with kernel losses

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Online learning with kernel losses

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators