Learning on the Edge: Online Learning with Stochastic Feedback Graphs

Esposito, Emmanuel; Fusco, Federico; van der Hoeven, Dirk; Cesa-Bianchi, Nicolò

Computer Science > Machine Learning

arXiv:2210.04229 (cs)

[Submitted on 9 Oct 2022]

Title:Learning on the Edge: Online Learning with Stochastic Feedback Graphs

Authors:Emmanuel Esposito, Federico Fusco, Dirk van der Hoeven, Nicolò Cesa-Bianchi

View PDF

Abstract:The framework of feedback graphs is a generalization of sequential decision-making with bandit or full information feedback. In this work, we study an extension where the directed feedback graph is stochastic, following a distribution similar to the classical Erdős-Rényi model. Specifically, in each round every edge in the graph is either realized or not with a distinct probability for each edge. We prove nearly optimal regret bounds of order $\min\bigl\{\min_{\varepsilon} \sqrt{(\alpha_\varepsilon/\varepsilon) T},\, \min_{\varepsilon} (\delta_\varepsilon/\varepsilon)^{1/3} T^{2/3}\bigr\}$ (ignoring logarithmic factors), where $\alpha_{\varepsilon}$ and $\delta_{\varepsilon}$ are graph-theoretic quantities measured on the support of the stochastic feedback graph $\mathcal{G}$ with edge probabilities thresholded at $\varepsilon$. Our result, which holds without any preliminary knowledge about $\mathcal{G}$, requires the learner to observe only the realized out-neighborhood of the chosen action. When the learner is allowed to observe the realization of the entire graph (but only the losses in the out-neighborhood of the chosen action), we derive a more efficient algorithm featuring a dependence on weighted versions of the independence and weak domination numbers that exhibits improved bounds for some special cases.

Subjects:	Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS)
Cite as:	arXiv:2210.04229 [cs.LG]
	(or arXiv:2210.04229v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.04229
Journal reference:	Advances in Neural Information Processing Systems 35 (NeurIPS 2022)

Submission history

From: Emmanuel Esposito [view email]
[v1] Sun, 9 Oct 2022 11:21:08 UTC (71 KB)

Computer Science > Machine Learning

Title:Learning on the Edge: Online Learning with Stochastic Feedback Graphs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning on the Edge: Online Learning with Stochastic Feedback Graphs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators