Robust Multi-Agent Bandits Over Undirected Graphs

Vial, Daniel; Shakkottai, Sanjay; Srikant, R.

Computer Science > Machine Learning

arXiv:2203.00076 (cs)

[Submitted on 28 Feb 2022 (v1), last revised 26 Jan 2023 (this version, v2)]

Title:Robust Multi-Agent Bandits Over Undirected Graphs

Authors:Daniel Vial, Sanjay Shakkottai, R. Srikant

View PDF

Abstract:We consider a multi-agent multi-armed bandit setting in which $n$ honest agents collaborate over a network to minimize regret but $m$ malicious agents can disrupt learning arbitrarily. Assuming the network is the complete graph, existing algorithms incur $O( (m + K/n) \log (T) / \Delta )$ regret in this setting, where $K$ is the number of arms and $\Delta$ is the arm gap. For $m \ll K$, this improves over the single-agent baseline regret of $O(K\log(T)/\Delta)$.
In this work, we show the situation is murkier beyond the case of a complete graph. In particular, we prove that if the state-of-the-art algorithm is used on the undirected line graph, honest agents can suffer (nearly) linear regret until time is doubly exponential in $K$ and $n$. In light of this negative result, we propose a new algorithm for which the $i$-th agent has regret $O( ( d_{\text{mal}}(i) + K/n) \log(T)/\Delta)$ on any connected and undirected graph, where $d_{\text{mal}}(i)$ is the number of $i$'s neighbors who are malicious. Thus, we generalize existing regret bounds beyond the complete graph (where $d_{\text{mal}}(i) = m$), and show the effect of malicious agents is entirely local (in the sense that only the $d_{\text{mal}}(i)$ malicious agents directly connected to $i$ affect its long-term regret).

Subjects:	Machine Learning (cs.LG); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
Cite as:	arXiv:2203.00076 [cs.LG]
	(or arXiv:2203.00076v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.00076
Journal reference:	Proceedings of the ACM on Measurement and Analysis of Computing Systems, December 2022

Submission history

From: Daniel Vial [view email]
[v1] Mon, 28 Feb 2022 20:21:55 UTC (2,027 KB)
[v2] Thu, 26 Jan 2023 19:39:16 UTC (3,436 KB)

Computer Science > Machine Learning

Title:Robust Multi-Agent Bandits Over Undirected Graphs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Robust Multi-Agent Bandits Over Undirected Graphs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators