The Gossiping Insert-Eliminate Algorithm for Multi-Agent Bandits

Chawla, Ronshee; Sankararaman, Abishek; Ganesh, Ayalvadi; Shakkottai, Sanjay

Computer Science > Machine Learning

arXiv:2001.05452 (cs)

[Submitted on 15 Jan 2020 (v1), last revised 2 Jul 2024 (this version, v4)]

Title:The Gossiping Insert-Eliminate Algorithm for Multi-Agent Bandits

Authors:Ronshee Chawla, Abishek Sankararaman, Ayalvadi Ganesh, Sanjay Shakkottai

View PDF

Abstract:We consider a decentralized multi-agent Multi Armed Bandit (MAB) setup consisting of $N$ agents, solving the same MAB instance to minimize individual cumulative regret. In our model, agents collaborate by exchanging messages through pairwise gossip style communications on an arbitrary connected graph. We develop two novel algorithms, where each agent only plays from a subset of all the arms. Agents use the communication medium to recommend only arm-IDs (not samples), and thus update the set of arms from which they play. We establish that, if agents communicate $\Omega(\log(T))$ times through any connected pairwise gossip mechanism, then every agent's regret is a factor of order $N$ smaller compared to the case of no collaborations. Furthermore, we show that the communication constraints only have a second order effect on the regret of our algorithm. We then analyze this second order term of the regret to derive bounds on the regret-communication tradeoffs. Finally, we empirically evaluate our algorithm and conclude that the insights are fundamental and not artifacts of our bounds. We also show a lower bound which gives that the regret scaling obtained by our algorithm cannot be improved even in the absence of any communication constraints. Our results thus demonstrate that even a minimal level of collaboration among agents greatly reduces regret for all agents.

Comments:	To Appear in AISTATS 2020. The first two authors contributed equally
Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Networking and Internet Architecture (cs.NI); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
Cite as:	arXiv:2001.05452 [cs.LG]
	(or arXiv:2001.05452v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2001.05452

Submission history

From: Ronshee Chawla [view email]
[v1] Wed, 15 Jan 2020 17:49:29 UTC (772 KB)
[v2] Tue, 11 Feb 2020 00:09:46 UTC (772 KB)
[v3] Wed, 12 Feb 2020 21:11:46 UTC (772 KB)
[v4] Tue, 2 Jul 2024 23:36:25 UTC (772 KB)

Computer Science > Machine Learning

Title:The Gossiping Insert-Eliminate Algorithm for Multi-Agent Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Gossiping Insert-Eliminate Algorithm for Multi-Agent Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators