N-Agent Ad Hoc Teamwork

Wang, Caroline; Rahman, Arrasy; Durugkar, Ishan; Liebman, Elad; Stone, Peter

Computer Science > Artificial Intelligence

arXiv:2404.10740 (cs)

[Submitted on 16 Apr 2024 (v1), last revised 4 Oct 2024 (this version, v3)]

Title:N-Agent Ad Hoc Teamwork

Authors:Caroline Wang, Arrasy Rahman, Ishan Durugkar, Elad Liebman, Peter Stone

View PDF HTML (experimental)

Abstract:Current approaches to learning cooperative multi-agent behaviors assume relatively restrictive settings. In standard fully cooperative multi-agent reinforcement learning, the learning algorithm controls $\textit{all}$ agents in the scenario, while in ad hoc teamwork, the learning algorithm usually assumes control over only a $\textit{single}$ agent in the scenario. However, many cooperative settings in the real world are much less restrictive. For example, in an autonomous driving scenario, a company might train its cars with the same learning algorithm, yet once on the road, these cars must cooperate with cars from another company. Towards expanding the class of scenarios that cooperative learning methods may optimally address, we introduce $N$-agent ad hoc teamwork (NAHT), where a set of autonomous agents must interact and cooperate with dynamically varying numbers and types of teammates. This paper formalizes the problem, and proposes the Policy Optimization with Agent Modelling (POAM) algorithm. POAM is a policy gradient, multi-agent reinforcement learning approach to the NAHT problem, that enables adaptation to diverse teammate behaviors by learning representations of teammate behaviors. Empirical evaluation on tasks from the multi-agent particle environment and StarCraft II shows that POAM improves cooperative task returns compared to baseline approaches, and enables out-of-distribution generalization to unseen teammates.

Subjects:	Artificial Intelligence (cs.AI)
ACM classes:	I.2.11; I.2.1; I.2.6; I.2.8
Cite as:	arXiv:2404.10740 [cs.AI]
	(or arXiv:2404.10740v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2404.10740

Submission history

From: Caroline Wang [view email]
[v1] Tue, 16 Apr 2024 17:13:08 UTC (1,767 KB)
[v2] Sat, 3 Aug 2024 05:50:47 UTC (2,018 KB)
[v3] Fri, 4 Oct 2024 16:08:52 UTC (1,465 KB)

Computer Science > Artificial Intelligence

Title:N-Agent Ad Hoc Teamwork

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:N-Agent Ad Hoc Teamwork

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators