Concept Learning for Interpretable Multi-Agent Reinforcement Learning

Zabounidis, Renos; Campbell, Joseph; Stepputtis, Simon; Hughes, Dana; Sycara, Katia

Computer Science > Machine Learning

arXiv:2302.12232 (cs)

[Submitted on 23 Feb 2023]

Title:Concept Learning for Interpretable Multi-Agent Reinforcement Learning

Authors:Renos Zabounidis, Joseph Campbell, Simon Stepputtis, Dana Hughes, Katia Sycara

View PDF

Abstract:Multi-agent robotic systems are increasingly operating in real-world environments in close proximity to humans, yet are largely controlled by policy models with inscrutable deep neural network representations. We introduce a method for incorporating interpretable concepts from a domain expert into models trained through multi-agent reinforcement learning, by requiring the model to first predict such concepts then utilize them for decision making. This allows an expert to both reason about the resulting concept policy models in terms of these high-level concepts at run-time, as well as intervene and correct mispredictions to improve performance. We show that this yields improved interpretability and training stability, with benefits to policy performance and sample efficiency in a simulated and real-world cooperative-competitive multi-agent game.

Comments:	Accepted to the 6th Conference on Robot Learning (CoRL 2022), Auckland, New Zealand
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2302.12232 [cs.LG]
	(or arXiv:2302.12232v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.12232

Submission history

From: Renos Zabounidis [view email]
[v1] Thu, 23 Feb 2023 18:53:09 UTC (1,036 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Machine Learning

Title:Concept Learning for Interpretable Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Machine Learning

Title:Concept Learning for Interpretable Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators