Self-Organized Polynomial-Time Coordination Graphs

Yang, Qianlan; Dong, Weijun; Ren, Zhizhou; Wang, Jianhao; Wang, Tonghan; Zhang, Chongjie

Computer Science > Machine Learning

arXiv:2112.03547 (cs)

[Submitted on 7 Dec 2021 (v1), last revised 16 Sep 2022 (this version, v4)]

Title:Self-Organized Polynomial-Time Coordination Graphs

Authors:Qianlan Yang, Weijun Dong, Zhizhou Ren, Jianhao Wang, Tonghan Wang, Chongjie Zhang

View PDF

Abstract:Coordination graph is a promising approach to model agent collaboration in multi-agent reinforcement learning. It conducts a graph-based value factorization and induces explicit coordination among agents to complete complicated tasks. However, one critical challenge in this paradigm is the complexity of greedy action selection with respect to the factorized values. It refers to the decentralized constraint optimization problem (DCOP), which and whose constant-ratio approximation are NP-hard problems. To bypass this systematic hardness, this paper proposes a novel method, named Self-Organized Polynomial-time Coordination Graphs (SOP-CG), which uses structured graph classes to guarantee the accuracy and the computational efficiency of collaborated action selection. SOP-CG employs dynamic graph topology to ensure sufficient value function expressiveness. The graph selection is unified into an end-to-end learning paradigm. In experiments, we show that our approach learns succinct and well-adapted graph topologies, induces effective coordination, and improves performance across a variety of cooperative multi-agent tasks.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
Cite as:	arXiv:2112.03547 [cs.LG]
	(or arXiv:2112.03547v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2112.03547

Submission history

From: Qianlan Yang [view email]
[v1] Tue, 7 Dec 2021 07:42:40 UTC (1,361 KB)
[v2] Sun, 13 Mar 2022 17:31:36 UTC (997 KB)
[v3] Mon, 20 Jun 2022 11:30:48 UTC (1,174 KB)
[v4] Fri, 16 Sep 2022 18:07:46 UTC (1,171 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-12

Change to browse by:

cs
cs.AI
cs.MA

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zhizhou Ren
Jianhao Wang
Tonghan Wang
Chongjie Zhang

export BibTeX citation

Computer Science > Machine Learning

Title:Self-Organized Polynomial-Time Coordination Graphs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Self-Organized Polynomial-Time Coordination Graphs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators