DACOOP-A: Decentralized Adaptive Cooperative Pursuit via Attention

Zhang, Zheng; Zhang, Dengyu; Zhang, Qingrui; Pan, Wei; Hu, Tianjiang

Computer Science > Robotics

arXiv:2310.15699 (cs)

[Submitted on 24 Oct 2023 (v1), last revised 28 Oct 2023 (this version, v2)]

Title:DACOOP-A: Decentralized Adaptive Cooperative Pursuit via Attention

Authors:Zheng Zhang, Dengyu Zhang, Qingrui Zhang, Wei Pan, Tianjiang Hu

View PDF

Abstract:Integrating rule-based policies into reinforcement learning promises to improve data efficiency and generalization in cooperative pursuit problems. However, most implementations do not properly distinguish the influence of neighboring robots in observation embedding or inter-robot interaction rules, leading to information loss and inefficient cooperation. This paper proposes a cooperative pursuit algorithm named Decentralized Adaptive COOperative Pursuit via Attention (DACOOP-A) by empowering reinforcement learning with artificial potential field and attention mechanisms. An attention-based framework is developed to emphasize important neighbors by concurrently integrating the learned attention scores into observation embedding and inter-robot interaction rules. A KL divergence regularization is introduced to alleviate the resultant learning stability issue. Improvements in data efficiency and generalization are demonstrated through numerical simulations. Extensive quantitative analysis and ablation studies are performed to illustrate the advantages of the proposed modules. Real-world experiments are performed to justify the feasibility of deploying DACOOP-A in physical systems.

Comments:	8 Pages; This manuscript has been accepted by IEEE Robotics and Automation Letters
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2310.15699 [cs.RO]
	(or arXiv:2310.15699v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2310.15699

Submission history

From: Qingrui Zhang [view email]
[v1] Tue, 24 Oct 2023 10:15:07 UTC (3,362 KB)
[v2] Sat, 28 Oct 2023 13:47:41 UTC (3,362 KB)

Computer Science > Robotics

Title:DACOOP-A: Decentralized Adaptive Cooperative Pursuit via Attention

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:DACOOP-A: Decentralized Adaptive Cooperative Pursuit via Attention

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators