Safe Policy Synthesis in Multi-Agent POMDPs via Discrete-Time Barrier Functions

Ahmadi, Mohamadreza; Singletary, Andrew; Burdick, Joel W.; Ames, Aaron D.

Computer Science > Robotics

arXiv:1903.07823 (cs)

[Submitted on 19 Mar 2019 (v1), last revised 12 Sep 2019 (this version, v2)]

Title:Safe Policy Synthesis in Multi-Agent POMDPs via Discrete-Time Barrier Functions

Authors:Mohamadreza Ahmadi, Andrew Singletary, Joel W. Burdick, Aaron D. Ames

View PDF

Abstract:A multi-agent partially observable Markov decision process (MPOMDP) is a modeling paradigm used for high-level planning of heterogeneous autonomous agents subject to uncertainty and partial observation. Despite their modeling efficiency, MPOMDPs have not received significant attention in safety-critical settings. In this paper, we use barrier functions to design policies for MPOMDPs that ensure safety. Notably, our method does not rely on discretization of the belief space, or finite memory. To this end, we formulate sufficient and necessary conditions for the safety of a given set based on discrete-time barrier functions (DTBFs) and we demonstrate that our formulation also allows for Boolean compositions of DTBFs for representing more complicated safe sets. We show that the proposed method can be implemented online by a sequence of one-step greedy algorithms as a standalone safe controller or as a safety-filter given a nominal planning policy. We illustrate the efficiency of the proposed methodology based on DTBFs using a high-fidelity simulation of heterogeneous robots.

Comments:	8 pages and 4 figures
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:1903.07823 [cs.RO]
	(or arXiv:1903.07823v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.1903.07823

Submission history

From: Mohamadreza Ahmadi [view email]
[v1] Tue, 19 Mar 2019 04:23:22 UTC (2,520 KB)
[v2] Thu, 12 Sep 2019 05:19:48 UTC (16,481 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.RO

< prev | next >

new | recent | 2019-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Mohamadreza Ahmadi
Andrew Singletary
Joel W. Burdick
Aaron D. Ames

export BibTeX citation

Computer Science > Robotics

Title:Safe Policy Synthesis in Multi-Agent POMDPs via Discrete-Time Barrier Functions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Safe Policy Synthesis in Multi-Agent POMDPs via Discrete-Time Barrier Functions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators