Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate

Yu, Dongjie; Zou, Wenjun; Yang, Yujie; Ma, Haitong; Li, Shengbo Eben; Duan, Jingliang; Chen, Jianyu

Computer Science > Robotics

arXiv:2210.07553 (cs)

[Submitted on 14 Oct 2022]

Title:Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate

Authors:Dongjie Yu, Wenjun Zou, Yujie Yang, Haitong Ma, Shengbo Eben Li, Jingliang Duan, Jianyu Chen

View PDF

Abstract:Safe reinforcement learning (RL) that solves constraint-satisfactory policies provides a promising way to the broader safety-critical applications of RL in real-world problems such as robotics. Among all safe RL approaches, model-based methods reduce training time violations further due to their high sample efficiency. However, lacking safety robustness against the model uncertainties remains an issue in safe model-based RL, especially in training time safety. In this paper, we propose a distributional reachability certificate (DRC) and its Bellman equation to address model uncertainties and characterize robust persistently safe states. Furthermore, we build a safe RL framework to resolve constraints required by the DRC and its corresponding shield policy. We also devise a line search method to maintain safety and reach higher returns simultaneously while leveraging the shield policy. Comprehensive experiments on classical benchmarks such as constrained tracking and navigation indicate that the proposed algorithm achieves comparable returns with much fewer constraint violations during training.

Comments:	12 pages, 6 figures
Subjects:	Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2210.07553 [cs.RO]
	(or arXiv:2210.07553v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2210.07553

Submission history

From: Dongjie Yu [view email]
[v1] Fri, 14 Oct 2022 06:16:53 UTC (4,185 KB)

Computer Science > Robotics

Title:Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators