Probabilistic Constrained Reinforcement Learning with Formal Interpretability

Wang, Yanran; Qian, Qiuchen; Boyle, David

Computer Science > Machine Learning

arXiv:2307.07084 (cs)

[Submitted on 13 Jul 2023 (v1), last revised 17 Jun 2024 (this version, v4)]

Title:Probabilistic Constrained Reinforcement Learning with Formal Interpretability

Authors:Yanran Wang, Qiuchen Qian, David Boyle

View PDF HTML (experimental)

Abstract:Reinforcement learning can provide effective reasoning for sequential decision-making problems with variable dynamics. Such reasoning in practical implementation, however, poses a persistent challenge in interpreting the reward function and the corresponding optimal policy. Consequently, representing sequential decision-making problems as probabilistic inference can have considerable value, as, in principle, the inference offers diverse and powerful mathematical tools to infer the stochastic dynamics whilst suggesting a probabilistic interpretation of policy optimization. In this study, we propose a novel Adaptive Wasserstein Variational Optimization, namely AWaVO, to tackle these interpretability challenges. Our approach uses formal methods to achieve the interpretability for convergence guarantee, training transparency, and intrinsic decision-interpretation. To demonstrate its practicality, we showcase guaranteed interpretability with an optimal global convergence rate in simulation and in practical quadrotor tasks. In comparison with state-of-the-art benchmarks including TRPO-IPO, PCPO and CRPO, we empirically verify that AWaVO offers a reasonable trade-off between high performance and sufficient interpretability.

Comments:	25 pages, 9 figures, containing Appendix
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO); Systems and Control (eess.SY)
Cite as:	arXiv:2307.07084 [cs.LG]
	(or arXiv:2307.07084v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2307.07084

Submission history

From: Yanran Wang [view email]
[v1] Thu, 13 Jul 2023 22:52:22 UTC (20,427 KB)
[v2] Thu, 8 Feb 2024 18:09:25 UTC (16,155 KB)
[v3] Sat, 10 Feb 2024 21:28:19 UTC (18,448 KB)
[v4] Mon, 17 Jun 2024 12:56:53 UTC (7,983 KB)

Computer Science > Machine Learning

Title:Probabilistic Constrained Reinforcement Learning with Formal Interpretability

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Probabilistic Constrained Reinforcement Learning with Formal Interpretability

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators