DRL-ORA: Distributional Reinforcement Learning with Online Risk Adaption

Wu, Yupeng; Huang, Wenjie; Ho, Chin Pang

Computer Science > Machine Learning

arXiv:2310.05179 (cs)

[Submitted on 8 Oct 2023 (v1), last revised 9 Feb 2025 (this version, v3)]

Title:DRL-ORA: Distributional Reinforcement Learning with Online Risk Adaption

Authors:Yupeng Wu, Wenjie Huang, Chin Pang Ho

View PDF HTML (experimental)

Abstract:One of the main challenges in reinforcement learning (RL) is that the agent has to make decisions that would influence the future performance without having complete knowledge of the environment. Dynamically adjusting the level of epistemic risk during the learning process can help to achieve reliable policies in safety-critical settings with better efficiency. In this work, we propose a new framework, Distributional RL with Online Risk Adaptation (DRL-ORA). This framework quantifies both epistemic and implicit aleatory uncertainties in a unified manner and dynamically adjusts the epistemic risk levels by solving a total variation minimization problem online. The selection of risk levels is performed efficiently via a grid search using a Follow-The-Leader-type algorithm, where the offline oracle corresponds to a "satisficing measure" under a specially modified loss function. We show that DRL-ORA outperforms existing methods that rely on fixed risk levels or manually designed risk level adaptation in multiple classes of tasks.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2310.05179 [cs.LG]
	(or arXiv:2310.05179v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.05179

Submission history

From: Yupeng Wu [view email]
[v1] Sun, 8 Oct 2023 14:32:23 UTC (395 KB)
[v2] Mon, 11 Mar 2024 15:36:19 UTC (475 KB)
[v3] Sun, 9 Feb 2025 01:03:33 UTC (589 KB)

Computer Science > Machine Learning

Title:DRL-ORA: Distributional Reinforcement Learning with Online Risk Adaption

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:DRL-ORA: Distributional Reinforcement Learning with Online Risk Adaption

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators