Robust Reinforcement Learning for Risk-Sensitive Linear Quadratic Gaussian Control

Cui, Leilei; Başar, Tamer; Jiang, Zhong-Ping

Electrical Engineering and Systems Science > Systems and Control

arXiv:2212.02072 (eess)

[Submitted on 5 Dec 2022 (v1), last revised 6 Dec 2023 (this version, v2)]

Title:Robust Reinforcement Learning for Risk-Sensitive Linear Quadratic Gaussian Control

Authors:Leilei Cui, Tamer Başar, Zhong-Ping Jiang

View PDF

Abstract:This paper proposes a novel robust reinforcement learning framework for discrete-time linear systems with model mismatch that may arise from the sim-to-real gap. A key strategy is to invoke advanced techniques from control theory. Using the formulation of the classical risk-sensitive linear quadratic Gaussian control, a dual-loop policy optimization algorithm is proposed to generate a robust optimal controller. The dual-loop policy optimization algorithm is shown to be globally and uniformly convergent, and robust against disturbances during the learning process. This robustness property is called small-disturbance input-to-state stability and guarantees that the proposed policy optimization algorithm converges to a small neighborhood of the optimal controller as long as the disturbance at each learning step is relatively small. In addition, when the system dynamics is unknown, a novel model-free off-policy policy optimization algorithm is proposed. Finally, numerical examples are provided to illustrate the proposed algorithm.

Comments:	27 Pages, 13 Figures
Subjects:	Systems and Control (eess.SY)
Cite as:	arXiv:2212.02072 [eess.SY]
	(or arXiv:2212.02072v2 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2212.02072

Submission history

From: Leilei Cui [view email]
[v1] Mon, 5 Dec 2022 07:36:22 UTC (260 KB)
[v2] Wed, 6 Dec 2023 18:02:58 UTC (2,283 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Robust Reinforcement Learning for Risk-Sensitive Linear Quadratic Gaussian Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Robust Reinforcement Learning for Risk-Sensitive Linear Quadratic Gaussian Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators