Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models

Westenbroek, Tyler; Levy, Jacob; Fridovich-Keil, David

Computer Science > Machine Learning

arXiv:2307.08168 (cs)

[Submitted on 16 Jul 2023 (v1), last revised 6 Nov 2023 (this version, v2)]

Title:Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models

Authors:Tyler Westenbroek, Jacob Levy, David Fridovich-Keil

View PDF

Abstract:We focus on developing efficient and reliable policy optimization strategies for robot learning with real-world data. In recent years, policy gradient methods have emerged as a promising paradigm for training control policies in simulation. However, these approaches often remain too data inefficient or unreliable to train on real robotic hardware. In this paper we introduce a novel policy gradient-based policy optimization framework which systematically leverages a (possibly highly simplified) first-principles model and enables learning precise control policies with limited amounts of real-world data. Our approach $1)$ uses the derivatives of the model to produce sample-efficient estimates of the policy gradient and $2)$ uses the model to design a low-level tracking controller, which is embedded in the policy class. Theoretical analysis provides insight into how the presence of this feedback controller overcomes key limitations of stand-alone policy gradient methods, while hardware experiments with a small car and quadruped demonstrate that our approach can learn precise control strategies reliably and with only minutes of real-world data.

Subjects:	Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2307.08168 [cs.LG]
	(or arXiv:2307.08168v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2307.08168

Submission history

From: Tyler Westenbroek [view email]
[v1] Sun, 16 Jul 2023 22:36:36 UTC (1,517 KB)
[v2] Mon, 6 Nov 2023 15:15:38 UTC (1,917 KB)

Computer Science > Machine Learning

Title:Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators