Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration

Xie, Christopher; Patil, Sachin; Moldovan, Teodor; Levine, Sergey; Abbeel, Pieter

Computer Science > Machine Learning

arXiv:1509.06824v1 (cs)

[Submitted on 23 Sep 2015 (this version), latest version 15 Mar 2016 (v2)]

Title:Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration

Authors:Christopher Xie, Sachin Patil, Teodor Moldovan, Sergey Levine, Pieter Abbeel

View PDF

Abstract:In this paper, we present a robotic model-based reinforcement learning method that combines ideas from model identification and model predictive control. We use a feature-based representation of the dynamics that allows the dynamics model to be fitted with a simple least squares procedure, and the features are identified from a high-level specification of the robot's morphology, consisting of the number and connectivity structure of its links. Model predictive control is then used to choose the actions under an optimistic model of the dynamics, which produces an efficient and goal-directed exploration strategy. We present real time experimental results on standard benchmark problems involving the pendulum, cartpole, and double pendulum systems. Experiments indicate that our method is able to learn a range of benchmark tasks substantially faster than the previous best methods. To evaluate our approach on a realistic robotic control task, we also demonstrate real time control of a simulated 7 degree of freedom arm.

Comments:	8 pages
Subjects:	Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:1509.06824 [cs.LG]
	(or arXiv:1509.06824v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1509.06824

Submission history

From: Christopher Xie [view email]
[v1] Wed, 23 Sep 2015 02:04:18 UTC (1,909 KB)
[v2] Tue, 15 Mar 2016 07:53:33 UTC (957 KB)

Computer Science > Machine Learning

Title:Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators