Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning

Rudin, Nikita; Hoeller, David; Reist, Philipp; Hutter, Marco

Computer Science > Robotics

arXiv:2109.11978 (cs)

[Submitted on 24 Sep 2021 (v1), last revised 19 Aug 2022 (this version, v3)]

Title:Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning

Authors:Nikita Rudin, David Hoeller, Philipp Reist, Marco Hutter

View PDF

Abstract:In this work, we present and study a training set-up that achieves fast policy generation for real-world robotic tasks by using massive parallelism on a single workstation GPU. We analyze and discuss the impact of different training algorithm components in the massively parallel regime on the final policy performance and training times. In addition, we present a novel game-inspired curriculum that is well suited for training with thousands of simulated robots in parallel. We evaluate the approach by training the quadrupedal robot ANYmal to walk on challenging terrain. The parallel approach allows training policies for flat terrain in under four minutes, and in twenty minutes for uneven terrain. This represents a speedup of multiple orders of magnitude compared to previous work. Finally, we transfer the policies to the real robot to validate the approach. We open-source our training code to help accelerate further research in the field of learned legged locomotion.

Comments:	CoRL 2021 Project website: : this https URL Video: this https URL
Subjects:	Robotics (cs.RO); Machine Learning (cs.LG)
Cite as:	arXiv:2109.11978 [cs.RO]
	(or arXiv:2109.11978v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2109.11978

Submission history

From: Nikita Rudin [view email]
[v1] Fri, 24 Sep 2021 14:04:19 UTC (48,407 KB)
[v2] Sat, 30 Oct 2021 14:35:58 UTC (48,402 KB)
[v3] Fri, 19 Aug 2022 07:52:32 UTC (48,402 KB)

Computer Science > Robotics

Title:Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators