Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing

Tjanaka, Bryon; Fontaine, Matthew C.; Lee, David H.; Kalkar, Aniruddha; Nikolaidis, Stefanos

Computer Science > Robotics

arXiv:2210.02622v3 (cs)

[Submitted on 6 Oct 2022 (v1), last revised 16 Sep 2023 (this version, v3)]

Title:Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing

Authors:Bryon Tjanaka, Matthew C. Fontaine, David H. Lee, Aniruddha Kalkar, Stefanos Nikolaidis

View PDF

Abstract:Pre-training a diverse set of neural network controllers in simulation has enabled robots to adapt online to damage in robot locomotion tasks. However, finding diverse, high-performing controllers requires expensive network training and extensive tuning of a large number of hyperparameters. On the other hand, Covariance Matrix Adaptation MAP-Annealing (CMA-MAE), an evolution strategies (ES)-based quality diversity algorithm, does not have these limitations and has achieved state-of-the-art performance on standard QD benchmarks. However, CMA-MAE cannot scale to modern neural network controllers due to its quadratic complexity. We leverage efficient approximation methods in ES to propose three new CMA-MAE variants that scale to high dimensions. Our experiments show that the variants outperform ES-based baselines in benchmark robotic locomotion tasks, while being comparable with or exceeding state-of-the-art deep reinforcement learning-based quality diversity algorithms.

Comments:	Source code and videos available at this https URL
Subjects:	Robotics (cs.RO); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2210.02622 [cs.RO]
	(or arXiv:2210.02622v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2210.02622

Submission history

From: Bryon Tjanaka [view email]
[v1] Thu, 6 Oct 2022 01:03:01 UTC (589 KB)
[v2] Sat, 13 May 2023 03:36:32 UTC (241 KB)
[v3] Sat, 16 Sep 2023 02:17:57 UTC (242 KB)

Computer Science > Robotics

Title:Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators