Collaborative Evolutionary Reinforcement Learning

Khadka, Shauharda; Majumdar, Somdeb; Nassar, Tarek; Dwiel, Zach; Tumer, Evren; Miret, Santiago; Liu, Yinyin; Tumer, Kagan

Computer Science > Machine Learning

arXiv:1905.00976 (cs)

[Submitted on 2 May 2019 (v1), last revised 6 May 2019 (this version, v2)]

Title:Collaborative Evolutionary Reinforcement Learning

Authors:Shauharda Khadka, Somdeb Majumdar, Tarek Nassar, Zach Dwiel, Evren Tumer, Santiago Miret, Yinyin Liu, Kagan Tumer

View PDF

Abstract:Deep reinforcement learning algorithms have been successfully applied to a range of challenging control tasks. However, these methods typically struggle with achieving effective exploration and are extremely sensitive to the choice of hyperparameters. One reason is that most approaches use a noisy version of their operating policy to explore - thereby limiting the range of exploration. In this paper, we introduce Collaborative Evolutionary Reinforcement Learning (CERL), a scalable framework that comprises a portfolio of policies that simultaneously explore and exploit diverse regions of the solution space. A collection of learners - typically proven algorithms like TD3 - optimize over varying time-horizons leading to this diverse portfolio. All learners contribute to and use a shared replay buffer to achieve greater sample efficiency. Computational resources are dynamically distributed to favor the best learners as a form of online algorithm selection. Neuroevolution binds this entire process to generate a single emergent learner that exceeds the capabilities of any individual learner. Experiments in a range of continuous control benchmarks demonstrate that the emergent learner significantly outperforms its composite learners while remaining overall more sample-efficient - notably solving the Mujoco Humanoid benchmark where all of its composite learners (TD3) fail entirely in isolation.

Comments:	Added link to public Github repo. Minor editorial changes. Order of authors modified to reflect ICML submission
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1905.00976 [cs.LG]
	(or arXiv:1905.00976v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1905.00976
Journal reference:	Proceedings of the 36th International Conference on Machine Learning, Long Beach, California, PMLR 97, 2019

Submission history

From: Somdeb Majumdar [view email]
[v1] Thu, 2 May 2019 21:45:03 UTC (1,445 KB)
[v2] Mon, 6 May 2019 21:44:24 UTC (1,445 KB)

Computer Science > Machine Learning

Title:Collaborative Evolutionary Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Collaborative Evolutionary Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators