Wasserstein Reinforcement Learning

Pacchiano, Aldo; Parker-Holder, Jack; Tang, Yunhao; Choromanska, Anna; Choromanski, Krzysztof; Jordan, Michael

Computer Science > Machine Learning

arXiv:1906.04349v1 (cs)

[Submitted on 11 Jun 2019 (this version), latest version 4 Mar 2020 (v4)]

Title:Wasserstein Reinforcement Learning

Authors:Aldo Pacchiano, Jack Parker-Holder, Yunhao Tang, Anna Choromanska, Krzysztof Choromanski, Michael Jordan

View PDF

Abstract:We propose behavior-driven optimization via Wasserstein distances (WDs) to improve several classes of state-of-the-art reinforcement learning (RL) algorithms. We show that WD regularizers acting on appropriate policy embeddings efficiently incorporate behavioral characteristics into policy optimization. We demonstrate that they improve Evolution Strategy methods by encouraging more efficient exploration, can be applied in imitation learning and to speed up training of Trust Region Policy Optimization methods. Since the exact computation of WDs is expensive, we develop approximate algorithms based on the combination of different methods: dual formulation of the optimal transport problem, alternating optimization and random feature maps, to effectively replace exact WD computations in the RL tasks considered. We provide theoretical analysis of our algorithms and exhaustive empirical evaluation in a variety of RL settings.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1906.04349 [cs.LG]
	(or arXiv:1906.04349v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1906.04349

Submission history

From: Jack Parker-Holder [view email]
[v1] Tue, 11 Jun 2019 02:06:51 UTC (5,600 KB)
[v2] Wed, 19 Jun 2019 14:57:54 UTC (5,600 KB)
[v3] Sun, 29 Sep 2019 16:02:32 UTC (7,565 KB)
[v4] Wed, 4 Mar 2020 08:27:28 UTC (7,787 KB)

Computer Science > Machine Learning

Title:Wasserstein Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Wasserstein Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators