Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs

Sahni, Himanshu; Buckley, Toby; Abbeel, Pieter; Kuzovkin, Ilya

Computer Science > Artificial Intelligence

arXiv:1901.11529 (cs)

[Submitted on 31 Jan 2019 (v1), last revised 30 Oct 2019 (this version, v2)]

Title:Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs

Authors:Himanshu Sahni, Toby Buckley, Pieter Abbeel, Ilya Kuzovkin

View PDF

Abstract:Reinforcement Learning (RL) algorithms typically require millions of environment interactions to learn successful policies in sparse reward settings. Hindsight Experience Replay (HER) was introduced as a technique to increase sample efficiency by reimagining unsuccessful trajectories as successful ones by altering the originally intended goals. However, it cannot be directly applied to visual environments where goal states are often characterized by the presence of distinct visual features. In this work, we show how visual trajectories can be hallucinated to appear successful by altering agent observations using a generative model trained on relatively few snapshots of the goal. We then use this model in combination with HER to train RL agents in visual settings. We validate our approach on 3D navigation tasks and a simulated robotics application and show marked improvement over baselines derived from previous work.

Comments:	To appear in Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada. Code available at this https URL
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1901.11529 [cs.AI]
	(or arXiv:1901.11529v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1901.11529

Submission history

From: Himanshu Sahni [view email]
[v1] Thu, 31 Jan 2019 18:50:44 UTC (3,109 KB)
[v2] Wed, 30 Oct 2019 02:23:49 UTC (5,827 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2019-01

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Himanshu Sahni
Toby Buckley
Pieter Abbeel
Ilya Kuzovkin

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators