Sparse Graphical Memory for Robust Planning

Emmons, Scott; Jain, Ajay; Laskin, Michael; Kurutach, Thanard; Abbeel, Pieter; Pathak, Deepak

Computer Science > Machine Learning

arXiv:2003.06417 (cs)

[Submitted on 13 Mar 2020 (v1), last revised 12 Nov 2020 (this version, v3)]

Title:Sparse Graphical Memory for Robust Planning

Authors:Scott Emmons, Ajay Jain, Michael Laskin, Thanard Kurutach, Pieter Abbeel, Deepak Pathak

View PDF

Abstract:To operate effectively in the real world, agents should be able to act from high-dimensional raw sensory input such as images and achieve diverse goals across long time-horizons. Current deep reinforcement and imitation learning methods can learn directly from high-dimensional inputs but do not scale well to long-horizon tasks. In contrast, classical graphical methods like A* search are able to solve long-horizon tasks, but assume that the state space is abstracted away from raw sensory input. Recent works have attempted to combine the strengths of deep learning and classical planning; however, dominant methods in this domain are still quite brittle and scale poorly with the size of the environment. We introduce Sparse Graphical Memory (SGM), a new data structure that stores states and feasible transitions in a sparse memory. SGM aggregates states according to a novel two-way consistency objective, adapting classic state aggregation criteria to goal-conditioned RL: two states are redundant when they are interchangeable both as goals and as starting states. Theoretically, we prove that merging nodes according to two-way consistency leads to an increase in shortest path lengths that scales only linearly with the merging threshold. Experimentally, we show that SGM significantly outperforms current state of the art methods on long horizon, sparse-reward visual navigation tasks. Project video and code are available at this https URL

Comments:	Accepted at NeurIPS 2020. Video and code at this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Machine Learning (stat.ML)
Cite as:	arXiv:2003.06417 [cs.LG]
	(or arXiv:2003.06417v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2003.06417

Submission history

From: Deepak Pathak [view email]
[v1] Fri, 13 Mar 2020 17:59:32 UTC (7,075 KB)
[v2] Tue, 12 May 2020 18:55:04 UTC (7,075 KB)
[v3] Thu, 12 Nov 2020 21:37:49 UTC (4,752 KB)

Computer Science > Machine Learning

Title:Sparse Graphical Memory for Robust Planning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sparse Graphical Memory for Robust Planning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators