Hierarchical Imitation and Reinforcement Learning

Le, Hoang M.; Jiang, Nan; Agarwal, Alekh; Dudík, Miroslav; Yue, Yisong; Daumé III, Hal

Computer Science > Machine Learning

arXiv:1803.00590v1 (cs)

[Submitted on 1 Mar 2018 (this version), latest version 9 Jun 2018 (v2)]

Title:Hierarchical Imitation and Reinforcement Learning

Authors:Hoang M. Le, Nan Jiang, Alekh Agarwal, Miroslav Dudík, Yisong Yue, Hal Daumé III

View PDF

Abstract:We study the problem of learning policies over long time horizons. We present a framework that leverages and integrates two key concepts. First, we utilize hierarchical policy classes that enable planning over different time scales, i.e., the high level planner proposes a sequence of subgoals for the low level planner to achieve. Second, we utilize expert demonstrations within the hierarchical action space to dramatically reduce cost of exploration. Our framework is flexible and can incorporate different combinations of imitation learning (IL) and reinforcement learning (RL) at different levels of the hierarchy. Using long-horizon benchmarks, including Montezuma's Revenge, we empirically demonstrate that our approach can learn significantly faster compared to hierarchical RL, and can be significantly more label- and sample-efficient compared to flat IL. We also provide theoretical analysis of the labeling cost for certain instantiations of our framework.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1803.00590 [cs.LG]
	(or arXiv:1803.00590v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1803.00590

Submission history

From: Hoang M. Le [view email]
[v1] Thu, 1 Mar 2018 19:12:27 UTC (3,934 KB)
[v2] Sat, 9 Jun 2018 08:41:37 UTC (1,153 KB)

Computer Science > Machine Learning

Title:Hierarchical Imitation and Reinforcement Learning

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hierarchical Imitation and Reinforcement Learning

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators