PixL2R: Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards

Goyal, Prasoon; Niekum, Scott; Mooney, Raymond J.

Computer Science > Machine Learning

arXiv:2007.15543 (cs)

[Submitted on 30 Jul 2020 (v1), last revised 19 Nov 2020 (this version, v2)]

Title:PixL2R: Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards

Authors:Prasoon Goyal, Scott Niekum, Raymond J. Mooney

View PDF

Abstract:Reinforcement learning (RL), particularly in sparse reward settings, often requires prohibitively large numbers of interactions with the environment, thereby limiting its applicability to complex problems. To address this, several prior approaches have used natural language to guide the agent's exploration. However, these approaches typically operate on structured representations of the environment, and/or assume some structure in the natural language commands. In this work, we propose a model that directly maps pixels to rewards, given a free-form natural language description of the task, which can then be used for policy learning. Our experiments on the Meta-World robot manipulation domain show that language-based rewards significantly improves the sample efficiency of policy learning, both in sparse and dense reward settings.

Comments:	Conference on Robot Learning (CoRL), 2020
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2007.15543 [cs.LG]
	(or arXiv:2007.15543v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2007.15543

Submission history

From: Prasoon Goyal [view email]
[v1] Thu, 30 Jul 2020 15:50:38 UTC (1,891 KB)
[v2] Thu, 19 Nov 2020 13:42:41 UTC (1,911 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2020-07

Change to browse by:

cs.AI
cs.LG
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Prasoon Goyal
Scott Niekum
Raymond J. Mooney

export BibTeX citation

Computer Science > Machine Learning

Title:PixL2R: Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:PixL2R: Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators