ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models

Ma, Runyu; Luijkx, Jelle; Ajanovic, Zlatan; Kober, Jens

Computer Science > Robotics

arXiv:2403.09583 (cs)

[Submitted on 14 Mar 2024 (v1), last revised 20 Sep 2024 (this version, v3)]

Title:ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models

Authors:Runyu Ma, Jelle Luijkx, Zlatan Ajanovic, Jens Kober

View PDF HTML (experimental)

Abstract:In robot manipulation tasks with large observation and action spaces, reinforcement learning (RL) often suffers from low sample efficiency and uncertain convergence. As an alternative, foundation models have shown promise in zero-shot and few-shot applications. However, these models can be unreliable due to their limited reasoning and challenges in understanding physical and spatial contexts. This paper introduces ExploRLLM, a method that combines the commonsense reasoning of foundation models with the experiential learning capabilities of RL. We leverage the strengths of both paradigms by using foundation models to obtain a base policy, an efficient representation, and an exploration policy. A residual RL agent learns when and how to deviate from the base policy while its exploration is guided by the exploration policy. In table-top manipulation experiments, we demonstrate that ExploRLLM outperforms both baseline foundation model policies and baseline RL policies. Additionally, we show that this policy can be transferred to the real world without further training. Supplementary material is available at this https URL.

Comments:	6 pages, 6 figures
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2403.09583 [cs.RO]
	(or arXiv:2403.09583v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2403.09583

Submission history

From: Jelle Luijkx [view email]
[v1] Thu, 14 Mar 2024 17:18:15 UTC (5,880 KB)
[v2] Fri, 15 Mar 2024 08:47:48 UTC (5,865 KB)
[v3] Fri, 20 Sep 2024 09:08:03 UTC (7,575 KB)

Computer Science > Robotics

Title:ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators