Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information

Islam, Riashat; Tomar, Manan; Lamb, Alex; Efroni, Yonathan; Zang, Hongyu; Didolkar, Aniket; Misra, Dipendra; Li, Xin; van Seijen, Harm; Combes, Remi Tachet des; Langford, John

Computer Science > Machine Learning

arXiv:2211.00164v1 (cs)

[Submitted on 31 Oct 2022 (this version), latest version 14 Aug 2023 (v2)]

Title:Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information

Authors:Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang, Aniket Didolkar, Dipendra Misra, Xin Li, Harm van Seijen, Remi Tachet des Combes, John Langford

View PDF

Abstract:Learning to control an agent from data collected offline in a rich pixel-based visual observation space is vital for real-world applications of reinforcement learning (RL). A major challenge in this setting is the presence of input information that is hard to model and irrelevant to controlling the agent. This problem has been approached by the theoretical RL community through the lens of exogenous information, i.e, any control-irrelevant information contained in observations. For example, a robot navigating in busy streets needs to ignore irrelevant information, such as other people walking in the background, textures of objects, or birds in the sky. In this paper, we focus on the setting with visually detailed exogenous information, and introduce new offline RL benchmarks offering the ability to study this problem. We find that contemporary representation learning techniques can fail on datasets where the noise is a complex and time dependent process, which is prevalent in practical applications. To address these, we propose to use multi-step inverse models, which have seen a great deal of interest in the RL theory community, to learn Agent-Controller Representations for Offline-RL (ACRO). Despite being simple and requiring no reward, we show theoretically and empirically that the representation created by this objective greatly outperforms baselines.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2211.00164 [cs.LG]
	(or arXiv:2211.00164v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2211.00164

Submission history

From: Manan Tomar Mr. [view email]
[v1] Mon, 31 Oct 2022 22:12:48 UTC (38,967 KB)
[v2] Mon, 14 Aug 2023 00:16:23 UTC (19,175 KB)

Computer Science > Machine Learning

Title:Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators