Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping.

AllVideos Images Books Maps News Shopping

Scholarly articles for Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping.

scholar.google.com › citations

… conditioned policies offline with self-supervised reward …
Mezghani · Cited by 13

Learning Goal-Conditioned Policies Offline with Self-Supervised ... - arXiv

Jan 5, 2023 · In this work, we propose a novel self-supervised learning phase on the pre-collected dataset to understand the structure and the dynamics of the ...

[PDF] Learning Goal-Conditioned Policies Offline with Self-Supervised ...

proceedings.mlr.press › ...

In this work, we present a self-supervised reward shaping method that enables building an offline dataset with dense rewards. To this end, we develop a self ...

Learning Goal-Conditioned Policies Offline with Self-Supervised ...

openreview.net › forum

Sep 10, 2022 · We propose a self-supervised reward shaping method for training goal-conditioned policies on pre-collected dataset without performing a ...

Goal-conditioned Offline Planning from Curious Exploration

Rethinking Goal-Conditioned Supervised Learning and Its ...

Goal-Conditioned Predictive Coding for Offline Reinforcement Learning

GOPlan: Goal-conditioned Offline Reinforcement Learning by ...

More results from openreview.net

Go-Fresh: Learning Goal-Conditioned Policies Offline with Self ... - GitHub

github.com › facebookresearch › go-fresh

This is the original implementation of the paper. Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping [Project Page] [Paper].

[PDF] Learning Goal-Conditioned Policies Offline with Self-Supervised ...

proceedings.mlr.press › ...

Relabel transition with goal g and reward rt, and. Push (st ... We first list the hyper-parameters for the self-supervised reward shaping phase in Table 1.

[PDF] Learning Goal-Conditioned Policies Offline with Self-Supervised ...

openreview.net › pdf

Learning Goal-Conditioned Policies Offline with. Self-Supervised Reward Shaping. Rebuttal Document. Anonymous Author(s). Affiliation. Address email. 1 ...

Learning Goal-Conditioned Policies Offline with Self-Supervised ...

www.researchgate.net › ... › Reward

These methods suffer from the issue of sparsity of rewards, and fail at long-horizon tasks. In this work, we propose a novel self-supervised learning phase on ...

Learning Goal-Conditioned Policies Ofﬂine with Self-Supervised ...

www.semanticscholar.org › paper › Lear...

Semantic Scholar extracted view of "Learning Goal-Conditioned Policies Ofﬂine with Self-Supervised Reward Shaping - Supplementary Material" by Lina ...

Learning Goal-Conditioned Policies Offline with Self ... - arxiv-sanity

arxiv-sanity-lite.com › ...

This paper proposes a novel magnetic field-based reward shaping (MFRS) method for goal-conditioned RL tasks with dynamic target and obstacles. Inspired by the ...

Learning Goal-Conditioned Policies Offline with Self-Supervised ...

deepai.org › publication › learning-goal-...

Jan 5, 2023 · These methods suffer from the issue of sparsity of rewards, and fail at long-horizon tasks. In this work, we propose a novel self-supervised ...