Using Contrastive Samples for Identifying and Leveraging Possible Causal Relationships in Reinforcement Learning

Khadilkar, Harshad; Meisheri, Hardik

Computer Science > Machine Learning

arXiv:2210.17296 (cs)

[Submitted on 28 Oct 2022]

Title:Using Contrastive Samples for Identifying and Leveraging Possible Causal Relationships in Reinforcement Learning

Authors:Harshad Khadilkar, Hardik Meisheri

View PDF

Abstract:A significant challenge in reinforcement learning is quantifying the complex relationship between actions and long-term rewards. The effects may manifest themselves over a long sequence of state-action pairs, making them hard to pinpoint. In this paper, we propose a method to link transitions with significant deviations in state with unusually large variations in subsequent rewards. Such transitions are marked as possible causal effects, and the corresponding state-action pairs are added to a separate replay buffer. In addition, we include \textit{contrastive} samples corresponding to transitions from a similar state but with differing actions. Including this Contrastive Experience Replay (CER) during training is shown to outperform standard value-based methods on 2D navigation tasks. We believe that CER can be useful for a broad class of learning tasks, including for any off-policy reinforcement learning algorithm.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2210.17296 [cs.LG]
	(or arXiv:2210.17296v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.17296

Submission history

From: Harshad Khadilkar [view email]
[v1] Fri, 28 Oct 2022 11:21:17 UTC (1,336 KB)

Computer Science > Machine Learning

Title:Using Contrastive Samples for Identifying and Leveraging Possible Causal Relationships in Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Using Contrastive Samples for Identifying and Leveraging Possible Causal Relationships in Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators