Policy Optimization with Sparse Global Contrastive Explanations

Yao, Jiayu; Parbhoo, Sonali; Pan, Weiwei; Doshi-Velez, Finale

Computer Science > Machine Learning

arXiv:2207.06269 (cs)

[Submitted on 13 Jul 2022]

Title:Policy Optimization with Sparse Global Contrastive Explanations

Authors:Jiayu Yao, Sonali Parbhoo, Weiwei Pan, Finale Doshi-Velez

View PDF

Abstract:We develop a Reinforcement Learning (RL) framework for improving an existing behavior policy via sparse, user-interpretable changes. Our goal is to make minimal changes while gaining as much benefit as possible. We define a minimal change as having a sparse, global contrastive explanation between the original and proposed policy. We improve the current policy with the constraint of keeping that global contrastive explanation short. We demonstrate our framework with a discrete MDP and a continuous 2D navigation domain.

Comments:	Accepted at IMLH Workshop, ICML 2022
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2207.06269 [cs.LG]
	(or arXiv:2207.06269v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2207.06269

Submission history

From: Jiayu Yao [view email]
[v1] Wed, 13 Jul 2022 15:17:26 UTC (850 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2022-07

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Policy Optimization with Sparse Global Contrastive Explanations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Policy Optimization with Sparse Global Contrastive Explanations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators