Explaining Classifiers using Adversarial Perturbations on the Perceptual Ball

Elliott, Andrew; Law, Stephen; Russell, Chris

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.09405 (cs)

[Submitted on 19 Dec 2019 (v1), last revised 30 Mar 2021 (this version, v4)]

Title:Explaining Classifiers using Adversarial Perturbations on the Perceptual Ball

Authors:Andrew Elliott, Stephen Law, Chris Russell

View PDF

Abstract:We present a simple regularization of adversarial perturbations based upon the perceptual loss. While the resulting perturbations remain imperceptible to the human eye, they differ from existing adversarial perturbations in that they are semi-sparse alterations that highlight objects and regions of interest while leaving the background unaltered. As a semantically meaningful adverse perturbations, it forms a bridge between counterfactual explanations and adversarial perturbations in the space of images. We evaluate our approach on several standard explainability benchmarks, namely, weak localization, insertion deletion, and the pointing game demonstrating that perceptually regularized counterfactuals are an effective explanation for image-based classifiers.

Comments:	CVPR 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1912.09405 [cs.CV]
	(or arXiv:1912.09405v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.09405

Submission history

From: Stephen Law Dr [view email]
[v1] Thu, 19 Dec 2019 17:25:07 UTC (7,859 KB)
[v2] Fri, 1 May 2020 14:55:40 UTC (7,606 KB)
[v3] Thu, 21 May 2020 18:50:43 UTC (7,607 KB)
[v4] Tue, 30 Mar 2021 21:51:19 UTC (9,508 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-12

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Andrew Elliott
Stephen Law
Chris Russell

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Explaining Classifiers using Adversarial Perturbations on the Perceptual Ball

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Explaining Classifiers using Adversarial Perturbations on the Perceptual Ball

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators