Towards Explanation of DNN-based Prediction with Guided Feature Inversion

Du, Mengnan; Liu, Ninghao; Song, Qingquan; Hu, Xia

Computer Science > Computer Vision and Pattern Recognition

arXiv:1804.00506 (cs)

[Submitted on 19 Mar 2018 (v1), last revised 26 May 2018 (this version, v2)]

Title:Towards Explanation of DNN-based Prediction with Guided Feature Inversion

Authors:Mengnan Du, Ninghao Liu, Qingquan Song, Xia Hu

View PDF

Abstract:While deep neural networks (DNN) have become an effective computational tool, the prediction results are often criticized by the lack of interpretability, which is essential in many real-world applications such as health informatics. Existing attempts based on local interpretations aim to identify relevant features contributing the most to the prediction of DNN by monitoring the neighborhood of a given input. They usually simply ignore the intermediate layers of the DNN that might contain rich information for interpretation. To bridge the gap, in this paper, we propose to investigate a guided feature inversion framework for taking advantage of the deep architectures towards effective interpretation. The proposed framework not only determines the contribution of each feature in the input but also provides insights into the decision-making process of DNN models. By further interacting with the neuron of the target category at the output layer of the DNN, we enforce the interpretation result to be class-discriminative. We apply the proposed interpretation model to different CNN architectures to provide explanations for image data and conduct extensive experiments on ImageNet and PASCAL VOC07 datasets. The interpretation results demonstrate the effectiveness of our proposed framework in providing class-discriminative interpretation for DNN-based prediction.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1804.00506 [cs.CV]
	(or arXiv:1804.00506v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1804.00506

Submission history

From: Mengnan Du [view email]
[v1] Mon, 19 Mar 2018 17:35:26 UTC (2,836 KB)
[v2] Sat, 26 May 2018 04:47:32 UTC (2,853 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Explanation of DNN-based Prediction with Guided Feature Inversion

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Explanation of DNN-based Prediction with Guided Feature Inversion

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators