research-article

Public Access

RES: A Robust Framework for Guiding Visual Explanation

Authors:

Tong Steven Sun,

Sungsoo Ray Hong,

Zhao LiangAuthors Info & Claims

KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Pages 432 - 442

https://doi.org/10.1145/3534678.3539419

Published: 14 August 2022 Publication History

Abstract

Despite the fast progress of explanation techniques in modern Deep Neural Networks (DNNs) where the main focus is handling "how to generate the explanations", advanced research questions that examine the quality of the explanation itself (e.g., "whether the explanations are accurate") and improve the explanation quality (e.g., "how to adjust the model to generate more accurate explanations when explanations are inaccurate") are still relatively under-explored. To guide the model toward better explanations, techniques in explanation supervision - which add supervision signals on the model explanation - have started to show promising effects on improving both the generalizability as and intrinsic interpretability of Deep Neural Networks. However, the research on supervising explanations, especially in vision-based applications represented through saliency maps, is in its early stage due to several inherent challenges: 1) inaccuracy of the human explanation annotation boundary, 2) incompleteness of the human explanation annotation region, and 3) inconsistency of the data distribution between human annotation and model explanation maps. To address the challenges, we propose a generic RES framework for guiding visual explanation by developing a novel objective that handles inaccurate boundary, incomplete region, and inconsistent distribution of human annotations, with a theoretical justification on model generalizability. Extensive experiments on two real-world image datasets demonstrate the effectiveness of the proposed framework on enhancing both the reasonability of the explanation and the performance of the backbone DNNs model.

References

[1]

Amina Adadi and Mohammed Berrada. 2018. Peeking inside the black-box: a survey on explainable artificial intelligence (XAI). IEEE access 6 (2018), 52138--52160.

[2]

Alejandro Barredo Arrieta, Natalia Díaz-Rodríguez, Javier Del Ser, Adrien Bennetot, Siham Tabik, Alberto Barbado, Salvador García, Sergio Gil-López, Daniel Molina, Richard Benjamins, et al. 2020. Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion 58 (2020), 82--115.

Digital Library

[3]

Sebastian Bach, Alexander Binder, Grégoire Montavon, Frederick Klauschen, Klaus-Robert Müller, and Wojciech Samek. 2015. On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PloS one 10, 7 (2015), e0130140.

[4]

Guangji Bai and Liang Zhao. 2022. Saliency-regularized Deep Multi-task Learning. In Proceedings of the 28th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM.

Digital Library

[5]

Pinar Barlas, Kyriakos Kyriakou, Olivia Guest, Styliani Kleanthous, and Jahna Otterbacher. 2021. To" See" is to Stereotype: Image Tagging Algorithms, Gender Recognition, and the Accuracy-Fairness Trade-off. Proceedings of the ACM on Human-Computer Interaction 4, CSCW3 (2021), 1--31.

Digital Library

[6]

David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, and Antonio Torralba. 2017. Network dissection: Quantifying interpretability of deep visual representations. In CVPR. 6541--6549.

[7]

Mariusz Bojarski, Anna Choromanska, Krzysztof Choromanski, Bernhard Firner, Larry Jackel, Urs Muller, and Karol Zieba. 2016. Visualbackprop: visualizing cnns for autonomous driving. arXiv preprint arXiv:1611.05418 2 (2016).

[8]

Kaylee Burns, Lisa Anne Hendricks, Kate Saenko, Trevor Darrell, and Anna Rohrbach. 2018. Women also snowboard: Overcoming bias in captioning models. In European Conference on Computer Vision. Springer, 793--811.

[9]

Minsuk Choi, Cheonbok Park, Soyoung Yang, Yonggyu Kim, Jaegul Choo, and Sungsoo Ray Hong. 2019. AILA: Attentive Interactive Labeling Assistant for Document Classification through Attention-Based Deep Neural Networks. In CHI. ACM, New York, NY, USA, Article 230, 12 pages.

Digital Library

[10]

Chaeyeon Chung, Jung Soo Lee, Kyungmin Park, Junsoo Lee, Jaegul Choo, and Sungsoo Ray Hong. 2021. Understanding Human-side Impact of Sequencing Images in Batch Labeling for Subjective Tasks. Proceedings of the ACM on Human-Computer Interaction CSCW (2021).

[11]

John Joon Young Chung, Jean Y Song, Sindhu Kutty, Sungsoo Ray Hong, Juho Kim, andWalter S Lasecki. 2019. Efficient Elicitation Approaches to Estimate Collective Crowd Answers. Proceedings of the ACM on Human-Computer Interaction 3, CSCW (2019), 1--25.

Digital Library

[12]

Abhishek Das, Harsh Agrawal, Larry Zitnick, Devi Parikh, and Dhruv Batra. 2017. Human attention in visual question answering: Do humans and deep networks look at the same regions? CVIU 163 (2017), 90--100.

Digital Library

[13]

Hiroshi Fukui, Tsubasa Hirakawa, Takayoshi Yamashita, and Hironobu Fujiyoshi. 2019. Attention branch network: Learning of attention mechanism for visual explanation. In CVPR. 10705--10714.

[14]

Yuyang Gao, Giorgio A Ascoli, and Liang Zhao. 2021. BEAN: Interpretable and efficient learning with biologically-enhanced artificial neuronal assembly regularization. Frontiers in Neurorobotics 15 (2021), 68.

[15]

Yuyang Gao, Tong Sun, Rishab Bhatt, Dazhou Yu, Sungsoo Hong, and Liang Zhao. 2021. GNES: Learning to Explain Graph Neural Networks. In 2021 IEEE International Conference on Data Mining (ICDM). IEEE, 131--140.

[16]

Yuyang Gao, Tong Sun, Liang Zhao, and Sungsoo Hong. 2022. Aligning Eyes between Humans and Deep Neural Network through Interactive Attention Alignment. arXiv:2202.02838

[17]

Riccardo Guidotti, Anna Monreale, Salvatore Ruggieri, Franco Turini, Fosca Giannotti, and Dino Pedreschi. 2018. A survey of methods for explaining black box models. ACM computing surveys (CSUR) 51, 5 (2018), 1--42.

[18]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR. 770--778.

[19]

Sungsoo Ray Hong, Jessica Hullman, and Enrico Bertini. 2020. Human Factors in Model Interpretability: Industry Practices, Challenges, and Needs. Proceedings of the ACM on Human-Computer Interaction 4 (2020), 1--26.

Digital Library

[20]

Alon Jacovi and Yoav Goldberg. 2020. Aligning Faithful Interpretations with their Social Attribution. arXiv preprint arXiv:2006.01067 (2020).

[21]

Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).

[22]

Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In European conference on computer vision. Springer, 740--755.

[23]

Drew Linsley, Dan Shiebler, Sven Eberhardt, and Thomas Serre. 2018. Learning what and where to attend. arXiv preprint arXiv:1805.08819 (2018).

[24]

Masahiro Mitsuhara, Hiroshi Fukui, Yusuke Sakashita, Takanori Ogata, Tsubasa Hirakawa, Takayoshi Yamashita, and Hironobu Fujiyoshi. 2019. Embedding Human Knowledge into Deep Neural Network via Attention Map. arXiv (2019).

[25]

Grégoire Montavon, Alexander Binder, Sebastian Lapuschkin, Wojciech Samek, and Klaus-Robert Müller. 2019. Layer-wise relevance propagation: an overview. Explainable AI: interpreting, explaining and visualizing deep learning (2019), 193--209.

[26]

Grégoire Montavon, Sebastian Lapuschkin, Alexander Binder, Wojciech Samek, and Klaus-Robert Müller. 2017. Explaining nonlinear classification decisions with deep taylor decomposition. Pattern recognition 65 (2017), 211--222.

[27]

Badri Patro, Vinay Namboodiri, et al. 2020. Explanation vs attention: A two player game to obtain attention for VQA. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 11848--11855.

[28]

Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. Why Should I Trust You?: Explaining the Predictions of Any Classifier. In Proceedings of ACM International Conference on Knowledge Discovery and Data Mining (KDD'16). ACM, 1135--1144. https://doi.org/10.1145/2939672.2939778

Digital Library

[29]

Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2018. Anchors: High precision model-agnostic explanations. In Proceedings of the AAAI conference on artificial intelligence, Vol. 32.

[30]

Andrew Slavin Ross, Michael C Hughes, and Finale Doshi-Velez. 2017. Right for the right reasons: Training differentiable models by constraining their explanations. arXiv preprint arXiv:1703.03717 (2017).

[31]

Ramprasaath R Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, and Dhruv Batra. 2017. Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE international conference on computer vision. 618--626.

[32]

Haifeng Shen, Kewen Liao, Zhibin Liao, Job Doornberg, Maoying Qiao, Anton Van Den Hengel, and Johan W Verjans. 2021. Human-AI interactive and continuous sensemaking: A case study of image classification using scribble attention maps. In Extended Abstracts of CHI. 1--8.

[33]

Roman Visotsky, Yuval Atzmon, and Gal Chechik. 2019. Few-shot learning with per-sample rich supervision. arXiv preprint arXiv:1906.03859 (2019).

[34]

Fuxun Yu, Zhuwei Qin, Chenchen Liu, Liang Zhao, YanzhiWang, and Xiang Chen. 2019. Interpreting and evaluating neural network robustness. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. 4199--4205.

[35]

Yundong Zhang, Juan Carlos Niebles, and Alvaro Soto. 2019. Interpretable visual question answering by visual grounding from attention supervision mining. In WACV. IEEE, 349--357.

[36]

Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, and Kai-Wei Chang. 2017. Men also like shopping: Reducing gender bias amplification using corpus level constraints. arXiv preprint arXiv:1707.09457 (2017).

[37]

Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba. 2016. Learning deep features for discriminative localization. In CVPR. 2921--2929.

[38]

Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba. 2017. Places: A 10 million Image Database for Scene Recognition. TPAMI (2017).

Cited By

Zhang YPan BGu SBai GQiu MYang XZhao LLarson K(2024)Visual attention prompted prediction and learningProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/610(5517-5525)Online publication date: 3-Aug-2024
https://dl.acm.org/doi/10.24963/ijcai.2024/610
Gao YGu SJiang JHong SYu DZhao L(2024)Going Beyond XAI: A Systematic Survey for Explanation-Guided LearningACM Computing Surveys10.1145/364407356:7(1-39)Online publication date: 9-Apr-2024
https://dl.acm.org/doi/10.1145/3644073
Ara ZSalemi HHong SSenarath YPeterson SHughes APurohit H(2024)Closing the Knowledge Gap in Designing Data Annotation Interfaces for AI-powered Disaster Management Analytic SystemsProceedings of the 29th International Conference on Intelligent User Interfaces10.1145/3640543.3645214(405-418)Online publication date: 18-Mar-2024
https://dl.acm.org/doi/10.1145/3640543.3645214
Show More Cited By

Index Terms

RES: A Robust Framework for Guiding Visual Explanation
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
  2. Machine learning
    1. Learning paradigms
      1. Supervised learning

Recommendations

Density-based reliable and robust explainer for counterfactual explanation
Abstract
As an essential post-hoc explanatory method, counterfactual explanation enables people to understand and react to machine learning models. Works on counterfactual explanation generally aim at generating high-quality results, which means providing ...
Studying and Exploiting the Relationship Between Model Accuracy and Explanation Quality
Machine Learning and Knowledge Discovery in Databases. Research Track
Abstract
Many explanation methods have been proposed to reveal insights about the internal procedures of black-box models like deep neural networks. Although these methods are able to generate explanations for individual predictions, little research has ...
Faithful Counterfactual Visual Explanations (FCVE)
Abstract
Deep learning models in computer vision have made remarkable progress, but their lack of transparency and interpretability remains a challenge. The development of explainable AI can enhance the understanding and performance of these models. ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 2022

5033 pages

ISBN:9781450393850

DOI:10.1145/3534678

General Chairs:
Aidong Zhang
University of Virginia
,
Huzefa Rangwala
Amazon/George Mason University

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 August 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Jeffress Memorial Trust Award
NVIDIA GPU Grant
NSF (National Science Foundation)
Amazon Research Award
Design Knowledge Company

Conference

KDD '22

Sponsor:

KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 14 - 18, 2022

Washington DC, USA

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Sponsor:
sigkdd
sigkdd

The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 3 - 7, 2025

Toronto , ON , Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

16
Total Citations
View Citations
441
Total Downloads

Downloads (Last 12 months)215
Downloads (Last 6 weeks)27

Reflects downloads up to 09 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zhang YPan BGu SBai GQiu MYang XZhao LLarson K(2024)Visual attention prompted prediction and learningProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/610(5517-5525)Online publication date: 3-Aug-2024
https://dl.acm.org/doi/10.24963/ijcai.2024/610
Gao YGu SJiang JHong SYu DZhao L(2024)Going Beyond XAI: A Systematic Survey for Explanation-Guided LearningACM Computing Surveys10.1145/364407356:7(1-39)Online publication date: 9-Apr-2024
https://dl.acm.org/doi/10.1145/3644073
Ara ZSalemi HHong SSenarath YPeterson SHughes APurohit H(2024)Closing the Knowledge Gap in Designing Data Annotation Interfaces for AI-powered Disaster Management Analytic SystemsProceedings of the 29th International Conference on Intelligent User Interfaces10.1145/3640543.3645214(405-418)Online publication date: 18-Mar-2024
https://dl.acm.org/doi/10.1145/3640543.3645214
Zhao QZhang YZhu MGu SGao YYang XZhao LBaeza-Yates RBonchi F(2024)DUE: Dynamic Uncertainty-Aware Explanation Supervision via 3D ImputationProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671641(6335-6343)Online publication date: 25-Aug-2024
https://dl.acm.org/doi/10.1145/3637528.3671641
Kwon NSun TGao YZhao LWang XKim JHong S(2024)3DPFIX: Improving Remote Novices' 3D Printing Troubleshooting through Human-AI Collaboration DesignProceedings of the ACM on Human-Computer Interaction10.1145/36372888:CSCW1(1-33)Online publication date: 26-Apr-2024
https://dl.acm.org/doi/10.1145/3637288
Chang CZhao QZhao LYang X(2024)Explainable AI for lung nodule detection and classification in CT imagesMedical Imaging 2024: Computer-Aided Diagnosis10.1117/12.3008472(105)Online publication date: 3-Apr-2024
https://doi.org/10.1117/12.3008472
Dong LChen LZheng CFu ZZukaib UCui XShen Z(2024)OCIEKnowledge-Based Systems10.1016/j.knosys.2024.112390302:COnline publication date: 25-Oct-2024
https://dl.acm.org/doi/10.1016/j.knosys.2024.112390
Dong LChen LFu ZZheng CCui XShen Z(2024)Leveraging saliency priors and explanations for enhanced consistent interpretabilityExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.123518249:PAOnline publication date: 1-Sep-2024
https://dl.acm.org/doi/10.1016/j.eswa.2024.123518
Parchami-Araghi ABöhle MRao SSchiele B(2024)Good Teachers Explain: Explanation-Enhanced Knowledge DistillationComputer Vision – ECCV 202410.1007/978-3-031-73464-9_18(293-310)Online publication date: 4-Dec-2024
https://doi.org/10.1007/978-3-031-73464-9_18
Zhao QChang CYang XZhao L(2024)Robust explanation supervision for false positive reduction in pulmonary nodule detectionMedical Physics10.1002/mp.1693751:3(1687-1701)Online publication date: 15-Jan-2024
https://doi.org/10.1002/mp.16937
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents