Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3488904.3493379acmconferencesArticle/Chapter ViewAbstractPublication PagescscsConference Proceedingsconference-collections
research-article

Leveraging Interpretability: Concept-based Pedestrian Detection with Deep Neural Networks

Published: 30 November 2021 Publication History

Abstract

The automation of driving systems relies on proof of the correct functioning of perception. Arguing the safety of deep neural networks (DNNs) must involve quantifiable evidence. Currently, the application of DNNs suffers from an incomprehensible behavior. It is still an open question if post-hoc methods mitigate the safety concerns of trained DNNs. Our work proposes a method for inherently interpretable and concept-based pedestrian detection (CPD). CPD explicitly structures the latent space with concept vectors that learn features for body parts as predefined concepts. The distance-based clustering and separation of latent representations build an interpretable reasoning process. Hence, CPD predicts a body part segmentation based on distances of latent representations to concept vectors. A non-interpretable 2d bounding box prediction for pedestrians complements the segmentation. The proposed CPD generates additional information that can be of great value in a safety argumentation of a DNN for pedestrian detection. We report competitive performance for the task of pedestrian detection. Finally, CPD enables concept-based tests to quantify evidence of a safe perception in automated driving systems.

References

[1]
Plamen Angelov and Eduardo Soares. 2020a. Towards Deep Machine Reasoning: a Prototype-based Deep Neural Network with Decision Tree Inference. In 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC). IEEE, 2092–2099.
[2]
Plamen Angelov and Eduardo Soares. 2020b. Towards explainable deep neural networks (xDNN). Neural Networks 130(2020), 185–194. Publisher: Elsevier.
[3]
André Araujo, Wade Norris, and Jack Sim. 2019. Computing Receptive Fields of Convolutional Neural Networks. Distill (2019). https://doi.org/10.23915/distill.00021
[4]
Chaofan Chen, Oscar Li, Chaofan Tao, Alina Jade Barnett, Jonathan Su, and Cynthia Rudin. 2019. This Looks Like That: Deep Learning for Interpretable Image Recognition. In Advances in Neural Information Processing Systems (NeurIPS).
[5]
Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, and Hartwig Adam. 2018. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. In Proceedings of the European Conference on Computer Vision (ECCV). 801–818.
[6]
Jacob Cohen. 2013. Statistical power analysis for the behavioral sciences. Academic press.
[7]
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition. Ieee, 248–255.
[8]
Piotr Dollar, Christian Wojek, Bernt Schiele, and Pietro Perona. 2011. Pedestrian Detection: An Evaluation of the State of the Art. IEEE Transactions on Pattern Analysis and Machine Intelligence 34, 4(2011), 743–761. Publisher: IEEE.
[9]
Finale Doshi-Velez and Been Kim. 2017. Towards a Rigorous Science of Interpretable Machine Learning. arXiv preprint arXiv:1702.08608(2017).
[10]
Patrick Feifel, Frank Bonarens, and Frank Koster. 2021. Reevaluating the Safety Impact of Inherent Interpretability on Deep Neural Networks for Pedestrian Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 29–37.
[11]
Ruth Fong and Andrea Vedaldi. 2018. Net2Vec: Quantifying and Explaining How Concepts are Encoded by Filters in Deep Neural Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 8730–8738.
[12]
Kamaledin Ghiasi-Shirazi. 2019. Generalizing the Convolution Operator in Convolutional Neural Networks. Neural Processing Letters 50, 3 (Dec. 2019), 2627–2646. https://doi.org/10.1007/s11063-019-10043-7 arXiv:1707.09864.
[13]
Xavier Glorot and Yoshua Bengio. 2010. Understanding the difficulty of training deep feedforward neural networks. In Proceedings of the Conference on Artificial Intelligence and Statistics (AISTATS). JMLR Workshop and Conference Proceedings, 249–256.
[14]
Chuan Guo, Geoff Pleiss, Yu Sun, and Kilian Q. Weinberger. 2017. On Calibration of Modern Neural Networks. In International Conference on Machine Learning (ICML). PMLR, 1321–1330.
[15]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 770–778.
[16]
Been Kim, Martin Wattenberg, Justin Gilmer, Carrie Cai, James Wexler, and Fernanda Viegas. 2018. Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV). In International Conference on Machine Learning (ICML). PMLR, 2668–2677.
[17]
Diederik P. Kingma and Jimmy Ba. 2014. ADAM: A Method for Stochastic Optimization. arXiv preprint arXiv:1412.6980(2014).
[18]
Jinpeng Li, Shengcai Liao, Hangzhi Jiang, and Ling Shao. 2020. Box Guided Convolution for Pedestrian Detection. In Proceedings of the 28th ACM International Conference on Multimedia. 1615–1624.
[19]
Guosheng Lin, Anton Milan, Chunhua Shen, and Ian Reid. 2017b. RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1925–1934.
[20]
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017a. Focal Loss for Dense Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2980–2988.
[21]
Wei Liu, Shengcai Liao, Weidong Hu, Xuezhi Liang, and Xiao Chen. 2018. Learning Efficient Single-Stage Pedestrian Detectors by Asymptotic Localization Fitting. In Proceedings of the European Conference on Computer Vision (ECCV). 618–634.
[22]
Wei Liu, Shengcai Liao, Weiqiang Ren, Weidong Hu, and Yinan Yu. 2019. High-level semantic feature detection: A new perspective for pedestrian detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 5187–5196.
[23]
Leland McInnes, John Healy, and James Melville. 2018. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. arXiv preprint arXiv:1802.03426(2018).
[24]
Claudio Michaelis, Benjamin Mitzkus, Robert Geirhos, Evgenia Rusak, Oliver Bringmann, Alexander S. Ecker, Matthias Bethge, and Wieland Brendel. 2019. Benchmarking robustness in object detection: Autonomous driving when winter is coming. arXiv preprint arXiv:1907.07484(2019).
[25]
Keivan Nalaie, Kamaledin Ghiasi-Shirazi, and Modhammad-R. Akbarzadeh-T. 2017. Efficient implementation of a generalized convolutional neural networks based on weighted euclidean distance. In International Conference on Computer and Knowledge Engineering (ICCKE). IEEE, 211–216.
[26]
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32, H. Wallach, H. Larochelle, A. Beygelzimer, F. d’ Alché-Buc, E. Fox, and R. Garnett (Eds.). Curran Associates, Inc., 8024–8035. http://papers.neurips.cc/paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf
[27]
Hanyang Peng and Shiqi Yu. 2021. Beyond softmax loss: Intra-concentration and inter-separability loss for classification. Neurocomputing 438 (May 2021), 155–164. https://doi.org/10.1016/j.neucom.2020.11.030
[28]
Peter J. Rousseeuw. 1987. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20 (Nov. 1987), 53–65. https://doi.org/10.1016/0377-0427(87)90125-7
[29]
Cynthia Rudin. 2019. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence 1, 5 (2019), 206–215. Publisher: Nature Publishing Group.
[30]
Cynthia Rudin, Chaofan Chen, Zhi Chen, Haiyang Huang, Lesia Semenova, and Chudi Zhong. 2021. Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges. arXiv preprint arXiv:2103.11251(2021).
[31]
Daniel Smilkov, Nikhil Thorat, Been Kim, Fernanda Viégas, and Martin Wattenberg. 2017. Smoothgrad: Removing Noise by Adding Noise. arXiv preprint arXiv:1706.03825(2017).
[32]
Xiaolin Song, Kaili Zhao, Wen-Sheng Chu, Honggang Zhang, and Jun Guo. 2020. Progressive Refinement Network for Occluded Pedestrian Detection. In Computer Vision – ECCV 2020, Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Vol. 12368. Springer International Publishing, Cham, 32–48. https://doi.org/10.1007/978-3-030-58592-1_3 Series Title: Lecture Notes in Computer Science.
[33]
Antti Tarvainen and Harri Valpola. 2017. Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. arXiv preprint arXiv:1703.01780(2017).
[34]
Xinlong Wang, Tete Xiao, Yuning Jiang, Shuai Shao, Jian Sun, and Chunhua Shen. 2018. Repulsion Loss: Detecting Pedestrians in a Crowd. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 7774–7783.
[35]
Oliver Willers, Sebastian Sudholt, Shervin Raafatnia, and Stephanie Abrecht. 2020. Safety Concerns and Mitigation Approaches Regarding the Use of Deep Learning in Safety-Critical Perception Tasks. In International Conference on Computer Safety, Reliability, and Security. Springer, 336–350.
[36]
Fisher Yu, Dequan Wang, Evan Shelhamer, and Trevor Darrell. 2018. Deep Layer Aggregation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, Salt Lake City, UT, 2403–2412. https://doi.org/10.1109/CVPR.2018.00255
[37]
Jialiang Zhang, Lixiang Lin, Jianke Zhu, Yang Li, Yun-chen Chen, Yao Hu, and CH Steven Hoi. 2020b. Attribute-aware Pedestrian Detection in a Crowd. IEEE Transactions on Multimedia(2020). Publisher: IEEE.
[38]
Jie M. Zhang, Mark Harman, Lei Ma, and Yang Liu. 2020a. Machine Learning Testing: Survey, Landscapes and Horizons. IEEE Transactions on Software Engineering(2020). Publisher: IEEE.
[39]
Shanshan Zhang, Rodrigo Benenson, and Bernt Schiele. 2017. CityPersons: A Diverse Dataset for Pedestrian Detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3213–3221.

Cited By

View all
  • (2023)The Thousand Faces of Explainable AI Along the Machine Learning Life Cycle: Industrial Reality and Current State of ResearchArtificial Intelligence in HCI10.1007/978-3-031-35891-3_13(184-208)Online publication date: 23-Jul-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CSCS '21: Proceedings of the 5th ACM Computer Science in Cars Symposium
November 2021
101 pages
ISBN:9781450391399
DOI:10.1145/3488904
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 November 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. automated driving
  2. body part segmentation
  3. interpretability
  4. neural networks
  5. pedestrian detection
  6. safety
  7. segmentation

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

Conference

CSCS '21
Sponsor:
CSCS '21: Computer Science in Cars Symposium
November 30, 2021
Ingolstadt, Germany

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)16
  • Downloads (Last 6 weeks)1
Reflects downloads up to 16 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)The Thousand Faces of Explainable AI Along the Machine Learning Life Cycle: Industrial Reality and Current State of ResearchArtificial Intelligence in HCI10.1007/978-3-031-35891-3_13(184-208)Online publication date: 23-Jul-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media