Trustworthy Conceptual Explanations for Neural Networks in Robot Decision-Making

Sagar, Som; Taparia, Aditya; Mankodiya, Harsh; Bidare, Pranav; Zhou, Yifan; Senanayake, Ransalu

Computer Science > Robotics

arXiv:2409.10733 (cs)

[Submitted on 16 Sep 2024]

Title:Trustworthy Conceptual Explanations for Neural Networks in Robot Decision-Making

Authors:Som Sagar, Aditya Taparia, Harsh Mankodiya, Pranav Bidare, Yifan Zhou, Ransalu Senanayake

View PDF HTML (experimental)

Abstract:Black box neural networks are an indispensable part of modern robots. Nevertheless, deploying such high-stakes systems in real-world scenarios poses significant challenges when the stakeholders, such as engineers and legislative bodies, lack insights into the neural networks' decision-making process. Presently, explainable AI is primarily tailored to natural language processing and computer vision, falling short in two critical aspects when applied in robots: grounding in decision-making tasks and the ability to assess trustworthiness of their explanations. In this paper, we introduce a trustworthy explainable robotics technique based on human-interpretable, high-level concepts that attribute to the decisions made by the neural network. Our proposed technique provides explanations with associated uncertainty scores by matching neural network's activations with human-interpretable visualizations. To validate our approach, we conducted a series of experiments with various simulated and real-world robot decision-making models, demonstrating the effectiveness of the proposed approach as a post-hoc, human-friendly robot learning diagnostic tool.

Comments:	19 pages, 25 figures
Subjects:	Robotics (cs.RO); Machine Learning (cs.LG)
Cite as:	arXiv:2409.10733 [cs.RO]
	(or arXiv:2409.10733v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2409.10733

Submission history

From: Som Sagar [view email]
[v1] Mon, 16 Sep 2024 21:11:12 UTC (32,091 KB)

Computer Science > Robotics

Title:Trustworthy Conceptual Explanations for Neural Networks in Robot Decision-Making

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Trustworthy Conceptual Explanations for Neural Networks in Robot Decision-Making

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators