Assessing the Local Interpretability of Machine Learning Models

Slack, Dylan; Friedler, Sorelle A.; Scheidegger, Carlos; Roy, Chitradeep Dutta

Computer Science > Machine Learning

arXiv:1902.03501 (cs)

[Submitted on 9 Feb 2019 (v1), last revised 2 Aug 2019 (this version, v2)]

Title:Assessing the Local Interpretability of Machine Learning Models

Authors:Dylan Slack, Sorelle A. Friedler, Carlos Scheidegger, Chitradeep Dutta Roy

View PDF

Abstract:The increasing adoption of machine learning tools has led to calls for accountability via model interpretability. But what does it mean for a machine learning model to be interpretable by humans, and how can this be assessed? We focus on two definitions of interpretability that have been introduced in the machine learning literature: simulatability (a user's ability to run a model on a given input) and "what if" local explainability (a user's ability to correctly determine a model's prediction under local changes to the input, given knowledge of the model's original prediction). Through a user study with 1,000 participants, we test whether humans perform well on tasks that mimic the definitions of simulatability and "what if" local explainability on models that are typically considered locally interpretable. To track the relative interpretability of models, we employ a simple metric, the runtime operation count on the simulatability task. We find evidence that as the number of operations increases, participant accuracy on the local interpretability tasks decreases. In addition, this evidence is consistent with the common intuition that decision trees and logistic regression models are interpretable and are more interpretable than neural networks.

Subjects:	Machine Learning (cs.LG); Human-Computer Interaction (cs.HC); Machine Learning (stat.ML)
Cite as:	arXiv:1902.03501 [cs.LG]
	(or arXiv:1902.03501v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1902.03501

Submission history

From: Dylan Slack [view email]
[v1] Sat, 9 Feb 2019 21:49:36 UTC (4,841 KB)
[v2] Fri, 2 Aug 2019 23:17:38 UTC (6,187 KB)

Computer Science > Machine Learning

Title:Assessing the Local Interpretability of Machine Learning Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Assessing the Local Interpretability of Machine Learning Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators