Linking Model Intervention to Causal Interpretation in Model Explanation

Cheng, Debo; Xu, Ziqi; Li, Jiuyong; Liu, Lin; Yu, Kui; Le, Thuc Duy; Liu, Jixue

Computer Science > Machine Learning

arXiv:2410.15648 (cs)

[Submitted on 21 Oct 2024]

Title:Linking Model Intervention to Causal Interpretation in Model Explanation

Authors:Debo Cheng, Ziqi Xu, Jiuyong Li, Lin Liu, Kui Yu, Thuc Duy Le, Jixue Liu

View PDF HTML (experimental)

Abstract:Intervention intuition is often used in model explanation where the intervention effect of a feature on the outcome is quantified by the difference of a model prediction when the feature value is changed from the current value to the baseline value. Such a model intervention effect of a feature is inherently association. In this paper, we will study the conditions when an intuitive model intervention effect has a causal interpretation, i.e., when it indicates whether a feature is a direct cause of the outcome. This work links the model intervention effect to the causal interpretation of a model. Such an interpretation capability is important since it indicates whether a machine learning model is trustworthy to domain experts. The conditions also reveal the limitations of using a model intervention effect for causal interpretation in an environment with unobserved features. Experiments on semi-synthetic datasets have been conducted to validate theorems and show the potential for using the model intervention effect for model interpretation.

Subjects:	Machine Learning (cs.LG); Methodology (stat.ME)
Cite as:	arXiv:2410.15648 [cs.LG]
	(or arXiv:2410.15648v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.15648

Submission history

From: Ziqi Xu [view email]
[v1] Mon, 21 Oct 2024 05:16:59 UTC (662 KB)

Computer Science > Machine Learning

Title:Linking Model Intervention to Causal Interpretation in Model Explanation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Linking Model Intervention to Causal Interpretation in Model Explanation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators