Slicing Through Bias: Explaining Performance Gaps in Medical Image Analysis using Slice Discovery Methods

Olesen, Vincent; Weng, Nina; Feragen, Aasa; Petersen, Eike

Computer Science > Machine Learning

arXiv:2406.12142v1 (cs)

[Submitted on 17 Jun 2024 (this version), latest version 22 Oct 2024 (v2)]

Title:Slicing Through Bias: Explaining Performance Gaps in Medical Image Analysis using Slice Discovery Methods

Authors:Vincent Olesen, Nina Weng, Aasa Feragen, Eike Petersen

View PDF HTML (experimental)

Abstract:Machine learning models have achieved high overall accuracy in medical image analysis. However, performance disparities on specific patient groups pose challenges to their clinical utility, safety, and fairness. This can affect known patient groups - such as those based on sex, age, or disease subtype - as well as previously unknown and unlabeled groups. Furthermore, the root cause of such observed performance disparities is often challenging to uncover, hindering mitigation efforts. In this paper, to address these issues, we leverage Slice Discovery Methods (SDMs) to identify interpretable underperforming subsets of data and formulate hypotheses regarding the cause of observed performance disparities. We introduce a novel SDM and apply it in a case study on the classification of pneumothorax and atelectasis from chest x-rays. Our study demonstrates the effectiveness of SDMs in hypothesis formulation and yields an explanation of previously observed but unexplained performance disparities between male and female patients in widely used chest X-ray datasets and models. Our findings indicate shortcut learning in both classification tasks, through the presence of chest drains and ECG wires, respectively. Sex-based differences in the prevalence of these shortcut features appear to cause the observed classification performance gap, representing a previously underappreciated interaction between shortcut learning and model fairness analyses.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
Cite as:	arXiv:2406.12142 [cs.LG]
	(or arXiv:2406.12142v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.12142

Submission history

From: Eike Petersen [view email]
[v1] Mon, 17 Jun 2024 23:08:46 UTC (4,490 KB)
[v2] Tue, 22 Oct 2024 13:32:34 UTC (630 KB)

Computer Science > Machine Learning

Title:Slicing Through Bias: Explaining Performance Gaps in Medical Image Analysis using Slice Discovery Methods

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Slicing Through Bias: Explaining Performance Gaps in Medical Image Analysis using Slice Discovery Methods

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators