Class Introspection: A Novel Technique for Detecting Unlabeled Subclasses by Leveraging Classifier Explainability Methods

Kage, Patrick; Andreadis, Pavlos

Computer Science > Machine Learning

arXiv:2107.01657 (cs)

[Submitted on 4 Jul 2021 (v1), last revised 8 Nov 2021 (this version, v2)]

Title:Class Introspection: A Novel Technique for Detecting Unlabeled Subclasses by Leveraging Classifier Explainability Methods

Authors:Patrick Kage, Pavlos Andreadis

View PDF

Abstract:Detecting latent structure within a dataset is a crucial step in performing analysis of a dataset. However, existing state-of-the-art techniques for subclass discovery are limited: either they are limited to detecting very small numbers of outliers or they lack the statistical power to deal with complex data such as image or audio. This paper proposes a solution to this subclass discovery problem: by leveraging instance explanation methods, an existing classifier can be extended to detect latent classes via differences in the classifier's internal decisions about each instance. This works not only with simple classification techniques but also with deep neural networks, allowing for a powerful and flexible approach to detecting latent structure within datasets. Effectively, this represents a projection of the dataset into the classifier's "explanation space," and preliminary results show that this technique outperforms the baseline for the detection of latent classes even with limited processing. This paper also contains a pipeline for analyzing classifiers automatically, and a web application for interactively exploring the results from this technique.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2107.01657 [cs.LG]
	(or arXiv:2107.01657v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2107.01657

Submission history

From: Patrick Kage [view email]
[v1] Sun, 4 Jul 2021 14:58:29 UTC (3,178 KB)
[v2] Mon, 8 Nov 2021 18:19:32 UTC (3,179 KB)

Computer Science > Machine Learning

Title:Class Introspection: A Novel Technique for Detecting Unlabeled Subclasses by Leveraging Classifier Explainability Methods

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Class Introspection: A Novel Technique for Detecting Unlabeled Subclasses by Leveraging Classifier Explainability Methods

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators