LLM2Loss: Leveraging Language Models for Explainable Model Diagnostics

Ardeshir, Shervin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.03212 (cs)

[Submitted on 4 May 2023 (v1), last revised 17 May 2023 (this version, v2)]

Title:LLM2Loss: Leveraging Language Models for Explainable Model Diagnostics

Authors:Shervin Ardeshir

View PDF

Abstract:Trained on a vast amount of data, Large Language models (LLMs) have achieved unprecedented success and generalization in modeling fairly complex textual inputs in the abstract space, making them powerful tools for zero-shot learning. Such capability is extended to other modalities such as the visual domain using cross-modal foundation models such as CLIP, and as a result, semantically meaningful representation are extractable from visual inputs.
In this work, we leverage this capability and propose an approach that can provide semantic insights into a model's patterns of failures and biases. Given a black box model, its training data, and task definition, we first calculate its task-related loss for each data point. We then extract a semantically meaningful representation for each training data point (such as CLIP embeddings from its visual encoder) and train a lightweight diagnosis model which maps this semantically meaningful representation of a data point to its task loss. We show that an ensemble of such lightweight models can be used to generate insights on the performance of the black-box model, in terms of identifying its patterns of failures and biases.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2305.03212 [cs.CV]
	(or arXiv:2305.03212v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.03212

Submission history

From: Shervin Ardeshir [view email]
[v1] Thu, 4 May 2023 23:54:37 UTC (215 KB)
[v2] Wed, 17 May 2023 22:36:03 UTC (673 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LLM2Loss: Leveraging Language Models for Explainable Model Diagnostics

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LLM2Loss: Leveraging Language Models for Explainable Model Diagnostics

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators