Interpretable Measures of Conceptual Similarity by Complexity-Constrained Descriptive Auto-Encoding

Achille, Alessandro; Steeg, Greg Ver; Liu, Tian Yu; Trager, Matthew; Klingenberg, Carson; Soatto, Stefano

Computer Science > Computer Vision and Pattern Recognition

arXiv:2402.08919 (cs)

[Submitted on 14 Feb 2024]

Title:Interpretable Measures of Conceptual Similarity by Complexity-Constrained Descriptive Auto-Encoding

Authors:Alessandro Achille, Greg Ver Steeg, Tian Yu Liu, Matthew Trager, Carson Klingenberg, Stefano Soatto

View PDF HTML (experimental)

Abstract:Quantifying the degree of similarity between images is a key copyright issue for image-based machine learning. In legal doctrine however, determining the degree of similarity between works requires subjective analysis, and fact-finders (judges and juries) can demonstrate considerable variability in these subjective judgement calls. Images that are structurally similar can be deemed dissimilar, whereas images of completely different scenes can be deemed similar enough to support a claim of copying. We seek to define and compute a notion of "conceptual similarity" among images that captures high-level relations even among images that do not share repeated elements or visually similar components. The idea is to use a base multi-modal model to generate "explanations" (captions) of visual data at increasing levels of complexity. Then, similarity can be measured by the length of the caption needed to discriminate between the two images: Two highly dissimilar images can be discriminated early in their description, whereas conceptually dissimilar ones will need more detail to be distinguished. We operationalize this definition and show that it correlates with subjective (averaged human evaluation) assessment, and beats existing baselines on both image-to-image and text-to-text similarity benchmarks. Beyond just providing a number, our method also offers interpretability by pointing to the specific level of granularity of the description where the source data are differentiated.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2402.08919 [cs.CV]
	(or arXiv:2402.08919v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2402.08919

Submission history

From: Alessandro Achille [view email]
[v1] Wed, 14 Feb 2024 03:31:17 UTC (3,249 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Interpretable Measures of Conceptual Similarity by Complexity-Constrained Descriptive Auto-Encoding

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Interpretable Measures of Conceptual Similarity by Complexity-Constrained Descriptive Auto-Encoding

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators