A Self-Supervised Descriptor for Image Copy Detection

Pizzi, Ed; Roy, Sreya Dutta; Ravindra, Sugosh Nagavara; Goyal, Priya; Douze, Matthijs

Computer Science > Computer Vision and Pattern Recognition

arXiv:2202.10261 (cs)

[Submitted on 21 Feb 2022 (v1), last revised 25 Mar 2022 (this version, v2)]

Title:A Self-Supervised Descriptor for Image Copy Detection

Authors:Ed Pizzi, Sreya Dutta Roy, Sugosh Nagavara Ravindra, Priya Goyal, Matthijs Douze

View PDF

Abstract:Image copy detection is an important task for content moderation. We introduce SSCD, a model that builds on a recent self-supervised contrastive training objective. We adapt this method to the copy detection task by changing the architecture and training objective, including a pooling operator from the instance matching literature, and adapting contrastive learning to augmentations that combine images.
Our approach relies on an entropy regularization term, promoting consistent separation between descriptor vectors, and we demonstrate that this significantly improves copy detection accuracy. Our method produces a compact descriptor vector, suitable for real-world web scale applications. Statistical information from a background image distribution can be incorporated into the descriptor.
On the recent DISC2021 benchmark, SSCD is shown to outperform both baseline copy detection models and self-supervised architectures designed for image classification by huge margins, in all settings. For example, SSCD out-performs SimCLR descriptors by 48% absolute. Code is available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2202.10261 [cs.CV]
	(or arXiv:2202.10261v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2202.10261

Submission history

From: Matthijs Douze [view email]
[v1] Mon, 21 Feb 2022 14:25:32 UTC (5,754 KB)
[v2] Fri, 25 Mar 2022 18:15:44 UTC (5,754 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Self-Supervised Descriptor for Image Copy Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Self-Supervised Descriptor for Image Copy Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators