Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images

Santosh; Lin, Li; Amerini, Irene; Wang, Xin; Hu, Shu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.12908 (cs)

[Submitted on 19 Apr 2024 (v1), last revised 8 Sep 2024 (this version, v2)]

Title:Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images

Authors:Santosh, Li Lin, Irene Amerini, Xin Wang, Shu Hu

View PDF HTML (experimental)

Abstract:Diffusion models (DMs) have revolutionized image generation, producing high-quality images with applications spanning various fields. However, their ability to create hyper-realistic images poses significant challenges in distinguishing between real and synthetic content, raising concerns about digital authenticity and potential misuse in creating deepfakes. This work introduces a robust detection framework that integrates image and text features extracted by CLIP model with a Multilayer Perceptron (MLP) classifier. We propose a novel loss that can improve the detector's robustness and handle imbalanced datasets. Additionally, we flatten the loss landscape during the model training to improve the detector's generalization capabilities. The effectiveness of our method, which outperforms traditional detection techniques, is demonstrated through extensive experiments, underscoring its potential to set a new state-of-the-art approach in DM-generated image detection. The code is available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2404.12908 [cs.CV]
	(or arXiv:2404.12908v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.12908

Submission history

From: Li Lin [view email]
[v1] Fri, 19 Apr 2024 14:30:41 UTC (1,485 KB)
[v2] Sun, 8 Sep 2024 04:46:00 UTC (1,486 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators