How to Backdoor Diffusion Models?

Chou, Sheng-Yen; Chen, Pin-Yu; Ho, Tsung-Yi

Computer Science > Computer Vision and Pattern Recognition

arXiv:2212.05400v2 (cs)

[Submitted on 11 Dec 2022 (v1), revised 21 Apr 2023 (this version, v2), latest version 9 Jun 2023 (v3)]

Title:How to Backdoor Diffusion Models?

Authors:Sheng-Yen Chou, Pin-Yu Chen, Tsung-Yi Ho

View PDF

Abstract:Diffusion models are state-of-the-art deep learning empowered generative models that are trained based on the principle of learning forward and reverse diffusion processes via progressive noise-addition and denoising. To gain a better understanding of the limitations and potential risks, this paper presents the first study on the robustness of diffusion models against backdoor attacks. Specifically, we propose BadDiffusion, a novel attack framework that engineers compromised diffusion processes during model training for backdoor implantation. At the inference stage, the backdoored diffusion model will behave just like an untampered generator for regular data inputs, while falsely generating some targeted outcome designed by the bad actor upon receiving the implanted trigger signal. Such a critical risk can be dreadful for downstream tasks and applications built upon the problematic model. Our extensive experiments on various backdoor attack settings show that BadDiffusion can consistently lead to compromised diffusion models with high utility and target specificity. Even worse, BadDiffusion can be made cost-effective by simply finetuning a clean pre-trained diffusion model to implant backdoors. We also explore some possible countermeasures for risk mitigation. Our results call attention to potential risks and possible misuse of diffusion models. Our code is available on this https URL.

Comments:	Accepted by CVPR 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2212.05400 [cs.CV]
	(or arXiv:2212.05400v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2212.05400

Submission history

From: Sheng-Yen Chou [view email]
[v1] Sun, 11 Dec 2022 03:44:38 UTC (34,505 KB)
[v2] Fri, 21 Apr 2023 08:19:51 UTC (37,676 KB)
[v3] Fri, 9 Jun 2023 01:20:27 UTC (37,676 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:How to Backdoor Diffusion Models?

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:How to Backdoor Diffusion Models?

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators