Ablating Concepts in Text-to-Image Diffusion Models

¹ CMU ²Tsinghua University ³Adobe Research

ICCV 2023

Our method can ablate (remove) copyrighted materials and memorized images from pretrained text-to-image diffusion models. We change the target concept distribution to an anchor concept e.g., Van Gogh painting to paintings, or Grumpy cat to Cat. Our method can also prevent the generation of memorized images.

Abstract

Large-scale text-to-image diffusion models can generate high-fidelity images with powerful compositional ability. However, these models are typically trained on an enormous amount of Internet data, often containing copyrighted material, licensed images, and personal photos. Furthermore, they have been found to replicate the style of various living artists or memorize exact training samples. How can we remove such copyrighted concepts or images without retraining the model from scratch? To achieve this goal, we propose an efficient method of ablating concepts in the pretrained model, i.e., preventing the generation of a target concept. Our algorithm learns to match the image distribution for a target style, instance, or text prompt we wish to ablate to the distribution corresponding to an anchor concept. This prevents the model from generating target concepts given its text condition. Extensive experiments show that our method can successfully prevent the generation of the ablated concept while preserving closely related concepts in the model.

Algorithm

Given a target concept Grumpy Cat to ablate and the anchor concept Cat, we fine-tune the model to have the same prediction given the target concept prompt A cute little Grumpy Cat as when the prompt is A cute little cat. The algorithm aims at minimizing the KL divergence between the target concept generated image distribution and anchor concept image distribution.

Ablating Instances

We show results with ablating various instances and overwriting it with a general category anchor concept. Ablated model generates anchor concept images whereas pretrained model generates the target concept. All our experiments are based on Stable Diffusion. For comparison to baselines, please refer to our Gallery page. Click images to see more examples.

Ablating Styles

We show results of ablating different target style concepts and generating normal paintings instead. Ablated model generates images different than the pretrained model for the given target concept. Click images to see more examples.

Ablating Memorized Images

Diffusion models have been shown to generate exact (or close) copies of training images [1,2]. Our method can also be used to ablate these memorized training images and instead generate variations. Click images to see more examples.

Compositional Ablation

we show that our method can be used to ablate the composition of two concepts while still preserving the meaning of each concept, for example, ablating "kids with guns" but still generating each category individually.

Limitations

Our method has still various limitations. It sometimes fails to ablate some famous paintings and can lead to degradation in some closely surrounding concepts.

In the below figure (a), our method fails to remove certain paintings generated with the painting's titles. We can further ablate these concepts as shown in (b). In figure(c), after we remove the target concept (e.g., Van Gogh), the results sometimes slightly degrade for surrounding concepts (Monet) compared to the pretrained model in figure (d).

Citation


                    @inproceedings{kumari2023conceptablation,

                      author = {Kumari, Nupur and Zhang, Bingliang and Wang, Sheng-Yu and Shechtman, Eli and Zhang, Richard and Zhu, Jun-Yan},

                      title  = {Ablating Concepts in Text-to-Image Diffusion Models},

                      booktitle = International Conference on Computer Vision (ICCV),

                      year   = {2023},

                }

Concurrent Work

Rohit Gandikota, Joanna Materzynska, Jaden Fiotto-Kaufman, and David Bau. Erasing Concepts from Diffusion Models. arXiv preprint arXiv:2303.07345 (2023).

Related Works

Patrick Schramowski, Manuel Brack, Björn Deiseroth, and Kristian Kersting. Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models arXiv preprint arXiv:2211.05105 (2022).

Zhifeng Kong, and Kamalika Chaudhuri. Data Redaction from Pre-trained GANs. In Workshop on Trustworthy and Socially Responsible Machine Learning, NeurIPS 2022.

Nupur Kumari, Bingliang Zhang, Richard Zhang, Eli Shechtman, and Jun-Yan Zhu. Multi-Concept Customization of Text-to-Image Diffusion arXiv preprint arXiv:2212.04488 (2022).

Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein, and Kfir Aberman. Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation. arXiv preprint arXiv:2208.12242 (2022).

Rinon Gal, Yuval Alaluf, Yuval Atzmon, Or Patashnik, Amit H. Bermano, Gal Chechik, and Daniel Cohen-Or. An image is worth one word: Personalizing text-to-image generation using textual inversion. arXiv preprint arXiv:2208.01618 (2022).

Acknowledgements

We are grateful to Gaurav Parmar, Daohan Lu, Muyang Li, Songwei Ge, Jingwan Lu, Sylvain Paris, and Bryan Russell for their helpful discussion, and to Aniruddha Mahapatra and Kangle Deng for paper proofreading. The work is partly supported by Adobe and NSF IIS-2239076. The website template is taken from Custom Diffusion project page.

Ablating Concepts in Text-to-Image Diffusion Models

Nupur Kumari¹

Bingliang Zhang²

Sheng-Yu Wang¹

Eli Shechtman³

Richard Zhang³

Jun-Yan Zhu¹

¹ CMU ²Tsinghua University ³Adobe Research

ICCV 2023

Abstract

Algorithm

Ablating Instances

Ablating Styles

Ablating Memorized Images

Compositional Ablation

Limitations

Citation

Concurrent Work

Rohit Gandikota, Joanna Materzynska, Jaden Fiotto-Kaufman, and David Bau. Erasing Concepts from Diffusion Models. arXiv preprint arXiv:2303.07345 (2023).

Related Works

Acknowledgements

Ablating Concepts in Text-to-Image Diffusion Models

Nupur Kumari1

Bingliang Zhang2

Sheng-Yu Wang1

Eli Shechtman3

Richard Zhang3

Jun-Yan Zhu1

1 CMU 2Tsinghua University 3Adobe Research

ICCV 2023

Abstract

Algorithm

Ablating Instances

Ablating Styles

Ablating Memorized Images

Compositional Ablation

Limitations

Citation

Concurrent Work

Rohit Gandikota, Joanna Materzynska, Jaden Fiotto-Kaufman, and David Bau. Erasing Concepts from Diffusion Models. arXiv preprint arXiv:2303.07345 (2023).

Related Works

Acknowledgements

Nupur Kumari¹

Bingliang Zhang²

Sheng-Yu Wang¹

Eli Shechtman³

Richard Zhang³

Jun-Yan Zhu¹

¹ CMU ²Tsinghua University ³Adobe Research