CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions

Abdal, Rameen; Zhu, Peihao; Femiani, John; Mitra, Niloy J.; Wonka, Peter

Computer Science > Computer Vision and Pattern Recognition

arXiv:2112.05219 (cs)

[Submitted on 9 Dec 2021]

Title:CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions

Authors:Rameen Abdal, Peihao Zhu, John Femiani, Niloy J. Mitra, Peter Wonka

View PDF

Abstract:The success of StyleGAN has enabled unprecedented semantic editing capabilities, on both synthesized and real images. However, such editing operations are either trained with semantic supervision or described using human guidance. In another development, the CLIP architecture has been trained with internet-scale image and text pairings and has been shown to be useful in several zero-shot learning settings. In this work, we investigate how to effectively link the pretrained latent spaces of StyleGAN and CLIP, which in turn allows us to automatically extract semantically labeled edit directions from StyleGAN, finding and naming meaningful edit operations without any additional human guidance. Technically, we propose two novel building blocks; one for finding interesting CLIP directions and one for labeling arbitrary directions in CLIP latent space. The setup does not assume any pre-determined labels and hence we do not require any additional supervised text/attributes to build the editing framework. We evaluate the effectiveness of the proposed method and demonstrate that extraction of disentangled labeled StyleGAN edit directions is indeed possible, and reveals interesting and non-trivial edit directions.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
Cite as:	arXiv:2112.05219 [cs.CV]
	(or arXiv:2112.05219v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2112.05219

Submission history

From: Rameen Abdal [view email]
[v1] Thu, 9 Dec 2021 21:26:03 UTC (20,364 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-12

Change to browse by:

cs
cs.GR

References & Citations

DBLP - CS Bibliography

listing | bibtex

Rameen Abdal
John Femiani
Niloy J. Mitra
Peter Wonka

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators