StableDrag: Stable Dragging for Point-based Image Editing

Cui, Yutao; Zhao, Xiaotong; Zhang, Guozhen; Cao, Shengming; Ma, Kai; Wang, Limin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.04437 (cs)

[Submitted on 7 Mar 2024]

Title:StableDrag: Stable Dragging for Point-based Image Editing

Authors:Yutao Cui, Xiaotong Zhao, Guozhen Zhang, Shengming Cao, Kai Ma, Limin Wang

View PDF HTML (experimental)

Abstract:Point-based image editing has attracted remarkable attention since the emergence of DragGAN. Recently, DragDiffusion further pushes forward the generative quality via adapting this dragging technique to diffusion models. Despite these great success, this dragging scheme exhibits two major drawbacks, namely inaccurate point tracking and incomplete motion supervision, which may result in unsatisfactory dragging outcomes. To tackle these issues, we build a stable and precise drag-based editing framework, coined as StableDrag, by designing a discirminative point tracking method and a confidence-based latent enhancement strategy for motion supervision. The former allows us to precisely locate the updated handle points, thereby boosting the stability of long-range manipulation, while the latter is responsible for guaranteeing the optimized latent as high-quality as possible across all the manipulation steps. Thanks to these unique designs, we instantiate two types of image editing models including StableDrag-GAN and StableDrag-Diff, which attains more stable dragging performance, through extensive qualitative experiments and quantitative assessment on DragBench.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.04437 [cs.CV]
	(or arXiv:2403.04437v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.04437

Submission history

From: Yutao Cui [view email]
[v1] Thu, 7 Mar 2024 12:11:02 UTC (5,765 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:StableDrag: Stable Dragging for Point-based Image Editing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:StableDrag: Stable Dragging for Point-based Image Editing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators