SVGDreamer: Text Guided SVG Generation with Diffusion Model

Xing, Ximing; Zhou, Haitao; Wang, Chuang; Zhang, Jing; Xu, Dong; Yu, Qian

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.16476 (cs)

[Submitted on 27 Dec 2023 (v1), last revised 2 Apr 2024 (this version, v5)]

Title:SVGDreamer: Text Guided SVG Generation with Diffusion Model

Authors:Ximing Xing, Haitao Zhou, Chuang Wang, Jing Zhang, Dong Xu, Qian Yu

View PDF HTML (experimental)

Abstract:Recently, text-guided scalable vector graphics (SVGs) synthesis has shown promise in domains such as iconography and sketch. However, existing text-to-SVG generation methods lack editability and struggle with visual quality and result diversity. To address these limitations, we propose a novel text-guided vector graphics synthesis method called SVGDreamer. SVGDreamer incorporates a semantic-driven image vectorization (SIVE) process that enables the decomposition of synthesis into foreground objects and background, thereby enhancing editability. Specifically, the SIVE process introduces attention-based primitive control and an attention-mask loss function for effective control and manipulation of individual elements. Additionally, we propose a Vectorized Particle-based Score Distillation (VPSD) approach to address issues of shape over-smoothing, color over-saturation, limited diversity, and slow convergence of the existing text-to-SVG generation methods by modeling SVGs as distributions of control points and colors. Furthermore, VPSD leverages a reward model to re-weight vector particles, which improves aesthetic appeal and accelerates convergence. Extensive experiments are conducted to validate the effectiveness of SVGDreamer, demonstrating its superiority over baseline methods in terms of editability, visual quality, and diversity. Project page: \href{this https URL}{this https URL}

Comments:	Accepted by CVPR 2024. project link: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2312.16476 [cs.CV]
	(or arXiv:2312.16476v5 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.16476

Submission history

From: XiMing Xing [view email]
[v1] Wed, 27 Dec 2023 08:50:01 UTC (93,615 KB)
[v2] Wed, 3 Jan 2024 14:40:49 UTC (93,617 KB)
[v3] Sun, 17 Mar 2024 09:12:58 UTC (48,269 KB)
[v4] Mon, 25 Mar 2024 11:24:45 UTC (45,667 KB)
[v5] Tue, 2 Apr 2024 13:25:04 UTC (45,663 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SVGDreamer: Text Guided SVG Generation with Diffusion Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SVGDreamer: Text Guided SVG Generation with Diffusion Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators